Archive

Archive for the ‘quantitative’ Category

Tableau Public

June 1, 2011 Leave a comment

After downloading but just not getting around to using the software on three separate occasions, I finally created my first chart (or should I say “visualization”) using the free Tableau Public platform. The visualization I created was for Give Me The Rock and it graphed the top 200 fantasy basketball players on a scatter plot based on how similar or dissimilar they played during the 2010-11 season.

While I am by no means an expert on using the software, here are my first impressions on Tableau Public.

The Good Stuff

I mean, let’s start with the fact that Tableau Public is free to use. In a world where I pay to drink water, that is a very good thing. Loading up the software, I noticed that the interface is very slick and creating a basic chart is fairly easy to do. I’m a guy who likes to jump in head first without reading a manual, and I appreciate I could do that and still create something cool. For the most part, the software is drag and drop with your data displayed on the left hand side of the screen and the visualization on the right. You simply start dragging your data into the appropriate place and the magic happens.

Also, the visualizations look great – on par or better than what Excel can produce. The shapes, colors, labels are all sharp and really pop on the screen. But the best thing about the software is that Tableau Public knows exactly what it is and what people use it for. The chart options that it provides seem spot on and the ease and speed at which the software allows you to slice and dice your data and the way you present it is something that Excel can’t come close to touching. For example, my visualization of 200 NBA players was original crammed on the screen, but Tableau Public has a pages option that allows you to split out your presentation into different pages which can be viewed individually. I ended up splitting my graph into pages by player position to make it more readable.

If your the type of person that likes to experiment with different ways to present data, then Tableau Public has the potential to save you a lot of time.

Finally, Tableau Public makes it incredibly easy to share visualizations on the web. It provides scripts that you can copy and paste right into your blog, as well as an option to email it to others. You can also download any visualization on the web onto your computer if you have Tableau Public. No more worrying about versions of Excel and compatibility issues.

The Bad Stuff

Like all cloud software that I’ve used, Tableau Public is a touch slow for my tastes, especially in regard to loading and saving data. For a one off chart (excuse me, visualization) I can deal with it, but given the spastic way I typically work, I wouldn’t want to work with it all day long. Tableau has professional versions of the software (for $999 and $1,999 depending on the version) that I’d assume solves this problem by not requiring you to work in a cloud.

And while the software is generally easy to use, I’d say it’s still a step or two away from being perfectly intuitive. There were a few things that took me a while to figure out. I didn’t realize you could drag the interactive legends/tools around the screen and they would be presented in that exact location when you published the visualization (very handy). And while I split my graph by player position to make it easier to read, I still wanted to add a total page that presented all the data on the screen at once. I never did figure out how to do that.

Finally – and this may bother some people more than others – while Tableau Public is free, you are required to save your visualization to their servers. Once published, your data is in the public domain for all to see. Obviously if you have proprietary data, Tableau Public is not for you (although again, the paid versions will solve this problem).

The Verdict

Overall, I’m impressed with Tableau Public. As a guy who uses Excel on a daily basis, it’s not going to replace that for my day-to-day work, but it has definite advantages over Excel like its ability to slice and dice data in any number of ways and the fact that Tableau makes it very easy to share visualizations with others. The next time I create a graph that is going to be displayed on a website, I’d hands down use Tableau Public do to that. And did I mention it’s free?

>Quantitative Analytical Techniques

July 9, 2010 Leave a comment

>Statistics is a branch of mathematics that deals with the collection, analysis, interpretation, and presentation of masses of numerical data. When a measurement is calculated for an entire population, say the average age, it’s called a parameter. When we look across a sample and calculate a measurement, also the average age, we call it a statistic. Since people make entire careers out of the study of statistics, the point of this post is to present a birds-eye overview and brief description of common terms you’ll hear in conversations about quantitative analysis.

When discussing statistics, researchers usually talk about the data in terms of “variables.” A variable is a characteristic that may assume more than one set of values (age, income, birth place can all have more than one value). A variable can either be nominal, ordinal, interval, or ratio in its scale. Nominal variables are also referred to as categorical variables because they represent categories of responses. The color of a car would be represented by a categorical value (for example, black, red, or silver). Categorical variables have no set order, meaning that a black car is not necessarily any better than a silver car.

The level of satisfaction with one’s car on a 1 to 10 scale is an example an ordinal variable (where a 10 is a better score than a 5 and a 5 is better than a 1). Ordinal variables have a clear, set order, but they still represent categories of responses. Interval and ratio variables are numerical variables whose numbers have direct meaning. The age of a car would be ratio variable because it can be measured precisely and at equal intervals (in hours, years, or decades).

Variables can also be discrete or continuous. Continuous variables, such as time, have an infinite number of possible values, while discrete variables, such as a satisfaction scale, have a finite (in this case 10) number of possible values.

Descriptive statistics are simple portrayals of what the variables show. They are summaries of the frequency of the different values (like percentages); the central tendency (mean, median or mode); and the dispersion (like the range and the standard deviation). Cross tabs (short for tabulations) are popular for displaying the joint distribution of two or more variables. They are usually presented in a matrix called a contingency table. In a cross tab table, each cell gives the number of respondents that gave a particular combination of responses.

Measures of association summarize the relationship between two variables (correlation and regression, for instance). Two variables are associated when information about one can help us predict information about the other. A variety of techniques to measure association are available, each better suited to different classes of variables. When analyzing data, most statisticians use multivariate analysis where the effects of many variables are considered.

Tests of statistical significance are used to determine how sure we can feel about the associations found in the data — Could it just be chance? Can we infer that the result can be generalized to the study population? Confidence intervals, chi square tests and t-tests are the most common statistics used to indicate the probability of saying that there is a difference between two groups when actually there is none (level of significance).

Measures of association can be used in very sophisticated ways. Conjoint analysis can be used to determine trade-offs customers are willing to make among product or service attributes. In addition to understanding current preferences, this technique allows modeling of the impact of the introduction of new factors on preferences.

Discrete choice analysis models selection of a product or concept with many attributes from a set of products or concepts. In essence, it models how people make decisions in the real world. For example, one could test products with varying combinations of features to assess which consumers prefer. As with conjoint analysis, discrete choice analysis allows modeling of the impact of the introduction of a new product or concept on factors such as market share.

Cluster analysis identifies population segments using groups of variables. This provides information to better understand and communicate with customers, or help you understand your place in the market place. In general, whenever one needs to classify a mass of information into manageable and meaningful results, cluster analysis is a technique of great usefulness.

Discriminant analysis is used to define which variables best differentiate between predefined groups. The key difference is that discriminant analysis relies on previously defined groups whereas cluster analysis uses the data to discover these groups.

Factor analysis finds the underlying construct behind answers to a series of questions. In other words, factor analysis is designed to classify variables. For clients, it simplifies the interpretation of answers to many questions to a few “factors” that seem to drive answers to all questions. It can be used to determine the key factors that drive aspects like satisfaction, image or customer retention. In addition, factor analysis is used when designing surveys. Often complex concepts (like “leadership”) need to be turned into a group of concrete questions in order to query meaningfully.

Regression Analysis (linear, non-linear and logistic) is widely used for forecasting. It compares the effects of one or more variables on another. The objective of regression analysis is to understand the relationship between several independent or predictor variables on a dependent or criterion variable. This allows forecasting or estimation of the change in a dependent variable based on the change in an independent variable.

***

http://rcm.amazon.com/e/cm?t=patrmadd0a-20&o=1&p=8&l=bpl&asins=0761925767&fc1=000000&IS2=1&lt1=_blank&m=amazon&lc1=0000FF&bc1=000000&bg1=FFFFFF&f=ifrhttp://rcm.amazon.com/e/cm?t=patrmadd0a-20&o=1&p=8&l=bpl&asins=0521674654&fc1=000000&IS2=1&lt1=_blank&m=amazon&lc1=0000FF&bc1=000000&bg1=FFFFFF&f=ifrhttp://rcm.amazon.com/e/cm?t=patrmadd0a-20&o=1&p=8&l=bpl&asins=B00387FOGM&fc1=000000&IS2=1&lt1=_blank&m=amazon&lc1=0000FF&bc1=000000&bg1=FFFFFF&f=ifr

>Qualitative vs. Quantitative Research

April 30, 2010 2 comments

>Discussing research methods can be confusing because what is often called a research method may actually be a data collection technique (such as a focus group) or it may be an analytical technique (such as a correlation study, where data is analyzed to discover relationships between things). Even more confusing are the way the terms “qualitative” and “quantitative” tend to be used in the survey research industry.

What is Qualitative Research?

Qualitative research is used to understand how and why people think the way they do using data collection techniques such as observation, storytelling and in-depth discussion. Qualitative research does not include numerical analysis of responses, at least in any way that can be generalized beyond the group of people you are studying. For example, if I was thinking about starting a service called “Man Maids” where men could be hired to perform chores around the house, I might first sit down with some woman and collect qualitative information about what they think about that service. Using the results of this discussion, I could form hypotheses about the service and its ability to succeed, but would not get any actual numerical information, such as the percentage of women in my area who would use that service.

Qualitative research is a wide field of study that attempt to examine and interpret people’s attitudes, perceptions and beliefs using techniques that probe or explore. In reality, a lot of times the term qualitative research is used when people really mean “focus groups” because it is easy to visualize a bunch of people sitting in a room talking about nonsense. However, the term also applies to other research approaches (such as in-depth interviews, content analysis, and ethnography)

What is Quantitative Research?

Quantitative investigation is analysis designed to determine the amounts or proportions of the components of something being studied. For example, while my discussions with women would allow me to develop hypotheses about my potential service man maid service, I would need to conduct quantitative research to test those hypotheses and measure the level of interest in my potential service area.

Like qualitative research, quantitative survey research attempts to investigate human activity. Unlike it, however, quantitative research does this by measuring how many times certain behaviors take place (buying a product, hiring man maids), or how many times people respond to a question in a specific way. We can then make reasonable estimates, projections and predictions of how a population of people may behave.

***

In the course of any particular research study, a combination of qualitative and quantitative methods can be employed. And, different combinations of data collection and analysis techniques, also referred to as “research methods,” will be utilized.

Coming up will be more detailed descriptions of frequently used research methods. Of course, the type of data you will be collecting (qualitative vs. quantitative) will have an effect on the data collection and analysis techniques you use and the types of interpretation that can be done. Focus groups, for instance, are by design representative only of the participants in the groups. They provide information that can be used to develop hypotheses. These hypotheses would need to be tested using some type of quantitative research in order to confirm whether they are true. Random quantitative surveys are conducted to be representative of a population, but they only if the survey questions, sampling and data collection processes are conducted in a scientific way.

Follow

Get every new post delivered to your Inbox.