education category. Segment Overall No degree High school Some college Bachelor's + *In the United States, from 2000 to 2013, inflation adjusted The median wage increased by 0.9%* Change in Median Wage (%) +0.9% -7.9% -4.7% -7.6% -1.2%
More people with Bachelor's degrees working. Fewer people without a degree working. Mix Effects: the labor force shifted. Change in Median Wage (%) +0.9% -7.9% -4.7% -7.6% -1.2% $827 $472 $651 $748 $1194 Segment Overall No degree High school Some college Bachelor's +
More people with Bachelor's degrees working. Fewer people without a degree working. Higher educated, and higher paid, segments increased in size. Change in Number Employed (%) +6.4% -21.3% -10.6% +5.4% +33.0% Change in Median Wage (%) +0.9% -7.9% -4.7% -7.6% -1.2% Median Wage, 2013 $827 $472 $651 $748 $1194 Mix Effects: the labor force shifted. Segment Overall No degree High school Some college Bachelor's +
weight/size" "value" Change in Median Wage (%) +0.9% -7.9% -4.7% -7.6% -1.2% Change in Number Employed (%) +6.4% -21.3% -10.6% +5.4% +33.0% Change in Median Wage (%) +0.9% -7.9% -4.7% -7.6% -1.2% Median Wage, 2013 $827 $472 $651 $748 $1194 Segment Overall No degree High school Some college Bachelor's +
Race and the death penalty in Florida Standardized test scores/education spending Active debate about diagnosis and treatment of meningococcal disease The "hot hand" in basketball; Baseball hitting averages Treatments for kidney stones My motivation: analyzing financial data at Google
or Size before Weight/Size after Change or difference in weight/size ex: median wage before, or unemployment rate before ex: median wage after, or unemployment rate before ex: change in median wage, or change in unemployment rate ex: percent of total labor force, or number of people in labor force before ex: labor force after ex: change in percent of total workers, or change in number of workers 6 Variables that (might) matter
the size of the box and the color of the box value or weight values or weights change in value, or change in weight Standard vis: only a few variables size represents geographic size rather than size metric
"Disaggregated spatial modelling for areal unit categorical data." Journal of the Royal Statistical Society: Series C (Applied Statistics) 59.1 (2010): 175-190. Example: maps Question: what part of North Carolina has the worst "low birth weight" percentage? (shown in darker colors)
people in labor force start: end Replace the points with a comet. 6 variables values, weights before, after, change Danny Holten and Jarke J. van Wijk. 2009. A user study on visualizing directed edges in graphs. InProceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '09). ACM, New York, NY, USA, 2299-2308. DOI=10.1145/1518701. 1519054 http://doi.acm.org/10.1145/1518701.1519054
in size while orange segments decreased in size. Comet Chart weight/size value ex: unemployment rate ex: number of people in labor force size increasing size decreasing
Fetal Death Rate: deaths per 1000 born (log) Number of babies born (log) Each comet represents babies born in 10 different US states and birth-weight category (very small, small, etc). Fetal Death Rate and number of babies born by birth-weight categories and 10 US states increase in number employed decrease in # workers late 1990's late 2000's
effects first! Visualization can help us see the flow of our data and to ask data-driven questions Try Comet Charts as a way to discover mix in your data! Summary Thank you to Martin Wattenberg, Fernanda Viégas and their Google Data Visualization Research Team!
that although these paradoxes reveal the perils of using statistical criteria to guide causal analysis, they hold neither the explanations of the phenomenon they depict nor the pointers on how to avoid them.” - Arah "The inclusion of additional control variables may increase or decrease the bias, and we cannot know for sure which is the case in any particular situation." - Clarke Arah, Onyebuchi A. "Emerging Themes in." Emerging Themes in Epidemiology5 (2008): 5. Clarke, Kevin A. "The phantom menace: Omitted variable bias in econometric research." Conflict Management and Peace Science 22.4 (2005): 341-352.