horrible ‣ Talks are the best medium to get your point across ‣ https://youtu.be/WAwDvbIfkos ‣ http://journals.plos.org/ploscompbiol/ article?id=10.1371/journal.pcbi.0030077
❖ A critical part of being a successful scientist ❖ Should be done at all stages of the scientific process ❖ Data exploration ❖ Data analysis ❖ Final presentation ❖ Do NOT be the guy/girl that makes plots like this for a paper or ends up at http://wtfviz.net
(2005) Rational inferences about departures from Hardy- Weinberg equilibrium. American Journal of Human Genetics 76:967-986, Figure 1 http://www.biostat.wisc.edu/~kbroman/topten_worstgraphs/
Inference on haplotype effects in case-control studies using unphased genotype data. American Journal of Human Genetics 73:1316-1329, Figure 1 http://www.biostat.wisc.edu/~kbroman/topten_worstgraphs/
Hassel BA (2001) Role for p53 in gene induction by double-stranded RNA. J Virol 75:7774-7777, Figure 4 http://www.biostat.wisc.edu/~kbroman/topten_worstgraphs/
mapping of transcription factor binding sites along human chromosomes 21 and 22 points to widespread regulation of noncoding RNAs. Cell 116:499-509, Figure 1 http://www.biostat.wisc.edu/~kbroman/topten_worstgraphs/
Higher levels of serum triglyceride and dietary carbohydrate intake are associated with smaller LDL particle size in healthy Korean women. Nutrition Research and Practice 6:120-125, Figure 1 http://www.biostat.wisc.edu/~kbroman/topten_worstgraphs/
validated as a surrogate endpoint for survival amoung epoetin- treated hemodialysis patients. Journal of Clinical Epidemiology 57:1086-1095, Figure 2 http://www.biostat.wisc.edu/~kbroman/topten_worstgraphs/
VC, White RL, Weber JL (1998) Comprehensive human genetic maps: Individual and sex-specific variation in recombination. American Journal of Human Genetics 63:861-869, Figure 1 http://www.biostat.wisc.edu/~kbroman/topten_worstgraphs/
demonstrates how the aspect ratio of a line chart can affect an analyst's perception of trends in the data. Cleveland proposes an optimization technique for computing the aspect ratio such that the average absolute orientation of line segments in the chart is equal to 45 degrees. This technique, called banking to 45 degrees, is designed to maximize the discriminability of the orientations of the line segments in the chart.” http://vis.berkeley.edu/papers/banking/ Two plots of monthly atmospheric carbon dioxide measurements, taken from 1959 to 1990. The first plot, with an aspect ratio of 1.17, reveals an accelerating increase in CO2 levels. The second plot, with an aspect ratio of 7.87, facilitates closer inspection of seasonal fluctuations, revealing a gradual attack followed by a steeper decay. These aspect ratios were automatically determined using multi-scale banking.
I., Jones, S., & Marra, M. (2012). Getting into visualization of large biological data sets: 20 imperatives of information design. Poster presented at 2nd IEEE Symposium on Biological Data Visualization (BioVis 2012), Seattle, WA. ❖ Human eye acuity is ~50 cycles/degree or about 1/200 (0.3 pt) at 10 inches
acuity is ~50 cycles/degree or about 1/200 (0.3 pt) at 10 inches Krzywinski, M., Brol, I., Jones, S., & Marra, M. (2012). Getting into visualization of large biological data sets: 20 imperatives of information design. Poster presented at 2nd IEEE Symposium on Biological Data Visualization (BioVis 2012), Seattle, WA.
& Marra, M. (2012). Getting into visualization of large biological data sets: 20 imperatives of information design. Poster presented at 2nd IEEE Symposium on Biological Data Visualization (BioVis 2012), Seattle, WA. Approaches to encoding min/avg/max values of downsampled data. In the top hi-low trace, the vertical bars are perceived as a separate layer and effectively show variance without obscuring trends in the average.
S., & Marra, M. (2012). Getting into visualization of large biological data sets: 20 imperatives of information design. Poster presented at 2nd IEEE Symposium on Biological Data Visualization (BioVis 2012), Seattle, WA. When drawing the position and size of densely packed genes, encode the gene’s size using a non-linear mapping. When the number of data values is large, such as in the OMIM gene track, hollow glyphs are effective. For even greater number of points, a density map is preferred. chr 1 <10 10-30 30-50 50-100 100-200 >200 size (kb) RAD54L G>A rs121908690 RNASEL C>T rs74315365 EPHB2 SFPQ TPM3 PBX1 PAX7 RBM15 BCL9 PRCC PRRX1 ABL2 LHX4 CDC73 LCK MYCL1 MUTYH TAL1 BCL10 CSF1 CSDE1 ARNT RIT1 NTRK1 TPR PRG4 CANCER CENSUS SNP OMIM 50 100 150 200 Mb
M. (2012). Getting into visualization of large biological data sets: 20 imperatives of information design. Poster presented at 2nd IEEE Symposium on Biological Data Visualization (BioVis 2012), Seattle, WA. 12 54 82 29 25 22 67 61 23 79 ed theme. What is communicated? (A) The raw data imparts no clear message.(B). Binning indicates ranges, not individual values, are important. (C). Frequency distribution suggests that there is a shortage of medium-sized values. (D) Individual data RQKPVUECPDGTGOQXGFVQGORJCUK\GVTGPFCPFUKIPKƁECPEG 0-30 31-60 61-100 30 60 * A B C D 30 60 29 25 23 22 12 54 82 79 67 61