We are all statisticians now

4bd13719da0ba2c5bd2a446e14f78187?s=47 Jeff L.
May 29, 2014

We are all statisticians now

Talk at the Data Intensive Biology Conference in the Johns Hopkins Department of Biology 2014.

4bd13719da0ba2c5bd2a446e14f78187?s=128

Jeff L.

May 29, 2014
Tweet

Transcript

  1. 2.
  2. 9.
  3. 11.
  4. 14.
  5. 15.

    from: jtleek@gmail.com Roger let me know you gave him a

    ballpark figure for the number of students registered for his course "Computing for Data Analysis”. Could you give me an idea of how many have registered for my course "Data Analysis?”    
  6. 16.

    from: pangwei@coursera.org Hi Jeff, 7,000 students! It's pretty awesome. (You'll

    be able to check this out yourself next week, once the class sites are up.)  
  7. 21.
  8. 22.
  9. 23.
  10. 27.

    what went wrong? expertise They used silly prediction rules (Pr(FEC)

     =  5/8[Pr(F)  +  Pr(E)  +  Pr(C)]  –  ¼)  
  11. 29.

    what went wrong? expertise Their predictions weren’t locked down Today:

     Pr(FEC)  =  0.8   Tomorrow:  Pr(FEC)  =  0.1    
  12. 30.

    At the end of the day the Potti analysis was

    fully reproducible The problem is that the analysis was wrong
  13. 31.
  14. 32.
  15. 33.
  16. 39.
  17. 40.
  18. 42.

    Yes, we are witnessing the birth of Yet another “pet

    bioinformatician”. What I mean by this term is a single bioinformatician employed within a laboratory based group. hJp://biomickwatson.wordpress.com/2013/04/23/a-­‐guide-­‐for-­‐the-­‐lonely-­‐bioinforma[cian/   “ ”
  19. 44.

    One year of biology One year of physics One year

    of English Two years of chemistry (through organic chemistry) https://www.aamc.org/students/applying/requirements/ med school entrance requirements
  20. 46.

    I  am  a  postdoctoral  fellow  in  redacted  group   I

     collected  data  on  redacted   …     Preliminary  analysis  has  pulled  out  some  interes[ng   things  but  we  need  some  professional  assistance   …     We  want  to  submit  at  the  end  of  next  month.   To: jtleek@gmail.com
  21. 49.
  22. 50.
  23. 51.
  24. 52.
  25. 53.
  26. 54.
  27. 55.
  28. 57.
  29. 59.

    1. Statistical thinking is (often) an afterthought 2. Most data

    analysts are untrained   3. Statistics is not math (and data analysis isn’t statistics)   4. How do we balance skepticism & excitement?