Save 37% off PRO during our Black Friday Sale! »

Growing your inner data scientist

Growing your inner data scientist

81689b093f75cf3f383e581ca57188df?s=128

Mine Cetinkaya-Rundel

September 27, 2021
Tweet

Transcript

  1. growing your inner data scientist: tips to success in data

    science, from questions to results Mine Çetinkaya-Rundel Duke University + RStudio 🔗 bit.ly/grow-ds-21 mine-cetinkaya-rundel cetinkaya.mine@gmail.com @minebocek
  2. professor of statistics data scientist book author minebocek mine-cetinkaya-rundel mine

    citizenstatistician
  3. what is data science?

  4. datascience.berkeley.edu/about/what-is-data-science r4ds.had.co.nz/explore-intro.html oreilly.com/library/view/doing-data-science/9781449363871/ch01.html

  5. data science is vague and evolving…

  6. 1 always be curious

  7. keep informed books articles blogs

  8. None
  9. keep current

  10. None
  11. keep engaged conferences workshops meetups webinars

  12. None
  13. 2improve your workflow

  14. rstats.wtf

  15. r4ds.had.co.nz

  16. 3share your output

  17. David Robinson @drob @rstudio::conf 2019, The Unreasonable E ff ectiveness

    of Public Work Idea Published paper Preliminary results Draft manuscript Completed manuscript How I used to think of my goals: More valuable Less valuable Anything still on your computer Anything out in the world (Data, code, results, draft, finished paper) (Paper, preprint, product, blog post, open source, tweet) How I should have been thinking of them: More valuable Less valuable Idea Published paper Preliminary results Draft manuscript Completed manuscript How I used to think of my goals: More valuable Less valuable Anything still on your computer Anything out in the world (Data, code, results, draft, finished paper) (Paper, preprint, product, blog post, open source, tweet) How I should have been thinking of them: More valuable Less valuable
  18. share the things you create

  19. share the things you create big

  20. datasciencebox.org datasciencebox.org

  21. None
  22. share the things you create little

  23. None
  24. share the things you learn

  25. Mara Averick @dataandme EARL 2017, leaRning out loud SOMETIMES I

    GO ON TWITTER, AND I TEND TO LEARN OUT LOUD
  26. None
  27. None
  28. # March 2019 library(tidyverse) ggplot(mtcars, aes(x = wt, y =

    mpg)) %>% geom_point() #> Error: `mapping` must be created by `aes()` #> Did you use %>% instead of +?
  29. share your questions

  30. Thiago Maciera “The Art of Problem Solving.” In Open Advice:

    FOSS: What We Wish We Had Known When We Started, edited by Lydia Pintscher, 55–61. THE MOST USELESS PROBLEM STATEMENT THAT ONE CAN FACE IS “IT DOESN’T WORK”, YET WE SEEM TO GET IT FAR TOO OFTEN.
  31. TEN SIMPLE RULES FOR GETTING HELP FROM ONLINE SCIENTIFIC COMMUNITIES

    1. Don’t be afraid to ask a question 2. State the question clearly 3. Learn established customs before posting 4. Don’t ask what has already been answered 5. Always use a good title 6. Do your homework before posting 7. Proofread your post 8. Be courteous to other forum members 9. Remember that the archive of your question can be helpful to others 10. Give back to the community Dall’Olio, Giovanni M., Jacopo Marino, Michael Schubert, Kevin L. Keys, Melanie I. Stefan, Colin S. Gillespie, Pierre Poulain, et al. 2011. “Ten Simple Rules for Getting Help from Online Scientific Communities.” PLoS Computational Biology 7 (9): 10–12. doi:10.1371/journal.pcbi.1002202.
  32. suppose… # Goal: "1 a" "2 b" "3 c" "4

    d" "5 e"
  33. I’m trying to create the following vector in R: "1

    a" "2 b" "3 c" "4 d" "5 e” So I define X to be 1 : 5 and Y to be the first 5 letters of the alphabet, but when I add them I get the following error. Error in x + y : non - numeric argument to binary operator 🤷 Q
  34. I’m trying to create the following vector in R: "1

    a" "2 b" "3 c" "4 d" "5 e" Below is a screenshot of what I tried. Why is it not working? 🤷 Q
  35. library(reprex) Prepare reproducible examples for posting to GitHub issues, StackOverflow,

    or Slack snippets. writing good questions
  36. I’m trying to create the following vector in R: "1

    a" "2 b" "3 c" "4 d" "5 e” Below is what I tried. What does this error mean, and how can I fix it? 🤷 Q x < - 1 : 5 y < - letters[1 : 5] x + y #> Error in x + y: non - numeric argument to binary operator
  37. 4contribute to community

  38. find open source projects you enjoy, and start contributing

  39. contribute to books

  40. None
  41. None
  42. contribute to packages

  43. None
  44. get the pulse of a project read the code contributing

    to oss watch the repo discuss your ideas make a pull request review CoC + contributing guide
  45. 5collaborate with others

  46. collaborate on process

  47. None
  48. collaborate in class

  49. Lorem ipsum dolor sit amet, consectetur adipiscing elit. Proin vulputate

    feugiat lacus eu lobortis. Mauris dictum ultrices tortor sit amet tincidunt. Cras magna metus, volutpat eu tempus nec, mattis vel nibh. Vivamus eros tellus, lobortis id molestie quis, feugiat sed lorem. Proin quis pellentesque justo, vitae elementum eros. Proin orci ex, dignissim sed urna in, congue fringilla nisi. Aliquam id urna orci. Vestibulum consequat, enim et sodales finibus, nunc arcu condimentum odio, rhoncus venenatis ligula sem at lectus. Nullam nec porttitor nisl. Fusce hendrerit, mauris sed iaculis gravida, odio diam lacinia diam, bibendum dapibus metus mi imperdiet ex. Praesent ac urna scelerisque, condimentum est vitae, pellentesque erat. Integer sed hendrerit ex. Sed facilisis sollicitudin venenatis. Nulla fringilla lorem at metus maximus cursus. Morbi facilisis turpis at purus volutpat bibendum quis quis eros. Ut id odio interdum, luctus mauris nec, pharetra quam. Nullam velit risus, consectetur ac faucibus eu, lacinia ut neque. Pellentesque ut aliquet libero. Ut ac neque eget nunc hendrerit commodo. Maecenas vel ultrices augue. blog post portfolio entry competition submission …
  50. collaborate outside class

  51. John M. Chambers Statistical So ft ware Award 🔗 stat-computing.org/awards/jmc

    ASA StatComp Student Paper Competition 🔗 stat-computing.org/awards/student Kaggle: Prediction competition 🔗 kaggle.com/competitions
  52. 6broadcast your work

  53. make data visualizations

  54. 🗓 Every Tuesday 🔗 github.com/rfordatascience/tidytuesday 🐦 #TidyTuesday

  55. speak at events

  56. None
  57. write blog posts

  58. bookdown.org/yihui/blogdown apreshill.com/blog/2020-12-new-year-new-blogdown

  59. keeping a blog alive find co-authors keep it regular write

    themed posts review events
  60. 1 2 3 4 5 6 always be curious improve

    your workflow share your output contribute to community collaborate with others broadcast your work
  61. None
  62. mine-cetinkaya-rundel cetinkaya.mine@gmail.com @minebocek growing your inner data scientist 🔗 bit.ly/grow-ds-21

    Mine Çetinkaya-Rundel Duke University + RStudio