PSYC 560 EDA Week 1

Introduction to the course Psyc 560 Exploratory Data Analysis and Visualization.


Jamie D. Bedics, PHD, ABPP

September 02, 2020


  1. EDA AND VISUALIZATION: PSY 560 Introduction to the Course Instructor:

    Jamie Bedics, PhD, ABPP 1
  2. WHAT IS R? A popular statistical language for data scientists

    across a variety of scientific disciplines (Python is a competitor) 2 R is a statistical programming language and computing environment for data management, data visualization, and statistical analysis.
  3. 3 Can be a little intimidating.

  4. R Studio is an integrated development environment (IDE). 4 R

    STUDIO Provides an appealing visual space for working with R along with added functionality. R runs through R Studio
  6. 6 They are separate. Update them both separately. R and

    R Studio
  7. 7 R Studio runs using R

  8. PACKAGES IN R 8 Your phone comes with several applications

    out of the box. They serve specific functions and purposes Are they ever enough?
  9. PACKAGES IN R 9 Packages are collections of R

    functions, data, and compiled code in a well-defined format. They allow us to do things in R. Package R comes with several packages (“applications”), out of the box They are never enough to do what we need to do in R. We can get more from R’s version of the “app store” call CRAN (it’s free) install.packages(“NAME”)
  10. 10 Submitted to CRAN for approval Packages Developed by Users

    Around the world Sent to Different “Mirror” Sites (e.g., univ., etc.) Used by you to do some good!

    “Comprehensive R Archive Network” (CRAN) is a collection of sites which carry identical material, consisting of the R distribution(s), the contributed extensions, documentation for R, and binaries. Others are available for download and installation.
  12. 12 The “Comprehensive R Archive Network” (CRAN) Instead of shopping,

    in an “app store”, you learn about different packages by reading books, attending trainings, or courses. We learn several helpful packages from our book R4DS. ‘ggplot’ for graphs ‘dplyr’ for transforming data There are so many helpful packages in our book that they have been ‘grouped’ into one package called the ‘tidyverse’ which we install early.
  15. 15 Regression ANOVA T-Test Chi-Square Correlation PSYC562

  17. 17

  18. 18 What is Version Control? Look familiar?: final.doc final(2).doc

    final(12-15-17).doc final(12-15-17)r.doc final_final.doc A type of software that manages, maintains, and stores your code Like Google Docs but on steroids
  19. 19 Created by Linus Torvalds the creator of the Linux

    system He was fed up with other systems of version control and thought he could make something better in two weeks. He did. Names products after himself including Linux and now “Git” The most widely used software for version control
  20. COMMAND LINE FUNCTIONALITY 20 Yes, it’s moderately horrifying Helpful to

    know some command lines Mac Terminal works PC would need to download Thankfully….
  21. 21 GitHub Web hosting version for Git All the functionality

    of Git + More Great place for collaboration and working on teams or in the classroom!
  22. 22 Git is like the engine for GitHub


  24. 24 Putting it all together…. PSYC 565 Research Practicum

  25. 25 adapted from Jenny Bryan Stat545! PSY565 PSY 560 Knitr

    PSY 560
  26. 26 Humility Patience & Perseverance Humility Mastery

  27. TRUISM 27 There’s always more to learn Be gentle with

    ourselves Be gentle towards others Create an Environment that is Motivating! Give yourself a lot of time to explore, practice, play, read, watch!
  29. 29 Not So Standard Statistics Active and Vibrant Community Strong

    Social Media Presence Multidisciplinary Welcoming of all levels Dedicated to the improvement of science and critical thinking across disciplines Meet and Mingle
  30. 30 Listen while you drive, runor relax! Podcast Not So

    Standard Deviations
  31. 31 Read!

  34. HOMEWORK 34 Getting Organized

