Upgrade to Pro — share decks privately, control downloads, hide ads and more …

SOC 4930 & SOC 5050 - Week 10

SOC 4930 & SOC 5050 - Week 10

Lecture slides for Week 10 of the Saint Louis University Course Quantitative Analysis: Applied Inferential Statistics. These slides cover the topics related to correlation analyses. Handouts created in RMarkdown are also discussed.

Christopher Prener

October 29, 2018
Tweet

More Decks by Christopher Prener

Other Decks in Education

Transcript

  1. Make sure you clone the lecture-10 repo using GitHub Desktop

    so that we can get right in to today’s lecture! WELCOME! GETTING STARTED
  2. AGENDA QUANTITATIVE ANALYSIS / WEEK 10 / LECTURE 10 1.

    Front Matter 2. Public Polling 3. Handouts 4. Covariance 5. Scatterplots 6. Pearson’s R 7. Back Matter
  3. 1. FRONT MATTER ANNOUNCEMENTS How have the ITS related issues

    been? Final project presentation and handout drafts due next Monday! Lab 09 due next Monday! We will not have any additional lecture preps after today. Midterm grades were very strong. Well done!
  4. This section supersedes final project instructions - this is the

    method you should use for creating your handout!
  5. ▸ s2 = variance ▸ x = value of observation

    ▸ x = mean of x ▸ n = sample size 4. COVARIANCE Let: VARIANCE Second Moment
  6. 4. COVARIANCE COVARIANCE ▸ s2 = variance ▸ x =

    value of observation ▸ x = mean of x ▸ y = value of observation ▸ y = mean of y ▸ n = sample size Let:
  7. ▸ English mathematician ▸ Student of Sir Francis Galton (and,

    like Galton, he was a eugenicist and social Darwinist) ▸ Developed the “product-moment correlation coefficient” based on work Galton had done in 19th century. ▸ Also introduced “moments”, histograms, and core concepts related to statistical significance (including the “p-value”)! 6. PEARSON’S R KARL PEARSON
  8. 6. PEARSON’S R CORRELATION ▸ x = value of observation

    ▸ x = mean of x ▸ y = value of observation ▸ y = mean of y ▸ sx = std. deviation of x ▸ sy = std. deviation of y ▸ n = sample size Let:
  9. 6. PEARSON’S R PEARSON’S R 
 Assumptions: 1. Both x

    and y should be continuous, normally distributed variables 2. There should be a linear relationship between x and y 3. Sufficiently large sample size (n >= 30) 4. There should be no extreme outliers
  10. 5. DEPENDENT SAMPLES SAMPLE MEAN z = [ 1, 3,

    4, 16, 18, 19, 22, 36, 52, 64, 81 ] a = [ 2, 2, 4, 10, 16, 21, 28, 36, 52, 64, 81 ] b = [ 2, 3, 3, 5, 6, 10, 14, 15, 17, 22, 219 ]
  11. 6. PEARSON’S R PEARSON’S R 
 Interpretation: 1. Range: -1

    <= r <= 1 2. Direction: negative or positive?
  12. 6. PEARSON’S R PEARSON’S R 
 Interpretation: 1. Range: -1

    <= r <= 1 2. Direction: negative or positive? 3. Effect size (absolute value): •Weak: 0 <= r < .3 •Moderate: .3 <= r < .6 •Strong: .6 <= r < 1
  13. ▸ Squaring r will give us a value of 


    0 <= r2 <= 1 ▸ r2 value corresponds to the amount of variation that x accounts for in y (or vice versa) • This is not casual ▸ A r2 value of .8 suggests that x accounts for 80% of the variance in y 6. PEARSON’S R PERCENT OF VARIANCE
  14. ▸ t = t value ▸ r = correlation coefficient

    ▸ r2 = variance accounted for ▸ v = degrees of freedom (n-2) 6. PEARSON’S R Let: STATISTICAL SIGNIFICANCE Use function from Week-07 to find the p value associated with t.
  15. 6. PEARSON’S R PEARSON’S R 
 Report: 1. The value

    of r and the associated p value 2. The direction of the relationship (positive or negative) 3. The strength of the relationship (weak, moderate, or strong) 4. The r2 value - how much of the variation does x account for in y? 5. A plain English interpretation of the observed relationship.
  16. COMMON PITFALLS ▸ Correlation != causation, but we should not

    dismiss strong correlations as not being suggestions of causality either ▸ r ≈ 0 does not imply independence (check for non-linearity) ▸ Ecological fallacy - correlations between groups are stronger than correlations within groups ▸ Simpson’s Paradox - correlations within groups may have different directions than correlation overall 6. PEARSON’S R
  17. AGENDA REVIEW 7. BACK MATTER 2. Public Polling 3. Handouts

    4. Covariance 5. Scatterplots 6. Pearson’s R
  18. We will not have any additional lecture preps after today.

    Midterm grades were very strong. Well done! REMINDERS 7. BACK MATTER Final project presentation and handout drafts due next Monday! Lab 09 due next Monday!