Slide 1

Slide 1 text

Bivariate analysis in SPSS SPSS LABORATORY 6

Slide 2

Slide 2 text

1. Calendar 2. Bivariate analysis: quick theory refresher 3. Bivariate analysis in SPSS 4. Exercise INDEX

Slide 3

Slide 3 text

No content

Slide 4

Slide 4 text

Photo: Startup Weekend Hackathon. Nov.2014 SURVEY Do not forget to take the survey by Friday, 18 at midnight. Canvas > Quiz > CMNS201 Survey 2022

Slide 5

Slide 5 text

Photo: Startup Weekend Hackathon. Nov.2014 BIVARIATE ANALYSIS: DEFINITION Bivariate analyses are conducted to determine whether a statistical association exists between two variables. For example, bivariate analyses could be used to answer the question of whether there is an association between income and quality of life. Source: Sandilands D.. (2014) Bivariate Analysis. In: Michalos A.C. (eds) Encyclopedia of Quality of Life and Well-Being Research. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-0753-5_222

Slide 6

Slide 6 text

Photo: Startup Weekend Hackathon. Nov.2014 BIVARIATE ANALYSIS: DEFINITION More specifically, bivariate analysis explores how the dependent variable depends by the independent variable. There are 2 types of relationship between the dependent and independent variable: ■ A direct relationship (also called positive correlation) – that means if the independent variable increases, then the dependent variable would also increase and vice versa. ■ A inverse relationship (negative correlation) – when the independent variable increases and the dependent variable decrease and vice versa.

Slide 7

Slide 7 text

Photo: Startup Weekend Hackathon. Nov.2014 BIVARIATE ANALYSIS: EXAMPLE Average daily temperature Ice cream sales Example of a direct relation. If average daily temperature increases, ice cream sales also increase

Slide 8

Slide 8 text

Photo: Startup Weekend Hackathon. Nov.2014 BIVARIATE ANALYSIS: EXAMPLE Average daily temperature Ramen sales Example of an inverse relation. If average daily temperature increases, ramen sales decrease

Slide 9

Slide 9 text

Photo: Startup Weekend Hackathon. Nov.2014 HOW TO CONDUCT BIVARIATE ANALYSIS IN SPSS ■ Go to Canvas ○ Assignments > SPSS Lab 6 - Bivariate analysis ○ Download the Week-6.sav file ■ Open the file on SPSS ○ Launch SPP ■ File > Open > Data… ■ Find and open the Week-6.sav file

Slide 10

Slide 10 text

Photo: Startup Weekend Hackathon. Nov.2014 LET’S BEGIN BY FORMULATING A HYPOTHESIS SFU Students with Canadian citizenship will tend to have higher GPA because they grew up within the Canadian educational system. On the contrary, SFU students on a study permit will have lower GPA because in addition to entering a new educational system, they also experience disconnection from their families and communities. Null hypothesis: there is no relation between Immigration Status and GPA

Slide 11

Slide 11 text

Photo: Startup Weekend Hackathon. Nov.2014 LET’S BEGIN BY FORMULATING A HYPOTHESIS SFU Students with Canadian citizenship will tend to have higher GPA because they grew up within the Canadian teaching system. On the contrary, SFU students on a study permit will have lower GPA because in addition to entering a new educational system, they also experience disconnection from their families and communities. Immigration status: Independent variable GPA: dependent variable Null hypothesis: there is no relation between Immigration Status and GPA

Slide 12

Slide 12 text

Photo: Startup Weekend Hackathon. Nov.2014 BIVARIATE ANALYSIS IN SPSS

Slide 13

Slide 13 text

Photo: Startup Weekend Hackathon. Nov.2014 SETTING UP YOUR CROSSTAB Independent variable in the Rows Dependent variable in the Columns

Slide 14

Slide 14 text

Photo: Startup Weekend Hackathon. Nov.2014 SETTING UP YOUR CROSSTAB 1. Put Residence_status in the Rows 2. Put GPA in the Columns 3. Click Cells

Slide 15

Slide 15 text

Select Observed and Expected counts Photo: Startup Weekend Hackathon. Nov.2014 SETTING UP YOUR CROSSTAB Select Row Click Continue

Slide 16

Slide 16 text

Photo: Startup Weekend Hackathon. Nov.2014 SETTING UP YOUR CROSSTAB 4. Click Ok

Slide 17

Slide 17 text

Photo: Startup Weekend Hackathon. Nov.2014 HOW TO READ A CROSSTABULATION Independent variable Dependent variable

Slide 18

Slide 18 text

Photo: Startup Weekend Hackathon. Nov.2014 HOW TO READ A CROSSTABULATION Count: is the observed frequency in a particular cell of the crosstabs table. For example, In this case, 18 students with Canadian Citizenship have a GPA of 2.99 or lower. Expected Count: The expected count is the predicted frequency for a cell under the assumption that the null hypothesis is true. In our case, the null hypothesis is that there is no association between Immigration Status and GPA. If the Null hypothesis was true, then we expected 17 students with a GPA of 2.99 or lower.

Slide 19

Slide 19 text

Photo: Startup Weekend Hackathon. Nov.2014 HOW TO READ A CROSSTABULATION *: “significant” is a very broad qualifier. In general, if you see a difference equal or greater of 10%, you can assume there is a relation between the two variables (which can confirm or not your hypothesis). If the difference is small (around 5%), you can assume there is no relation between the two variables. To understand if our initial hypothesis is confirmed or not, we need to compare % by column. To confirm our hypothesis, the % of Canadian students with a GPA greater than 3 should have been significantly* above the % of study permit students with a GPA above 3. This results do not support the hypothesis. Actually show the opposite: there is a relation between Residence_Status and GPA but a relatively larger share of Study permit students (4.4%) compared to Canadian Citizen appear to have a GPA of 3 or higher.

Slide 20

Slide 20 text

Photo: Startup Weekend Hackathon. Nov.2014 CROSSTABULATION AND VARIABLE RECODING Hypothesis: People who sleep less tend to achieve a higher GPA. Null hypothesis: there is no relation between Sleep_hrs and GPA

Slide 21

Slide 21 text

Photo: Startup Weekend Hackathon. Nov.2014 CROSSTABULATION AND VARIABLE RECODING

Slide 22

Slide 22 text

Photo: Startup Weekend Hackathon. Nov.2014 CROSSTABULATION AND VARIABLE RECODING 8 is usually indicated as the ideal number of hours of sleep per night. Based on this information, let’s create 2 segments. In formula: ■ If Sleep_hrs <= 7.5, then Sleep_new = “Restless” ■ If Sleep_hrs => 8, then Sleep_new = “Sleepy”

Slide 23

Slide 23 text

Photo: Startup Weekend Hackathon. Nov.2014 CROSSTABULATION AND VARIABLE RECODING If you do not remember how to recode a variable, check Week 3 slides.

Slide 24

Slide 24 text

Photo: Startup Weekend Hackathon. Nov.2014 1: select the variable to recode 2: type the new variable name and add descriptive label 3: click change 4: click Old and New Values 5: set your range 6: tick the checkbox and define width 15 7: define the new variable value for the range 8: click add and repeat steps 5-7-8 for the second range. 10: click OK 9: once defined all ranges, click Continue

Slide 25

Slide 25 text

Photo: Startup Weekend Hackathon. Nov.2014 CROSSTABULATION AND VARIABLE RECODING

Slide 26

Slide 26 text

Photo: Startup Weekend Hackathon. Nov.2014 CROSSTABULATION AND VARIABLE RECODING The findings do not support the hypothesis. The % of students with a GPA of 3 and above is nearly identical for both Sleepy and Restless students. The difference is just 1.5%!

Slide 27

Slide 27 text

Photo: Startup Weekend Hackathon. Nov.2014 EXERCISE Run a Crosstab for the variables Gender (independent Variable) and GPA (dependent variable). Is there a relation between the two variables? Upload a screenshot of the crosstab on Canvas alongside a one sentence comment about the relation (or lack thereof) between the two variables.

Slide 28

Slide 28 text

THANK YOU Alberto Lusoli [email protected] Office hour: Thursday, 12.30pm - 1.20pm (please book an appointment in advance via email).