Introduction to Data Science, case-by-case

mine-cetinkaya-rundel [email protected] @minebocek mine çetinkaya-rundel duke university + rstudio intro
to data science: case by case bit.ly/enar19-cases

FIRST YEAR Introductory courses: survey of methods / tools /
approaches SENIOR Capstone / case studies SOPHOMORE Intermediary courses: deeper look at fundamentals JUNIOR Applied and theoretical electives bit.ly/enar19-cases typical undergraduate curriculum

QHow can we design an introductory curriculum around case studies?
bit.ly/enar19-cases

APPLICATION FIRST BATCH OF CONNECTED LEARNING GOALS DESIGNED AROUND CASE
STUDIES learning modules bit.ly/enar19-cases

QWhy should we consider designing an introductory curriculum around case
studies? bit.ly/enar19-cases

illustrate process provide motivation dem onstrate am biguity

Q Which of the following is more likely to be
motivating for a wide range of students? bit.ly/enar19-cases

option 1: ✓ Topic: Web scraping & mapping ✓ What
will we learn about? ✓ rvest: A new package for harvesting data oﬀ the web ✓ regular expressions ✓ ggplot2’s mapping features ✓ Functions and automation bit.ly/enar19-cases

option 2: ✓ Case study: Money in Politics regex fn
bit.ly/enar19-cases

Money in politics OPENSECRETS.ORG bit.ly/enar19-cases

students will encounter lots of new challenges along the way
— let that happen, and then provide a solution bit.ly/enar19-cases

Lesson: Web scraping essentials for turning a structured table into
a data frame in R. bit.ly/enar19-cases

a data frame in R. Ex 1: Scrape the table oﬀ the web and save as a data frame. bit.ly/enar19-cases

a data frame in R. Ex 1: Scrape the table oﬀ the web and save as a data frame. Ex 2: What other information do we need represented as variables in the data to obtain the desired facets? bit.ly/enar19-cases

a data frame in R. Ex 1: Scrape the table oﬀ the web and save as a data frame. Ex 2: What other information do we need represented as variables in the data to obtain the desired facets? Lesson: “Just enough” string parsing and regular expressions to go from to bit.ly/enar19-cases

bit.ly/enar19-cases

FIND A DATASET PIN DOWN A RESEARCH QUESTION IDENTIFY METHODS
AND TECHNIQUES PERFORM ANALYSIS WRITE REPORT / PREPARE PRESENTATION how we might expect students to approach a final project… bit.ly/enar19-cases

FIND A DATASET PIN DOWN A RESEARCH QUESTION IDENTIFY METHODS
AND TECHNIQUES PERFORM ANALYSIS WRITE REPORT / PREPARE PRESENTATION …how students might approach a final project bit.ly/enar19-cases

Poverty index HOUSEHOLD ASSET HOLDINGS + DEMOGRAPHICS FROM VARIOUS COUNTRIES
bit.ly/enar19-cases

there is no one clear answer, allow students to brainstorm
approaches and take them through your (expert) reasoning for what might / might not work bit.ly/enar19-cases

ASA DataFest is a data analysis competition where teams of
up to five students attack a large, complex, and surprise dataset over a weekend. inspiration… bit.ly/enar19-cases

Paris Paintings PAINTING AUCTION DATA 1764 - 1780 bit.ly/enar19-cases

introduce students to the formulation of research questions, and help
them understand what questions can (and cannot) be answered with a given dataset bit.ly/enar19-cases

Two paintings very rich in composition, of a beautiful execution,
and whose merit is very remarkable, each 17 inches 3 lines high, 23 inches wide; the ﬁrst, painted on wood, comes from the Cabinet of Madame la Comtesse de Verrue; it represents a departure for the hunt: it shows in the front a child on a white horse, a man who gives the horn to gather the dogs, a falconer and other ﬁgures nicely distributed across the width of the painting; two horses drinking from a fountain; on the right in the corner a lovely country house topped by a terrace, on which people are at the table, others who play instruments; trees and fabriques pleasantly enrich the background. bit.ly/enar19-cases

bit.ly/enar19-cases

QWhat are resources for case studies for an introductory curriculum?
bit.ly/enar19-cases

data expeditions PAIR OF GRAD STUDENTS, WORK WITH COURSE INSTRUCTOR
TO FORMULATE A QUESTION, + A PATHWAY THROUGH A DATASET TO EXPLORE THE QUESTION ELEMENT OF AN UNDERGRADUATE COURSE THAT INTRODUCES STUDENTS TO EXPLORATORY DATA ANALYSIS GRADUATE STUDENT PARTICIPANTS RECEIVE A TRAVEL GRANT bit.ly/enar19-cases

datasciencebox.org bit.ly/enar19-cases

Visualizing data Wrangling data Making rigorous conclusions Looking forward Fundamentals
of data & data viz, confounding variables, Simpson’s paradox + R / RStudio, R Markdown, simple git Tidy data, data frames vs. summary tables, recoding and transforming, web scraping and iteration + collaboration on GitHub Building & selecting models, visualizing interactions, prediction & validation, inference via simulation Data science ethics, interactive viz & reporting, text analysis, Bayesian inference + communication, dissemination bit.ly/enar19-cases

mine-cetinkaya-rundel [email protected] @minebocek mine çetinkaya-rundel duke university + rstudio intro
to data science: case by case bit.ly/enar19-cases

Introduction to Data Science, case-by-case

Introduction to Data Science, case-by-case

Mine Cetinkaya-Rundel

More Decks by Mine Cetinkaya-Rundel

Other Decks in Education

Featured

Transcript

mine-cetinkaya-rundel [email protected] @minebocek mine çetinkaya-rundel duke university + rstudio intro

FIRST YEAR Introductory courses: survey of methods / tools /

QHow can we design an introductory curriculum around case studies?

APPLICATION FIRST BATCH OF CONNECTED LEARNING GOALS DESIGNED AROUND CASE

QWhy should we consider designing an introductory curriculum around case

illustrate process provide motivation dem onstrate am biguity

Q Which of the following is more likely to be

option 1: ✓ Topic: Web scraping & mapping ✓ What

option 2: ✓ Case study: Money in Politics regex fn

Money in politics OPENSECRETS.ORG bit.ly/enar19-cases

students will encounter lots of new challenges along the way

Lesson: Web scraping essentials for turning a structured table into

Lesson: Web scraping essentials for turning a structured table into

Lesson: Web scraping essentials for turning a structured table into

Lesson: Web scraping essentials for turning a structured table into

illustrate process provide motivation dem onstrate am biguity

bit.ly/enar19-cases

FIND A DATASET PIN DOWN A RESEARCH QUESTION IDENTIFY METHODS

FIND A DATASET PIN DOWN A RESEARCH QUESTION IDENTIFY METHODS

Poverty index HOUSEHOLD ASSET HOLDINGS + DEMOGRAPHICS FROM VARIOUS COUNTRIES

there is no one clear answer, allow students to brainstorm

illustrate process provide motivation dem onstrate am biguity

ASA DataFest is a data analysis competition where teams of

Paris Paintings PAINTING AUCTION DATA 1764 - 1780 bit.ly/enar19-cases

introduce students to the formulation of research questions, and help

Two paintings very rich in composition, of a beautiful execution,

bit.ly/enar19-cases

QWhat are resources for case studies for an introductory curriculum?

data expeditions PAIR OF GRAD STUDENTS, WORK WITH COURSE INSTRUCTOR

datasciencebox.org bit.ly/enar19-cases

Visualizing data Wrangling data Making rigorous conclusions Looking forward Fundamentals

mine-cetinkaya-rundel [email protected] @minebocek mine çetinkaya-rundel duke university + rstudio intro