BeginneR Session - Data Analysis - #86 Tokyo.R 2020.06.27 @kilometer00

Who!? 誰だ?

Who!? 名前: 三村 @kilometer 職業: ポスドク (こうがくはくし) 専⾨: ⾏動神経科学(霊⻑類) 脳イメージング 医療システム⼯学 R歴: ~ 10年ぐらい 流⾏: 三体

・RStudio ・ Readable coding

・RStudio ・ Readable coding ・Tidy data ・Tidyverse

Input Output

Integrated Development Environment RStudio h6ps:// Recommended!!

RStudio File > New File > R script (⌘ + ⇧ + N) Script Editor Console Files, Plots, Pkgs, ... Env., HIstoly, Git, ...

RStudio File > New File > R script (⌘ + ⇧ + N) Script Editor Console Files, Plots, Pkgs, ... Env., HIstoly, Git, ... ≈ Run

RStudio select, ⌘ + ↩ ≈ Run

Integrated Development Environment RStudio

Projects RStudio

File > New Project > New Directory > New Project > Create New Project

my_project my_project.Rproj .Rproj.user ~/Documents/R Auto saved working information (Unsaved edits, tab order, etc.) Project Root folder Workspace setting Project Folder Open RStudio

my_project my_project.Rproj ~/Documents/R Project Folder data fig script.R

my_project my_project.Rproj ~/Documents/R Project Folder data fig script.R data_raw data_all.csv dat1.csv dat2.csv dat3.csv

my_project my_project.Rproj ~/Documents/R Project Folder data fig script.R data_raw data_all.csv dat <- read.csv("data/data_all.csv") Rela>ve path Project Root dat1.csv dat2.csv dat3.csv

~/Documents/R project1 project2 project3 project4

Project ≠ sandbox

Sandbox h)p:// h)p://

Sandbox A h+p:// h+p:// Sandbox B

Sandbox A h+p:// h+p:// Sandbox B Isolated & Independent

"Sandbox" in Python [python] version = "3.7" [packages] cycler==0.10.0 kiwisolver==1.1.0 matplotlib==3.1.1 numpy==1.16.4 opencv-python== pandas==0.25.0 pyparsing==2.4.0 PypeR==1.1.2 ... [python] version = "2.7" [packages] numpy==1.16.4 ... cf. hEps://

Sandbox environments?

R & RStudio Projects

h"ps:// If you want,

my_project my_project.Rproj ~/Documents/R "4dy" folder structure data fig script.R data_raw data_all.csv dat1.csv dat2.csv dat3.csv

『アンチ整理術』森 博嗣, 2019, ⽇本実業出版社

『アンチ整理術』森 博嗣, 2019, ⽇本実業出版社 整理・整頓する、という ⾏為には、デザインされた ⽅針が前提になっている。

-PTT͛ Symbol grounding problem %JWFSHFODF 3FBM *OGP %BUB "QQMF &ODPEJOH

Exploratory Data Analysis

Exploratory Data Analysis "The Future of Data Analysis" Tukey, J. W., 1962 Three of the main strategies of data analysis are: 1. graphical presenta;on. 2. provision of flexibility in viewpoint and in facili;es, 3. intensive search for parsimony and simplicity. Brillinger, D. R., 2011

Exploratory Data Analysis "The Future of Data Analysis" Tukey, J. W., 1962 "Exploratory Data Analysis" Tukey, J. W., 1970 "Exploratory data analysis isolates pa3erns and features of the data and reveals these forcefully to the analyst" "Exploratory data analysis’ is an a:tude, a state of flexibility, a willingness to look for those things that we believe are not there, as well as those we believe to be there" Brillinger, D. R., 2011

Confirmatory (Hypothesis tes2ng) Exploratory Data analysis

Confirmatory (Hypothesis tes2ng) Exploratory Data analysis Tidy (needless to say) Un0dy Need to be thought of "2dy data" (because it isn't)

Confirmatory (Hypothesis tes2ng) Exploratory Data analysis

Confirmatory (Hypothesis tes2ng) Exploratory Data analysis Generalized procedures Improvisa2on

Data Pipeline spaghe. code!! spaghe. code? readable coding

Programing Write Run Read Think

Run!!! h&ps://

Programing Write Run Read Think

Programing Write Run Read Think coding style

The %dyverse style guide h"ps://style., "Good coding style is like correct punctua,on: you can manage without it, bu,tsuremakesthingseasiertoread." Google's R Style Guide h"ps://style., "The goal of the R Programming Style Guide is to make our R code easier to read, share, and verify." R coding style guides

Programing Write Run Read Think Write Run Read Think Share

Text Figure Information Intention Data decode encode feedback Programing

・RStudio ・Readable coding ・Tidy data ・Tidyverse

