Upgrade to PRO for Only $50/Year—Limited-Time Offer! 🔥
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Fostering the next gen of open science with R
Search
Karthik Ram
December 18, 2013
Science
0
160
Fostering the next gen of open science with R
My talk at the American Geophysical Union meeting in SF, December 2013.
Karthik Ram
December 18, 2013
Tweet
Share
More Decks by Karthik Ram
See All by Karthik Ram
R summit talk
karthik
1
66
Introduction to ggplot2
karthik
11
16k
Other Decks in Science
See All in Science
会社でMLモデルを作るとは @電気通信大学 データアントレプレナーフェロープログラム
yuto16
1
420
データベース06: SQL (3/3) 副問い合わせ
trycycle
PRO
1
700
データベース09: 実体関連モデル上の一貫性制約
trycycle
PRO
0
1k
データマイニング - コミュニティ発見
trycycle
PRO
0
180
【RSJ2025】PAMIQ Core: リアルタイム継続学習のための⾮同期推論・学習フレームワーク
gesonanko
0
400
データベース11: 正規化(1/2) - 望ましくない関係スキーマ
trycycle
PRO
0
1k
風の力で振れ幅が大きくなる振り子!? 〜タコマナローズ橋はなぜ落ちたのか〜
syotasasaki593876
1
160
MCMCのR-hatは分散分析である
moricup
0
520
ランサムウェア対策にも考慮したVMware、Hyper-V、Azure、AWS間のリアルタイムレプリケーション「Zerto」を徹底解説
climbteam
0
170
機械学習 - 決定木からはじめる機械学習
trycycle
PRO
0
1.2k
力学系から見た現代的な機械学習
hanbao
3
3.5k
データマイニング - グラフ構造の諸指標
trycycle
PRO
0
220
Featured
See All Featured
Imperfection Machines: The Place of Print at Facebook
scottboms
269
13k
CoffeeScript is Beautiful & I Never Want to Write Plain JavaScript Again
sstephenson
162
15k
Writing Fast Ruby
sferik
630
62k
Statistics for Hackers
jakevdp
799
230k
Building Better People: How to give real-time feedback that sticks.
wjessup
370
20k
GitHub's CSS Performance
jonrohan
1032
470k
The Language of Interfaces
destraynor
162
25k
YesSQL, Process and Tooling at Scale
rocio
174
15k
VelocityConf: Rendering Performance Case Studies
addyosmani
333
24k
Building a Scalable Design System with Sketch
lauravandoore
463
34k
jQuery: Nuts, Bolts and Bling
dougneiner
65
8.2k
The Cult of Friendly URLs
andyhume
79
6.7k
Transcript
OPEN SCIENCE fostering NEXT GEN . . . . .
. . with . . . . . . .
@_inundata
None
None
None
MOST OF THESE DATA ARE NOT EASILY ACCESSIBLE
The data life cycle
IT LEADS TO THIS
A RESEARCH PAPER THE NARRATIVE THE DATA THE CODE
THE NARRATIVE THE DATA THE CODE A RESEARCH PAPER
Scientists don’t share code or data few incentives
But incentive structures are slowly changing
Instructions for preparation of the Biographical Sketch have been revised
to rename the "Publications" section to "Products" and amend terminology and instructions accordingly. This change makes clear that products may include, but are not limited to, publications, data sets, software, patents, and copyrights. Funders:
community:
community:
Programmatic workflows increase reproducibility
None
glm(y ~ -1 + a + c + z +
a:z, data = mydata, maxit = 30) This is reproducible, repeatable and can serve as a analytic workflow
INCORPORATE OPEN SCIENCE INTO existing workflows
R + open science
Why R? 5,000 + Packages Widely used by environmental scientists
Our mission is to develop tools (mostly around R) to
access scientific data repositories, full-text of articles, and science metrics.
ACQUIRE DATA { 50+ CURRENT OFFERINGS packages ADD METADATA DEPOSIT
DATA
Search over 100K articles rplos
Retrieve data underlying papers in 200+ journals rdryad
Validate results from papers treebase
Past and future climate projections rWBclimate
Map biodiversity data spocc Danaus plexippus
Model species distributions
Deposit data into permanent archives library(figshare) fs_auth() id <- fs_create()
fs_upload(id, r_object) rfigshare
A RESEARCH PAPER THE NARRATIVE THE DATA THE CODE
CODE/ DATA Acquisition Analysis Documentation Deposition
DATA SHOULD BE machine readable
AUTOMATE BORING TASKS paint drying adding metadata
dryaddat <- download_url("10255/dryad.1759") ! dat <- dryad_getfile(dryaddat) ! eml_write(dat =
dat, meta, title = "Landings Data", description = description, creator = "Karthik Ram <
[email protected]
>", file = "landings.xml") ! ! eml_publish("landings.xml", description = description, categories = "Ecology", tags = "fisheries", destination = "figshare", visibility = "public")
A RESEARCH PAPER THE NARRATIVE THE DATA THE CODE
None
ropensci.org @ropensci