Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Fostering the next gen of open science with R
Search
Karthik Ram
December 18, 2013
Science
0
150
Fostering the next gen of open science with R
My talk at the American Geophysical Union meeting in SF, December 2013.
Karthik Ram
December 18, 2013
Tweet
Share
More Decks by Karthik Ram
See All by Karthik Ram
R summit talk
karthik
1
64
Introduction to ggplot2
karthik
11
16k
Other Decks in Science
See All in Science
データベース10: 拡張実体関連モデル
trycycle
PRO
0
990
データマイニング - グラフ構造の諸指標
trycycle
PRO
0
190
知能とはなにかーヒトとAIのあいだー
tagtag
0
140
mathematics of indirect reciprocity
yohm
1
190
mOrganic™ Holdings, LLC.
hyperlocalnetwork
0
110
傾向スコアによる効果検証 / Propensity Score Analysis and Causal Effect Estimation
ikuma_w
0
140
地質研究者が苦労しながら運用する情報公開システムの実例
naito2000
0
280
データベース14: B+木 & ハッシュ索引
trycycle
PRO
0
470
データベース05: SQL(2/3) 結合質問
trycycle
PRO
0
810
ランサムウェア対策にも考慮したVMware、Hyper-V、Azure、AWS間のリアルタイムレプリケーション「Zerto」を徹底解説
climbteam
0
120
AIによる科学の加速: 各領域での革新と共創の未来
masayamoriofficial
0
150
高校生就活へのDA導入の提案
shunyanoda
0
6k
Featured
See All Featured
The Art of Delivering Value - GDevCon NA Keynote
reverentgeek
15
1.7k
Building an army of robots
kneath
306
46k
Scaling GitHub
holman
463
140k
Fireside Chat
paigeccino
40
3.7k
It's Worth the Effort
3n
187
28k
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
132
19k
Gamification - CAS2011
davidbonilla
81
5.5k
Raft: Consensus for Rubyists
vanstee
139
7.1k
Mobile First: as difficult as doing things right
swwweet
224
10k
Thoughts on Productivity
jonyablonski
70
4.9k
Designing for Performance
lara
610
69k
Docker and Python
trallard
46
3.6k
Transcript
OPEN SCIENCE fostering NEXT GEN . . . . .
. . with . . . . . . .
@_inundata
None
None
None
MOST OF THESE DATA ARE NOT EASILY ACCESSIBLE
The data life cycle
IT LEADS TO THIS
A RESEARCH PAPER THE NARRATIVE THE DATA THE CODE
THE NARRATIVE THE DATA THE CODE A RESEARCH PAPER
Scientists don’t share code or data few incentives
But incentive structures are slowly changing
Instructions for preparation of the Biographical Sketch have been revised
to rename the "Publications" section to "Products" and amend terminology and instructions accordingly. This change makes clear that products may include, but are not limited to, publications, data sets, software, patents, and copyrights. Funders:
community:
community:
Programmatic workflows increase reproducibility
None
glm(y ~ -1 + a + c + z +
a:z, data = mydata, maxit = 30) This is reproducible, repeatable and can serve as a analytic workflow
INCORPORATE OPEN SCIENCE INTO existing workflows
R + open science
Why R? 5,000 + Packages Widely used by environmental scientists
Our mission is to develop tools (mostly around R) to
access scientific data repositories, full-text of articles, and science metrics.
ACQUIRE DATA { 50+ CURRENT OFFERINGS packages ADD METADATA DEPOSIT
DATA
Search over 100K articles rplos
Retrieve data underlying papers in 200+ journals rdryad
Validate results from papers treebase
Past and future climate projections rWBclimate
Map biodiversity data spocc Danaus plexippus
Model species distributions
Deposit data into permanent archives library(figshare) fs_auth() id <- fs_create()
fs_upload(id, r_object) rfigshare
A RESEARCH PAPER THE NARRATIVE THE DATA THE CODE
CODE/ DATA Acquisition Analysis Documentation Deposition
DATA SHOULD BE machine readable
AUTOMATE BORING TASKS paint drying adding metadata
dryaddat <- download_url("10255/dryad.1759") ! dat <- dryad_getfile(dryaddat) ! eml_write(dat =
dat, meta, title = "Landings Data", description = description, creator = "Karthik Ram <
[email protected]
>", file = "landings.xml") ! ! eml_publish("landings.xml", description = description, categories = "Ecology", tags = "fisheries", destination = "figshare", visibility = "public")
A RESEARCH PAPER THE NARRATIVE THE DATA THE CODE
None
ropensci.org @ropensci