Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Fostering the next gen of open science with R
Search
Karthik Ram
December 18, 2013
Science
0
150
Fostering the next gen of open science with R
My talk at the American Geophysical Union meeting in SF, December 2013.
Karthik Ram
December 18, 2013
Tweet
Share
More Decks by Karthik Ram
See All by Karthik Ram
R summit talk
karthik
1
66
Introduction to ggplot2
karthik
11
16k
Other Decks in Science
See All in Science
機械学習 - DBSCAN
trycycle
PRO
0
1.2k
データマイニング - グラフデータと経路
trycycle
PRO
1
240
機械学習 - SVM
trycycle
PRO
1
910
風の力で振れ幅が大きくなる振り子!? 〜タコマナローズ橋はなぜ落ちたのか〜
syotasasaki593876
1
130
[Paper Introduction] From Bytes to Ideas:Language Modeling with Autoregressive U-Nets
haruumiomoto
0
160
AIに仕事を奪われる 最初の医師たちへ
ikora128
0
990
「美は世界を救う」を心理学で実証したい~クラファンを通じた新しい研究方法
jimpe_hitsuwari
1
170
白金鉱業Meetup_Vol.20 効果検証ことはじめ / Introduction to Impact Evaluation
brainpadpr
2
1.3k
Ignite の1年間の軌跡
ktombow
0
170
深層学習を用いた根菜類の個数カウントによる収量推定法の開発
kentaitakura
0
200
デジタルアーカイブの教育利用促進を目指したメタデータLOD基盤に関する研究 / Research on a Metadata LOD Platform for Promoting Educational Uses of Digital Archives
masao
0
110
AI(人工知能)の過去・現在・未来 —AIは人間を超えるのか—
tagtag
0
120
Featured
See All Featured
GitHub's CSS Performance
jonrohan
1032
470k
StorybookのUI Testing Handbookを読んだ
zakiyama
31
6.3k
Fight the Zombie Pattern Library - RWD Summit 2016
marcelosomers
234
17k
Agile that works and the tools we love
rasmusluckow
331
21k
Typedesign – Prime Four
hannesfritz
42
2.9k
The Cult of Friendly URLs
andyhume
79
6.7k
Performance Is Good for Brains [We Love Speed 2024]
tammyeverts
12
1.3k
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
52
5.7k
A Tale of Four Properties
chriscoyier
161
23k
Improving Core Web Vitals using Speculation Rules API
sergeychernyshev
21
1.2k
How to train your dragon (web standard)
notwaldorf
97
6.3k
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
48
9.8k
Transcript
OPEN SCIENCE fostering NEXT GEN . . . . .
. . with . . . . . . .
@_inundata
None
None
None
MOST OF THESE DATA ARE NOT EASILY ACCESSIBLE
The data life cycle
IT LEADS TO THIS
A RESEARCH PAPER THE NARRATIVE THE DATA THE CODE
THE NARRATIVE THE DATA THE CODE A RESEARCH PAPER
Scientists don’t share code or data few incentives
But incentive structures are slowly changing
Instructions for preparation of the Biographical Sketch have been revised
to rename the "Publications" section to "Products" and amend terminology and instructions accordingly. This change makes clear that products may include, but are not limited to, publications, data sets, software, patents, and copyrights. Funders:
community:
community:
Programmatic workflows increase reproducibility
None
glm(y ~ -1 + a + c + z +
a:z, data = mydata, maxit = 30) This is reproducible, repeatable and can serve as a analytic workflow
INCORPORATE OPEN SCIENCE INTO existing workflows
R + open science
Why R? 5,000 + Packages Widely used by environmental scientists
Our mission is to develop tools (mostly around R) to
access scientific data repositories, full-text of articles, and science metrics.
ACQUIRE DATA { 50+ CURRENT OFFERINGS packages ADD METADATA DEPOSIT
DATA
Search over 100K articles rplos
Retrieve data underlying papers in 200+ journals rdryad
Validate results from papers treebase
Past and future climate projections rWBclimate
Map biodiversity data spocc Danaus plexippus
Model species distributions
Deposit data into permanent archives library(figshare) fs_auth() id <- fs_create()
fs_upload(id, r_object) rfigshare
A RESEARCH PAPER THE NARRATIVE THE DATA THE CODE
CODE/ DATA Acquisition Analysis Documentation Deposition
DATA SHOULD BE machine readable
AUTOMATE BORING TASKS paint drying adding metadata
dryaddat <- download_url("10255/dryad.1759") ! dat <- dryad_getfile(dryaddat) ! eml_write(dat =
dat, meta, title = "Landings Data", description = description, creator = "Karthik Ram <
[email protected]
>", file = "landings.xml") ! ! eml_publish("landings.xml", description = description, categories = "Ecology", tags = "fisheries", destination = "figshare", visibility = "public")
A RESEARCH PAPER THE NARRATIVE THE DATA THE CODE
None
ropensci.org @ropensci