Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Fostering the next gen of open science with R
Search
Karthik Ram
December 18, 2013
Science
0
140
Fostering the next gen of open science with R
My talk at the American Geophysical Union meeting in SF, December 2013.
Karthik Ram
December 18, 2013
Tweet
Share
More Decks by Karthik Ram
See All by Karthik Ram
R summit talk
karthik
1
58
Introduction to ggplot2
karthik
11
16k
Other Decks in Science
See All in Science
白金鉱業Meetup Vol.16_【初学者向け発表】 数理最適化のはじめの一歩 〜身近な問題で学ぶ最適化の面白さ〜
brainpadpr
10
2k
Improving Search @scale with efficient query experimentation @BerlinBuzzwords 2024
searchhub
0
270
(論文読み)贈り物の交換による地位の競争と社会構造の変化 - 文化人類学への統計物理学的アプローチ -
__ymgc__
1
180
The Incredible Machine: Developer Productivity and the Impact of AI
tomzimmermann
0
560
Design of three-dimensional binary manipulators for pick-and-place task avoiding obstacles (IECON2024)
konakalab
0
130
Reconciling Accuracy, Cost, and Latency of Inference Serving Systems
pjamshidi
0
120
地表面抽出の方法であるSMRFについて紹介
kentaitakura
1
350
メール送信サーバの集約における透過型SMTP プロキシの定量評価 / Quantitative Evaluation of Transparent SMTP Proxy in Email Sending Server Aggregation
linyows
0
680
マテリアルズ・インフォマティクスの先端で起きていること / What's Happening at the Cutting Edge of Materials Informatics
snhryt
1
190
Tensor Representations in Signal Processing and Machine Learning (Tutorial at APSIPA-ASC 2020)
yokotatsuya
0
120
SciPyDataJapan 2025
schwalbe10
0
140
Planted Clique Conjectures are Equivalent
nobushimi
0
120
Featured
See All Featured
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
44
7k
The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024
eileencodes
21
2.5k
Statistics for Hackers
jakevdp
797
220k
RailsConf 2023
tenderlove
29
1k
Side Projects
sachag
452
42k
Building Better People: How to give real-time feedback that sticks.
wjessup
367
19k
Put a Button on it: Removing Barriers to Going Fast.
kastner
60
3.7k
Music & Morning Musume
bryan
46
6.4k
Navigating Team Friction
lara
183
15k
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
280
13k
Site-Speed That Sticks
csswizardry
4
410
Faster Mobile Websites
deanohume
306
31k
Transcript
OPEN SCIENCE fostering NEXT GEN . . . . .
. . with . . . . . . .
@_inundata
None
None
None
MOST OF THESE DATA ARE NOT EASILY ACCESSIBLE
The data life cycle
IT LEADS TO THIS
A RESEARCH PAPER THE NARRATIVE THE DATA THE CODE
THE NARRATIVE THE DATA THE CODE A RESEARCH PAPER
Scientists don’t share code or data few incentives
But incentive structures are slowly changing
Instructions for preparation of the Biographical Sketch have been revised
to rename the "Publications" section to "Products" and amend terminology and instructions accordingly. This change makes clear that products may include, but are not limited to, publications, data sets, software, patents, and copyrights. Funders:
community:
community:
Programmatic workflows increase reproducibility
None
glm(y ~ -1 + a + c + z +
a:z, data = mydata, maxit = 30) This is reproducible, repeatable and can serve as a analytic workflow
INCORPORATE OPEN SCIENCE INTO existing workflows
R + open science
Why R? 5,000 + Packages Widely used by environmental scientists
Our mission is to develop tools (mostly around R) to
access scientific data repositories, full-text of articles, and science metrics.
ACQUIRE DATA { 50+ CURRENT OFFERINGS packages ADD METADATA DEPOSIT
DATA
Search over 100K articles rplos
Retrieve data underlying papers in 200+ journals rdryad
Validate results from papers treebase
Past and future climate projections rWBclimate
Map biodiversity data spocc Danaus plexippus
Model species distributions
Deposit data into permanent archives library(figshare) fs_auth() id <- fs_create()
fs_upload(id, r_object) rfigshare
A RESEARCH PAPER THE NARRATIVE THE DATA THE CODE
CODE/ DATA Acquisition Analysis Documentation Deposition
DATA SHOULD BE machine readable
AUTOMATE BORING TASKS paint drying adding metadata
dryaddat <- download_url("10255/dryad.1759") ! dat <- dryad_getfile(dryaddat) ! eml_write(dat =
dat, meta, title = "Landings Data", description = description, creator = "Karthik Ram <karthik@ropensci.org>", file = "landings.xml") ! ! eml_publish("landings.xml", description = description, categories = "Ecology", tags = "fisheries", destination = "figshare", visibility = "public")
A RESEARCH PAPER THE NARRATIVE THE DATA THE CODE
None
ropensci.org @ropensci