Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
R Based tools for open and collaborative science
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Scott Chamberlain
July 26, 2013
Science
0
1.4k
R Based tools for open and collaborative science
Ignite talk at ESA on R-based tools for open and collaborative science
Scott Chamberlain
July 26, 2013
Tweet
Share
More Decks by Scott Chamberlain
See All by Scott Chamberlain
Contribution of traits, phenology, & phylogenetic history to plant-pollinator network structure
myrmecocystus
0
84
Programmatic access for Altmetrics
myrmecocystus
1
110
Contribution of traits and phylogenetic history to plant-pollinator network
myrmecocystus
0
110
Other Decks in Science
See All in Science
機械学習 - ニューラルネットワーク入門
trycycle
PRO
0
960
データベース11: 正規化(1/2) - 望ましくない関係スキーマ
trycycle
PRO
0
1.1k
AIによる科学の加速: 各領域での革新と共創の未来
masayamoriofficial
0
460
[Paper Introduction] From Bytes to Ideas:Language Modeling with Autoregressive U-Nets
haruumiomoto
0
220
やるべきときにMLをやる AIエージェント開発
fufufukakaka
2
1.2k
20251212_LT忘年会_データサイエンス枠_新川.pdf
shinpsan
0
250
データベース08: 実体関連モデルとは?
trycycle
PRO
0
1k
俺たちは本当に分かり合えるのか? ~ PdMとスクラムチームの “ずれ” を科学する
bonotake
2
2k
KISHIMOTO Atsuo
genomethica
0
110
【RSJ2025】PAMIQ Core: リアルタイム継続学習のための⾮同期推論・学習フレームワーク
gesonanko
0
690
データマイニング - ノードの中心性
trycycle
PRO
0
350
Text-to-SQLの既存の評価指標を問い直す
gotalab555
1
180
Featured
See All Featured
Making the Leap to Tech Lead
cromwellryan
135
9.8k
Breaking role norms: Why Content Design is so much more than writing copy - Taylor Woolridge
uxyall
0
200
How to Talk to Developers About Accessibility
jct
2
150
Imperfection Machines: The Place of Print at Facebook
scottboms
269
14k
It's Worth the Effort
3n
188
29k
The Illustrated Guide to Node.js - THAT Conference 2024
reverentgeek
1
300
Automating Front-end Workflow
addyosmani
1370
200k
The Power of CSS Pseudo Elements
geoffreycrofte
82
6.2k
Future Trends and Review - Lecture 12 - Web Technologies (1019888BNR)
signer
PRO
0
3.3k
Building Better People: How to give real-time feedback that sticks.
wjessup
370
20k
Noah Learner - AI + Me: how we built a GSC Bulk Export data pipeline
techseoconnect
PRO
0
130
Max Prin - Stacking Signals: How International SEO Comes Together (And Falls Apart)
techseoconnect
PRO
0
110
Transcript
R-based tools for open and collaborative science @recology_ Scott Chamberlain
Science needs to be more open
http://everyoneknowsbest.files.wordpress.com/2008/08/bodysculpture.jpg We build on the knowledge of others http://everyoneknowsbest.files.wordpress.com/2008/08/bodysculpture.jpg
Less mistakes More things can happen b/c data is open
http://everyoneknowsbest.files.wordpress.com/2008/08/bodysculpture.jpg The public paid for it!
http://www.fotopedia.com/items/flickr-4796633039 But we need tools to do it!!!!!
What kinds of tools? Not these
These!!!!!!
• Collect data • Manipulate data • Visualize • Analyze
• Write What does an ecologist do?
R is a good solution
• R is Open source = Free + Rapid change
• R = entire workflow in 1 place • R = reproducible science Why?
Get some data from the web library(RCurl); library(RJSONIO) dat <-
fromJSON(getURL("https://api.github.com/users/hadley/repos")) Manipulate the data library(plyr); library(reshape2) dat_melt <- melt(ldply(dat, function(x) data.frame(x[names(x) %in% c("name","watchers_count","forks")]))) Run some statistical model lm(value ~ variable, data = dat_melt) Visualize results library(ggplot2) ggplot(dat_melt, aes(name, value, colour = variable)) + geom_point() + coord_flip() Write the paper # Introduction...
Data increasingly on the web
The toolbelt
Literature library(rplos) plot_throughtime('phylogeny', 300) + geom_line(size=2)
Taxonomy library(taxize) classification("Abies procera", db = "itis") rankName taxonName
tsn Kingdom Plantae 202422 Subkingdom Viridaeplantae 846492 Infrakingdom Streptophyta 846494 Division Tracheophyta 846496 Subdivision Spermatophytina 846504 Infradivision Gymnospermae 846506 Class Pinopsida 500009 Order Pinales 500028 Family Pinaceae 18030 Genus Abies 18031 Species Abies procera 181835
Species occurrences from GBIF library(rgbif) splist <- c('Accipiter erythronemius', 'Junco
hyemalis', 'Aix sponsa', 'Podiceps cristatus') out <- occurrencelist_many(splist) gbifmap_list(out)
Occurrence from USGS’s BISON service library(rbison) out <- bison(species="Helianthus annuus",
count=500) bisonmap(input=out, tomap="county")
Climate data from the World Bank library(rWBclimate) country.list <- c("USA",
"MEX") country.dat <- get_historical_temp(country.list, "year") ggplot(country.dat, aes(x = year, y = data, group = locator)) + geom_point() + geom_path() + labs(y="Average annual temperature of Canada", x="Year") + theme_bw() + stat_smooth(se = F, colour = "black") + facet_wrap(~locator, scale = "free")
Take action! FontAwesome http://fortawesome.github.io/Font-Awesome/ fontawesome 2 png https://github.com/odyniec/font-awesome-to-png Presentation available
here: http://bit.ly/16tuVbu