Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
R Based tools for open and collaborative science
Search
Scott Chamberlain
July 26, 2013
Science
1.4k
0
Share
R Based tools for open and collaborative science
Ignite talk at ESA on R-based tools for open and collaborative science
Scott Chamberlain
July 26, 2013
More Decks by Scott Chamberlain
See All by Scott Chamberlain
Contribution of traits, phenology, & phylogenetic history to plant-pollinator network structure
myrmecocystus
0
85
Programmatic access for Altmetrics
myrmecocystus
1
110
Contribution of traits and phylogenetic history to plant-pollinator network
myrmecocystus
0
110
Other Decks in Science
See All in Science
AI(人工知能)の過去・現在・未来 —AIは人間を超えるのか—
tagtag
PRO
1
270
なぜエネルギーは保存する? 〜自由落下でわかる“対称性”とネーターの定理〜
syotasasaki593876
0
130
データから見る勝敗の法則 / The principle of victory discovered by science (open lecture in NSSU)
konakalab
1
310
次代のデータサイエンティストへ~スキルチェックリスト、タスクリスト更新~
datascientistsociety
PRO
3
38k
共生概念の整理と AIアライメントの構想
hiroakihamada
0
190
Algorithmic Aspects of Quiver Representations
tasusu
0
320
イロレーティングを活用した関東大学サッカーの定量的実力評価 / A quantitative performance evaluation of Kanto University Football Association using Elo rating
konakalab
0
250
生成AIと司法書士の未来.pdf
tagtag
PRO
0
110
データベース04: SQL (1/3) 単純質問 & 集約演算
trycycle
PRO
0
1.2k
水耕栽培を始める前に知っておきたい植物の科学
grow_design_lab
0
150
因果推論と機械学習
sshimizu2006
1
1.1k
20251212_LT忘年会_データサイエンス枠_新川.pdf
shinpsan
0
280
Featured
See All Featured
Between Models and Reality
mayunak
3
280
Leveraging Curiosity to Care for An Aging Population
cassininazir
1
230
Amusing Abliteration
ianozsvald
1
160
Principles of Awesome APIs and How to Build Them.
keavy
128
17k
Stewardship and Sustainability of Urban and Community Forests
pwiseman
0
190
CoffeeScript is Beautiful & I Never Want to Write Plain JavaScript Again
sstephenson
162
16k
How To Stay Up To Date on Web Technology
chriscoyier
790
250k
Why You Should Never Use an ORM
jnunemaker
PRO
61
9.8k
Context Engineering - Making Every Token Count
addyosmani
9
860
How to Think Like a Performance Engineer
csswizardry
28
2.6k
エンジニアに許された特別な時間の終わり
watany
106
240k
The Organizational Zoo: Understanding Human Behavior Agility Through Metaphoric Constructive Conversations (based on the works of Arthur Shelley, Ph.D)
kimpetersen
PRO
0
320
Transcript
R-based tools for open and collaborative science @recology_ Scott Chamberlain
Science needs to be more open
http://everyoneknowsbest.files.wordpress.com/2008/08/bodysculpture.jpg We build on the knowledge of others http://everyoneknowsbest.files.wordpress.com/2008/08/bodysculpture.jpg
Less mistakes More things can happen b/c data is open
http://everyoneknowsbest.files.wordpress.com/2008/08/bodysculpture.jpg The public paid for it!
http://www.fotopedia.com/items/flickr-4796633039 But we need tools to do it!!!!!
What kinds of tools? Not these
These!!!!!!
• Collect data • Manipulate data • Visualize • Analyze
• Write What does an ecologist do?
R is a good solution
• R is Open source = Free + Rapid change
• R = entire workflow in 1 place • R = reproducible science Why?
Get some data from the web library(RCurl); library(RJSONIO) dat <-
fromJSON(getURL("https://api.github.com/users/hadley/repos")) Manipulate the data library(plyr); library(reshape2) dat_melt <- melt(ldply(dat, function(x) data.frame(x[names(x) %in% c("name","watchers_count","forks")]))) Run some statistical model lm(value ~ variable, data = dat_melt) Visualize results library(ggplot2) ggplot(dat_melt, aes(name, value, colour = variable)) + geom_point() + coord_flip() Write the paper # Introduction...
Data increasingly on the web
The toolbelt
Literature library(rplos) plot_throughtime('phylogeny', 300) + geom_line(size=2)
Taxonomy library(taxize) classification("Abies procera", db = "itis") rankName taxonName
tsn Kingdom Plantae 202422 Subkingdom Viridaeplantae 846492 Infrakingdom Streptophyta 846494 Division Tracheophyta 846496 Subdivision Spermatophytina 846504 Infradivision Gymnospermae 846506 Class Pinopsida 500009 Order Pinales 500028 Family Pinaceae 18030 Genus Abies 18031 Species Abies procera 181835
Species occurrences from GBIF library(rgbif) splist <- c('Accipiter erythronemius', 'Junco
hyemalis', 'Aix sponsa', 'Podiceps cristatus') out <- occurrencelist_many(splist) gbifmap_list(out)
Occurrence from USGS’s BISON service library(rbison) out <- bison(species="Helianthus annuus",
count=500) bisonmap(input=out, tomap="county")
Climate data from the World Bank library(rWBclimate) country.list <- c("USA",
"MEX") country.dat <- get_historical_temp(country.list, "year") ggplot(country.dat, aes(x = year, y = data, group = locator)) + geom_point() + geom_path() + labs(y="Average annual temperature of Canada", x="Year") + theme_bw() + stat_smooth(se = F, colour = "black") + facet_wrap(~locator, scale = "free")
Take action! FontAwesome http://fortawesome.github.io/Font-Awesome/ fontawesome 2 png https://github.com/odyniec/font-awesome-to-png Presentation available
here: http://bit.ly/16tuVbu