and maintain a centralized resource for expert bioinformatics consulting & data analysis and to help collaborators fund & publish their work - 1. Service - 2. Training - 3. Infrastructure building 5
disparate data types? - See “data integration” talk at stephenturner.us/slides • New technologies: how to best support new and emerging technologies? - See “new technologies” talk at stephenturner.us/slides • Transparency & reproducibility ! • Training 10
R, RStudio, knitr: Markdown + embedded R code » HTML/PDF report - IPython notebook • Galaxy - Web-based bioinformatics toolkit - Tracks history, versions, parameters, data • Wiki - Version controlled place to code, scripts, data, and results used for client projects • Training 12
basic computing skills to scientists - Core curriculum: ‣ Basic programming ‣ Version control ‣ Automation ‣ Testing - Two-day bootcamps - Coming soon: train-the-trainer program • Workshops (bioconnector.github.io/workshops) - All course material on GitHub - All R-related materials compiled as RMarkdown dynamic document - Courses: ‣ Introduction to R for life scientists ‣ RNA-seq data analysis (coming soon) ‣ Data visualization with R and ggplot2 (coming soon) ‣ Data manipulation with data.table and dplyr (coming soon) 13
2. How to incentivize open science + reproducibility for “traditional” scientists? How to change culture at the senior faculty level? 3. What are the technical barriers (if any) to open science and reproducibility? How solve? 4. Training: how to make sustainable & scalable? 14 twitter @genetics_blog web stephenturner.us core bioinformatics.virginia.edu email [email protected]