learning to using to teaching to developing R Leonardo Collado Torres @fellgernon #rstats #teaching #CDSBMexico https://speakerdeck.com/lcolladotor/CDSBMexico
+ BioC2011 Developer’s day + 2 conference days + Europe Bioc 2010 http://www- huber.embl.de/biocdeveleurope2010/ With support from: @Bioconductor, @lcgunam, @WINTERGENOMICS
:P, it was a pilot for OpenCourseware) TAs: Alejandro Reyes @areyesq José Víctor Moreno Mayar https://geogenetics.ku.dk/staff/?pure=en/persons/475726 José Reyes http://sysbiophd.harvard.edu/people/student-profiles/jose-reyes
or lodging • Support for teaching: Robert Gentleman gave me free copies of books he had in his office (authors normally get several free copies of books) • Support for community building: almost had Bioconductor’s support in 2010ish for 1 visit, we didn’t give up! #CDSBMexico • Feel free to ask for help! We all started somewhere!! Check your spam box and filters: • Almost lost a scholarship for user!2013 that way :P Check the dates for applying for support! Ask for emails and keep in touch • I asked for PhD application and career advice to Davis McCarthy @davisjmcc in 2010 • That’s how I got into my PhD Socialize! Take advantage of opportunities offered to you!
(BacterialTranscription): Transcription initiation mapping and transcription unit identification in E. coli Rafael Irizarry https://rafalab.github.io/ @rafalab Ingo Ruczinski http://www.biostat.jhsph.edu/~iruczins/ Them: Have you heard about Johns Hopkins? Me: Johns???? No idea Them: come join us at @jhubiostat !!
= 36 Discovery data Jaffe et al, Nat. Neuroscience, 2015 Postmortem Human Brain Samples Fetal Infant Child Teen Adult 50+ 6 / group, N = 36 Replication data @andrewejaffe
male 1240 Male 141 Total 3640 Even when information is provided, it’s not always clear… sra_meta$S ex “1 Male, 2 Female”, “2 Male, 1 Female”, “3 Female”, “DK”, “male and female” “Male (note: ….)”, “missing”, “mixed”, “mixture”, “N/A”, “Not available”, “not applicable”, “not collected”, “not determined”, “pooled male and female”, “U”, “unknown”, “Unknown” slide adapted from Shannon Ellis @Shannon_E_Ellis
Race Age 662 0 NA female liver NA NA 662 1 NA female liver NA NA 662 2 NA female liver NA NA 662 3 NA female liver NA NA 662 4 NA female liver NA NA 662 5 NA male liver NA NA 662 6 NA male liver NA NA 662 7 NA male liver NA NA 662 8 NA male liver NA NA z z z z slide adapted from Shannon Ellis @Shannon_E_Ellis
samples in recount gene, exon, exon-exon junction and expressed region RNA-Seq data SRA Sequence Read Archive N=49,848 TCGA The Cancer Genome Atlas N=11,284 GTEx Genotype Tissue Expression Project N=9,662 slide adapted from Shannon Ellis @Shannon_E_Ellis
samples in recount gene, exon, exon-exon junction and expressed region RNA-Seq data SRA Sequence Read Archive N=49,848 GTEx Genotype Tissue Expression Project N=9,662 divide samples build and optimize phenotype predictor training set test accurac y of predicto r test set TCGA The Cancer Genome Atlas N=11,284 slide adapted from Shannon Ellis @Shannon_E_Ellis
samples in recount gene, exon, exon-exon junction and expressed region RNA-Seq data SRA Sequence Read Archive N=49,848 GTEx Genotype Tissue Expression Project N=9,662 divide samples build and optimize phenotype predictor training set test accurac y of predicto r predict phenotypes across samples in TCGA test set TCGA The Cancer Genome Atlas N=11,284 slide adapted from Shannon Ellis @Shannon_E_Ellis
samples in recount gene, exon, exon-exon junction and expressed region RNA-Seq data SRA Sequence Read Archive N=49,848 GTEx Genotype Tissue Expression Project N=9,662 divide samples build and optimize phenotype predictor training set predict phenotypes across SRA samples test accurac y of predicto r predict phenotypes across samples in TCGA test set TCGA The Cancer Genome Atlas N=11,284 slide adapted from Shannon Ellis @Shannon_E_Ellis
N=9,962 TCGA N=11,284 SRA N=49,848 samples expression estimates gene exon junctions ERs Answer meaningful questions about human biology and expression sex tissue M Blood F Heart F Liver slide adapted from Shannon Ellis @Shannon_E_Ellis
N=9,962 TCGA N=11,284 SRA N=49,848 samples expression estimates gene exon junctions ERs Answer meaningful questions about human biology and expression sex tissue M Blood F Heart F Liver slide adapted from Shannon Ellis @Shannon_E_Ellis
Black Hispanic White Tissue Site 1 Cerebral cortex Hippocampus Brainstem Cerebellum Tissue Site 2 Frontal lobe Temporal lobe Midbrain Basal ganglia Tissue Site 3 Dorsolateral prefrontal cortex Superior temporal gyrus Substantia nigra Caudate Hemisphere Left Right Brodmann Area 1-52 Disease Status Disease Neurological control Disease Brain tumor Alzheimer’s disease Parkinson’s disease Bipolar disorder Tumor Type Glioblastoma Astrocytoma Oligodendroglioma Ependymoma Clinical Stage 1 Grade I Grade II Grade III Grade IV Clinical Stage 2 Primary Secondary Recurrent Viability Postmortem Biopsy Preparation Frozen Thawed
to - develop methods to process and analyze data from recount2 - use recount2 to address specific biological questions This project involves the Hansen, Leek, Langmead and Battle labs at JHU Contact: Kasper D. Hansen ([email protected] | www.hansenlab.org) @KasperDHansen @jtleek @BenLangmead @alexisjbattle