difference(s) between biological conditions or over time? Problems • Annotation may be incomplete • Assembly with short reads is challenging • Counting is harder than it looks
between cocaine and alcohol addicts in the human hippocampus? 2. in blood in a natural timecourse for a single individual? 3. at multiple developmental stages? 4. in the dorsolateral prefrontal complex over lifespan? Jaffe*, Shin, Collado-Torres, Leek et al, In review, 2014 Zhou et al, PNAS, 2011 Chen et al, Cell, 2012 Xie et al, Cell, 2013
Build base-resolution models for artifacts: – RNA quality, cell composition, batch effects 3. Improve annotation of DERs 4. Make available via Bioconductor https://blogs.warwick.ac.uk/nichols/entry/spm5_gem_6 Nichols and Holmes, Human Brain Mapping, 2001
from >1300 individuals from DC/VA/MD Medical Examiners Offices • Non-psychiatric controls from across the lifespan (fetal through aged) and individuals with brain disorders (schizophrenia, bipolar, major depression) • Generating genomic data from brain regions of interest: genotypes, gene expression microarrays, RNA-seq, DNA methylation, etc
Colantuoni 2011, Kang 2011 • Previous approaches relied on microarray technologies à pre-defined probe sequences that capture only a limited proportion of transcriptome diversity Background
utilized gene- and/or exon-level count- based summarizations (www.brainspan.org) • Feature-based read counts lack the ability to reliably identify novel transcriptional activity • Transcript assembly using short reads or counting are hard
50+ 6 / group, N = 36 Discovery data Fetal Infant Child Teen Adult 50+ 6 / group, N = 36 Independent samples! Jaffe et al, In review, 2014 Replication data
Adult 50+ 6 / group, N = 36 Discovery data Null: Alt: Models Cutoff Details • Rank DERs by area • 1000 permutations • Control FWER (≤ 5%) by max area per permutation Results 63,135 DERs 20.509 Corresponds to p-value 10-08 Jaffe et al, In review, 2014
Adult 50+ 6 / group, N = 36 Replication data Null: Alt: Models Cutoff Details • Per sample and per DER calculate average expression • Use the 36 numbers to calculate F-statistic Results 50,650 DERs replicated Single F-statistic per DER p-value < 0.05 Jaffe et al, In review, 2014
of the DERs have the highest expression levels in fetal life (81.7%) • Overlap genes enriched for neurogenesis, signaling, development; genes involved in brain development, e.g. SOX11, DCX, GAT1, NRGN, CAMK2A, CNTNAP1 Jaffe et al, In review, 2014
in many brain regions (www.brainspan.org) – Gene/exon counts – Coverage-level data • Downloaded and processed RNA-seq data from ~500 samples in 16 brain regions (11 from neocortex), extracting coverage levels within the DERs Coverage Data from BrainSpan: h3p://download.alleninsStute.org/brainspan/MRF_BigWig_Gencode_v10/
• Downloaded and reprocessed RNA-seq data from stem cell and somatic tissue • Majority of the DERs had on average > 5 reads in at least one stem cell (86.4%) or tissue (84.0%) type • 53.3% of all DERs, and 26.5% of non- exonic DERs were expressed in all five stem cell conditions Jaffe et al, In review, 2014 Illumina BodyMap data
Utilized DNA methylation data from: – flow-sorted cortex GuinSvano 2013 – and stem cell developmental system Kim 2014 • Performed composition estimation using recently published approaches for Illumina 450k Houseman et al 2012, Jaffe and Irizarry 2014 Jaffe et al, In review, 2014
• Identified DERs that replicate • Validated DERs (cytosol vs total mRNA) • Confirmed with BrainSpan data • Identified DERs conserved in mouse • DERs expressed in other tissues (BodyMap data) • Estimated cell composition from DNA methylation
development • Incomplete annotation of the human brain transcriptome • Differences in expression occurring across birth, may be driven principally by changing neuronal phenotypes, rather than the rise of non-neuronal cell types Discussion
Ben Langmead LIBD Andrew Jaffe Jooheon Shin Nikolay Ivanov Amy Deep Ran Tao Yankai Jia Thomas Hyde Joel Kleinman Daniel Weinberger Harvard Rafael Irizarry Funding NIH LIBD CONACyT México