generate at least one high-quality, error-free, near gapless, chromosome-level, haplotype phased, and annotated reference genome assembly for all extant vertebrate species, and to utilize those genomes to address fundamental questions in biology, disease, and conservation. Goal of the VGP
specimens – 85 of 260 species 2. Add and define a metric for haplotype phasing 3. Non-trio approach for complete phasing of haplotypes, at all steps 4. Missing genomic sequence from VGP assemblies 5. Assembly and identification of mitochondrial genomes 6. Assembly and identification of sex chromosomes 7. Defining “assembled chromosome” and “chromosomal-level assembly” 8. Scaling up production of genome assemblies to 6 per week 9. Improve multilayered and reference free alignments 10. Planning publications with VGP Phase 1 genomes
scaffold chromosome base accuracy phasing h2: 3.4.2.QV40.h2.90% maternal paternal x.y.z.QV.p% G10K-VGP Assembly working group (Durbin, Lewin, Jarvis, Myers et al)
group (Koren et al 2018 Nature Biotech; Rhie et al in preparation) Trio approach proves theoretically possible to assemble higher quality 3.4.2.QV50.h1.98% 3.4.2.QV50.h1.98% maternal paternal
read on each sex (Guojie Zhang) • Trio approach (Arang Rhie) • Bionano or HiC mapping ? G10K Assembly working group (planned analyses led by Kateryna Makova and Paul Medvedev )
is a bottle neck • PacBio Sequel II and 8M SMRT cell will enable this goal • Need to separate production from R&D • Need dedicated effort to continuously obtain samples • Need more rapid production of RNASeq/IsoSeq • Need VGP pipeline on DNANexus tmoro be functional • Need more systematic upload of data to annotation archives
genome quality and biology, and setting standards 2. DNA sample preparation method comparisons for high quality genomes 3. Use of high quality genomes to inform and reverse the current 6th mass extinction VGP submissions for 2019 1. Genome-scale family tree of vertebrates 2. Comparative genomics of specialized traits 3. Genomics of vocal learning and spoken language 4. A universal vertebrate gene orthology and nomenclature 5. Deciphering vertebrate chromosomal genome evolution 6. Reconstruction ancestor genome of vertebrates and vertebrate clades 7. Evolution of bases and chromosomes of the human genome 8. Why are some lineages more resistant to diseases relative to others 9. Conservation genomics of endangered species 10. The genomes of all remaining Kakapo parrots on the planet 11. Genetic signatures of domestication across vertebrates 12. Sex determination and sex chromosome evolution among vertebrates 13. Brain cell-type evolution and homologies across vertebrates VGP submissions for 2020