Upgrade to Pro — share decks privately, control downloads, hide ads and more …

PacBio Software Overview

PacBio
August 01, 2013

PacBio Software Overview

PacBio

August 01, 2013
Tweet

More Decks by PacBio

Other Decks in Science

Transcript

  1. FIND MEANING IN COMPLEXITY © Copyright 2013 by Pacific Biosciences

    of California, Inc. All rights reserved. PacBio® Software Overview
  2. Learning Objectives 2 After the training, the participant will know:

    • The names of the PacBio software programs • Which software to use for a specific need Bioinformaticians: • Interested in understanding how the different PacBio® software programs fit within the sequencing workflow • None
  3. PacBio® Software Suite 3 • Fully automated analysis from run

    setup, with the option to manually run later • Efficient integration with LIMS and third-party analysis tools • User-friendly UI design for advanced informatics researchers, as well as biologists and clinicians • Industry-standard output formats: FASTA, FASTQ, SAM/BAM, VCF • Open source through our developers’ network ®
  4. High Level Software Workflow 4 Laboratory Information Management System API

    Layer Primary Analysis Secondary Analysis Tertiary Analysis Sample Preparation Design Run Load Instrument RS Touch and RS Remote SMRT® Portal and SMRT® View Run & Monitor Review Run
  5. RS Touch and RS Remote 5 Primary Analysis Design Run

    Load Instrument Run & Monitor Review Run RS Touch RS Remote RS Remote Secondary Analysis Tertiary Analysis Sample Preparation RS Touch RS Remote
  6. RS Remote – Designing a Run 7 • Design runs

    remotely • Can assign multiple SMRT® Cells per well with different movie times
  7. RS Touch – Monitoring a Run 9 • Monitor at

    the instrument or remotely • View real-time base incorporations • Displays remaining run time • Shows SMRT® Cell status, from SMRT Cell prep to Quality Values
  8. Movie Trace Pulse Raw Base Calls CC Base Calls SMRT®

    Cell Movie2Trace Trace2Pulse Pulse2Base Base2 CircularConsensus 2° Analysis Primary Analysis Pipeline 11 GCAACGATCA…GCAACGATCA…GCAACGATCA…GCAACGATCA…GCAA ACGATCA Pulse Features • Pulse Height • Pulse Width • Interpulse Distance t
  9. High Performance Computing Analysis Setup 12 Data Repository Customer LAN

    PacBio® High Resolution Genetic Analyzer Base calls + QVs Primary Analysis: Signal processing, base calling, quality assessment Secondary Analysis: Alignment/Assembly
  10. RS Insight – Remote Monitoring • Proactive monitoring by PacBio

    Tech Support if RS Insight access is enabled – Sequencing QC metrics are retained on the Blade Center 13 Service Management Tech Support Field Service R & D Customer- Controlled Policy Server Internet PacBio® RS PacBio Remote Engineering Team
  11. SMRT® Analysis – Secondary Analysis 14 Secondary Analysis SMRT Portal

    Alignment/Assembly SMRT View Visualization Primary Analysis Design Run Load Instrument Run & Monitor Review Run Tertiary Analysis Sample Preparation
  12. SMRT® Portal 15 • Web-based interface • Accessible from any

    computer • Automated secondary analysis
  13. SMRT® Portal – Job Selection 16 • Prior jobs are

    searchable • Can select one or more jobs
  14. SMRT® Portal – Report Viewing 17 Full complement of reports

    automatically generated Quality Values Readlength distributions Post-secondary accuracy distributions Automatic generation of industry- standard data formats FASTA/FASTQ SRA SAM/BAM …others Integrated genome browser Direct connection to SMRT® View
  15. SMRT® View Base Modification Details 19 Kinetogram: IPD ratio by

    base position and strand Modification density by mod, variants, and separate tracks per motif
  16. Application-Specific Analysis Software De Novo Assembly HGAP: Generate high quality

    assemblies from PacBio® long reads alone ALLORA: Assemble pure PacBio long reads, then polish with Quiver AHA: Fill gaps and join existing scaffolds with PacBio long reads Celera® Assembler: Combine PacBio long reads with short reads from other technologies SMRT® View: QC assemblies 20 Targeted Sequencing BLASR: Map reads against a reference Quiver: Call haploid SNPs and indels with 99.999% accuracy GATK: Identify haploid and diploid SNPs using the Broad’s Unified Genotyper GMAP: Align full-length cDNA transcripts against genomic DNA to discover splicing SMRT View: Browse coverage, variants and annotations DNA Base Modifications Available as an extension of standard resequencing, or using a case/control comparison Modification detection: Find specific modified sites in unamplified genomes Bacterial methylomes: Discover recognition motifs for adenine and cytosine methylation SMRT View: Visualize modified sites and sequence contexts
  17. Summary of Key Points • Suite of data management and

    analysis software tools - Supports granularity, scalability, and full functionality of the PacBio® RS instrument • Integrated software solution from beginning to end - Fully automated analysis, with the option for manual set-up • Accessible and usable - User-friendly UI design • Industry-standard output formats - FASTA, FASTQ, SAM/BAM, and so on. • Open and extensible platform - Developer’s Network 22
  18. Where to Find More Information These documents are all available

    on DevNet: – SMRT Portal Help – SMRT View Help – SMRT Pipe Reference Guide (v1.4) – SMRT Analysis Software Installation (v1.4) – Software Getting Started Guide 23