Slide 1

Slide 1 text

FIND MEANING IN COMPLEXITY © Copyright 2013 by Pacific Biosciences of California, Inc. All rights reserved. PacBio® Software Overview

Slide 2

Slide 2 text

Learning Objectives 2 After the training, the participant will know: • The names of the PacBio software programs • Which software to use for a specific need Bioinformaticians: • Interested in understanding how the different PacBio® software programs fit within the sequencing workflow • None

Slide 3

Slide 3 text

PacBio® Software Suite 3 • Fully automated analysis from run setup, with the option to manually run later • Efficient integration with LIMS and third-party analysis tools • User-friendly UI design for advanced informatics researchers, as well as biologists and clinicians • Industry-standard output formats: FASTA, FASTQ, SAM/BAM, VCF • Open source through our developers’ network ®

Slide 4

Slide 4 text

High Level Software Workflow 4 Laboratory Information Management System API Layer Primary Analysis Secondary Analysis Tertiary Analysis Sample Preparation Design Run Load Instrument RS Touch and RS Remote SMRT® Portal and SMRT® View Run & Monitor Review Run

Slide 5

Slide 5 text

RS Touch and RS Remote 5 Primary Analysis Design Run Load Instrument Run & Monitor Review Run RS Touch RS Remote RS Remote Secondary Analysis Tertiary Analysis Sample Preparation RS Touch RS Remote

Slide 6

Slide 6 text

RS Remote – Designing a Run 6

Slide 7

Slide 7 text

RS Remote – Designing a Run 7 • Design runs remotely • Can assign multiple SMRT® Cells per well with different movie times

Slide 8

Slide 8 text

RS Touch – Loading a Run 8

Slide 9

Slide 9 text

RS Touch – Monitoring a Run 9 • Monitor at the instrument or remotely • View real-time base incorporations • Displays remaining run time • Shows SMRT® Cell status, from SMRT Cell prep to Quality Values

Slide 10

Slide 10 text

PacBio® High Resolution Genetic Analyzer 10

Slide 11

Slide 11 text

Movie Trace Pulse Raw Base Calls CC Base Calls SMRT® Cell Movie2Trace Trace2Pulse Pulse2Base Base2 CircularConsensus 2° Analysis Primary Analysis Pipeline 11 GCAACGATCA…GCAACGATCA…GCAACGATCA…GCAACGATCA…GCAA ACGATCA Pulse Features • Pulse Height • Pulse Width • Interpulse Distance t

Slide 12

Slide 12 text

High Performance Computing Analysis Setup 12 Data Repository Customer LAN PacBio® High Resolution Genetic Analyzer Base calls + QVs Primary Analysis: Signal processing, base calling, quality assessment Secondary Analysis: Alignment/Assembly

Slide 13

Slide 13 text

RS Insight – Remote Monitoring • Proactive monitoring by PacBio Tech Support if RS Insight access is enabled – Sequencing QC metrics are retained on the Blade Center 13 Service Management Tech Support Field Service R & D Customer- Controlled Policy Server Internet PacBio® RS PacBio Remote Engineering Team

Slide 14

Slide 14 text

SMRT® Analysis – Secondary Analysis 14 Secondary Analysis SMRT Portal Alignment/Assembly SMRT View Visualization Primary Analysis Design Run Load Instrument Run & Monitor Review Run Tertiary Analysis Sample Preparation

Slide 15

Slide 15 text

SMRT® Portal 15 • Web-based interface • Accessible from any computer • Automated secondary analysis

Slide 16

Slide 16 text

SMRT® Portal – Job Selection 16 • Prior jobs are searchable • Can select one or more jobs

Slide 17

Slide 17 text

SMRT® Portal – Report Viewing 17 Full complement of reports automatically generated Quality Values Readlength distributions Post-secondary accuracy distributions Automatic generation of industry- standard data formats FASTA/FASTQ SRA SAM/BAM …others Integrated genome browser Direct connection to SMRT® View

Slide 18

Slide 18 text

SMRT® View – Genome Browser 18

Slide 19

Slide 19 text

SMRT® View Base Modification Details 19 Kinetogram: IPD ratio by base position and strand Modification density by mod, variants, and separate tracks per motif

Slide 20

Slide 20 text

Application-Specific Analysis Software De Novo Assembly HGAP: Generate high quality assemblies from PacBio® long reads alone ALLORA: Assemble pure PacBio long reads, then polish with Quiver AHA: Fill gaps and join existing scaffolds with PacBio long reads Celera® Assembler: Combine PacBio long reads with short reads from other technologies SMRT® View: QC assemblies 20 Targeted Sequencing BLASR: Map reads against a reference Quiver: Call haploid SNPs and indels with 99.999% accuracy GATK: Identify haploid and diploid SNPs using the Broad’s Unified Genotyper GMAP: Align full-length cDNA transcripts against genomic DNA to discover splicing SMRT View: Browse coverage, variants and annotations DNA Base Modifications Available as an extension of standard resequencing, or using a case/control comparison Modification detection: Find specific modified sites in unamplified genomes Bacterial methylomes: Discover recognition motifs for adenine and cytosine methylation SMRT View: Visualize modified sites and sequence contexts

Slide 21

Slide 21 text

DevNet 21

Slide 22

Slide 22 text

Summary of Key Points • Suite of data management and analysis software tools - Supports granularity, scalability, and full functionality of the PacBio® RS instrument • Integrated software solution from beginning to end - Fully automated analysis, with the option for manual set-up • Accessible and usable - User-friendly UI design • Industry-standard output formats - FASTA, FASTQ, SAM/BAM, and so on. • Open and extensible platform - Developer’s Network 22

Slide 23

Slide 23 text

Where to Find More Information These documents are all available on DevNet: – SMRT Portal Help – SMRT View Help – SMRT Pipe Reference Guide (v1.4) – SMRT Analysis Software Installation (v1.4) – Software Getting Started Guide 23