Upgrade to Pro — share decks privately, control downloads, hide ads and more …

TSP Scrum

Sponsored · Your Podcast. Everywhere. Effortlessly. Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.

TSP Scrum

Avatar for Radhouane Aniba

Radhouane Aniba

July 14, 2014
Tweet

More Decks by Radhouane Aniba

Other Decks in Research

Transcript

  1. Targeted Sequencing Pipeline Features * Alignment using bwa-mem * alignment

    problems * improvements * Xenograft project showed some limitations * Binomial exact test * Background size * Code working on decent coverage * No tests yet * Variant tagging needs some more work
  2. Targeted Sequencing Pipeline git clone http://[email protected]/scm/pp/miseq-pipeline.git cd miseq-pipeline/pipeline/code ! python

    pipeline.py \ —num_cpus 10 \ —mode cluster \ —install_dir <path_to_miseq>/software \ <path_to_config_file> Limitations : ! - Manifest file - positions format - limited to point mutations ! ! If you want to run this version : ! - create_config_file_for_miseqpipeline.py One single code that generates all needed formats to run the pipeline
  3. Input processing Alignment recalibration Variant calling and filtering Variant characterization

    * Fastq QC (not yet implemented) * Alignment (BWA-MEM, bwa) * (Bowtie2 to test ?)
  4. Input processing Alignment recalibration Variant calling and filtering Variant characterization

    * Recalibrate Base Quality from sequencers * create targets for local realignments (indels) * local realignment (correction of small fraction of the alignment) * can be time consuming depending on the depth Broad best practices information : http://goo.gl/8sRWCF * Fastq QC * Alignment (BWA-MEM, bwa) * (Bowtie2 to test ?)
  5. Input processing Alignment recalibration Variant calling and filtering Variant characterization

    * Recalibrate Base Quality from sequencers * create targets for local realignments (indels) * local realignment (correction of small fraction of the alignment) * can be time consuming depending on the depth Broad best practices information : http://goo.gl/8sRWCF * Fastq QC * Alignment (BWA-MEM, bwa) * (Bowtie2 to test ?) * call variants (any caller), I tested UnifiedGenotyper and HaplotypeCaller * Merge all vcfs for all the samples * Intersect with amplicons positions and filter out call out of the targets * Get unique calls across all samples
  6. Input processing Alignment recalibration Variant calling and filtering Variant characterization

    * Recalibrate Base Quality from sequencers * create targets for local realignments (indels) * local realignment (correction of small fraction of the alignment) * can be time consuming depending on the depth Broad best practices information : http://goo.gl/8sRWCF * Fastq QC * Alignment (BWA-MEM, bwa) * (Bowtie2 to test ?) * call variants (any caller), I tested UnifiedGenotyper and HaplotypeCaller * Merge all vcfs for all the samples * Intersect with amplicons positions and filter out call out of the targets * Get unique calls across all samples * Build counts file per sample * Create foreground and background * binomial test and tag the variant * report generation
  7. Targeted Sequencing Pipeline git clone <stash>/scm/~raniba/targeted_sequencing_pipeline.git! cd targeted_sequencing_pipeline! git checkout

    dev! ! python pipeline.py \! ! ! --num_cpus 40 \! ! ! --mode cluster \! ! ! --install_dir <path_to_tsp>/software/ \! ! ! <path_to_config.yaml> \! ! ! --dedup no \! ! ! --targets <path_to_amplicon_targets>! ! ! --email your@email Installation instructions on the README.md of the repo
  8. Targeted Sequencing Pipeline |-- log |-- results `-- tmp |--

    amplicon_positions |-- bam |-- calls |-- counts |-- intervals |-- merged_calls |-- positions |-- realigned |-- reheaded `-- variant_status results