Skip to content

Latest commit

 

History

History
104 lines (66 loc) · 3.35 KB

step4.rst

File metadata and controls

104 lines (66 loc) · 3.35 KB

|CyVerse_logo|_

|Home_Icon|_ Learning Center Home

Resolve spliced alignments using GATK

In this step, we will process BAM files from previous step to resolve splice alignments. GATK is strict about chromosome order, and thus we include preventative re-sorting steps using Picard-ReorderSAM app.


Input Data:

Input Description Example
BAM files BAM files with uniquely mapped reads iplantcollaborative > example_data > HAMR_tutorial -> unique_mapping_reads_assignRG -> output.RG.bam

RUN Picard-ReorderSAM

  1. Click on "Apps" tab in the Discovery Environment and search for "Picard-ReorderSAM".
  2. Under Input section, provide input BAM file (output from Picard-AddOrReplaceReadGroups). Provide Reference sequence, dictionary and index files from the tutorial example data (iplantcollaborative > example_data > HAMR_tutorial -> reference_genome). Reference dict for your genome can be generated using GATK-CreateSequenceDictionary app.
  3. Provide an output file name or leave it to defaults and launch analysis.

Splits reads that contain Ns in their cigar string using GATK-SplitNCigarReads

  1. Search for "GATK-SplitNCigarReads v3.5" from the app tab. Click on the app icon.
  2. Provide Reference sequence, dictionary and index files from the tutorial example data. Input reordered BAM and index file from the previous step. Example data for this step is provided at iplantcollaborative > example_data > HAMR_tutorial -> reordered_BAM_files.
  3. Provide an output file name or leave it to defaults and launch analysis.

Output/Results

Output Description Example
BAM file BAM file with reads split at N CIGAR elements and CIGAR strings updated. iplantcollaborative > example_data > HAMR_tutorial -> resolved_splice_alignments

Fix or improve this documentation


|Home_Icon|_ Learning Center Home