This repository has been archived by the owner on Jun 21, 2023. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 67
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Part 5 of n: SNV caller initial calculations (#134)
* Set up the set up * Add circle CI test and Docker config * Add some more comments * Set up Rprojroot for circle CI test to work better * Fixing Circle CI file. * Change read_tsv to data.table::fread for big file * read in the .gz file * push plot function changes * Fix an error * Add missing package to Dockerfile * Reduce cosmic file to only the brain sample mutations * Update README with changes to cosmic file * re-updated Dockerfile * Ran a linter on set up script * Comment out of date * Get rid of old WGX/WXS bed file set up * Incorporate initial PR suggestions from @jashapiro and @cbethell * Push a working bash script * Add bash script to circle CI * Add usage section in README and change name of script * Add some more comments * Correct a couple things in the README * Get rid of remnant comment * Fix a typo! * Add Usage to the TOC * Add more documentation to the README * Update Circle CI * Push more exact bash script * Fix a couple issues with handling metadata file path * Get rid of dev remnants * Couple changes for readability * Add some things to README and get rid of part of bash that isn't there * Found dumb mistake * Changed [*] to [+] * I mean [*] to a [@] which makes more sense. * Fix some of the overwrite handling * Update comments * Temporarily remove VarDict while #135 is unresolved * Switch out WGS file for lancet * Add an if statement for no WXS samples * Get rid of development remnant * Fix an error with that last commit * One more change
- Loading branch information
1 parent
d4e8c5c
commit 542e3a6
Showing
6 changed files
with
526 additions
and
94 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,41 @@ | ||
#!/bin/bash | ||
# C. Savonen | ||
# CCDL for ALSF 2019 | ||
|
||
# Purpose:Run an intial evaluation of each variant caller's MAF file | ||
|
||
# Change directory | ||
cd kitematic | ||
|
||
# The files named in these arrays will be ran in the analysis. | ||
datasets=("strelka2" "mutect2" "lancet") | ||
wgs_files=("WGS.hg38.strelka2.unpadded.bed" "WGS.hg38.mutect2.unpadded.bed" "WGS.hg38.lancet.300bp_padded.bed") | ||
|
||
# Reference file paths | ||
cosmic=analyses/snv-callers/brain_cosmic_variants_coordinates.tsv | ||
annot_rds=scratch/hg38_genomic_region_annotation.rds | ||
|
||
############################ Set Up Reference Files ############################ | ||
# The original COSMIC file is obtained from: https://cancer.sanger.ac.uk/cosmic/download | ||
# These data are available if you register. The full, unfiltered somatic mutations | ||
# file CosmicMutantExport.tsv.gz for grch38 is used here. | ||
Rscript analyses/snv-callers/scripts/00-set_up.R \ | ||
--annot_rds $annot_rds \ | ||
--cosmic_og scratch/CosmicMutantExport.tsv.gz \ | ||
--cosmic_clean $cosmic | ||
|
||
########################## Calculate and Set Up Data ########################## | ||
# Create files that contain calculated VAF, TMB, and regional analyses. | ||
for ((i=0;i<${#datasets[@]};i++)); | ||
do | ||
echo "Processing dataset: ${datasets[$i]}" | ||
Rscript analyses/snv-callers/scripts/01-calculate_vaf_tmb.R \ | ||
--label ${datasets[$i]} \ | ||
--output analyses/snv-callers/results/${datasets[$i]} \ | ||
--maf data/pbta-snv-${datasets[$i]}.vep.maf.gz \ | ||
--metadata data/pbta-histologies.tsv \ | ||
--bed_wgs data/${wgs_files[$i]} \ | ||
--bed_wxs data/WXS.hg38.100bp_padded.bed \ | ||
--annot_rds $annot_rds \ | ||
--overwrite | ||
done |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.