Skip to content
This repository has been archived by the owner on Jun 21, 2023. It is now read-only.

Subset files for ATRT scripts #325

Merged
merged 13 commits into from
Dec 12, 2019

Conversation

cbethell
Copy link
Contributor

  • Add 00-subset-files-for-ATRT.R
  • Adapt 01-ATRT-molecular-subtyping data-prep.Rmd to use subset files

Purpose/implementation Section

The purpose of this PR is to subset the files for the ATRT molecular subtyping scripts.

What scientific question is your analysis addressing?

This analysis addresses the molecular subtyping of ATRT samples question.

What was your approach?

I subset the histologies, focal copy number, ssGSEA pathways, tumor mutation burden, and RNA expression files for ATRT samples using the sample_id variable in the histologies file to match samples.

What GitHub issue does your pull request address?

This PR addresses issue #244.

Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.

Which areas should receive a particularly close look?

  • Does this analysis seem to be correct?
  • Are there any areas that can be refactored?

Is the analysis in a mature enough form that the resulting figure(s) and/or table(s) are ready for review?

Yes, this analysis is ready for review.

Results

What types of results are included (e.g., table, figure)?

The output files of this PR include the subset files as follow:

-atrt-subset/atrt_focal_cn.tsv.gz
-atrt-subset/atrt_histologies.tsv
-atrt-subset/atrt_ssgsea.tsv
-atrt-subset/atrt_tmb.tsv
-atrt-subset/atrt_zscored_expression.RDS

What is your summary of the results?

Reproducibility Checklist

  • The dependencies required to run the code in this pull request have been added to the project Dockerfile.
  • This analysis has been added to continuous integration.

- Add `00-subset-files-for-ATRT.R`
- Adapt `01-ATRT-molecular-subtyping data-prep.Rmd` to use subset files
Comment on lines 213 to 217
dplyr::summarise(
HALLMARK_MYC_TARGETS_V1 = mean(HALLMARK_MYC_TARGETS_V1),
HALLMARK_MYC_TARGETS_V2 = mean(HALLMARK_MYC_TARGETS_V2),
HALLMARK_NOTCH_SIGNALING = mean(HALLMARK_NOTCH_SIGNALING)
)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You appear to be repeating this step in the next notebook.


#### Filter ssGSEA data --------------------------------------------------------

# Calculate ssGSEA mean and sd
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Before you calculated the z-score, I would subset this to ATRT samples. Which I don't think you've done unless I missed it.

Copy link
Member

@jaclyn-taroni jaclyn-taroni left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks good! Two things I'd like to see before it gets merged:

  • The TODO for the GISTIC stuff (Proposed Analysis: Molecularly subtype ATRT tumors #244 (comment))
  • Add the start of the README here (not the usage, etc.) with a note that warns folks about what release of the data that you used to generate the subset files, e.g., if you run this again you might want to make sure the ATRT subset files are regenerated with the most recent release.

data.table::fread(file.path(root_dir,
"data",
"pbta-snv-consensus-mutation-tmb.tsv"))

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's add a TODO re: SV data / GISTIC output here please.

cbethell and others added 3 commits December 12, 2019 16:25
- Add TODO comment in `00-subset-files-for-ATRT` indicating plan to incorporate SV data/GISTIC output
@jaclyn-taroni jaclyn-taroni merged commit 5f0f7b8 into AlexsLemonade:master Dec 12, 2019
@cbethell cbethell deleted the subset-ATRT-files branch January 14, 2020 14:26
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants