CN Status Heatmap (PR 2 of 2) #603

cansavvy · 2020-03-04T21:11:40Z

Purpose/implementation Section

What scientific question is your analysis addressing?

We wanted a summary visualization of copy number status.

This second PR has the main notebook where the functions from the previous PR are implemented.

What was your approach?

Create a summary heatmap of copy number status from the consensus CNV call data.
This is done by binning the genome and calculating the segment's coverage of the
CNV consensus segments.
A bin is declared a particular copy number status if that status's base pair
coverage is a certain threshold percentage larger than the other statuses'
coverage.

What GitHub issue does your pull request address?

Issue: #594

Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.

Which areas should receive a particularly close look?

Plot coordinates look wrong. Unsure why.
Do the cuttoff's seem reasonable?
Keep in mind colors will need to be updated with (Updated analysis: Unified color palette for plots #510) but are there other aesthetic issues with the plot?
Any gotchas?

Results

See the rendered notebook here:
https://cansavvy.github.io/openpbta-notebook-concept/cnv-chrom-plot/cn_status_heatmap.nb.html

Reproducibility Checklist

The dependencies required to run the code in this pull request have been added to the project Dockerfile.
This analysis has been added to continuous integration.

Documentation Checklist

**These items already existed but have been updated in #602

This analysis module has a README and it is up to date.
This analysis is recorded in the table in analyses/README.md and the entry is up to date.
The analytical code is documented and contains comments.

…s-heatmap

…tus-heatmap-1

…tus-heatmap

cbethell

Looks good @cansavvy!

I have some comments below around clarifying some steps.

analyses/cnv-chrom-plot/cn_status_heatmap.Rmd

cbethell

Hi @cansavvy, this looks just about ready to merge to me!

I did however notice that the bottom CN status annotation bar on the final heatmap does not appear to reflect all of the calls (it appears to reflect only losses and gains although the legend also includes neutral, unstable and uncallable).

That being said, is the bottom annotation based on the majority call for a whole chromosome? If so, would it be possible to add the chromosome labels to make it clear that the bottom annotation bar is relevant to each individual chromosome (although I am sure that this will be included in the figure description).

I am going to approve this PR because I believe it can be merged without said labels 👍 although they would be nice.

cansavvy · 2020-05-12T12:38:40Z

I did however notice that the bottom CN status annotation bar on the final heatmap does not appear to reflect all of the calls (it appears to reflect only losses and gains although the legend also includes neutral, unstable and uncallable).

I'm not seeing this? When I'm looking at the plot it looks like all of the labels are in the legend? Sometimes the Markdown document cuts off part of it in the preview, but if you look at the pdf, it looks fine.

cbethell · 2020-05-12T13:24:22Z

I'm not seeing this? When I'm looking at the plot it looks like all of the labels are in the legend? Sometimes the Markdown document cuts off part of it in the preview, but if you look at the pdf, it looks fine.

The legend looks fine, I am referring to the bottom annotation red and blue bar. That bottom bar reflects the chromosomal majority CN status, is that correct?

cansavvy · 2020-05-14T12:21:26Z

analyses/cnv-chrom-plot/util/bin-coverage.R

-        frac_loss > threshold ~ "loss",
-        frac_neutral > threshold ~ "neutral",
-        TRUE ~ "unstable"
+        frac_uncallable > frac_uncallable_val ~ "uncallable",


Is this what we wanted here? Or should we keep what's in master?

It looks like this is correct, as it now matches the function arguments. However, the docs for the function should also be changed to match.

Unless you were looking at the order of the final two options, but I think that what you have is correct, because you got strange results the other way? I don't fully remember.

Yes. I think we had settled on this. https://alexslemonade.slack.com/archives/CNH4FND1C/p1586786889017900

Will make sure docs are updated though.

jashapiro · 2020-05-14T13:42:36Z

I'm not seeing this? When I'm looking at the plot it looks like all of the labels are in the legend? Sometimes the Markdown document cuts off part of it in the preview, but if you look at the pdf, it looks fine.

The legend looks fine, I am referring to the bottom annotation red and blue bar. That bottom bar reflects the chromosomal majority CN status, is that correct?

I think what you are looking at is just the visual cue for where each chromosome starts and ends. Perhaps the color should be changed to avoid confusion with calls though.

jashapiro

This looks good! A few minor comments, and one with substance: lets see what happens if we upgrade the resolution of the final figure.

jashapiro · 2020-05-15T17:21:05Z

analyses/cnv-chrom-plot/cn_status_heatmap.Rmd

+seg_data <- data.table::fread(file.path(
+  input_dir,
+  "pbta-cnv-consensus.seg.gz"
+),
+data.table = FALSE
+)


Indentation here is strange.

Suggested change

seg_data <- data.table::fread(file.path(

input_dir,

"pbta-cnv-consensus.seg.gz"

),

data.table = FALSE

)

seg_data <- data.table::fread(

file.path(

input_dir,

"pbta-cnv-consensus.seg.gz"

),

data.table = FALSE

)

I ran styler on it. It doesn't always make perfect choices.

Bad styler, no biscuit!

jashapiro · 2020-05-15T17:23:12Z

analyses/cnv-chrom-plot/cn_status_heatmap.Rmd

+seg_data <- seg_data %>%
+  # Join the histology column to this data
+  dplyr::inner_join(dplyr::select(
+    metadata,
+    "Kids_First_Biospecimen_ID",
+    "short_histology",
+    "tumor_ploidy"
+  ),
+  by = c("ID" = "Kids_First_Biospecimen_ID")


Same thing... try not to open 2 sets of parens on the same line that don't then close together, as it makes for non-semantic indentation.

Suggested change

seg_data <- seg_data %>%

# Join the histology column to this data

dplyr::inner_join(dplyr::select(

metadata,

"Kids_First_Biospecimen_ID",

"short_histology",

"tumor_ploidy"

),

by = c("ID" = "Kids_First_Biospecimen_ID")

seg_data <- seg_data %>%

# Join the histology column to this data

dplyr::inner_join(

dplyr::select(

metadata,

"Kids_First_Biospecimen_ID",

"short_histology",

"tumor_ploidy"

),

by = c("ID" = "Kids_First_Biospecimen_ID")

jashapiro · 2020-05-15T17:36:07Z

analyses/cnv-chrom-plot/cn_status_heatmap.Rmd

+  show_row_names = FALSE,
+  bottom_annotation = chr_annot,
+  right_annotation = hist_annot,
+  heatmap_legend_param = list(nrow = 1)


The heatmap itself is drawn rasterized, which is mostly fine, but may be a bit lower resolution than ideal. Can we see what happens if we bump it up, both to file size and image quality? You might even go higher than this, depending on results. It is a bit hard to interpret the docs https://jokergoo.github.io/ComplexHeatmap-reference/book/a-single-heatmap.html#heatmap-as-raster-image, but if the general setting is equivalent to 72dpi, we might want to go as far as 4 here.

Suggested change

heatmap_legend_param = list(nrow = 1)

heatmap_legend_param = list(nrow = 1),

raster_quality = 2

jashapiro

Looks sharp!

This reverts commit 9bc1a10.

cansavvy added 25 commits March 3, 2020 16:24

The basic set up is there. Needs more work

97da925

Push this version of the heatmap though its gonna change

73b6292

Sort of working

99bd2d5

Merge remote-tracking branch 'origin/cn-status-heatmap' into cn-statu…

29aa5d9

…s-heatmap

Neatened up things

f441b27

Sorted out a few error handling items

dd63b5f

Almost there

574a75e

It's working. Needs more documentation and tweaks

f16075a

documentation!!

73c9c7d

Organize and make functions in their own util folder

ebba0d7

Add length filter and fix error

0b1219c

Merge branch 'cn-status-heatmap-1' into cn-status-heatmap

1791b6d

Refreshy the notebook

cf93927

Merge branch 'cn-status-heatmap' into cn-status-heatmap-1

9b71c3e

Add to CircleCI

6b69b8e

Merge branch 'cn-status-heatmap' into cn-status-heatmap-1

dfec47c

Streamline the PR to functions and README

2c9d4dd

Minor updates to READMEs

1a9b4dc

Update some minor comments and etc.

80ddd77

Merge branch 'master' into cn-status-heatmap-1

9f0b086

Merge remote-tracking branch 'upstream/master' into cn-status-heatmap-1

2709999

Merge remote-tracking branch 'origin/cn-status-heatmap-1' into cn-sta…

9927c61

…tus-heatmap-1

Merge branch 'master' into cn-status-heatmap

a34cddd

extra space in config file

691cb10

extra space in config file

95693cc

cansavvy added the work in progress Used to label (non-draft) pull requests that are not yet ready for review label Mar 4, 2020

cansavvy added 4 commits March 4, 2020 16:46

fix a minor error.

09f166c

Forgot this should not be in CI until next PR

3c16dc7

refreshing notebook

c735a62

Found some of the indexing problems I was having

06333b0

cansavvy added 4 commits April 13, 2020 09:47

Merge branch 'master' into cn-status-heatmap

054a059

Resolve reordering error

df23c14

Merge remote-tracking branch 'cansavvy/cn-status-heatmap' into cn-sta…

af29757

…tus-heatmap

Switch logic so neutral is default.

fe9b404

cansavvy requested review from jashapiro and cbethell and removed request for jashapiro April 13, 2020 19:29

cbethell reviewed Apr 17, 2020

View reviewed changes

cansavvy added 2 commits April 20, 2020 08:55

Incorporate @cbethell 's suggestions

2321c24

Merge branch 'master' into cn-status-heatmap

43fbcd1

cansavvy requested a review from cbethell April 21, 2020 13:31

cbethell approved these changes May 11, 2020

View reviewed changes

jaclyn-taroni requested a review from jashapiro May 12, 2020 12:18

cansavvy added 2 commits May 12, 2020 11:26

Heatmap legend is in right spot but not horizontal

6fb86c9

Horizontal Legend!!!

b999962

cansavvy commented May 14, 2020

View reviewed changes

linter it

9226a78

jashapiro and others added 2 commits May 15, 2020 09:21

Merge branch 'master' into cn-status-heatmap

7681562

Docs clarifications on thresholds and change chr back to original colors

3d0172b

jashapiro reviewed May 15, 2020

View reviewed changes

cansavvy added 2 commits May 15, 2020 14:16

Merge branch 'master' into cn-status-heatmap

0ec01d5

Up the resolution

d2051a2

jashapiro approved these changes May 15, 2020

View reviewed changes

Merge branch 'master' into cn-status-heatmap

8bbc84c

jashapiro merged commit 9bc1a10 into AlexsLemonade:master May 15, 2020

jashapiro added a commit that referenced this pull request May 15, 2020

Revert "CN Status Heatmap (PR 2 of 2) (#603)"

848ffa1

This reverts commit 9bc1a10.

cansavvy deleted the cn-status-heatmap branch August 13, 2020 11:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CN Status Heatmap (PR 2 of 2) #603

CN Status Heatmap (PR 2 of 2) #603

cansavvy commented Mar 4, 2020 •

edited

Loading

cbethell left a comment

cbethell left a comment

cansavvy commented May 12, 2020

cbethell commented May 12, 2020

cansavvy May 14, 2020

jashapiro May 14, 2020 •

edited

Loading

jashapiro May 14, 2020

cansavvy May 15, 2020

jashapiro commented May 14, 2020

jashapiro left a comment

jashapiro May 15, 2020

cansavvy May 15, 2020

jashapiro May 15, 2020

jashapiro May 15, 2020

jashapiro May 15, 2020 •

edited

Loading

jashapiro left a comment

	heatmap_legend_param = list(nrow = 1)
	heatmap_legend_param = list(nrow = 1),
	raster_quality = 2

CN Status Heatmap (PR 2 of 2) #603

CN Status Heatmap (PR 2 of 2) #603

Conversation

cansavvy commented Mar 4, 2020 • edited Loading

Purpose/implementation Section

What scientific question is your analysis addressing?

What was your approach?

What GitHub issue does your pull request address?

Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.

Which areas should receive a particularly close look?

Results

Reproducibility Checklist

Documentation Checklist

cbethell left a comment

Choose a reason for hiding this comment

cbethell left a comment

Choose a reason for hiding this comment

cansavvy commented May 12, 2020

cbethell commented May 12, 2020

cansavvy May 14, 2020

Choose a reason for hiding this comment

jashapiro May 14, 2020 • edited Loading

Choose a reason for hiding this comment

jashapiro May 14, 2020

Choose a reason for hiding this comment

cansavvy May 15, 2020

Choose a reason for hiding this comment

jashapiro commented May 14, 2020

jashapiro left a comment

Choose a reason for hiding this comment

jashapiro May 15, 2020

Choose a reason for hiding this comment

cansavvy May 15, 2020

Choose a reason for hiding this comment

jashapiro May 15, 2020

Choose a reason for hiding this comment

jashapiro May 15, 2020

Choose a reason for hiding this comment

jashapiro May 15, 2020 • edited Loading

Choose a reason for hiding this comment

jashapiro left a comment

Choose a reason for hiding this comment

cansavvy commented Mar 4, 2020 •

edited

Loading

jashapiro May 14, 2020 •

edited

Loading

jashapiro May 15, 2020 •

edited

Loading