Update focal-cn-preparation to use consensus SEG file in data download #1130

jaclyn-taroni · 2021-08-09T20:20:03Z

The focal-cn-preparation module was taking over an hour to run in CI. This is because it was using the consensus SEG file that is committed to the repository, which is not for a subset of samples but instead contains all samples. With the release of v20, there's no reason to continue to use the file in analyses/copy_number_consensus_call/results so here I am changing the path in the second notebook of focal-cn-preparation to use the file in the data download. In the context of CI, this will be the subset file.

Closes #1129.

jaclyn-taroni · 2021-08-09T22:22:16Z

analyses/focal-cn-file-preparation/02-add-ploidy-consensus.Rmd

-# TODO: the consensus SEG file is not currently in the data download -- when it
-# gets included we will have to change the file path here
-consensus_seg_file <- file.path("..", "copy_number_consensus_call", "results", 
+consensus_seg_file <- file.path("..", "..", "data", 


@jharenza and @kgaonkar6 - is there any reason (that you are aware of) that prohibits me from making this change at this point?

I do not think so - the latest file is in the download.

We probably need to wait until all the v20 CNV changes go through though? Or would just getting #1123 through -> updating the release (if not done yet) be sufficient?

I've updated v20 release data with the latest consensus seg with #1123 b9284650be04df3538e6c6dba29b8eb0 pbta-cnv-consensus.seg.gz

I added the relative path so that we can run the code while running all the preprocessing steps of subtyping, so maybe we can add a logic to change to relative path or not within the if (params$base_run ==0)

OpenPBTA-analysis/analyses/focal-cn-file-preparation/02-add-ploidy-consensus.Rmd

Lines 48 to 58 in a6281c2

```{r}

# TODO: the consensus SEG file is not currently in the data download -- when it

# gets included we will have to change the file path here

consensus_seg_file <- file.path("..", "copy_number_consensus_call", "results",

"pbta-cnv-consensus.seg.gz")

if ( params$base_run ==0 ){

histologies_file <- file.path("..", "..", "data", "pbta-histologies.tsv")

} else {

histologies_file <- file.path("..", "..", "data", "pbta-histologies-base.tsv")

}

In which scenario would you use the file in analyses/copy_number_consensus_call/results? When you use pbta-histologies-base.tsv?

yes, we will rerun the consensus seg file module first so the latest consensus seg file can be used as input for the focal-cn module

This reverts commit e12a34e.

jaclyn-taroni · 2021-08-10T23:39:13Z

I'm not going to update the HTML for this notebook, because I expect that will happen as part of getting #1124 through anyway.

jaclyn-taroni · 2021-08-11T10:56:17Z

It is curious, though, that the consensus CN steps happens right before this one and I would expect then that the file in results for that module would then be from the subset of samples. There might be something snakemake-y happening that I don't understand.

jaclyn-taroni · 2021-08-11T14:31:15Z

Closing in favor of #1124

jaclyn-taroni added 15 commits January 11, 2021 09:26

Merge remote-tracking branch 'upstream/master'

0fe0890

Merge remote-tracking branch 'upstream/master'

7c0997c

Merge remote-tracking branch 'upstream/master'

8727fac

Merge remote-tracking branch 'upstream/master'

e7f1d0a

Merge remote-tracking branch 'upstream/master'

a199d3c

Merge remote-tracking branch 'upstream/master'

8431b1e

Merge branch 'AlexsLemonade:master' into master

c29d225

Merge branch 'AlexsLemonade:master' into master

5292b8e

Merge branch 'AlexsLemonade:master' into master

6175d61

Merge remote-tracking branch 'upstream/master'

43830e8

Merge branch 'AlexsLemonade:master' into master

ed4bb16

Merge remote-tracking branch 'upstream/master'

0a5093a

Merge remote-tracking branch 'upstream/master'

d9e5645

Try using the consensus seg in the download file

c1a5c8c

Comment out other steps

e12a34e

jaclyn-taroni commented Aug 9, 2021

View reviewed changes

jaclyn-taroni mentioned this pull request Aug 10, 2021

V20 CNV update part1: Update overlap criteria for consensus CNV #1123

Merged

5 tasks

jaclyn-taroni added 2 commits August 10, 2021 19:32

Merge branch 'master' into jaclyn-taroni/fix-focal-long-running

1ddda21

Revert "Comment out other steps"

9f1160a

This reverts commit e12a34e.

jaclyn-taroni changed the title ~~DRAFT Jaclyn taroni/fix focal long running~~ Update focal-cn-preparation to use consensus SEG file in data download Aug 10, 2021

jaclyn-taroni marked this pull request as ready for review August 10, 2021 23:37

jaclyn-taroni requested a review from jharenza August 10, 2021 23:38

jaclyn-taroni requested a review from jashapiro August 11, 2021 10:56

jaclyn-taroni mentioned this pull request Aug 11, 2021

V20 CNV update part2: focal cnv and oncoprint rerun #1124

Merged

5 tasks

jaclyn-taroni closed this Aug 11, 2021

jaclyn-taroni deleted the jaclyn-taroni/fix-focal-long-running branch August 11, 2021 14:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update focal-cn-preparation to use consensus SEG file in data download #1130

Update focal-cn-preparation to use consensus SEG file in data download #1130

jaclyn-taroni commented Aug 9, 2021 •

edited

Loading

jaclyn-taroni Aug 9, 2021

jharenza Aug 9, 2021

jaclyn-taroni Aug 10, 2021 •

edited

Loading

kgaonkar6 Aug 11, 2021

jaclyn-taroni Aug 11, 2021

kgaonkar6 Aug 11, 2021

jaclyn-taroni commented Aug 10, 2021

jaclyn-taroni commented Aug 11, 2021

jaclyn-taroni commented Aug 11, 2021

	```{r}
	# TODO: the consensus SEG file is not currently in the data download -- when it
	# gets included we will have to change the file path here
	consensus_seg_file <- file.path("..", "copy_number_consensus_call", "results",
	"pbta-cnv-consensus.seg.gz")
	if ( params$base_run ==0 ){
	histologies_file <- file.path("..", "..", "data", "pbta-histologies.tsv")
	} else {
	histologies_file <- file.path("..", "..", "data", "pbta-histologies-base.tsv")
	}

Update focal-cn-preparation to use consensus SEG file in data download #1130

Update focal-cn-preparation to use consensus SEG file in data download #1130

Conversation

jaclyn-taroni commented Aug 9, 2021 • edited Loading

jaclyn-taroni Aug 9, 2021

Choose a reason for hiding this comment

jharenza Aug 9, 2021

Choose a reason for hiding this comment

jaclyn-taroni Aug 10, 2021 • edited Loading

Choose a reason for hiding this comment

kgaonkar6 Aug 11, 2021

Choose a reason for hiding this comment

jaclyn-taroni Aug 11, 2021

Choose a reason for hiding this comment

kgaonkar6 Aug 11, 2021

Choose a reason for hiding this comment

jaclyn-taroni commented Aug 10, 2021

jaclyn-taroni commented Aug 11, 2021

jaclyn-taroni commented Aug 11, 2021

jaclyn-taroni commented Aug 9, 2021 •

edited

Loading

jaclyn-taroni Aug 10, 2021 •

edited

Loading