CNV consensus (8 of 6): Changing Step3 to use Bedtools Subtract #430

nhatduongnn · 2020-01-13T17:14:43Z

Purpose/implementation Section

Increase code run time and efficiency

What GitHub issue does your pull request address?

#128

Which areas should receive a particularly close look?

The changes in the cnv_consensus.tsv. The Manta CNV order changed for one singular consensus CNV call.

Reproducibility Checklist

The dependencies required to run the code in this pull request have been added to the project Dockerfile.
This analysis has been added to continuous integration.

Documentation Checklist

This analysis module has a README and it is up to date.
This analysis is recorded in the table in analyses/README.md and the entry is up to date.
The analytical code is documented and contains comments.

jashapiro

This looks good. I forgot that we had already made the intermediate files tab delimited! The one change is that I think we may as well go ahead and remove the now orphaned get_rid_bad_segments.py script from the repository (it will live on in the git history).

One other related comment: In some discussions around #392, we have decided we would very much like to preserve the cnvkit seg.mean column into this output file. I think this should be as simple as modifying the awk commands to include that column in the output (adding NA, perhaps for the other callers), modifying the first_merge bedtools command to also include that column, and finally altering restructure_column.py to include that column as well.

I don't know how much time you have left to work on this before your break ends, so let me know if you will be able to handle that modification this week, or if it should be handled by one of us.

analyses/copy_number_consensus_call/Snakefile

Co-Authored-By: jashapiro <josh.shapiro@ccdatalab.org>

…lysis into fix_step3

nhatduongnn · 2020-01-13T21:33:41Z

@jashapiro I just got rid of the get_rid_bad_segments.py file. Sad but a necessary step 😃 .

As for the discussion around #392, I just had a look and it seems that, just like you said, we will have to make adjustment to the awk command by giving cnvkit its seg.mean column and give each of the other two callers a column of NA. We then have to change first_merge and restructure_column.py. With this, the format of any CNV in columns 4, 5, and 6 would be chr:start:end:copy_number:seg.mean.

Sadly, my break ended last Monday and I have been back in school for a week now. That's why it took a while for me to make this PR. However, I would like to see my project until the end so I think I can handle the implementation of seg.mean this week and have you look over it. Will this be part (9 of 6)?

jashapiro · 2020-01-13T21:59:01Z

Sadly, my break ended last Monday and I have been back in school for a week now. That's why it took a while for me to make this PR. However, I would like to see my project until the end so I think I can handle the implementation of seg.mean this week and have you look over it. Will this be part (9 of 6)?

Okay, I will look forward to seeing your changes. You can just submit them as a new PR: no need to keep up the numbering if you don't want to!

jashapiro

This looks good! Thank you for your contributions!

Duong and others added 9 commits December 18, 2019 03:51

add to Snakefile

3f55855

resolve conflict

9a923a3

Merge remote-tracking branch 'upstream/master'

d38289c

updating fork

3a20aa0

Merge remote-tracking branch 'upstream/master'

d3d6431

changed output path and name

305bbbf

update Snakefile to master

eec2ffc

add bedtools subtract and new result file

8f3e136

Merge branch 'master' into fix_step3

bad5118

jashapiro reviewed Jan 13, 2020

View reviewed changes

analyses/copy_number_consensus_call/Snakefile Outdated Show resolved Hide resolved

nhatduongnn and others added 3 commits January 13, 2020 16:00

Update analyses/copy_number_consensus_call/Snakefile

e4c19d8

Co-Authored-By: jashapiro <josh.shapiro@ccdatalab.org>

removed get_rid_badsegment.py

20f3278

Merge branch 'fix_step3' of https://github.com/fingerfen/OpenPBTA-ana…

da01485

…lysis into fix_step3

jashapiro approved these changes Jan 13, 2020

View reviewed changes

Merge branch 'master' into fix_step3

8c25fc7

jaclyn-taroni merged commit 998b84e into AlexsLemonade:master Jan 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CNV consensus (8 of 6): Changing Step3 to use Bedtools Subtract #430

CNV consensus (8 of 6): Changing Step3 to use Bedtools Subtract #430

nhatduongnn commented Jan 13, 2020

jashapiro left a comment •

edited

Loading

nhatduongnn commented Jan 13, 2020

jashapiro commented Jan 13, 2020

jashapiro left a comment

CNV consensus (8 of 6): Changing Step3 to use Bedtools Subtract #430

CNV consensus (8 of 6): Changing Step3 to use Bedtools Subtract #430

Conversation

nhatduongnn commented Jan 13, 2020

Purpose/implementation Section

What GitHub issue does your pull request address?

Which areas should receive a particularly close look?

Reproducibility Checklist

Documentation Checklist

jashapiro left a comment • edited Loading

Choose a reason for hiding this comment

nhatduongnn commented Jan 13, 2020

jashapiro commented Jan 13, 2020

jashapiro left a comment

Choose a reason for hiding this comment

jashapiro left a comment •

edited

Loading