-
Notifications
You must be signed in to change notification settings - Fork 83
CNV consensus (8 of 6): Changing Step3 to use Bedtools Subtract #430
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks good. I forgot that we had already made the intermediate files tab delimited! The one change is that I think we may as well go ahead and remove the now orphaned get_rid_bad_segments.py
script from the repository (it will live on in the git history).
One other related comment: In some discussions around #392, we have decided we would very much like to preserve the cnvkit seg.mean
column into this output file. I think this should be as simple as modifying the awk
commands to include that column in the output (adding NA, perhaps for the other callers), modifying the first_merge
bedtools command to also include that column, and finally altering restructure_column.py
to include that column as well.
I don't know how much time you have left to work on this before your break ends, so let me know if you will be able to handle that modification this week, or if it should be handled by one of us.
Co-Authored-By: jashapiro <josh.shapiro@ccdatalab.org>
@jashapiro I just got rid of the As for the discussion around #392, I just had a look and it seems that, just like you said, we will have to make adjustment to the Sadly, my break ended last Monday and I have been back in school for a week now. That's why it took a while for me to make this PR. However, I would like to see my project until the end so I think I can handle the implementation of |
Okay, I will look forward to seeing your changes. You can just submit them as a new PR: no need to keep up the numbering if you don't want to! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks good! Thank you for your contributions!
Purpose/implementation Section
Increase code run time and efficiency
What GitHub issue does your pull request address?
#128
Which areas should receive a particularly close look?
The changes in the cnv_consensus.tsv. The Manta CNV order changed for one singular consensus CNV call.
Reproducibility Checklist
Documentation Checklist
README
and it is up to date.analyses/README.md
and the entry is up to date.