PBTA Histologies v19 Part 3 of N: Update fusion filtering #1041

kgaonkar6 · 2021-04-27T22:08:18Z

Purpose/implementation Section

What scientific question is your analysis addressing?

Module	Reason	Brief Description	output
`fusion_filtering`	BS_JXF8A2A6 is removed as per #862	Standardizes, filters, and prioritizes fusion calls	`results/pbta-fusion-putative-oncogenic.tsv`(included in data download) `results/pbta-fusion-recurrent-fusion-byhistology.tsv` (included in data download) `results/pbta-fusion-recurrent-fusion-bysample.tsv` (included in data download)

What was your approach?

Just re-run as part of the scripts/run-for-subtyping.sh in kgaonkar6:v19-release in #1028
Then create a new branch for module specific review:
git checkout v19-release analyses/fusion_filtering/

What GitHub issue does your pull request address?

#867

Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.

Which areas should receive a particularly close look?

Not fully sure why the samples are missing from pbta-fusion-recurrent-fusion-bysample.tsv

Is there anything that you want to discuss further?

Is the analysis in a mature enough form that the resulting figure(s) and/or table(s) are ready for review?

Results

What types of results are included (e.g., table, figure)?

What is your summary of the results?

Reproducibility Checklist

The dependencies required to run the code in this pull request have been added to the project Dockerfile.
This analysis has been added to continuous integration.

Documentation Checklist

This analysis module has a README and it is up to date.
This analysis is recorded in the table in analyses/README.md and the entry is up to date.
The analytical code is documented and contains comments.

jashapiro

There seem to be a lot of lines removed here in the recurrent lists, more than I would have expected. I initially thought this might be due to sample changes in the independent samples analysis, but then I would have expected some additions, not just removals.

FilteredFusion.tsv only seems to remove BS_JXF8A2A6 as expected, but other files have more samples removed, which I was not expecting.

Is it possible that there was a mismatch between the data files & sample list for some part of this module, making it so the changes in #1040 might not be fully reflected here?

One other note: pbta-fusion-putative-oncogenic.tsv has more changes than I would have expected. Spot checking it looks like most of this is ordering changes, but I didn't do a careful analysis.

kgaonkar6 · 2021-04-28T16:13:44Z

Thanks for the review! I was not able to figure out the issue before while filing the PR but I think I figured out:

https://github.com/kgaonkar6/OpenPBTA-analysis/blob/5dba0062ae333a74ff6ee14bea6086fc21703f79/analyses/fusion_filtering/06-recurrent-fusions-per-histology.R#L40-L43

This code needs to be change cohort name from PNOC003 to PNOC, That changed the 15 samples other than BS_JXF8A2A6.

Just wanted to also mention that all in #1028 I actually run the modules related to each subsequent module by running the https://github.com/kgaonkar6/OpenPBTA-analysis/blob/v19-release/scripts/run-for-subtyping.sh so the changes are propogated. But added each module by themselves in PBTA Histologies v19 Part * for easy PR review.

kgaonkar6 · 2021-04-28T18:28:05Z

Updating the code to use PNOC instead of PNOC003 fixed those unwanted changes.

jashapiro

This looks good now, but I'm still a bit confused by the large number of changes in pbta-fusion-putative-oncogenic.tsv As I said, I don't think it is a problem; it seems like there might be just an ordering change there.

If you want to fix it, it looks like just adding an arrange(Sample, FusionName) at line 211 of
05-QC_putative_onco_fusion_dustribution.Rmd would do the trick to prevent this from coming up again in the future, and make it easier to distinguish any changes that do occur after this point. (You might want to fix the typo in the file name while you are at it. 😉 )

jashapiro · 2021-04-28T20:35:13Z

@kgaonkar6 Thanks for the updates in efe4de5... you will want to make sure to delete the old files as well, so GH recognizes it as a move rather than new files.

kgaonkar6 · 2021-04-28T20:43:43Z

Ah yes removed those old files! All these commits for this module and I never saw that file name typo 😅

update fusion filtering v19

bc41ad4

kgaonkar6 mentioned this pull request Apr 27, 2021

PBTA Histologies v19 Part 8 of N: Run molecular subtyping #1028

Merged

5 tasks

kgaonkar6 added review before release v19-release labels Apr 27, 2021

jashapiro reviewed Apr 28, 2021

View reviewed changes

kgaonkar6 mentioned this pull request Apr 28, 2021

V19 ci files #1038

Merged

5 tasks

updated hist

a0f0626

kgaonkar6 requested a review from jashapiro April 28, 2021 18:27

Merge branch 'master' into update-fusion-filtering

9892986

jashapiro approved these changes Apr 28, 2021

View reviewed changes

jashapiro and others added 2 commits April 28, 2021 16:14

Merge branch 'master' into update-fusion-filtering

a469ecd

arrange Sample

efe4de5

kgaonkar6 added 2 commits April 28, 2021 16:42

Delete 05-QC_putative_onco_fusion_dustribution.Rmd

74bb6fc

Delete 05-QC_putative_onco_fusion_dustribution.nb.html

6425376

jashapiro merged commit 03e8360 into AlexsLemonade:master Apr 28, 2021

kgaonkar6 deleted the update-fusion-filtering branch May 11, 2021 13:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PBTA Histologies v19 Part 3 of N: Update fusion filtering #1041

PBTA Histologies v19 Part 3 of N: Update fusion filtering #1041

kgaonkar6 commented Apr 27, 2021 •

edited

Loading

jashapiro left a comment

kgaonkar6 commented Apr 28, 2021 •

edited

Loading

kgaonkar6 commented Apr 28, 2021

jashapiro left a comment

jashapiro commented Apr 28, 2021

kgaonkar6 commented Apr 28, 2021 •

edited

Loading

PBTA Histologies v19 Part 3 of N: Update fusion filtering #1041

PBTA Histologies v19 Part 3 of N: Update fusion filtering #1041

Conversation

kgaonkar6 commented Apr 27, 2021 • edited Loading

Purpose/implementation Section

What scientific question is your analysis addressing?

What was your approach?

What GitHub issue does your pull request address?

Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.

Which areas should receive a particularly close look?

Is there anything that you want to discuss further?

Is the analysis in a mature enough form that the resulting figure(s) and/or table(s) are ready for review?

Results

What types of results are included (e.g., table, figure)?

What is your summary of the results?

Reproducibility Checklist

Documentation Checklist

jashapiro left a comment

Choose a reason for hiding this comment

kgaonkar6 commented Apr 28, 2021 • edited Loading

kgaonkar6 commented Apr 28, 2021

jashapiro left a comment

Choose a reason for hiding this comment

jashapiro commented Apr 28, 2021

kgaonkar6 commented Apr 28, 2021 • edited Loading

kgaonkar6 commented Apr 27, 2021 •

edited

Loading

kgaonkar6 commented Apr 28, 2021 •

edited

Loading

kgaonkar6 commented Apr 28, 2021 •

edited

Loading