Skip to content
This repository has been archived by the owner on Jun 21, 2023. It is now read-only.

PBTA Histologies: Fusion filtering base (3 of N) #865

Merged
merged 14 commits into from
Jan 6, 2021

Conversation

kgaonkar6
Copy link
Collaborator

@kgaonkar6 kgaonkar6 commented Dec 7, 2020

Purpose/implementation Section

What scientific question is your analysis addressing?

Module Reason Brief Description output
fusion_filtering adding 8 samples #749 and used in subtyping Standardizes, filters, and prioritizes fusion calls updated in #808 and #830 results/pbta-fusion-putative-oncogenic.tsv(included in data download)
results/pbta-fusion-recurrent-fusion-byhistology.tsv (included in data download)
results/pbta-fusion-recurrent-fusion-bysample.tsv (included in data download)
results/fusion_summary_lgat_foi.tsv (included in data download)

What was your approach?

Added condition to use pbta-histologies-base.tsv if running module for subtyping to 04-project-specific-filtering.Rmd,
05-QC_putative_onco_fusion_dustribution.Rmd and run_fusion_merged.sh.

What GitHub issue does your pull request address?

#861

Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.

Which areas should receive a particularly close look?

Broad_histology column was updated in v18 changes in output files requiring broad_histology are seen in pbta-fusion-recurrent-fusion-byhistology.tsv. Some samples that were Neuronal and mixed neuronal-glial tumor are now grouped with Low-grade astrocytic tumor . CC @jharenza for additional info if required for review.

Other modules that need to be rerun for subtyping are in #861

Results

What types of results are included (e.g., table, figure)?

table

What is your summary of the results?

8 new stranded samples were added to the module and broad_histology were updated to match v18 base histology.

Reproducibility Checklist

  • The dependencies required to run the code in this pull request have been added to the project Dockerfile.
  • This analysis has been added to continuous integration.

Documentation Checklist

  • This analysis module has a README and it is up to date.
  • This analysis is recorded in the table in analyses/README.md and the entry is up to date.
  • The analytical code is documented and contains comments.

@jharenza jharenza self-requested a review December 8, 2020 02:05
@jharenza
Copy link
Collaborator

jharenza commented Dec 8, 2020

Broad_histology column was updated in v18 changes in output files requiring broad_histology are seen in pbta-fusion-recurrent-fusion-byhistology.tsv. Some samples that were Neuronal and mixed neuronal-glial tumor are now grouped with Low-grade astrocytic tumor . CC @jharenza for additional info if required for review.

With #793, we added Ganglioglioma to LGAT subtyping and with the V18 release, we updated the broad_histology to Low-grade astrocytic tumor instead of Neuronal and mixed neuronal-glial tumor, per discussions with Cassie Kline, so these changes are expected.

Copy link
Collaborator

@jharenza jharenza left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This also looks good and runs as expected.

Copy link
Collaborator

@jharenza jharenza left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks for the update to the README. looks good!

@jharenza jharenza changed the title Fusion filtering base PBTA Histologies: Fusion filtering base (3 of N) Dec 9, 2020
@jaclyn-taroni jaclyn-taroni removed their request for review December 13, 2020 20:46
Copy link
Collaborator

@cansavvy cansavvy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks good. My review is mainly two comments:

  1. I have the same comment about your if statements. If you just establish a histology file variable with your if statement, then you can have the main code written once. Since it looks like the histology file used is the only difference.
  2. I don't know enough about how we expected the base histology file to change these results, but as long as everything here is expected and makes sense then 👍.

@@ -88,8 +96,13 @@ fusion_calls<-QCGeneFiltered_filtFusion %>% mutate(FusionName=rm_between(.data$F
group<-params$group

# get histology file
if ( params$base_run ==0 ){
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks like the only difference here is whether params$base_histology or params$histology is used, so I would reduce your if statement to specify a histology_file variable and then supply that to that code chunk rather than repeating the code chunk twice.

@@ -69,8 +77,14 @@ fusion_calls<-read_tsv(file.path(root_dir,params$dataPutativeFusion))
outputfolder<-params$outputfolder

#### get histology file
if ( params$base_run ==0 ){
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same comment here!

@@ -1,6 +1,5 @@
FusionName broad_histology count
KIAA1549--BRAF Low-grade astrocytic tumor 109
KIAA1549--BRAF Low-grade astrocytic tumor 114
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm assuming these results changes are expected, but just going to bring them to attention so we can check.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yup, these are expected since some samples were updated to Low-grade. More info in Jo Lynne's comment here #865 (comment)

@@ -26,7 +31,11 @@ set.seed(201910)

```{r load files}
root_dir <- rprojroot::find_root(rprojroot::has_dir(".git"))
if(params$base_run == 0) {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same comment here about if statements and making it so you don't have to repeat the same code.


if [[ RUN_FOR_SUBTYPING == "0" ]]
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same comment here about if statements. You can name a variable so you don't have to repeat whole chunks of code.

@kgaonkar6 kgaonkar6 deleted the fusion_filtering_base branch January 22, 2021 21:17
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants