Skip to content
This repository has been archived by the owner on Jun 21, 2023. It is now read-only.

Update integrated dx for 3 samples based on pathology review #879

Merged
merged 5 commits into from
Jan 5, 2021

Conversation

kgaonkar6
Copy link
Collaborator

@kgaonkar6 kgaonkar6 commented Dec 23, 2020

Purpose/implementation Section

What scientific question is your analysis addressing?

  • update integrated_diagnosis column values for samples where initial molecular_subtype was "HGG, H3 wildtype" which got updated to molecular_subtype== "DMG K28M" from clinical review in 02-incorporate-clinical-feedback.Rmd

  • update 03-incorporate-pathology-feedback.Rmd to add Notes to each pathology review updated sample_ids.

What was your approach?

  • I am updating integrated_diagnosis column values for samples where initial molecular_subtype was "HGG, H3 wildtype" OR " HGG, To be classified" which got updated to molecular_subtype== "DMG K28M" from clinical review

  • I'm also adding "Notes" = "Updated via OpenPBTA subtyping" in 03-incorporate-pathology-feedback.Rmd to cover all columns that need to be updated per updated molecular_subtype from pathology review. And specifically, since in v18 samples with "XYZ, To Be classified" the Notes and integrated_diagnosis is left NA (unlike in v17 histology file) so will create duplicate rows in v18 for same bs id (7316-506, 7316-765 and 7316-238) has Notes =c("NA", "Updated via OpenPBTA subtyping")

What GitHub issue does your pull request address?

#878

Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.

Which areas should receive a particularly close look?

Is there some condition I'm missing in clinical review update in 02?

Is there anything that you want to discuss further?

The LGAT update to "Recurrence" was not expected, it happened because I had v18 files in data release. I've updated to v17 files in the latest commit 3fd999b

Is the analysis in a mature enough form that the resulting figure(s) and/or table(s) are ready for review?

Yes

Results

What types of results are included (e.g., table, figure)?

compiled subtypes with updated int_dx for PNOC003 samples

What is your summary of the results?

Update compiled subtypes

Reproducibility Checklist

  • The dependencies required to run the code in this pull request have been added to the project Dockerfile.
  • This analysis has been added to continuous integration.

Documentation Checklist

  • This analysis module has a README and it is up to date.
  • This analysis is recorded in the table in analyses/README.md and the entry is up to date.
  • The analytical code is documented and contains comments.

@kgaonkar6
Copy link
Collaborator Author

kgaonkar6 commented Dec 23, 2020

hmm not sure why CI is erroring out with the EPN subtyping error that we see in v18 CI runs #876. In the "Checkout code" section of CI it does seem like we are using v17 histology file, oh maybe OPENPBTA_RELEASE=testing now has v18 testing files?

Regarding the actual error that fails CI : I believe (might be wrong) the error occurs because change in pathology_diagnosis from v17 to v18 discussed here: #876 (comment)

@jharenza
Copy link
Collaborator

hmm not sure why CI is erroring out with the EPN subtyping error that we see in v18 CI runs #876. In the "Checkout code" section of CI it does seem like we are using v17 histology file, oh maybe OPENPBTA_RELEASE=testing now has v18 testing files?

Regarding the actual error that fails CI : I believe (might be wrong) the error occurs because change in pathology_diagnosis from v17 to v18 discussed here: #876 (comment)

This looks good to me, but yes, we are now using v18 CI files, which I forgot about. I wonder if you should add then add this as an update within the v18 set of PRs?

Copy link
Collaborator

@jharenza jharenza left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

content looks right

@jaclyn-taroni jaclyn-taroni changed the title Update pathology dx Update integrated dx for 3 samples based on pathology review Dec 30, 2020
@jharenza
Copy link
Collaborator

jharenza commented Jan 4, 2021

Hi @jaclyn-taroni @jashapiro @cansavvy - When looking at the final v18 histologies file, I found the error prompting this PR. We did as a standalone PR instead of within the v18 release PRs since it was a bug in the code. However, it errors out with V18 CI files because of the EPN subset step that @jashapiro has subsequently fixed, but we do not yet have merged.

hmm not sure why CI is erroring out with the EPN subtyping error that we see in v18 CI runs #876. In the "Checkout code" section of CI it does seem like we are using v17 histology file, oh maybe OPENPBTA_RELEASE=testing now has v18 testing files?

The easiest thing may be for us to switch back to v17 CI files to get this run through CI, merge this, switch to v18 CI files (unless you are OK with us merging this without it passing V18 CI, so that we can get this updated into the V18 histologies files. Please advise :).

@cansavvy cansavvy self-requested a review January 5, 2021 13:23
Copy link
Collaborator

@cansavvy cansavvy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems reasonable to me and like it is doing what is described.

@@ -316,6 +323,10 @@ compiled_df <- compiled_df %>%
molecular_subtype = case_when(
sample_id == "7316-1102" ~ "DMG, H3 K28",
TRUE ~ molecular_subtype
),
Notes = case_when(
sample_id == "7316-1102" ~ "Updated via OpenPBTA subtyping",
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't quite understand why it looks like there's recoding by individual samples -- it would probably be good to look into a more systematic way to do this, but that should be in a different PR, not now.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This notebook is designed to handle individual cases based on feedback from pathologists / clinical expertise. I think it's possible that coding for individual cases with lots of context is more clear.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants