-
Notifications
You must be signed in to change notification settings - Fork 83
Update integrated dx for 3 samples based on pathology review #879
Update integrated dx for 3 samples based on pathology review #879
Conversation
hmm not sure why CI is erroring out with the EPN subtyping error that we see in v18 CI runs #876. In the "Checkout code" section of CI it does seem like we are using v17 histology file, oh maybe OPENPBTA_RELEASE=testing now has v18 testing files? Regarding the actual error that fails CI : I believe (might be wrong) the error occurs because change in pathology_diagnosis from v17 to v18 discussed here: #876 (comment) |
This looks good to me, but yes, we are now using v18 CI files, which I forgot about. I wonder if you should add then add this as an update within the v18 set of PRs? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
content looks right
Hi @jaclyn-taroni @jashapiro @cansavvy - When looking at the final v18 histologies file, I found the error prompting this PR. We did as a standalone PR instead of within the v18 release PRs since it was a bug in the code. However, it errors out with V18 CI files because of the EPN subset step that @jashapiro has subsequently fixed, but we do not yet have merged.
The easiest thing may be for us to switch back to v17 CI files to get this run through CI, merge this, switch to v18 CI files (unless you are OK with us merging this without it passing V18 CI, so that we can get this updated into the V18 histologies files. Please advise :). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Seems reasonable to me and like it is doing what is described.
@@ -316,6 +323,10 @@ compiled_df <- compiled_df %>% | |||
molecular_subtype = case_when( | |||
sample_id == "7316-1102" ~ "DMG, H3 K28", | |||
TRUE ~ molecular_subtype | |||
), | |||
Notes = case_when( | |||
sample_id == "7316-1102" ~ "Updated via OpenPBTA subtyping", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't quite understand why it looks like there's recoding by individual samples -- it would probably be good to look into a more systematic way to do this, but that should be in a different PR, not now.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This notebook is designed to handle individual cases based on feedback from pathologists / clinical expertise. I think it's possible that coding for individual cases with lots of context is more clear.
Purpose/implementation Section
What scientific question is your analysis addressing?
update integrated_diagnosis column values for samples where initial molecular_subtype was "HGG, H3 wildtype" which got updated to molecular_subtype== "DMG K28M" from clinical review in 02-incorporate-clinical-feedback.Rmd
update 03-incorporate-pathology-feedback.Rmd to add Notes to each pathology review updated sample_ids.
What was your approach?
I am updating integrated_diagnosis column values for samples where initial molecular_subtype was "HGG, H3 wildtype" OR " HGG, To be classified" which got updated to molecular_subtype== "DMG K28M" from clinical review
I'm also adding "Notes" = "Updated via OpenPBTA subtyping" in 03-incorporate-pathology-feedback.Rmd to cover all columns that need to be updated per updated molecular_subtype from pathology review. And specifically, since in v18 samples with "XYZ, To Be classified" the Notes and integrated_diagnosis is left NA (unlike in v17 histology file) so will create duplicate rows in v18 for same bs id (7316-506, 7316-765 and 7316-238) has Notes =c("NA", "Updated via OpenPBTA subtyping")
What GitHub issue does your pull request address?
#878
Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.
Which areas should receive a particularly close look?
Is there some condition I'm missing in clinical review update in 02?
Is there anything that you want to discuss further?
The LGAT update to "Recurrence" was not expected, it happened because I had v18 files in data release. I've updated to v17 files in the latest commit 3fd999b
Is the analysis in a mature enough form that the resulting figure(s) and/or table(s) are ready for review?
Yes
Results
What types of results are included (e.g., table, figure)?
compiled subtypes with updated int_dx for PNOC003 samples
What is your summary of the results?
Update compiled subtypes
Reproducibility Checklist
Documentation Checklist
README
and it is up to date.analyses/README.md
and the entry is up to date.