-
Notifications
You must be signed in to change notification settings - Fork 83
Proposed Analysis: Add CNS_region mapping in histologies file to OpenPBTA analyses #838
Comments
Is this still a to-do ticket or the base histology already takes care of this? Base histology file seems to already have the "Updated CNS Region" terms assigned. |
The direct matches in the ticket wouldn't satisfy where primary_site values can be assigned to multiple Example list of multiple primary_site values:
|
Converting all of the extraction of data using DBT will mean that |
Closing, as this is now a part of the intermediate D3b workflow prior to release of |
@kgaonkar6 I am going to re-open this, as I think the logic was not captured entirely as anticipated for #849. For instance, some samples which should be mixed are annotated as midline or hemispheric. We can update this in #v19. |
One other thing to note is that for our "Mixed" samples, @jainpayal022 and Cassie Kline will be manually reviewing pathology and imaging reports of about 70 or so HGAT samples to assess the primary site of origin. We will then add these manually curated values for those CNS_regions. See this ticket. |
Adding the code here for CNS_region matching here https://github.com/d3b-center/D3b-codes/blob/master/OpenPBTA_v19_release_QC/QC_histology_v19.Rmd to record |
Just wanted to add the primary site changes from the latest pulls, so guide updates to the above primary_site ~ CNS_region matches
|
I'm not sure where exactly we will be using CNS_regions so just wondering if we have an updated dictionary matching primary_site to CNS_region to incorporate the new primary_sites assigned to PNOC autopsy samples? Currently the CNS_region is being assigned as NA since it doesn't match any terms in the issue description. |
Thank you for checking on this @kgaonkar6 - I had the answers from Cassie, but never posted here. See below with the only one in question being
|
Thank you for the update! |
Confirmed this will remain Mixed
|
Previously the first step of CNS_region assignment is "Mixed" is there are multiple values separated by ";" and then check if all individual values are part of on CNS_region. So the primary_site like https://github.com/d3b-center/D3b-codes/blob/20210212-release/OpenPBTA_v20_release_QC/code/util/primary_site_matched_CNS_region.R do we want to update the code or make these exeptions since "Adjacent" is not a brain location per se? |
To be honest, I was thinking about this and think that whoever ingests this into the BRP should match these regions with CBTN terms so we don't always have to update the terms. Let me check in with Jenn Mason and Shannon Robbins. In the meantime. I was envisioning these going into the JSON file for Midline. |
Just need a confirmation BS_J8EK6RNF seems to have primary_site "Pons" in the latest histology file pulls ( since 20210126-data release ) instead of the "Brain Stem-Medulla;Brain Stem- Midbrain/Tectum;Brain Stem- Pons;Cerebellum/Posterior Fossa;Thalamus" in the table above do we need to confirm this @jharenza ? |
Let me check on this. Are the other samples matching |
yes |
Ok, it looks like |
confirmed that this value should be |
we added this upstream during QC |
What are the scientific goals of the analysis?
Add CNS_region to pbta-histologies.tsv using logic utilized on CHOP D3b end
What methods do you plan to use to accomplish the scientific goals?
Regions: Hemispheric, Midline, Spine, Ventricles, Posterior fossa, Optic pathway, Suprasellar, Other
Utilize this table to perform CNS_region mapping:
Reference notion doc
from @baileyckelly
What input data are required for this analysis?
pbta-histologies.tsv
How long do you expect is needed to complete the analysis? Will it be a multi-step analysis?
0.5 day
Who will complete the analysis (please add a GitHub handle here if relevant)?
@kgaonkar6
What relevant scientific literature relates to this analysis?
NA, informed by physicians: Angela Waanders and Cassie Kline
The text was updated successfully, but these errors were encountered: