Skip to content
This repository has been archived by the owner on Jun 21, 2023. It is now read-only.

Updated analysis: update some figures to include annotations by cancer_group for manuscript #1144

Closed
3 of 4 tasks
jharenza opened this issue Aug 18, 2021 · 9 comments
Closed
3 of 4 tasks
Assignees
Labels
updated analysis work in progress Used to label (non-draft) pull requests that are not yet ready for review

Comments

@jharenza
Copy link
Collaborator

jharenza commented Aug 18, 2021

What analysis module should be updated and why? What changes need to be made? Please provide enough detail for another participant to make the update.

  • interaction-plots - update plotting to be by cancer_group
  • telomerase-activity-prediction - change broad_histology and short_histology plots to cancer_group for the main figure. Could keep all in the analysis module
  • transcriptomic-dimension-reduction - color by cancer_group
  • sample-distribution-analysis - add second level with cancer_group, 3rd/4th/5th with WGS/WXS/RNA, 7th with tumor descriptor, 8th with sex

What input data should be used? Which data were used in the version being updated?

v20

When do you expect the revised analysis will be completed?

this and next week

Who will complete the updated analysis?

@kgaonkar6

Note, ticket still a WIP

@jharenza jharenza added updated analysis work in progress Used to label (non-draft) pull requests that are not yet ready for review labels Aug 18, 2021
@kgaonkar6
Copy link
Collaborator

kgaonkar6 commented Aug 19, 2021

While working in the interaction plots I've identified some points that I'd like to get clarification on:

  1. Do we want to plot coocurrence plot per each cancer_group corresponding to the broad_histology here
    declare -A disease
    disease[Diffuse-astrocytic-and-oligodendroglial-tumor]="Diffuse astrocytic and oligodendroglial tumor"
    if [ "$ALL" -gt "0" ]; then
    disease[LGAT]="Low-grade astrocytic tumor"
    disease[Tumors-of-sellar-region]="Tumors of sellar region"
    disease[Embryonal-tumor]="Embryonal tumor"
    disease[Ependymal-tumor]="Ependymal tumor"
    disease[Meningioma]="Meningioma"
    disease[Neuronal-mixed-neuronal-glial-tumor]="Neuronal and mixed neuronal-glial tumor"
    fi
    ? Asking because dividing some cancer_groups only have a couple of samples
             broad_histology                        cancer_group   n
1 Low-grade astrocytic tumor      Diffuse fibrillary astrocytoma   7
2 Low-grade astrocytic tumor                       Ganglioglioma   1
3 Low-grade astrocytic tumor                 Gliomatosis cerebri   1
4 Low-grade astrocytic tumor                                 GNG  47
5 Low-grade astrocytic tumor                   Oligodendroglioma   1
6 Low-grade astrocytic tumor               Pilocytic astrocytoma 123
7 Low-grade astrocytic tumor       Pleomorphic xanthoastrocytoma  11
8 Low-grade astrocytic tumor Subependymal Giant Cell Astrocytoma   9

And since currently only genes that are mutated in minimum 5 samples are kept for plotting we end up erroring out on some of the smaller cancer_groups. So we can either drop them with a cutoff of samples or plot cooccurrence by broad_histology but plot the cancer_group in the barplots?

@jharenza
Copy link
Collaborator Author

@kgaonkar6 I was envisioning only the barplot would be changed to cancer group, but the analysis would remain as is.

jharenza pushed a commit that referenced this issue Aug 27, 2021
* update with v21

* adding cancer_group

* removing NAs

* rerun with v21

* update with more hst

* add hex codes

Co-authored-by: kgaonkar6 <gaonkark@chop.edu>
@kgaonkar6 kgaonkar6 self-assigned this Aug 30, 2021
@kgaonkar6
Copy link
Collaborator

For transcriptomic-dimension-reduction I get an error saying
Error: Insufficient values in manual scale. 58 needed but only 26 provided.

The manual scale has 26 colors here, do we want to add the 58 colors from the figures palette for cancer_group here?
https://github.com/jaclyn-taroni/OpenPBTA-analysis/blob/1ac140e1d93fcb9615d57265f9a11c55c9f4bc74/analyses/transcriptomic-dimension-reduction/util/dimension-reduction-functions.R#L256-L265

@jaclyn-taroni
Copy link
Member

@kgaonkar6 do you need to alter the module itself or would be altering the figure generation steps to use an updated palette be sufficient for what you're trying to accomplish (e.g., in here)?

@kgaonkar6
Copy link
Collaborator

umm good question, I thought we want to update the modules in this ticket and the figures/ will be recreated in #1132.
Let me know if I mis-understood.

@jaclyn-taroni
Copy link
Member

That question probably wasn't for me, but I'd say that I don't see any reason to update that module if it's only for display purposes. In fact, keeping figures/ separate is handy for this kind of tweak as it keeps the final touches of display separate from the analysis which can sometimes be exploratory in nature & is nice to have preserved.

For interaction plots, the figure is just copied from the module so changing the module is the right call:

## Interaction plots

@jaclyn-taroni
Copy link
Member

I'll also add: I'm not sure making a scatter plot with a 58 color palette is going to be helpful to or discernible by our readers?

@kgaonkar6
Copy link
Collaborator

That makes sense @jaclyn-taroni!

From additional discussions with @jharenza we decided to:

  • update the TelomeraseActivities.R and transcriptomic-overview.R in the figures folder itself with the cancer_group update, and if that look too busy then revert back to the original figures.
  • copy over figure for sample distribution from sample-distribution-analysis with updated code in #1144 part2: Sample distribution multi-layer plot #1161

jaclyn-taroni added a commit that referenced this issue Sep 1, 2021
* adding cancer_group update

* comment update

* replaced / with ; in display_group

* add adiitinal hex_codes for cancer_group

* add comments for cancer_group filtering

* Update figures/mapping-histology-labels.Rmd

Co-authored-by: Jo Lynne Rokita <jharenza@gmail.com>

* NA cancer_group as gray

* Update figures/mapping-histology-labels.Rmd

Co-authored-by: Jo Lynne Rokita <jharenza@gmail.com>

* Update figures/mapping-histology-labels.Rmd

Co-authored-by: Jo Lynne Rokita <jharenza@gmail.com>

* Update figures/mapping-histology-labels.Rmd

Co-authored-by: Jo Lynne Rokita <jharenza@gmail.com>

* comment to remove NA in color assignment

* comment to replace / with or

* updated colors

* new colors and v21

* updated distribution

* sample distribution multi-layer

* adding available and unavailble status

* Update README.md

* removing new file for updated 02

* add sample distribution only

* add distribution tree

* update with v21

* update colors

* rerun with latest hex codes

* add hex color update

* rerun

* remove old files

* fix color assignment

* fix prog colors

* add palette in figure folder

* remove temp QC RDS

* delete wrong filename

* rerun without devel temp outputs

* Delete tm.RDS

* rerun

* update DNA-Seq and color_pal script

* typo fix

* remove figures sample dist script

* Delete fig1-sample-distribution.R

* Update README.md

* add tumor_descriptor description

* Update .circleci/config.yml

Co-authored-by: Ubuntu <ubuntu@ip-10-21-3-107.ec2.internal>
Co-authored-by: Ubuntu <ubuntu@ip-10-21-3-207.ec2.internal>
Co-authored-by: Jo Lynne Rokita <jharenza@gmail.com>
Co-authored-by: Ubuntu <ubuntu@ip-10-21-3-40.ec2.internal>
Co-authored-by: krutika <krutika@krutikas-MacBook-Air.local>
Co-authored-by: kgaonkar6 <gaonkark@chop.edu>
Co-authored-by: Jo Lynne Rokita <jolynnerokita@d3b.center>
Co-authored-by: Jaclyn Taroni <jaclyn.n.taroni@gmail.com>
jaclyn-taroni added a commit that referenced this issue Sep 1, 2021
* update telo plot with cancer_groups

* remove NA

Co-authored-by: kgaonkar6 <gaonkark@chop.edu>
Co-authored-by: Jaclyn Taroni <jaclyn.n.taroni@gmail.com>
@jharenza
Copy link
Collaborator Author

This seems to be completed.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
updated analysis work in progress Used to label (non-draft) pull requests that are not yet ready for review
Projects
None yet
Development

No branches or pull requests

3 participants