You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jun 21, 2023. It is now read-only.
To determine whether or not the GISTIC results are in agreement when we compare those of a specific histology to those of the entire PBTA cohort.
For context, we found that there are some histologies in the PBTA cohort that contain more samples than others (ie. LGAT). This means that histologies with higher n samples may be driving the results of our analyses. This comparison analysis will help us decide whether or not we should handle our downstream analyses in a histology specific manner.
What methods do you plan to use to accomplish the scientific goals?
In analyses/run-gistic/results there are four zip files. One contains the GISTIC results for the entire cohort, and the other three contain the GISTIC results for three individual histologies (LGAT, HGAT, and medulloblastoma).
The plan for this analysis is to:
Visualize the agreement/disagreement in the scores.gistic files for each of the individual histologies compared to that for the entire cohort by adapting the code in analyses/cnv-chrom-plot/gistic_plot.Rmd.
Use a Venn diagram to visualize the agreement/disagreement in the lists of amplified and deleted genes for each of the individual histologies compared to that for the entire cohort.
Use appropriate visualizations to observe the degree of agreement between each of the GISTIC result files that the individual histologies have in common with the entire cohort.
This will also be represented in a R notebook.
What input data are required for this analysis?
The input data required for this analysis include:
What are the scientific goals of the analysis?
To determine whether or not the GISTIC results are in agreement when we compare those of a specific histology to those of the entire PBTA cohort.
For context, we found that there are some histologies in the PBTA cohort that contain more samples than others (ie. LGAT). This means that histologies with higher n samples may be driving the results of our analyses. This comparison analysis will help us decide whether or not we should handle our downstream analyses in a histology specific manner.
What methods do you plan to use to accomplish the scientific goals?
In
analyses/run-gistic/results
there are four zip files. One contains the GISTIC results for the entire cohort, and the other three contain the GISTIC results for three individual histologies (LGAT, HGAT, and medulloblastoma).The plan for this analysis is to:
scores.gistic
files for each of the individual histologies compared to that for the entire cohort by adapting the code inanalyses/cnv-chrom-plot/gistic_plot.Rmd
.This will also be represented in a R notebook.
What input data are required for this analysis?
The input data required for this analysis include:
analyses/run-gistic/results/pbta-cnv-consensus-gistic.zip
analyses/run-gistic/results/pbta-cnv-consensus-hgat-gistic.zip
analyses/run-gistic/results/pbta-cnv-consensus-lgat-gistic.zip
analyses/run-gistic/results/pbta-cnv-consensus-medulloblastoma-gistic.zip
How long do you expect is needed to complete the analysis? Will it be a multi-step analysis?
~2 days (rough estimate)
Who will complete the analysis (please add a GitHub handle here if relevant)?
@cbethell
The text was updated successfully, but these errors were encountered: