Skip to content
This repository has been archived by the owner on Jun 21, 2023. It is now read-only.

V20 CNV update part2: focal cnv and oncoprint rerun #1124

Merged
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
20 changes: 14 additions & 6 deletions analyses/focal-cn-file-preparation/02-add-ploidy-consensus.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,11 @@ output:
toc_float: TRUE
author: Chante Bethell and Jaclyn Taroni for ALSF CCDL
date: 2020
params:
base_run:
label: "1/0 to run with base histology"
value: 0
input: integer
---

The `pbta-histologies.tsv` file contains a `tumor_ploidy` column, which is tumor ploidy as inferred by ControlFreeC.
Expand Down Expand Up @@ -41,11 +46,15 @@ if(!dir.exists(output_dir)) {
```

```{r}
# TODO: the consensus SEG file is not currently in the data download -- when it
# gets included we will have to change the file path here
consensus_seg_file <- file.path("..", "copy_number_consensus_call", "results",
"pbta-cnv-consensus.seg.gz")
histologies_file <- file.path("..", "..", "data", "pbta-histologies.tsv")
if ( params$base_run ==0 ){
histologies_file <- file.path("..", "..", "data", "pbta-histologies.tsv")
consensus_seg_file <- file.path("..", "..", "data",
"pbta-cnv-consensus.seg.gz")
} else {
histologies_file <- file.path("..", "..", "data", "pbta-histologies-base.tsv")
consensus_seg_file <- file.path("..", "copy_number_consensus_call", "results",
"pbta-cnv-consensus.seg.gz")
}

consensus_seg_df <- read_tsv(consensus_seg_file)
histologies_df <- read_tsv(histologies_file,
Expand Down Expand Up @@ -215,4 +224,3 @@ temp_gain <- purrr::imap(bed_gain_list,
```{r}
sessionInfo()
```

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

11 changes: 10 additions & 1 deletion analyses/focal-cn-file-preparation/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,11 +8,20 @@ The purpose of this module is to map from those ranges to gene identifiers for c
### Running this analysis
*This analysis requires at least ~24 GB of RAM to run to completion*

To run this analysis _only on consensus SEG file_, use the following (from the root directory of the repository):
To run this analysis _only on consensus SEG file_,

use OPENPBTA_BASE_SUBTYPING=1 to run this module using the pbta-histologies-base.tsv from data folder and relative path to `copy_number_consensus_call/results/pbta-cnv-consensus.seg.gz` while running molecular-subtyping modules for release.

```
OPENPBTA_BASE_SUBTYPING=1 bash analyses/focal-cn-file-preparation/run-prepare-cn.sh
```

Or by default runs analyses using pbta-histologies.tsv and downloaded files from data release:

```
bash analyses/focal-cn-file-preparation/run-prepare-cn.sh
```

**Note**: The `run-bedtools.snakemake` script is implemented in `run-prepare-cn.sh` to run the bedtools coverage steps between the UCSC cytoband file and the samples in the copy number files produced in `02-add-ploidy-consensus.Rmd`.
This script currently takes a while to run, and therefore slows down the processing speed of the main shell script `run-prepare-cn.sh`.

Expand Down
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file not shown.
Binary file not shown.
Original file line number Diff line number Diff line change
@@ -1,39 +1,23 @@
gene
KDM7A
KDM7A-DT
PARP12
SLC37A3
TBXAS1
FMC1
FMC1-LUC7L2
LUC7L2
TTC26
UBN2
ZC3HAV1
ZC3HAV1L
CLEC2L
FMC1
FMC1-LUC7L2
HIPK2
KDM7A
KIAA1549
KLRG2
LUC7L2
PARP12
TBXAS1
KDM7A-DT
SLC37A3
MKRN1
RAB19
ZC3HAV1L
DENND2A
KIAA1549
ADCK2
BRAF
CLEC2L
FMC1
FMC1-LUC7L2
HIPK2
KDM7A
KDM7A-DT
KLRG2
LUC7L2
NDUFB2
NDUFB2-AS1
PARP12
SLC37A3
TBXAS1
TTC26
UBN2
ZC3HAV1
ZC3HAV1L
Binary file not shown.
Loading