V15 Fix molecular-subtype-chordoma analysis #590

cansavvy · 2020-03-02T18:43:41Z

Purpose/implementation Section

There are not enough chordoma samples in the subset CircleCI files to test so I switched to the approach that was used in the other subtyping modules.

Looks like the rename steps were not working because of library load and it was not calling the dplyr::rename but another rename. So to fix I added a series of dplyr::, reran the notebook with v15 and pushed the results.

What GitHub issue does your pull request address?

#574

Reproducibility Checklist

These items were all taken care of previously.

The dependencies required to run the code in this pull request have been added to the project Dockerfile.
This analysis has been added to continuous integration.

Documentation Checklist

This analysis module has a README and it is up to date.
This analysis is recorded in the table in analyses/README.md and the entry is up to date.
The analytical code is documented and contains comments.

cansavvy · 2020-03-02T19:05:42Z

@jaclyn-taroni I have two fix ideas for the lack of chordoma samples in subset for this notebook.

Force the subset to include chordoma samples by adding another step in create-subset-files (This is the more painful route since we'd have to re-run the subsetting steps).
Make this filter step in the notebook only occur when it is NOT being run in CircleCI:

OpenPBTA-analysis/analyses/molecular-subtyping-chordoma/01-Subtype-chordoma.Rmd

Line 147 in 0d9a85e

smarcb1_expression <- smarcb1_expression[, which(colnames(expression_data) %in% chordoma_samples) ]

At this point in time, we are thinking 2 would be okay, but I wanna clear it by you before I implement it.

jaclyn-taroni · 2020-03-02T19:12:43Z

There is another option - have the first step of this module be it’s own subsetting where all resulting files are committed to the repository. That is what we do for other subtyping modules.

cansavvy · 2020-03-02T19:13:45Z

There is another option - have the first step of this module be it’s own subsetting where all resulting files are committed to the repository. That is what we do for other subtyping modules.

I was just noticing this for the other subtyping modules. I will try to follow their suit then.

…hordoma

jashapiro

Just a few small things, while biggish changes are happening.

analyses/molecular-subtyping-chordoma/00-subset-files-for-chordoma.R

.circleci/config.yml

…hordoma

This reverts commit 400be46.

jaclyn-taroni · 2020-03-02T20:32:01Z

fusion-summary is commented out in a6e2923 #569 because it's going to require a bigger change (because the fusion files are part of what is changing in v15) which is tracked with #578. I intended for #578 to wait until after v15 so we don't hold things up.

jashapiro · 2020-03-02T20:32:53Z

Meant to leave that comment in the main thread...

cbethell

LGTM once the comments below are addressed 👍

cbethell · 2020-03-02T20:14:16Z

analyses/molecular-subtyping-chordoma/README.md

 ```
+This bash script the same regardless of where it is called and will first subset the data to `Chordoma` samples


Looks like you're missing a verb here, correct?

cbethell · 2020-03-02T20:24:13Z

analyses/molecular-subtyping-chordoma/01-Subtype-chordoma.Rmd

+# Subset metadata
+subset_metadata <- histologies_df %>%
+  dplyr::filter(short_histology == "Chordoma") %>%
+  select(


You're calling the filter function above using dplyr:: but not doing the same for select here. Is this because filter was behaving similarly to rename as you noted in your original comment? If so, then disregard this comment. If not, removing the dplyr:: altogether (except for in the case of the rename function) or adding the dplyr:: to the rest of the functions would be nice for consistency purposes.

filter also has a base:: function that can mess with stuff sometimes.

cbethell · 2020-03-02T20:34:43Z

analyses/molecular-subtyping-chordoma/01-Subtype-chordoma.Rmd

-# remove large expression matrix that's no longer needed
-rm(expression_data)
+# now only the columns correspond to chordoma samples
+smarcb1_expression <- smarcb1_expression[, which(colnames(subset_expression_data) %in% subset_metadata$Kids_First_Biospecimen_ID) ]


This line looks weird (format-wise), did you run this through a styler?

I did but it doesn't do anything with really long lines. I'd rather use dplyr::filter here anyway but was trying to keep the original code here.

jashapiro

LGTM!

jashapiro · 2020-03-02T20:37:59Z

Merging before expected CI failure.

cansavvy added 2 commits March 2, 2020 13:40

dplyr:: were needed to differentiate rename

a45c5cd

Comment out all tests except this one

d947e9f

cansavvy mentioned this pull request Mar 2, 2020

Fix v15 breaking changes #574

Closed

Merge branch 'master' into v15-fix-chordoma

3482801

cansavvy added 6 commits March 2, 2020 14:58

Restructure Chordoma to do subsetting and be have that as an option

287289c

Merge remote-tracking branch 'origin/v15-fix-chordoma' into v15-fix-c…

4fd9b95

…hordoma

Update CircleCI

e81d7d1

Add back the plot

3812d2b

Refresh everything

a410905

Linter and refresh everything

4164881

cansavvy requested a review from cbethell March 2, 2020 20:10

Merge branch 'master' into v15-fix-chordoma

b82c6f8

jashapiro reviewed Mar 2, 2020

View reviewed changes

cansavvy added 5 commits March 2, 2020 15:19

Combine select_metadata and chordoma_df as one thing

03b26e8

Merge remote-tracking branch 'origin/v15-fix-chordoma' into v15-fix-c…

f1b5771

…hordoma

Fix CircleCI spacing per @jashapiro 's request

563d2c2

Comment out everything except this thing

400be46

Revert "Comment out everything except this thing"

8e97f1d

This reverts commit 400be46.

cbethell approved these changes Mar 2, 2020

View reviewed changes

jashapiro approved these changes Mar 2, 2020

View reviewed changes

jashapiro merged commit 2a86c8e into AlexsLemonade:master Mar 2, 2020

cansavvy deleted the v15-fix-chordoma branch March 25, 2020 20:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

V15 Fix molecular-subtype-chordoma analysis #590

V15 Fix molecular-subtype-chordoma analysis #590

cansavvy commented Mar 2, 2020 •

edited

Loading

cansavvy commented Mar 2, 2020

jaclyn-taroni commented Mar 2, 2020

cansavvy commented Mar 2, 2020

jashapiro left a comment

jaclyn-taroni commented Mar 2, 2020

jashapiro commented Mar 2, 2020

cbethell left a comment

cbethell Mar 2, 2020

cbethell Mar 2, 2020

cansavvy Mar 2, 2020

cbethell Mar 2, 2020

cansavvy Mar 2, 2020

jashapiro left a comment

jashapiro commented Mar 2, 2020

		```
		This bash script the same regardless of where it is called and will first subset the data to `Chordoma` samples

V15 Fix molecular-subtype-chordoma analysis #590

V15 Fix molecular-subtype-chordoma analysis #590

Conversation

cansavvy commented Mar 2, 2020 • edited Loading

Purpose/implementation Section

What GitHub issue does your pull request address?

Reproducibility Checklist

Documentation Checklist

cansavvy commented Mar 2, 2020

jaclyn-taroni commented Mar 2, 2020

cansavvy commented Mar 2, 2020

jashapiro left a comment

Choose a reason for hiding this comment

jaclyn-taroni commented Mar 2, 2020

jashapiro commented Mar 2, 2020

cbethell left a comment

Choose a reason for hiding this comment

cbethell Mar 2, 2020

Choose a reason for hiding this comment

cbethell Mar 2, 2020

Choose a reason for hiding this comment

cansavvy Mar 2, 2020

Choose a reason for hiding this comment

cbethell Mar 2, 2020

Choose a reason for hiding this comment

cansavvy Mar 2, 2020

Choose a reason for hiding this comment

jashapiro left a comment

Choose a reason for hiding this comment

jashapiro commented Mar 2, 2020

cansavvy commented Mar 2, 2020 •

edited

Loading