Skip to content
This repository has been archived by the owner on Jun 21, 2023. It is now read-only.

Fix v15 breaking changes #574

Closed
jaclyn-taroni opened this issue Feb 28, 2020 · 11 comments
Closed

Fix v15 breaking changes #574

jaclyn-taroni opened this issue Feb 28, 2020 · 11 comments

Comments

@jaclyn-taroni
Copy link
Member

jaclyn-taroni commented Feb 28, 2020

To quote the release notes being added in #569, we're changing the names of well-enough-used columns in the clinical file:

  • Change disease_type_old to pathology_diagnosis and disease_type_new to integrated_diagnosis per request in comment.

I know this change to pbta-histologies.tsv will break a number of things. The purpose of this issue is to track what will need to be changed as a result. Not only will the column names need to be updated, but we will also need to rerun any notebooks, change documentation, etc.

Anticipated issues

Here I'll list what I know needs to change in modules that are not deprecated.

Issues that have arisen as part of #576

molecular-subtyping-chordoma fails with the following:

Quitting from lines 159-168 (01-Subtype-chordoma.Rmd) 
    Error in .f(.x[[i]], ...) : object 'SMARCB1' not found
    Calls: <Anonymous> ... <Anonymous> -> vars_rename_eval -> map_if -> map -> .f

That's from this chunk:


I suspect what is actually happening is that there are no chordoma samples in the expression data used in CI and this step
smarcb1_expression <- smarcb1_expression[, which(colnames(expression_data) %in% chordoma_samples) ]

We may want to take an approach that is similar to other subtyping modules and have the first step be a script that generates files that consist only of chordoma samples that are committed to the repository.


The Add Shatterseek step of sv-analysis, which is Rscript analyses/sv-analysis/02-shatterseek.R fails with:

Error in file(file, "rt") : cannot open the connection
Calls: read.table -> file
In addition: Warning message:
In file(file, "rt") :
  cannot open file 'scratch/sv-vcf/BS_K07KNTFY_withoutYandM.tsv': No such file or directory
Execution halted

analyses/sv-analysis/02-shatterseek.R uses an independent specimen file, which is included in its entirety in CI, to read in files:

bioid <- unique(independent_specimen_list$Kids_First_Biospecimen_ID)

The step that would have generated scratch/sv-vcf/BS_K07KNTFY_withoutYandM.tsv comes prior to this one in CI

So it will only have access to the subsetted Manta file. See #449 (comment) and #449 (comment) for more context. The sv-analysis module should be more robust to "missing" samples.

Next steps

@cansavvy @cbethell @jashapiro I'd recommend splitting this up such that modifications to each module are in separate pull requests so you can go through any make sure you catch any documentation stuff I may not have come across.

@jaclyn-taroni
Copy link
Member Author

Because of the extent of these breaking changes, I believe it's prudent to handle each module in a separate pull request as stated above. Unfortunately, that means that CI will fail for a bunch of these fixes until the last one goes in. So here is the general procedure I think we should follow:

  1. First and foremost, state which module you'll be working on on this issue so we don't duplicate effort!
  2. When you file a pull request for a particular module, you should comment out all the modules that come before it in CI to quickly demonstrate that your fix works. This should be done in a single commit.
  3. Once you're changes have been reviewed, you've made any changes as the result of review, and approved, revert the single commit that commented out all the modules before the step your PR pertains to.
  4. We should merge the fix without the check passing.

This procedure has a significant weakness in that there may be changes introduced in any one fix that will cause CI to fail unexpectedly once the final fix goes in and this issue is closed. Once this issue gets closed, #569 should be updated such that it is in sync with master and that will test all steps in CI (except fusion-summary but that is tracked in #578). Any additional CI fixes can go into the AlexsLemonade:update-release-docs-v15 branch provided that they are small in scope.

@cansavvy
Copy link
Collaborator

cansavvy commented Mar 2, 2020

So we can keep track of progress on these, I took @jaclyn-taroni 's list above and made it into a checklist. We can claim items and then check things off as we fix them. I'll start by claiming this first item. I'll put the PR number next to it too when I get it filed.

v15 breaking changes TODO list

@jaclyn-taroni
Copy link
Member Author

I just realized that the sv-analysis failure may simply be due to the fact that I had commented out the first script in that module. So the scope of that fix may be to group those near each other in .circleci/config.yml or to add a shell script to that module.

@cansavvy
Copy link
Collaborator

cansavvy commented Mar 2, 2020

Okay. Well I just started working on it now, I'll see if that's it.

@cansavvy
Copy link
Collaborator

cansavvy commented Mar 2, 2020

@jaclyn-taroni you were right. It is fine if the first script is ran. #587

@jaclyn-taroni
Copy link
Member Author

Okay 👍 - would love to see those organized such that the step that was failing was immediately after the step that it depends on (perhaps after v15 is out out). I think that would have increased the chances I noticed that immediately.

@cansavvy
Copy link
Collaborator

cansavvy commented Mar 2, 2020

Okay 👍 - would love to see those organized such that the step that was failing was immediately after the step that it depends on (perhaps after v15 is out out). I think that would have increased the chances I noticed that immediately.

I was about to just make this change when I had the branch open but I didn't know if there were particular sequential orders to some of the other tests and didn't want to throw another possible wrench in our testings here. But yeah, we may even want to have a bash script that calls both and make it one CircleCI test.

@cansavvy
Copy link
Collaborator

cansavvy commented Mar 2, 2020

@jaclyn-taroni In regards to selection-strategy-comparison and your comment:

We may want to just deprecate this analysis at this point rather than try to maintain it?

I don't know enough about this analysis module to make an informed decision on this. Do we want to retire it though?

@jaclyn-taroni
Copy link
Member Author

@jashapiro - what do you think, time to retire selection-strategy-comparison ?

@jashapiro
Copy link
Member

I think it can be deprecated. Will do that now.

@jashapiro
Copy link
Member

jashapiro commented Mar 2, 2020

We did it, everyone! Changes incorporated to master in #569

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants