-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Store analysis output-files from Balsamic in Housekeeper #475
Comments
@henrikstranneheim This is the structure I have for BALSAMIC. Files section is for housekeeper to store. And key names are going to be directory path to keep'em organized and tidy. How do you have it in MIP? Notes: Add keys under bam to have sampleIDs (tumor, normal, etc)
|
Here is my suggestion:
The point here is that the file produced by the workflows can have from 1 to n files where each file can have a set of mandatory keys and values (if we like), but also other keys describing each file whenever appropriate. I think this will be more maintainable and extendible while still bringing consistency across workflows and the code that operates on this file. @hassanfa @jemten @barrystokman @patrikgrenfeldt @sylvinite What do you think of this? |
Looks good. I like this! It is quite informative. Two comments:
|
The path_index would most likely not be mandatory, but is probably a good idea to have it in the same hash entry as the path which it is indexing for. Meaning that housekeeper could pick it up if it expects it, but skip it otherwise. Yeah, I know the |
So we leave the logic to housekeeper to handle path_index, etc.
|
Sounds good! I updated the suggestion |
👍 I'll start working on balsamic's side. |
@patrikgrenfeldt @barrystokman Do you have any comments before we start working on the code to produce the file? |
Discussed last Thursday, good idea. Issue for work in cg regarding MIP-RNA here. edit: specified pipeline |
I suggest we call the root node (currently called store) something intuitive for the usage maybe: "output-files", "outputs", "files" or "deliverables", what do you say? |
I prefer 'output-files' or 'files'. |
Let's do 'files' then. |
I'd prefer deliverables. Easier for me to understand which file to use for storing results if I have to do it manually |
@emiliaol you're talking about the name of the file, which has not been decided yet. You indicated to me that |
Aha! It was for inside the file. That I won't care about :) |
Sounds good! Updated suggestion |
I added a new key to the suggestion "tag". This is needed to distinguish between files produced in the same step e.g. multiqc html and json reports or mip_analyse config or log. |
As a TA member, I would like to have files from Balsamic runs added to HK so that they are easy to find/deliver etc
Problem: the Balsamic pipeline is now being set up. The output files from this shall be stored in Housekeeper just as the files from MIP.
Information:
Dir-structure for Balsamic
fastq's: /home/proj/production/cancer/analysis/case-id/fastq/
results: /home/proj/production/cancer/analysis/case-id/analysis/
job logs: /home/proj/production/cancer/analysis/case-id/logs/
job scripts: /home/proj/production/cancer/analysis/case-id/scripts/
Internal BALSAMIC log: job logs: /home/proj/production/cancer/analysis/case-id/BALSAMIC_run
How to know when an analysis is finished:
I asked Hassan to make Balsamic create a file when an analysis is completed, for example 'analysis_finish' in case-directory (we need to know if this is on sample or case level)
He has created an issue on GitHub:
Clinical-Genomics/BALSAMIC#143
Suggested solution:
There will be a delivery_report directory in /home/proj/production/cancer/analysis//analysis/ with a file .hk which is a yaml-file with keys:
Check if tests exist in cg: other US will be made by KB
Q: What do we need to store?
A: We should ask HFA before starting the US
Q2: is the .hk already created?
A2:
DoD:
Files from Balsamic are stored in HK after finished analysis
A pipeline-tag has been added to the files in HK
Verified
How to demo:
Show
$ housekeeper get balsamic case bundle
The text was updated successfully, but these errors were encountered: