add Flu NA Nextclade outputs to theiacov_illumina_pe #406
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR closes #405
🗑️ This dev branch should be deleted after merging to main.
🧠 Aim, Context and Functionality
The TheiaCoV_Illumina_PE_PHB and TheiaCoV_ONT_PHB workflows run Nextclade on the NA segment, which provides a clade for this segment, but this output is not populated back to the Terra data table unfortunately. You will have to look into the output files on the step that is specific to running nextclade on the NA segment.
Additionally, the output files produced by nextclade (auspice_input_json, nextclade_json, nextclade_tsv) are not populated back to the Terra data table either.
And lastly, the
nextclade_qc
output string is also not populated to the Terra data table.🛠️ Impacted Workflows/Tasks & Changes Being Made
This will affect the behavior of the workflow(s) even if users don’t change any workflow inputs relative to the last version : No
Running this workflow on different occasions could result in different results, e.g. due to use of a live database, "latest" docker image, or stochastic data processing: No
Impacted workflows:
I don't believe this impacts TheiaCoV_FASTA_PHB workflow, as the user must provide either
HA
orNA
for the optional input calledflu_segment
. TheiaCoV_FASTA_PHB is not currently set up to run both HA and NA segments through in parallel. It only runs nextclade once. So I have skipped making these changes to that workflow because that will be a much more involved change that requires extensive dev and testing.📋 Workflow/Task Step Changes
🔄 Data Processing
Docker/software or software versions changed: N/A
Databases or database versions changed: N/A
Data processing/commands changed: N/A
File processing changed: N/A
Compute resources changed: N/A
➡️ Inputs
N/A
⬅️ Outputs
Added 5 new outputs to the TheiaCoV_Illumina_PE_PHB and TheiaCoV_ONT_PHB workflows:
String nextclade_json_flu_na = select_first([nextclade_flu_na.nextclade_json, ""])
String auspice_json_flu_na = select_first([nextclade_flu_na.auspice_json, ""])
String nextclade_tsv_flu_na = select_first([nextclade_flu_na.nextclade_tsv, ""])
String nextclade_clade_flu_na = select_first([nextclade_output_parser_flu_na.nextclade_clade, ""])
String? nextclade_qc_flu_na = nextclade_output_parser_flu_na.nextclade_qc
🧪 Testing
Test Dataset
I used the validation sets for ILMN PE and ONT workflows. I added 2 new H1N1 ONT datasets since the 5 we have currently available are not the best.
Commandline Testing with MiniWDL or Cromwell (optional)
Skipped testing locally except for TheiaCoV_ONT, which was successful (log not shown)
Terra Testing
See below comments for links and screenshots
Suggested Scenarios for Reviewer to Test
Test the 2 impacted workflows with Flu samples
Theiagen Version Release Testing (optional)
🔬 Final Developer Checklist
🎯 Reviewer Checklist
🗂️ Associated Documentation (to be completed by Theiagen developer)