-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weβll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ONT] Remove KMC #578
[ONT] Remove KMC #578
Conversation
TheiaProk_ONT_PHB Test:
|
The run succeeded and I've now managed to look at the results from the assembly lengths produced versus the expected genome size. The differences are very very small except for Shigella sonnei where the assemblies were around 10% larger than expected: <style> </style>
The reason why is beyond this PR . BUSCO duplication score is not awful. The results are concordant with what was obtained with PHB v2.1.0 (https://app.terra.bio/#workspaces/theiagen-validations/PHB_Validation_v2-1-0/data) |
@sage-wright Can you update the documentation while I view the results from TheiaCoV ONT? |
TheiaCoV_ONT: |
@cimendes, documentation has been updated! |
This PR closes #203
ποΈ This dev branch should be deleted after merging to main.
π§ Summary
This PR removes the KMC module from the TheiaCoV and TheiaProk ONT workflows. In place of the estimated genome length in TheiaProk ONT, I have added a default genome length of 5 Mb, which is around .7 Mb larger than the average genome length as estimated by calculating the mean genome length of all bacteria included in this file.
β‘ Impacted Workflows/Tasks
TheiaProk_ONT and TheiaCoV_ONT
This PR may lead to different results in pre-existing outputs: Yes, assemblies generated may be different due to differences in the use of estimated genome length in RASUSA downsampling.
This PR uses an element that could cause duplicate runs to have different results: No
π οΈ Changes
βοΈ Algorithm
β‘οΈ Inputs
No
β¬ οΈ Outputs
KMC outputs have been removed.
π§ͺ Testing
Test on TheiaProk here
Test on TheiaCoV (flu) here
Suggested Scenarios for Reviewer to Test
π¬ Final Developer Checklist
π― Reviewer Checklist