-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[MRG] emit fewer warnings about potential ANI estimation issues #2061
Conversation
Codecov Report
@@ Coverage Diff @@
## latest #2061 +/- ##
==========================================
+ Coverage 84.28% 91.81% +7.53%
==========================================
Files 130 99 -31
Lines 15255 11047 -4208
Branches 2155 2178 +23
==========================================
- Hits 12857 10143 -2714
+ Misses 2099 606 -1493
+ Partials 299 298 -1
Flags with carried forward coverage won't be shown. Click here to find out more.
Continue to review full report at Codecov.
|
@ctb ready for review |
p.s. this looks really great, thank you :) |
question: will any warning show up even when no match is found? I'm seeing that behavior in prefetch on genbank with |
Which warnings are you seeing when no matches are found? (All?) The warning I would want to show up is the one about potential false negatives, but I don't know how to have that show up just once with the current design. This PR should make it so no warnings show up at all if you have no matches in search/prefetch/gather |
@ctb ready for re-review. I'm not sure what happened to screw up the wheels though :/. Any suggestions? |
no worries, it's almost certainly not PR specific. |
addresses #2058 by emitting fewer warnings.
search
prefetch
Notes and Questions:
compare
/search
/prefetch
/gather
if there were any issues. For these, we do not currently have output columns for these properties, so if users get this warning, there will be no way to know which of the comparisons generated the issue, other than the fact that ANI will not be estimated for these comparisons (ANI gets zeroed for both size accuracy and jaccard error issues).potential false negatives
are a bit different. We now warn at the end ofcompare
if there are any potential false negatives. But most of the time, this won't work forsearch
because if there are no hashes in common, there will just be no match found during initial search, so aSearchResult
/PrefetchResult
etc will never be generated. I do currently store this as a property in*Result
, but haven't figured out test data to get aTrue
value out of it, so maybe this should not be included? Perhaps this is the situation where we want to warn immediately upon comparison, since it will mostly show up when the scaled value is too high/query sketch is too small?