Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Picard vs Samtools queryname sort incompatibility is a common user pain point #8

Open
clintval opened this issue Jan 28, 2025 · 0 comments
Assignees

Comments

@clintval
Copy link
Owner

clintval commented Jan 28, 2025

Picard/fgbio/htsjdk aligned files (such as to a wildtype genome) are of a different sort order to the often-used samtools approach of using samtools fastq and aligning with bwa. Thus this exception is hit when trying to neodisambiguate:

require(
templates.map(_.name).distinct.length <= 1,
"Templates with different names found! This can only occur if your SAM sources are queryname sorted using"
+ " different implementations, such as with Picard tools versus Samtools. If you have encountered this"
+ " exception, then please alert the maintainer!"
)

It might be nice to have a fallback mode that re-sorts the SAM streams if they:

  1. Say they are queryname sorted
  2. And appear out of order (parsimony is different sort implementations)
@clintval clintval self-assigned this Jan 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant