-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ADAM output is corrupt in S3 #117
Comments
This issue is somewhere in ADAM. I've opened bigdatagenomics/adam#962 in parallel for tracking. |
OK, we've got a fix at bigdatagenomics/adam#964. I'm going to update the ADAM Docker container to pull this fix in, and then will test on the cluster and report back. |
No dice. From inside of the GATK:
I'm wondering if we wrote out a bad tag? Time to dig in more... |
We are writing record group IDs as integers, not strings:
From the SAM spec:
Should be a simple fix, although I'm a bit perplexed as to how we did that. |
Fix at fnothaft/adam@14b41d5. Retesting on the cluster... |
Brief synopsis is in #116, but the TL;DR is that the BAM written by ADAM is corrupt when downloaded from S3. I'm working to sort out whether something is going wrong when ADAM writes the BAM out, or if something is going wrong when the BAM is uploaded to S3. The header for the file is good (w00t) but the rest of the file can't be read.
The text was updated successfully, but these errors were encountered: