-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Not running final consensus since no rukki paths provided! #162
Comments
Can you post the error log from 5-untip/untip.err? |
Here it is:
|
Haven't seen that before. Are you able to share the full 5-untip folder? You can see the FAQ here on how to send us data: https://canu.readthedocs.io/en/latest/faq.html#how-can-i-send-data-to-you. I may need some other folders but lets start with that one first. |
the file "5-untip.tar.gz" was uploaded as suggested - thanks! |
I also encountered the same error. I just installed rukki, looking forward to a solution! |
This should be fixed by the commit (which auto-closed the issue). You should be able to just replace the script in your conda install and re-run. However, your assembly graph looked quite fragmented. I doubt the Hi-C phasing, at least with default parameters, will work well here. What type of input data do you have for your genome? |
Thanks @skoren - the data used for verkko testing (hifi + ONT + HiC) were all from GIAB HG002. |
After I reinstalled rukki, the error was exactly the same as before.
|
The error doesn't look the same, there's no error message in the untip.err file. How did you re-install? The fix isn't part of a release, you just need to patch the one python script. What was the full log of the run that didn't finish? |
For my latest testing, the previous 5-unitig error was gone, but now stopped at 8-hicPipeline. Looks like this related to run_mashmap.err:
|
Yes, that's a mashmap issue with a missing math library. Is mashmap also installed from conda? Could you check mashmap --version? |
I believe it was installed through conda verkko installation ( I did not install mashmap specifically).
|
Conda has been a pain recently, in this case it seems it installed an invalid version of mashmap. I'm not even sure why it's got the dependency, that's not a mashmap-included library. Not something we can fix within verkko since we're just relying on conda to solve the environment correctly and install working tools. Try uninstalling or updating the latest mashmap version, 3.0.5 and see if it runs then. |
I confirmed (see linked issue) that this shouldn't be a dependency with mashmap but may have been an issue with mashmap v3.0.4 installations in conda. So updating as I suggested above should fix your issue. |
just installed mashmap (3.0.6) and re-run - but got the following errors:
|
Ah that is a missing dependency in the conda package, if you install networkx with conda it should get past this error. I'll add that to the next verkko build. |
"conda install networkx" should install it, but after I did, for some reason, verkko still complained that networkx could not be found - kind weird ! Finally I used "pip install networkx --user" command to install - now it is running again ( in step of: output: 6-layoutContigs/unitig-popped.layout, 6-layoutContigs/unitig-popped.layout.scfmap, 6-layoutContigs/gaps.txt) Hope this process will go all the way this time! |
Again, verkko stopped with an error (below): The log mentioned by message was actually NOT there (.snakemake/log/2023-07-10T163056.780339.snakemake.log).
|
Are you running this on a cluster? I'd guess this job ran out of time on the cluster. Can you post the end of part016.err? |
just on local (not on a cluster).
|
That log file has no error so no idea why snakemake thinks it failed. It might have been an intermittent I/O issue on the system. What's in the 7-consensus/packages folder? You can try running verkko with the |
the following is from dry run ("--snakeopts --dry-run ") : does this mean both part012 and part016 failed?
|
Either part012 failed or didn't run. Does it have an error log in packages, if so post the end of that log as well and the contents of packages folder. |
part012.err seems not very helpful.
Now I'm re-running the job again (without dry-run) to see if it can run through this time .... |
Neither one of those reported an error. I would have suggested just renaming the WORKING.fasta files to .fasta yourself and resuming from there. I don't think anything failed during those jobs. |
no luck to run to the end of the pipeline with NO obvious error messages in the following 3 error files (part012.err, part016.err , part017.err ):
The 3 fasta files were generated apparently, but the pipeline still complained and terminated.
|
It's strange that no error is being reported and the job is being marked as failed by snakemake when the fasta file is generated. The fasta is only renamed from WORKING to the final name if the consensus command completed with no error and the next thing right after that is
Also, I'm not clear why this is running three consensus jobs, your previous failure and dry-run reported that only two jobs had failed. This seems like some kind of snakemake weirdness/bug (wouldn't be the first time). You should be able to run verkko with |
I would think it's related to #166 but it's strange there is no final error or failure reported. In addition to trying the above, can you also share one of the package files (e.g. |
the latest run with your suggestion failed again (with part12/16/17) - verkko hic-integration seems having some unknowns that caused this run terminated prematurely. Just send package-part12.tar.gz to you as before. Thanks for looking into this. |
I was able to run the partition without error and it returned 0. The number of sequences output seems to match what you got so I think your file is complete as well. I have no idea why snakemake is detecting these jobs as failed but would guess this is a snakemake bug. Have you tried the suggestion to run touch and dry-run to see if it will continue on and use your existing outputs? |
Thanks @skoren. I actually tried before, but not for the latest run - it seemed that verkko could not get out of those jobs as it considered them as "failed" - basically it just re-run and then fail again ! I'm testing touch and dry-run now and seeing it is still generating "part012/16/17.fasta" (even they were actually finished). Also I just noticed that 3 core dumps sit there under the folder of "7-consensus" (below) - not sure if this is of any help?
|
Core would imply the jobs failed, at least at some point, but I would expect that to show up in the logs. In your original consensus error the -EM parameter was set to 0 but the ones you shared were correct so that could have been the source of the original core dump. Your run looks very strange as the folder also has the files combined.fasta, assembly.fasta, and even haplotype-split results which shouldn't be created unless the assembly finished consensus and moved onto the next step. Is this a run that finished that you re-started with changed parameters? Is this the 7-consensus folder under the top-level folder or under 8-hicPipeline/final_contigs/7-consensus (assuming you're running w/hic)? Something is very off with this folder if that is the case because, like I said, many of these files shouldn't exist if the consensus partitions didn't finish. I think the multiple restarts have caused some kind of snakemake tracking issue and I would suggest making a new folder and copying over all the [1-8]-* folders to it but not the 8-hicPipeline/final_contigs/7-consensus folder or any files named |
some update:
eg. In verkko_out/7-consensus/packages/part012.sh (also in part016.sh, part017.sh)
but in verkko_out/8-hicPipeline/final_contigs/7-consensus/packages/part012.sh (in fact, all 29 jobs same here) -EM 0 \
But after I mv *.fasta.WORKING to *.fasta for those 3 jobs (under verkko_out/8-hicPipeline/final_contigs/7-consensus), I re-run the pipeline and it did get into the end finally. |
Ah OK, that explains why the output showed no error. The HiC folder layout is a bit confusing and we plan to standardize it to match the trio/other runs in the near future. I am pretty sure this is the same as #166 since the -EM was not 0 in the initial consensus and became 0 later. I would suggest updating the -EM to match what it was in the top-level 7-consensus folder and re-running the partitions in 8-hicPipeline/final_contigs/7-consensus/packages. After that, snakemake should redo the last steps, you may have to remove the |
These issues should all be addressed by the v1.4.1 release. I've also made a pull request to update conda: bioconda/bioconda-recipes#42411 |
Just installed the latest v1.4 using conda (with no installation issue), and run verkko with --hic1/--hic2 option, but encountered error like below:
I added rukki (~/miniconda3/lib/verkko/bin/rukki) to PATH and re-run, but still failed with same error.
Any suggestion?
Thanks
The text was updated successfully, but these errors were encountered: