-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nanopore support #134
Nanopore support #134
Conversation
Great that you adding Nanopore to the workflow! I will test it today and compare the results with the pipeline developed by nf-core guys(their version is quite basic so I am looking forward to using rnaflow). |
Hi! Cool, would be great if you can give it a try. As I mentioned in the PR it's really basic and there might be more long-read specialized tools out there. And we also have to perform some comparisons on our end - but basically it should work and long-reads should be properly mapped and counted. The subsequent DEG calc part is the same as for Illumina at the moment. Let us know if you run into any issues! |
Hey! First, if you have problem with minimap2 output of stats, maybe you can use samtools flagstat? It provides all of the mapping statistics. Now the pipeline is working, the issue was my annotation file. So I tested it and I get similar result as in nf-core pipeline. No bugs for this moment. Thanks for adapting it to Nanopore! Old comment:
|
@AggresiveHayBale sorry for the late reply!
Yes, this might work, although I don't know if multiQC can take samtools output directly...
Awesome, thanks for testing and replying! The input annotation file is always a bit tricky: and bc/ we started to develop RNAflow with human/mice/rat RNA-Seq data in mind we focused on the integration of "proper" GTF files like the ones from Ensembl that have the full gene-transcript-exon structure. Of course, for bacteria, such a structure is not really necessary but still recommended to work with most tools straight away. I am just curious: what was the problem w/ your initial input annotation file? Missing transcript entries? After @MarieLataretu approves, we will merge this PR and make another release for basic Nanopore support. |
@hoelzer The issue was inconsistent naming scheme between fasta and annotation file. Stupid mistake from my part. After correcting everything started to work. About MultiQC, I don't have that big coding experience so I can't say it for sure but according to their website samtools are fully implemented. The only problematic thing would be Nanoplot then. However as the output is still saved this is not a big problem. Anyway good luck and thanks for the answer! |
@MarieLataretu when you are basically happy w/ this PR we could also merge it and do a rnaflow release w/ (basic!) nanopore support. |
I'll look into it tomorrow! |
@MultiQC the new For CLEAN we already have the full htlm report in MultiQC - looks not that nice, but I think, it's good enough (reminder issue #140) |
@MarieLataretu okay great! I agree regarding MultiQC and NanoPlot... then it is like this. I hoped that now the NanoPlot output can be also summarized in MultiQC... but if not it's nothing to invest loads of time Please feel free to merge and release once you were able to check |
It was actually a problem in the I assume you used the Docker container:
If the error is |
@MarieLataretu ahh so this might be solved then with a newer version of Nanoplot in the container? We use right now
but I can also provide a newer container for latest Nanoplot version if that helps:
|
I tested it with Singularity and it works! |
ah, okay! fine. I mean anyway we could update to the latest version |
In the new @hoelzer Can you fix it easily or should we modify the expected output in Nextflow? |
@MarieLataretu ah strange... I did not touch the other dependencies... but I can easily fix that, one sec |
FROM continuumio/miniconda3
ENV VERSION 1.38.0
ENV TOOL nanoplot
RUN apt-get update && apt install -y procps wget gzip zip && apt-get clean && rm -rf /var/lib/apt/lists/* /tmp/* /var/tmp/*
RUN conda config --add channels conda-forge && \
conda config --add channels bioconda && \
conda config --add channels default
RUN conda install $TOOL=$VERSION && conda clean -a I am rebuilding the container w/ this recipe right now. |
here it is: docker pull nanozoo/nanoplot:1.38.0--11e5b96 I deleted the other one |
Thanks! It works now 👍 I'm merging now |
I added basic nanopore support as described in #131
minimap2
instead ofhisat2
nanoplot
instead offastqc
-L
flag for long reads forfeatureCounts
sortmerna
still active, not sure how well it works w/ long reads (maybe better to deactivate per default when--nanopore
is usedknown issues:
multiqc
reporting does not work bc/minimap2
does not write stats that can be used. Also no support fornanoplot
afaikmultiqc
input channel a bit in case of--nanopore
so that a report is still generatedAlso, this is really a basic hack to get nanopore data working. For sure, there are more specialized tools. But it's also interesting to see how well RNAflow performs w/ that simple changes on long-read nanopore data.
Tested w/
-profile test,local,docker --nanopore
--> works except thenanoplot
s are failing likely because the test data is actually not nanopore long reads. But the general workflow runs through.