nhhaidee/scovtree

Phylogenetic Analysis for SARS-COV2.

Introduction

nhhaidee/scovtree is a bioinformatics pipeline for sars-cov2 phylogenetic analysis, given a consensus sequences the workflow will output phylogenetic tree and SNP information. The pipeline also allows to filter and find the most related sequences in GISAID. The GISAID filters workflow will output filtered sequences and metadata in old format (GISAID changed format of metadata recently) so the output then can be used with Nextstrain locally.

The pipeline is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with docker containers making installation trivial and results highly reproducible.

Quick Start

Install nextflow
Install any of Docker, Singularity for full pipeline reproducibility (please only use Conda as a last resort; see docs)

Download the pipeline and test it on a minimal dataset with a single command:

nextflow run nhhaidee/scovtree -profile test_gisaid_full,<docker/singularity/conda>
nextflow run nhhaidee/scovtree -profile test_gisaid_drop_columns,<docker/singularity/conda>
nextflow run nhhaidee/scovtree -profile test,<docker/singularity/conda>

Start running your own analysis!

Typical command for phylogenetic analysis is as follow:

nextflow run nhhaidee/scovtree -profile <docker/singularity/conda> \
    --filter_gisaid false \
    --input '/path/to/consensus/consensus_sequences.fasta'

Typical command for phylogenetic analysis with GISAID Sequences is as follow:

nextflow run nhhaidee/scovtree -profile <docker/singularity/conda> \
    --filter_gisaid true \
    --gisaid_sequences /path/to/sequences.fasta \
    --gisaid_metadata /path/to/metadata.tsv \
    --input '/path/to/consensus/consensus_sequences.fasta'

Credits

nhhaidee/scovtree was originally written by Hai Nguyen.

Contributions and Support

If you would like to contribute to this pipeline, please see the contributing guidelines.

For further information or help, don't hesitate to get in touch on the Slack #scovtree channel (you can join with this invite).

Citations

You can cite the nf-core publication as follows:

The nf-core framework for community-curated bioinformatics pipelines.

Philip Ewels, Alexander Peltzer, Sven Fillinger, Harshil Patel, Johannes Alneberg, Andreas Wilm, Maxime Ulysse Garcia, Paolo Di Tommaso & Sven Nahnsen.

Nat Biotechnol. 2020 Feb 13. doi: 10.1038/s41587-020-0439-x.

In addition, references of tools and data used in this pipeline are as follows:

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
.github		.github
.idea		.idea
assets		assets
bin		bin
conf		conf
docs		docs
lib		lib
modules		modules
workflows		workflows
.gitattributes		.gitattributes
.gitignore		.gitignore
.nf-core-lint.yaml		.nf-core-lint.yaml
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
main.nf		main.nf
nextflow.config		nextflow.config
nextflow_schema.json		nextflow_schema.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nhhaidee/scovtree

Introduction

Quick Start

Credits

Contributions and Support

Citations

About

Releases

Packages

Languages

License

peterk87/scovtree

Folders and files

Latest commit

History

Repository files navigation

nhhaidee/scovtree

Introduction

Quick Start

Credits

Contributions and Support

Citations

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages