Genomic Origin Through Taxonomic CHAllenge (GOTTCHA)

GOTTCHA is an application of a novel, gene-independent and signature-based metagenomic taxonomic profiling method with significantly smaller false discovery rates (FDR) that is laptop deployable. Our algorithm was tested and validated on twenty synthetic and mock datasets ranging in community composition and complexity, was applied successfully to data generated from spiked environmental and clinical samples, and robustly demonstrates superior performance compared with other available tools.

GOTTCHAv2 is currently under development in BETA stage. Pre-built databases for v1 are incompatible with v2.

DEPENDENCIES

GOTTCHA2 profiler is written in Python3 and leverage minimap2 to map reads to signature sequences. In order to run GOTTCHA2 correctly, your system requires to have following dependencies installed correctly. The YAML file for Conda environment can be found in environment.yml.

Python 3.6+
minimap2 2.17+
pandas
samtools

QUICK START

Install the package:

 via conda `conda install -c bioconda gottcha2`

 OR

 Download or git clone GOTTCHA2 from this repository and run `pip install .`

Download the latest version of the GOTTCHA2 database. (This step may take some time)
```
 https://ref-db.edgebioinformatics.org/gottcha2/RefSeq-r220/
```

Run GOTTCHA2:

 $ gottcha2.py -d RefSeq-r220_BAVxH-cg/gottcha_db.species.fna -t 8 -i <FASTQ>
 
 OR
 
 $ gottcha2 profile -d RefSeq-r220_BAVxH-cg/gottcha_db.species.fna -t 8 -i <FASTQ>

RESULT

GOTTCHA2 can output the profiling results in either CSV, TSV or BIOM format.

summary (.tsv or .csv) - A summary of profiling results (10 columns) in taxonomic ranks breakdown
full (.tsv or .csv) - A full profiling results including unfiltered profiling results and additional columns
lineage (.lineage.tsv or .lineage.tsv) - output lineage and abundance of the profiled taxon per line
extract (.extract[TAXID].fastq) - Extracted reads for a specific taxon.

DOCUMENTATION

Please refer to https://github.com/poeli/GOTTCHA2/wiki for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 181 Commits
.github/workflows		.github/workflows
gottcha		gottcha
test		test
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Genomic Origin Through Taxonomic CHAllenge (GOTTCHA)

DEPENDENCIES

QUICK START

RESULT

DOCUMENTATION

About

Releases 12

Packages

Contributors 2

Languages

License

poeli/GOTTCHA2

Folders and files

Latest commit

History

Repository files navigation

Genomic Origin Through Taxonomic CHAllenge (GOTTCHA)

DEPENDENCIES

QUICK START

RESULT

DOCUMENTATION

About

Resources

License

Stars

Watchers

Forks

Releases 12

Packages 0

Contributors 2

Languages

Packages