CodAn

CodAn (Coding sequence Annotator) is a computational tool designed to characterize the CDS and UTR regions on transcripts from any Eukaryote species.

Getting Started

Installation

Decompress the CodAn.tar.gz file:

tar -xf CodAn.tar.gz

Add the bin directory to your PATH:

export PATH=$PATH:path/to/CodAn/bin/

Requirements

Python3 and Biopython
- apt-get install python3-biopython
Perl, Bioperl and MCE (libmce-perl)
- apt-get install bioperl libmce-perl
NCBI-BLAST (v2.9.0 or above)

Predictive models

The predictive models are available in the subfolder "models". The folder contains all models designed for Eukaryote species (i.e., Fungi, Plants and Animals [Invertebrates and Vertebrates]). The models were designed to be used in Full-Length or Partial transcripts.

Download the model specific to your necessities, as described at the "models" folder, decompress the model file (using unzip model.zip), and indicate the decompressed model path in the -m option.

Usage

Usage: codan.py [options]

Options:
  -h, --help            show this help message and exit
  -t file, --transcripts=file
                        Mandatory - input transcripts file (FASTA format),
                        /path/to/transcripts.fa
  -m model, --model=model
                        Mandatory - path to model, /path/to/model
  -s string, --strand=string
                        Optional - strand of sequence to predict genes (plus,
                        minus or both) [default=both]
  -c int, --cpu=int     Optional - number of threads to be used [default=1]
  -o folder, --output=folder
                        Optional - path to output folder,
                        /path/to/output/folder/ if not declared, it will be
                        created at the transcripts input folder
                        [default="CodAn_output"]
  -b proteinDB, --blastdb=proteinDB
                        Optional - path to blastDB of known protein sequences,
                        /path/to/blast/DB/DB_name
  -H int, --HSP=int     Optional - used in the "-qcov_hsp_perc" option of
                        blastx [default=80]

Basic usage (predict CDS):

codan.py -t transcripts.fa -o output_folder -m model

Alternative usage (predict CDS and perform BLAST search in specific DB to annotated predicted genes based on similarity):

codan.py -t transcripts.fa -o output_folder -m model -b blast_DB

To run this optional step, just indicate a specific protein DB mounted using the makeblastdb function from the NCBI-BLAST approach. The user can download the pre-mounted protein DBs, such as swissprot (ftp://ftp.ncbi.nlm.nih.gov/blast/db/).

Tutorial

Follow the instructions in the quick tutorial to learn how to use CodAn and interpret the results.

Reference

If you use or discuss CodAn, please cite the preprint:

Nachtigall et al. CodAn: predictive models for the characterization of mRNA transcripts in Eukaryotes

License

GNU GPLv3

Contact

To report bugs, to ask for help and to give any feedback, please contact Pedro G. Nachtigall: pedronachtigall@gmail.com

Name	Name	Last commit message	Last commit date
Latest commit pedronachtigall Create script.py Oct 15, 2019 d9b5dfa · Oct 15, 2019 History 92 Commits
datasets	datasets	Update README.md	Sep 19, 2019
models	models	Update README.md	Oct 9, 2019
tutorial	tutorial	Update README.md	Oct 14, 2019
CodAn.tar.gz	CodAn.tar.gz	Add files via upload	Oct 1, 2019
LICENSE.txt	LICENSE.txt	Update LICENSE.txt	Sep 9, 2019
README.md	README.md	Update README.md	Oct 9, 2019
codan_logo.png	codan_logo.png	Add files via upload	Sep 5, 2019
script.py	script.py	Create script.py	Oct 15, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CodAn

Getting Started

Installation

Requirements

Predictive models

Usage

Tutorial

Reference

License

Contact

About

Releases

Packages

Languages

License

alandurham/CodAn

Folders and files

Latest commit

History

Repository files navigation

CodAn

Getting Started

Installation

Requirements

Predictive models

Usage

Tutorial

Reference

License

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages