PUBLICATION

CNIT: a fast and accurate web tool for identifying protein-coding and long non-coding transcripts based on intrinsic sequence composition

Authors

Guo, J.C., Fang, S.S., Wu, Y., Zhang, J.H., Chen, Y., Liu, J., Wu, B., Wu, J.R., Li, E.M., Xu, L.Y., Sun, L., Zhao, Y.

ID

ZDB-PUB-190601-18

Date

2019

Source

Nucleic acids research 47(W1): W516-W522 (Journal)

Registered Authors

Liu, Jing

Keywords

none

MeSH Terms

Animals
High-Throughput Nucleotide Sequencing
Humans
Internet
Mice
Neural Cell Adhesion Molecule L1/genetics
Proteins/genetics*
RNA, Long Noncoding/chemistry*
Sequence Analysis, RNA*
Software*

PubMed

31147700 Full text @ Nucleic Acids Res.

Abstract

As more and more high-throughput data has been produced by next-generation sequencing, it is still a challenge to classify RNA transcripts into protein-coding or non-coding, especially for poorly annotated species. We upgraded our original coding potential calculator, CNCI (Coding-Non-Coding Index), to CNIT (Coding-Non-Coding Identifying Tool), which provides faster and more accurate evaluation of the coding ability of RNA transcripts. CNIT runs ∼200 times faster than CNCI and exhibits more accuracy compared with CNCI (0.98 versus 0.94 for human, 0.95 versus 0.93 for mouse, 0.93 versus 0.92 for zebrafish, 0.93 versus 0.92 for fruit fly, 0.92 versus 0.88 for worm, and 0.98 versus 0.85 for Arabidopsis transcripts). Moreover, the AUC values of 11 animal species and 27 plant species showed that CNIT was capable of obtaining relatively accurate identification results for almost all eukaryotic transcripts. In addition, a mobile-friendly web server is now freely available at http://cnit.noncode.org/CNIT.

Genes / Markers

Figures

Show all Figures

Expression

Phenotype

Mutations / Transgenics

Human Disease / Model

Sequence Targeting Reagents

Fish

Antibodies

Orthology

Engineered Foreign Genes

Mapping

Guo et al., 2019 ZDB-PUB-190601-18 PMID:31147700

CNIT: a fast and accurate web tool for identifying protein-coding and long non-coding transcripts based on intrinsic sequence composition

Guo et al., 2019

ZDB-PUB-190601-18
PMID:31147700