Инд. авторы: | Volkova O.A., Kondrakhin Y.V., Kashapov T.A., Sharipov R.N. |
Заглавие: | Comparative analysis of protein-coding and long non-coding transcripts based on RNA sequence features |
Библ. ссылка: | Volkova O.A., Kondrakhin Y.V., Kashapov T.A., Sharipov R.N. Comparative analysis of protein-coding and long non-coding transcripts based on RNA sequence features // Journal of Bioinformatics and Computational Biology. - 2018. - Vol.16. - Iss. 2. - Art.1840013. - ISSN 0219-7200. - EISSN 1757-6334. |
Внешние системы: | DOI: 10.1142/S0219720018400139; SCOPUS: 2-s2.0-85046857488; WoS: 000431797900007; |
Реферат: | eng: RNA plays an important role in the intracellular cell life and in the organism in general. Besides the well-established protein coding RNAs (messenger RNAs, mRNAs), long non-coding RNAs (lncRNAs) have gained the attention of recent researchers. Although lncRNAs have been classified as non-coding, some authors reported the presence of corresponding sequences in ribosome profiling data (Ribo-seq). Ribo-seq technology is a powerful experimental tool utilized to characterize RNA translation in cell with focus on initiation (harringtonine, lactimidomycin) and elongation (cycloheximide). By exploiting translation starts obtained from the Ribo-seq experiment, we developed a novel position weight matrix model for the prediction of translation starts. This model allowed us to achieve 96% accuracy of discrimination between human mRNAs and lncRNAs. When the same model was used for the prediction of putative ORFs in RNAs, we discovered that the majority of lncRNAs contained only small ORFs (≤300nt) in contrast to mRNAs. © 2018 World Scientific Publishing Europe Ltd.
|
Ключевые слова: | IPSmatrix algorithm; Human mRNAs; small ORFs; discriminant analysis; human lncRNAs; position weight matrix approach; |
Издано: | 2018 |
Физ. характеристика: | 1840013 |