PseUI: Pseudouridine sites identification based on RNA sequence information
- PMID: 30157750
- PMCID: PMC6114832
- DOI: 10.1186/s12859-018-2321-0
PseUI: Pseudouridine sites identification based on RNA sequence information
Abstract
Background: Pseudouridylation is the most prevalent type of posttranscriptional modification in various stable RNAs of all organisms, which significantly affects many cellular processes that are regulated by RNA. Thus, accurate identification of pseudouridine (Ψ) sites in RNA will be of great benefit for understanding these cellular processes. Due to the low efficiency and high cost of current available experimental methods, it is highly desirable to develop computational methods for accurately and efficiently detecting Ψ sites in RNA sequences. However, the predictive accuracy of existing computational methods is not satisfactory and still needs improvement.
Results: In this study, we developed a new model, PseUI, for Ψ sites identification in three species, which are H. sapiens, S. cerevisiae, and M. musculus. Firstly, five different kinds of features including nucleotide composition (NC), dinucleotide composition (DC), pseudo dinucleotide composition (pseDNC), position-specific nucleotide propensity (PSNP), and position-specific dinucleotide propensity (PSDP) were generated based on RNA segments. Then, a sequential forward feature selection strategy was used to gain an effective feature subset with a compact representation but discriminative prediction power. Based on the selected feature subsets, we built our model by using a support vector machine (SVM). Finally, the generalization of our model was validated by both the jackknife test and independent validation tests on the benchmark datasets. The experimental results showed that our model is more accurate and stable than the previously published models. We have also provided a user-friendly web server for our model at http://zhulab.ahu.edu.cn/PseUI , and a brief instruction for the web server is provided in this paper. By using this instruction, the academic users can conveniently get their desired results without complicated calculations.
Conclusion: In this study, we proposed a new predictor, PseUI, to detect Ψ sites in RNA sequences. It is shown that our model outperformed the existing state-of-art models. It is expected that our model, PseUI, will become a useful tool for accurate identification of RNA Ψ sites.
Keywords: Nucleotide composition; Position specific nucleotide propensity; Pseudouridine site.
Conflict of interest statement
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figures
Similar articles
-
TargetM6A: Identifying N6-Methyladenosine Sites From RNA Sequences via Position-Specific Nucleotide Propensities and a Support Vector Machine.IEEE Trans Nanobioscience. 2016 Oct;15(7):674-682. doi: 10.1109/TNB.2016.2599115. Epub 2016 Aug 10. IEEE Trans Nanobioscience. 2016. PMID: 27552763
-
PseU-KeMRF: A Novel Method for Identifying RNA Pseudouridine Sites.IEEE/ACM Trans Comput Biol Bioinform. 2024 Sep-Oct;21(5):1423-1435. doi: 10.1109/TCBB.2024.3389094. Epub 2024 Oct 9. IEEE/ACM Trans Comput Biol Bioinform. 2024. PMID: 38625768
-
PPUS: a web server to predict PUS-specific pseudouridine sites.Bioinformatics. 2015 Oct 15;31(20):3362-4. doi: 10.1093/bioinformatics/btv366. Epub 2015 Jun 14. Bioinformatics. 2015. PMID: 26076723
-
Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences.Brief Bioinform. 2020 Sep 25;21(5):1676-1696. doi: 10.1093/bib/bbz112. Brief Bioinform. 2020. PMID: 31714956 Review.
-
PseUdeep: RNA Pseudouridine Site Identification with Deep Learning Algorithm.Front Genet. 2021 Nov 18;12:773882. doi: 10.3389/fgene.2021.773882. eCollection 2021. Front Genet. 2021. PMID: 34868261 Free PMC article. Review.
Cited by
-
Bioinformatics for Inosine: Tools and Approaches to Trace This Elusive RNA Modification.Genes (Basel). 2024 Jul 29;15(8):996. doi: 10.3390/genes15080996. Genes (Basel). 2024. PMID: 39202357 Free PMC article. Review.
-
PseUpred-ELPSO Is an Ensemble Learning Predictor with Particle Swarm Optimizer for Improving the Prediction of RNA Pseudouridine Sites.Biology (Basel). 2024 Apr 8;13(4):248. doi: 10.3390/biology13040248. Biology (Basel). 2024. PMID: 38666860 Free PMC article.
-
Fuzzy kernel evidence Random Forest for identifying pseudouridine sites.Brief Bioinform. 2024 Mar 27;25(3):bbae169. doi: 10.1093/bib/bbae169. Brief Bioinform. 2024. PMID: 38622357 Free PMC article.
-
Interpretable Multi-Scale Deep Learning for RNA Methylation Analysis across Multiple Species.Int J Mol Sci. 2024 Mar 1;25(5):2869. doi: 10.3390/ijms25052869. Int J Mol Sci. 2024. PMID: 38474116 Free PMC article.
-
PseU-ST: A new stacked ensemble-learning method for identifying RNA pseudouridine sites.Front Genet. 2023 Jan 19;14:1121694. doi: 10.3389/fgene.2023.1121694. eCollection 2023. Front Genet. 2023. PMID: 36741328 Free PMC article.
References
-
- Behmansmant I, Urban A, Ma X, Yu YT, Motorin Y, Branlant C. The Saccharomyces cerevisiae U2 snRNA:pseudouridine-synthase Pus7p is a novel multisite-multisubstrate RNA:psi-synthase also acting on tRNAs. Rna-a Publication of the Rna Society. 2003;9(11):1371. doi: 10.1261/rna.5520403. - DOI - PMC - PubMed
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases