GP-HTNLoc: A graph prototype head-tail network-based model for multi-label subcellular localization prediction of ncRNAs
- PMID: 38765609
- PMCID: PMC11101938
- DOI: 10.1016/j.csbj.2024.04.052
GP-HTNLoc: A graph prototype head-tail network-based model for multi-label subcellular localization prediction of ncRNAs
Abstract
Numerous research results demonstrated that understanding the subcellular localization of non-coding RNAs (ncRNAs) is pivotal in elucidating their roles and regulatory mechanisms in cells. Despite the existence of over ten computational models dedicated to predicting the subcellular localization of ncRNAs, a majority of these models are designed solely for single-label prediction. In reality, ncRNAs often exhibit localization across multiple subcellular compartments. Furthermore, the existing multi-label localization prediction models are insufficient in addressing the challenges posed by the scarcity of training samples and class imbalance in ncRNA dataset. To address these limitations, this study proposes a novel multi-label localization prediction model for ncRNAs, named GP-HTNLoc. To mitigate class imbalance, GP-HTNLoc adopts separate training approaches for head and tail location labels. Additionally, GP-HTNLoc introduces a pioneering graph prototype module to enhance its performance in small-sample, multi-label scenarios. The experimental results based on 10-fold cross-validation on benchmark datasets demonstrate that GP-HTNLoc achieves competitive predictive performance. The average results from 10 rounds of testing on an independent dataset show that GP-HTNLoc outperforms the best existing models on the human lncRNA, human snoRNA, and human miRNA subsets, with average precision improvements of 31.5%, 14.2%, and 5.6%, respectively, reaching 0.685, 0.632, and 0.704. A user-friendly online GP-HTNLoc server is accessible at https://56s8y85390.goho.co.
Keywords: Class imbalance; Heterogeneous graph representation learning; Multi-label classification; Non-coding RNA subcellular localization prediction.
© 2024 The Authors.
Conflict of interest statement
All authors disclosed no relevant relationships.
Figures
Similar articles
-
ncRNALocate-EL: a multi-label ncRNA subcellular locality prediction model based on ensemble learning.Brief Funct Genomics. 2023 Nov 10;22(5):442-452. doi: 10.1093/bfgp/elad007. Brief Funct Genomics. 2023. PMID: 37122147
-
Identify ncRNA Subcellular Localization via Graph Regularized k-Local Hyperplane Distance Nearest Neighbor Model on Multi-Kernel Learning.IEEE/ACM Trans Comput Biol Bioinform. 2022 Nov-Dec;19(6):3517-3529. doi: 10.1109/TCBB.2021.3107621. Epub 2022 Dec 8. IEEE/ACM Trans Comput Biol Bioinform. 2022. PMID: 34432632
-
Shared subspace-based radial basis function neural network for identifying ncRNAs subcellular localization.Neural Netw. 2022 Dec;156:170-178. doi: 10.1016/j.neunet.2022.09.026. Epub 2022 Oct 10. Neural Netw. 2022. PMID: 36274524
-
Advances in Computational Methodologies for Classification and Sub-Cellular Locality Prediction of Non-Coding RNAs.Int J Mol Sci. 2021 Aug 13;22(16):8719. doi: 10.3390/ijms22168719. Int J Mol Sci. 2021. PMID: 34445436 Free PMC article. Review.
-
ncRDeep: Non-coding RNA classification with convolutional neural network.Comput Biol Chem. 2020 Oct;88:107364. doi: 10.1016/j.compbiolchem.2020.107364. Epub 2020 Aug 27. Comput Biol Chem. 2020. PMID: 32890916 Review.
References
-
- Sheng N., Huang L., Lu Y., et al. Data resources and computational methods for lncRNA-disease association prediction. Comput Biol Med. 2023 - PubMed
LinkOut - more resources
Full Text Sources