RNAknot2S: a New RNA Secondary Structure Prediction Method that Combines the Comparative Approach with Genetic Algorithms
(*) Corresponding author
DOI: https://doi.org/10.15866/ireaco.v16i6.24279
Abstract
The RNA secondary structure is one of the extensively explored subjects in bioinformatics due to its pivotal involvement in various biological processes. Several methods have been developed for predicting the optimal secondary structure of a given RNA sequence. However, only a limited number of these methods have produced satisfactory results, especially when dealing with complex secondary structure components such as pseudoknot classes. This paper introduces RNAknot2S, a novel program for RNA secondary structure prediction. RNAknot2S is based on the genetic algorithm and combines two main approaches, the comparative approach and the thermodynamic approach. It takes three inputs, the first is the target RNA sequence (S1) for which the secondary structure is to be determined, the second is another RNA sequence (S2) expected to be homologous to S1, and the third element is the secondary structure (Ss) of the sequence S2 presented in dot-bracket format. RNAknot2S produces as output a secondary structure that includes various components such as stems, hairpin loops, multi-branched loops or multi-loops, bulge loops, internal loops, H-type pseudoknots, and hairpin kissing interactions. In order to evaluate the performance of the proposed algorithm, a comparative study has been performed using some of the most popular methods for RNA secondary structure prediction and the database Rfam. The obtained results clearly demonstrate a significant enhancement in the prediction accuracy of the proposed method when compared to other existing programs. RNAknot2S exhibits the highest average values for specificity (SP) at 55.22%, sensitivity (SN) at 58.29%, and F-measure (F-M) at 55.59% when compared to its counterparts.
Copyright © 2023 Praise Worthy Prize - All rights reserved.
Keywords
Full Text:
PDFReferences
Statello, Luisa, Guo, Chun-Jie, Chen, Ling-Ling, et al. Gene regulation by long non-coding RNAs and its biological functions. Nature reviews Molecular cell biology, 2021, vol. 22, no 2, p. 96-118.
https://doi.org/10.1038/s41580-020-00315-9
Bergkessel, Megan. Regulation of protein biosynthetic activity during growth arrest. Current opinion in microbiology, 2020, vol. 57, p. 62-69.
https://doi.org/10.1016/j.mib.2020.07.010
Wang, Xunxun, Yu, Shixiong, Lou, En, et al. RNA 3D Structure Prediction: Progress and Perspective. Molecules, 2023, vol. 28, no 14, p. 5532.
https://doi.org/10.3390/molecules28145532
Nussinov R, Pieczenik G, Griggs JR, Kleitman DJ, Algorithms for loop matchings, SIAM Journal on Applied mathematics 35(1):68-82, 1978
https://doi.org/10.1137/0135006
Hofacker IL, Fontana W, Stadler PF, Bonhoeffer LS, Tacker M, Schuster P, Fast folding and comparison of rna secondary structures, Monatshefte f¨ur Chemie/Chemical Monthly 125(2):167-188, 1994.
https://doi.org/10.1007/BF00818163
Zuker M, Mfold web server for nucleic acid folding and hybridization prediction, Nucleic acids research 31(13):3406-3415, 2003.
https://doi.org/10.1093/nar/gkg595
Sato, Kengo, Hamada, Michiaki. Recent trends in RNA informatics: a review of machine learning and deep learning for RNA secondary structure prediction and RNA drug discovery. Briefings in Bioinformatics, 2023, p. bbad186.
https://doi.org/10.1093/bib/bbad186
Mathews DH, Sabina J, Zuker M, Turner DH, Expanded sequence dependence of thermodynamic parameters improves prediction of rna secondary structure, Journal of molecular biology 288(5):911-940, 1999.
https://doi.org/10.1006/jmbi.1999.2700
Sato K, Kato Y, Hamada M, Akutsu T, Asai K, Ipknot: fast and accurate prediction of rna secondary structures with pseudoknots using integer programming, Bioinformatics 27(13):i85-i93, 2011.
https://doi.org/10.1093/bioinformatics/btr215
Dawson W, Takai T, Ito N, Shimizu K, Kawai G, A new entropy model for rna: part iii. is the folding free energy landscape of rna funnel shaped?, Journal of Nucleic Acids Investigation 5(1), 2014.
https://doi.org/10.4081/jnai.2014.2652
Xayaphoummine A, Bucher T, Isambert H, Kinefold web server for rna/dna folding path and structure prediction including pseudoknots and knots, Nucleic acids research 33(suppl 2):W605-W610, 2005.
https://doi.org/10.1093/nar/gki447
Bindewald E, Kluth T, Shapiro BA, Cylofold: secondary structure prediction including pseudoknots, Nucleic acids research 38(suppl 2):W368-W372, 2010.
https://doi.org/10.1093/nar/gkq432
Mathews DH, Using an rna secondary structure partition function to determine confidence in base pairs predicted by free energy minimization, Rna 10(8):1178-1190, 2004.
https://doi.org/10.1261/rna.7650904
YU, Bo, LU, Yao, ZHANG, Qiangfeng Cliff, et al. Prediction and differential analysis of RNA secondary structure. Quantitative Biology, 2020, vol. 8, p. 109-118.
https://doi.org/10.1007/s40484-020-0205-6
Han K, Kim HJ, Prediction of common folding structures of homologous rnas, Nucleic Acids Research 21(5):1251-1257, 1993.
https://doi.org/10.1093/nar/21.5.1251
Harmanci AO, Sharma G, Mathews DH, Turbofold: iterative probabilistic estimation of secondary structures for multiple rna sequences, BMC bioinformatics 12(1):1, 2011.
https://doi.org/10.1186/1471-2105-12-108
Bindewald E, Shapiro BA, RNA secondary structure prediction from sequence alignments using a network of k-nearest neighbor classifiers, RNA 12(3):342-352, 2006.
https://doi.org/10.1261/rna.2164906
Engelen S, Tahi F, Tfold: efficient in silico prediction of non-coding RNA secondary structures, Nucleic acids research 38(7):2453-2466, 2010.
https://doi.org/10.1093/nar/gkp1067
Janssen S, Giegerich R, The RNA shapes studio, Bioinformatics 31(3):423-425, 2015.
https://doi.org/10.1093/bioinformatics/btu649
Bernhart, S. H., Hofacker, I. L., Will, S., Gruber, A. R., & Stadler, P. F. (2008). RNAalifold: improved consensus structure prediction for RNA alignments. BMC Bioinformatics, 9(1), 474.
https://doi.org/10.1186/1471-2105-9-474
Mittal, Shubham, Hasija, Yasha. RNA Secondary Structure Prediction using Machine Learning: A Review. In: 2020 IEEE 5th International Conference on Computing Communication and Automation (ICCCA). IEEE, 2020. p. 714-719.
https://doi.org/10.1109/ICCCA49541.2020.9250877
Fu, Laiyi, Cao, Yingxin, Wu, Jie, et al. UFold: fast and accurate RNA secondary structure prediction with deep learning. Nucleic acids research, 2022, vol. 50, no 3, p. e14-e14.
https://doi.org/10.1093/nar/gkab1074
Jung, Andrew J., Lee, Leo J., Gao, Alice J., et al. RTfold: RNA secondary structure prediction using deep learning with domain inductive bias. In: The 2022 ICML Workshop on Computational Biology. Baltimore, Maryland, USA. 2022.
Chen, Chun-Chi, Chan, Yi-Ming. REDfold: accurate RNA secondary structure prediction using residual encoder-decoder network. BMC bioinformatics, 2023, vol. 24, no 1, p. 1-13.
https://doi.org/10.1186/s12859-023-05238-8
Tan, Cheng, Gao, Zhangyang, Li, Stan Z. Rfold: Towards simple yet effective rna secondary structure prediction. arXiv preprint arXiv:2212.14041, 2022.
Sato, Kengo, Akiyama, Manato, Sakakibara, Yasubumi. RNA secondary structure prediction using deep learning with thermodynamic integration. Nature communications, 2021, vol. 12, no 1, p. 941.
https://doi.org/10.1038/s41467-021-21194-4
Chen, X., Li, Y., Umarov, R., Gao, X. & Song, L. RNA secondary structure prediction by learning unrolled algorithms. In Proceedings of the 8th International Conference on Learning Representations (2020).
https://doi.org/10.6084/m9.figshare.hgv.1920
Katoch, Sourabh, Chauhan, Sumit Singh, Kumar, Vijay. A review on genetic algorithm: past, present, and future. Multimedia tools and applications, 2021, vol. 80, p. 8091-8126.
https://doi.org/10.1007/s11042-020-10139-6
GLsearch Goujon M, McWilliam H, Li W, Valentin F, Squizzato S, Paern J, Lopez R, A new bioinformatics analysis tools framework at embl-ebi, Nucleic acids research 38(suppl 2):W695-W699, 2010.
https://doi.org/10.1093/nar/gkq313
El Fatmi A, Bekri MA, Benhlima S, Rnaknot: A new algorithm for rna secondary structure prediction based on genetic algorithm and grasp method, Journal of bioinformatics and computational biology 17(05):1950031, 2019.
https://doi.org/10.1142/S0219720019500318
Turner DH, Mathews DH, Nndb: the nearest neighbor parameter database for predicting stability of nucleic acid secondary structure, Nucleic acids research 38(suppl 1):D280-D282, 2009.
https://doi.org/10.1093/nar/gkp892
Sperschneider J, Datta A, Dotknot: pseudoknot prediction using the probability dot-plot under a refined energy model, Nucleic acids research 38(7):e103-e103, 2010.
https://doi.org/10.1093/nar/gkq021
Sperschneider J, Datta A, Wise MJ, Heuristic rna pseudoknot prediction including intramolecular kissing hairpins, RNA 17(1):27-38, 2011.
https://doi.org/10.1261/rna.2394511
https://drive.google.com/file/d/1Is-4d2WVpW67Cd6KxmJ9jgpuH3eCwmN/view?usp=sharing
Burset M, Guigo R, Evaluation of gene structure prediction programs, Genomics 34(3):353-367, 1996.
https://doi.org/10.1006/geno.1996.0298
Srikamdee, Supawadee et Chongstitvatana, Prabhas. Collaborative Learning of Estimation of Distribution Algorithm for RNA secondary structure prediction. ECTI Transactions on Computer and Information Technology (ECTI-CIT), 2020, vol. 14, no 1, p. 92-102.
https://doi.org/10.37936/ecti-cit.2020141.239871
Kalvari, Ioanna, Nawrocki, Eric P., Ontiveros-Palacios, Nancy et al. Rfam 14: expanded coverage of metagenomic, viral and microRNA families. Nucleic Acids Research, 2021, vol. 49, no D1, p. D192-D200.
https://doi.org/10.1093/nar/gkaa1047
Chan CY, Lawrence CE, Ding Y, Structure clustering features on the sfold web server, Bioinformatics 21(20):3926-3928, 2005.
https://doi.org/10.1093/bioinformatics/bti632
http://rna.urmc.rochester.edu/NNDB/turner04/
Refbacks
- There are currently no refbacks.
Please send any question about this web site to info@praiseworthyprize.com
Copyright © 2005-2024 Praise Worthy Prize