Progressive Multiple Sequence Alignment for COVID-19 Mutation Identification via Deep Reinforcement Learning

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 743))

Included in the following conference series:

International Conference on Practical Applications of Computational Biology & Bioinformatics

204 Accesses

Abstract

COVID-19 can mutate rapidly, resulting in new variants which could be more malignant. To recognize the new variant, we must identify the mutation parts by locating the nucleotide changes in the DNA sequence of COVID-19. The identification is by processing sequence alignment. In this work, we propose a method to perform multiple sequence alignment via deep reinforcement learning effectively. The proposed method integrates a progressive alignment approach by aligning each pairwise sequence center to deep Q networks. We designed the experiment by evaluating the proposed method on five COVID-19 variants: alpha, beta, delta, gamma, and omicron. The experiment results showed that the proposed method was successfully applied to align multiple COVID-19 DNA sequences by demonstrating that pairwise alignment processes can precisely locate the sequence mutation up to $90\%$. Moreover, we effectively identify the mutation in multiple sequence alignments fashion by discovering around $10.8\%$ conserved region of nitrogenous bases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://www.ncbi.nlm.nih.gov/

References

Isaev, A., Deem, M.: Introduction to mathematical methods in bioinformatics. Phys. Today 58, 83 (2005). https://doi.org/10.1063/1.2138428
Jafari, R., Javidi, M.M., Kuchaki Rafsanjani, M.: Using deep reinforcement learning approach for solving the multiple sequence alignment problem. SN Applied Sciences 1(6), 1–12 (2019). https://doi.org/10.1007/s42452-019-0611-4
Article Google Scholar
Lipman, D.J., Altschul, S.F., Kececioglu, J.D.: A tool for multiple sequence alignment. Proc. Natl. Acad. Sci. U.S.A. 86(12), 4412–5 (1989). https://doi.org/10.1073/pnas.86.12.4412
Article Google Scholar
Mircea, I., Bocicor, M., Czibula, G.: Reinforcement learning based approach to multiple sequence alignment. Soft computing applications. Adv. Intell. Syst. Comput. 634, 54–70 (2018). https://doi.org/10.1007/978-3-319-62524-9_6
Mircea, I., Bocicor, M., Dincu, A.: On reinforcement learning based multiple sequence alignment. Studia Universitatis “Babes-Bolyai”, Informatica LIX, pp. 50–56 (2014)
Google Scholar
Naeem, M., Rizvi, S.T.H., Coronato, A.: A gentle introduction to reinforcement learning and its application in different fields. IEEE Access 8(5), 209320–209344 (2020). https://doi.org/10.1109/ACCESS.2020.3038605
Needleman, S.B., Wunsch, C.D.: A general method applicable to the search for similarities in the amino acid sequence of two proteins. J. Mol. Biol. 48(3), 443–453 (1970). https://doi.org/10.1016/0022-2836(70)90057-4
Rashed, A.E.E.D., Amer, H.M., Seddek, M.E., Moustafa, H.E.D.: Sequence alignment using machine learning-based Needleman-Wunsch algorithm. IEEE Access 9, 109522–109535 (2021). https://doi.org/10.1109/ACCESS.2021.3100408
Song, Y.J., Cho, D.H.: Local alignment of DNA sequence based on deep reinforcement learning. IEEE Open J. Eng. Med. Biol. 2, 170–178 (2021). https://doi.org/10.1109/OJEMB.2021.3076156
Article Google Scholar
WHO Coronavirus (COVID-19) Dashboard. https://covid19.who.int/. Accessed 17 Apr 2023
Zou, Q., Hu, Q., Guo, M., Wang, G.: HAlign: fast multiple similar DNA/RNA sequence alignment based on the centre star strategy. Bioinformatics 31(15), 2475–2481 (2015). https://doi.org/10.1093/bioinformatics/btv177
Article Google Scholar

Download references

Acknowledgment

The authors gratefully acknowledge financial support from the Institut Teknologi Sepuluh Nopember for this work, under project scheme of the Publication Writing and IPR Incentive Program (PPHKI) 2023.

Author information

Authors and Affiliations

Department of Mathematics, Institut Teknologi Sepuluh Nopember, Surabaya, Indonesia
Zanuba Hilla Qudrotu Chofsoh, Imam Mukhlash, Mohammad Iqbal & Bandung Arry Sanjoyo

Authors

Zanuba Hilla Qudrotu Chofsoh
View author publications
You can also search for this author in PubMed Google Scholar
Imam Mukhlash
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Iqbal
View author publications
You can also search for this author in PubMed Google Scholar
Bandung Arry Sanjoyo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Imam Mukhlash .

Editor information

Editors and Affiliations

Centro de Engenharia Biológica, Universidade de Vigo, Braga, Portugal
Miguel Rocha
Computer Science Department, University of Minho, Vigo, Spain
Florentino Fdez-Riverola
Department of Genetics and Genomics, College of Medicine and Health Sciences, United Arab Emirates University, Al Ain, Abu Dhabi, United Arab Emirates
Mohd Saberi Mohamad
University of Salamanca, Salamanca, Spain
Ana Belén Gil-González

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chofsoh, Z.H.Q., Mukhlash, I., Iqbal, M., Sanjoyo, B.A. (2023). Progressive Multiple Sequence Alignment for COVID-19 Mutation Identification via Deep Reinforcement Learning. In: Rocha, M., Fdez-Riverola, F., Mohamad, M.S., Gil-González, A.B. (eds) Practical Applications of Computational Biology and Bioinformatics, 17th International Conference (PACBB 2023). PACBB 2023. Lecture Notes in Networks and Systems, vol 743. Springer, Cham. https://doi.org/10.1007/978-3-031-38079-2_8

Download citation

DOI: https://doi.org/10.1007/978-3-031-38079-2_8
Published: 11 July 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-38078-5
Online ISBN: 978-3-031-38079-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics