Skip to main content

Progressive Multiple Sequence Alignment for COVID-19 Mutation Identification via Deep Reinforcement Learning

  • Conference paper
  • First Online:
Practical Applications of Computational Biology and Bioinformatics, 17th International Conference (PACBB 2023) (PACBB 2023)

Abstract

COVID-19 can mutate rapidly, resulting in new variants which could be more malignant. To recognize the new variant, we must identify the mutation parts by locating the nucleotide changes in the DNA sequence of COVID-19. The identification is by processing sequence alignment. In this work, we propose a method to perform multiple sequence alignment via deep reinforcement learning effectively. The proposed method integrates a progressive alignment approach by aligning each pairwise sequence center to deep Q networks. We designed the experiment by evaluating the proposed method on five COVID-19 variants: alpha, beta, delta, gamma, and omicron. The experiment results showed that the proposed method was successfully applied to align multiple COVID-19 DNA sequences by demonstrating that pairwise alignment processes can precisely locate the sequence mutation up to \(90\%\). Moreover, we effectively identify the mutation in multiple sequence alignments fashion by discovering around \(10.8\%\) conserved region of nitrogenous bases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
eBook
USD 149.00
Price excludes VAT (USA)
Softcover Book
USD 199.99
Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.ncbi.nlm.nih.gov/

References

  1. Isaev, A., Deem, M.: Introduction to mathematical methods in bioinformatics. Phys. Today 58, 83 (2005). https://doi.org/10.1063/1.2138428

  2. Jafari, R., Javidi, M.M., Kuchaki Rafsanjani, M.: Using deep reinforcement learning approach for solving the multiple sequence alignment problem. SN Applied Sciences 1(6), 1–12 (2019). https://doi.org/10.1007/s42452-019-0611-4

    Article  Google Scholar 

  3. Lipman, D.J., Altschul, S.F., Kececioglu, J.D.: A tool for multiple sequence alignment. Proc. Natl. Acad. Sci. U.S.A. 86(12), 4412–5 (1989). https://doi.org/10.1073/pnas.86.12.4412

    Article  Google Scholar 

  4. Mircea, I., Bocicor, M., Czibula, G.: Reinforcement learning based approach to multiple sequence alignment. Soft computing applications. Adv. Intell. Syst. Comput. 634, 54–70 (2018). https://doi.org/10.1007/978-3-319-62524-9_6

  5. Mircea, I., Bocicor, M., Dincu, A.: On reinforcement learning based multiple sequence alignment. Studia Universitatis “Babes-Bolyai”, Informatica LIX, pp. 50–56 (2014)

    Google Scholar 

  6. Naeem, M., Rizvi, S.T.H., Coronato, A.: A gentle introduction to reinforcement learning and its application in different fields. IEEE Access 8(5), 209320–209344 (2020). https://doi.org/10.1109/ACCESS.2020.3038605

  7. Needleman, S.B., Wunsch, C.D.: A general method applicable to the search for similarities in the amino acid sequence of two proteins. J. Mol. Biol. 48(3), 443–453 (1970). https://doi.org/10.1016/0022-2836(70)90057-4

  8. Rashed, A.E.E.D., Amer, H.M., Seddek, M.E., Moustafa, H.E.D.: Sequence alignment using machine learning-based Needleman-Wunsch algorithm. IEEE Access 9, 109522–109535 (2021). https://doi.org/10.1109/ACCESS.2021.3100408

  9. Song, Y.J., Cho, D.H.: Local alignment of DNA sequence based on deep reinforcement learning. IEEE Open J. Eng. Med. Biol. 2, 170–178 (2021). https://doi.org/10.1109/OJEMB.2021.3076156

    Article  Google Scholar 

  10. WHO Coronavirus (COVID-19) Dashboard. https://covid19.who.int/. Accessed 17 Apr 2023

  11. Zou, Q., Hu, Q., Guo, M., Wang, G.: HAlign: fast multiple similar DNA/RNA sequence alignment based on the centre star strategy. Bioinformatics 31(15), 2475–2481 (2015). https://doi.org/10.1093/bioinformatics/btv177

    Article  Google Scholar 

Download references

Acknowledgment

The authors gratefully acknowledge financial support from the Institut Teknologi Sepuluh Nopember for this work, under project scheme of the Publication Writing and IPR Incentive Program (PPHKI) 2023.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Imam Mukhlash .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chofsoh, Z.H.Q., Mukhlash, I., Iqbal, M., Sanjoyo, B.A. (2023). Progressive Multiple Sequence Alignment for COVID-19 Mutation Identification via Deep Reinforcement Learning. In: Rocha, M., Fdez-Riverola, F., Mohamad, M.S., Gil-González, A.B. (eds) Practical Applications of Computational Biology and Bioinformatics, 17th International Conference (PACBB 2023). PACBB 2023. Lecture Notes in Networks and Systems, vol 743. Springer, Cham. https://doi.org/10.1007/978-3-031-38079-2_8

Download citation

Publish with us

Policies and ethics