skip to main content
research-article
Open access

Optimally resilient codes for list-decoding from insertions and deletions

Published: 22 June 2020 Publication History

Abstract

We give a complete answer to the following basic question: ”What is the maximal fraction of deletions or insertions tolerable by q-ary list-decodable codes with non-vanishing information rate?”
This question has been open even for binary codes, including the restriction to the binary insertion-only setting, where the best-known result was that a γ≤ 0.707 fraction of insertions is tolerable by some binary code family.
For any desired є>0, we construct a family of binary codes of positive rate which can be efficiently list-decoded from any combination of γ fraction of insertions and δ fraction of deletions as long as γ+2δ≤ 1−ε. On the other hand, for any γ, δ with γ+2δ=1 list-decoding is impossible. Our result thus precisely characterizes the feasibility region of binary list-decodable codes for insertions and deletions.
We further generalize our result to codes over any finite alphabet of size q. Surprisingly, our work reveals that the feasibility region for q>2 is not the natural generalization of the binary bound above. We provide tight upper and lower bounds that precisely pin down the feasibility region, which turns out to have a (q−1)-piece-wise linear boundary whose q corner-points lie on a quadratic curve.
The main technical work in our results is proving the existence of code families of sufficiently large size with good list-decoding properties for any combination of δ,γ within the claimed feasibility region. We achieve this via an intricate analysis of codes introduced by [Bukh, Ma; SIAM J. Discrete Math; 2014]. Finally, we give a simple yet powerful concatenation scheme for list-decodable insertion-deletion codes which transforms any such (non-efficient) code family (with information rate zero) into an efficiently decodable code family with constant rate.

References

[1]
Erdal Arikan. 2008. Channel polarization: A method for constructing capacityachieving codes. In 2008 IEEE International Symposium on Information Theory. IEEE, 1173-1177.
[2]
Jaroslaw Blasiok, Venkatesan Guruswami, Preetum Nakkiran, Atri Rudra, and Madhu Sudan. 2018. General strong polarization. In Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing. 485-492.
[3]
Joshua Brakensiek, Venkatesan Guruswami, and Samuel Zbarsky. 2018. Eficient Low-Redundancy Codes for Correcting Multiple Deletions. IEEE Trans. Information Theory 64, 5 ( 2018 ), 3403-3410.
[4]
Boris Bukh and Venkatesan Guruswami. 2016. An improved bound on the fraction of correctable deletions. In Annual ACM-SIAM Symposium on Discrete Algorithms (SODA). 1893-1901.
[5]
Boris Bukh, Venkatesan Guruswami, and Johan Håstad. 2017. An improved bound on the fraction of correctable deletions. IEEE Transactions on Information Theory 63, 1 ( 2017 ), 93-103.
[6]
Boris Bukh and Jie Ma. 2014. Longest common subsequences in sets of words. SIAM Journal on Discrete Mathematics 28, 4 ( 2014 ), 2042-2049.
[7]
Kuan Cheng, Bernhard Haeupler, Xin Li, Amirbehshad Shahrasbi, and Ke Wu. 2019. Synchronization strings: highly eficient deterministic constructions over small alphabets. In Annual ACM-SIAM Symposium on Discrete Algorithms (SODA).
[8]
Kuan Cheng, Zhengzhong Jin, Xin Li, and Ke Wu. 2018. Deterministic document exchange protocols, and almost optimal binary codes for edit errors. In Proceedings of the Annual Symposium on Foundations of Computer Science (FOCS).
[9]
Kuan Cheng, Zhengzhong Jin, Xin Li, and Ke Wu. 2019. Block Edit Errors with Transpositions: Deterministic Document Exchange Protocols and Almost Optimal Binary Codes. In International Colloquium on Automata, Languages, and Programming (ICALP).
[10]
Vladimír Dančík. 1994. Expected length of longest common subsequences. Ph.D. Dissertation. University of Warwick.
[11]
Vlado Dančík and Mike Paterson. 1995. Upper bounds for the expected length of a longest common subsequence of two binary sequences. Random Structures & Algorithms 6, 4 ( 1995 ), 449-458.
[12]
Venkatesan Guruswami and Ray Li. 2016. Eficiently decodable insertion/deletion codes for high-noise and high-rate regimes. In Information Theory (ISIT), 2016 IEEE International Symposium on. IEEE, 620-624.
[13]
Venkatesan Guruswami and Ray Li. 2018. Coding against deletions in oblivious and online models. In Proceedings of the 29th Annual ACM-SIAM Symposium on Discrete Algorithms. 625-643.
[14]
Venkatesan Guruswami and Carol Wang. 2017. Deletion codes in the high-noise and high-rate regimes. IEEE Transactions on Information Theory 63, 4 ( 2017 ), 1961-1970.
[15]
Venkatesan Guruswami and Chaoping Xing. 2013. List decoding Reed-Solomon, Algebraic-Geometric, and Gabidulin subcodes up to the Singleton bound. In Proceedings of the forty-fifth annual ACM symposium on Theory of computing. ACM, 843-852.
[16]
Bernhard Haeupler. 2019. Optimal document exchange and new codes for insertions and deletions. In Proceedings of the Annual Symposium on Foundations of Computer Science (FOCS). 334-347.
[17]
Bernhard Haeupler, Aviad Rubinstein, and Amirbehshad Shahrasbi. 2019. Nearlinear time insertion-deletion codes and (1+)-approximating edit distance via indexing. In Proceedings of the Annual Symposium on Theory of Computing (STOC). 697-708.
[18]
Bernhard Haeupler and Amirbehshad Shahrasbi. 2017. Synchronization Strings: Codes for Insertions and Deletions Approaching the Singleton Bound. In Proceedings of the Annual Symposium on Theory of Computing (STOC). 33-46.
[19]
Bernhard Haeupler and Amirbehshad Shahrasbi. 2018. Synchronization Strings: Explicit Constructions, Local Decoding, and Applications. In Proceedings of the Annual Symposium on Theory of Computing (STOC). 841-854.
[20]
Bernhard Haeupler, Amirbehshad Shahrasbi, and Madhu Sudan. 2018. Synchronization Strings: List Decoding for Insertions and Deletions. In 45th International Colloquium on Automata, Languages, and Programming (ICALP). 76 : 1-76 : 14.
[21]
Bernhard Haeupler, Amirbehshad Shahrasbi, and Ellen Vitercik. 2018. Synchronization Strings: Channel Simulations and Interactive Coding for Insertions and Deletions. In 45th International Colloquium on Automata, Languages, and Programming (ICALP). 75 : 1-75 : 14.
[22]
Tomohiro Hayashi and Kenji Yasunaga. 2018. On the List Decodability of Insertions and Deletions. In 2018 IEEE International Symposium on Information Theory (ISIT). IEEE, 86-90.
[23]
Marcos Kiwi, Martin Loebl, and Jiří Matoušek. 2005. Expected length of the longest common subsequence for large alphabets. Advances in Mathematics 197, 2 ( 2005 ), 480-498.
[24]
Vladimir Levenshtein. 1965. Binary codes capable of correcting deletions, insertions, and reversals. Doklady Akademii Nauk SSSR 163 4 ( 1965 ), 845-848.
[25]
Shu Liu, Ivan Tjuawinata, and Chaoping Xing. 2019. Explicit Constructions of Two-Dimensional Reed-Solomon Codes in High Insertion and Deletion Noise Regime. arXiv preprint arXiv: 1909. 03426 ( 2019 ).
[26]
Shu Liu, Ivan Tjuawinata, and Chaoping Xing. 2019. List Decoding of Insertion and Deletion Codes. arXiv preprint arXiv: 1906. 09705 ( 2019 ).
[27]
George S Lueker. 2009. Improved bounds on the average length of longest common subsequences. Journal of the ACM (JACM) 56, 3 ( 2009 ), 17.
[28]
Hugues Mercier, Vijay K Bhargava, and Vahid Tarokh. 2010. A survey of errorcorrecting codes for channels with symbol synchronization errors. IEEE Communications Surveys & Tutorials 12, 1 ( 2010 ).
[29]
Michael Mitzenmacher. 2009. A survey of results for deletion channels and related synchronization channels. Probability Surveys 6 ( 2009 ), 1-33.
[30]
Leonard J. Schulman and David Zuckerman. 1999. Asymptotically good codes correcting insertions, deletions, and transpositions. IEEE transactions on information theory 45, 7 ( 1999 ), 2552-2557.
[31]
Neil J. A Sloane. 2002. On single-deletion-correcting codes. Codes and designs 10 ( 2002 ), 273-291.
[32]
Antonia Wachter-Zeh. 2018. List Decoding of Insertions and Deletions. IEEE Trans. Information Theory 64, 9 ( 2018 ), 6297-6304.

Cited By

View all
  • (2024)Optimal Two-Dimensional Reed–Solomon Codes Correcting Insertions and DeletionsIEEE Transactions on Information Theory10.1109/TIT.2024.338784870:7(5012-5016)Online publication date: Jul-2024
  • (2024)Multiple Packing: Lower Bounds via Error ExponentsIEEE Transactions on Information Theory10.1109/TIT.2023.333403270:2(1008-1039)Online publication date: Feb-2024
  • (2024)On the Intersection of Multiple Insertion (or Deletion) Balls and its Application to List Decoding Under the Reconstruction ModelIEEE Transactions on Information Theory10.1109/TIT.2023.331960870:5(3262-3297)Online publication date: May-2024
  • Show More Cited By

Index Terms

  1. Optimally resilient codes for list-decoding from insertions and deletions

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    STOC 2020: Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing
    June 2020
    1429 pages
    ISBN:9781450369794
    DOI:10.1145/3357713
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 June 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Coding for Insertions and Deletions
    2. Error Resilience
    3. List Decoding

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    STOC '20
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,469 of 4,586 submissions, 32%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)115
    • Downloads (Last 6 weeks)12
    Reflects downloads up to 21 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Optimal Two-Dimensional Reed–Solomon Codes Correcting Insertions and DeletionsIEEE Transactions on Information Theory10.1109/TIT.2024.338784870:7(5012-5016)Online publication date: Jul-2024
    • (2024)Multiple Packing: Lower Bounds via Error ExponentsIEEE Transactions on Information Theory10.1109/TIT.2023.333403270:2(1008-1039)Online publication date: Feb-2024
    • (2024)On the Intersection of Multiple Insertion (or Deletion) Balls and its Application to List Decoding Under the Reconstruction ModelIEEE Transactions on Information Theory10.1109/TIT.2023.331960870:5(3262-3297)Online publication date: May-2024
    • (2023)A Lower Bound on the List-Decodability of Insdel CodesIEEE Transactions on Information Theory10.1109/TIT.2023.330286269:11(6989-7002)Online publication date: 7-Aug-2023
    • (2023)Multiple Packing:Lower Bounds via Infinite ConstellationsIEEE Transactions on Information Theory10.1109/TIT.2023.326095069:7(4513-4527)Online publication date: Jul-2023
    • (2023)Strict Half-Singleton Bound, Strict Direct Upper Bound for Linear Insertion-Deletion Codes and Optimal CodesIEEE Transactions on Information Theory10.1109/TIT.2023.323496769:5(2900-2910)Online publication date: 1-May-2023
    • (2023)The Zero-Rate Threshold for Adversarial Bit-Deletions is Less Than 1/2IEEE Transactions on Information Theory10.1109/TIT.2022.322302369:4(2218-2239)Online publication date: 1-Apr-2023
    • (2023)Beyond Single-Deletion Correcting Codes: Substitutions and TranspositionsIEEE Transactions on Information Theory10.1109/TIT.2022.320285669:1(169-186)Online publication date: Jan-2023
    • (2022)Coordinate-Ordering-Free Upper Bounds for Linear Insertion-Deletion CodesIEEE Transactions on Information Theory10.1109/TIT.2022.316766268:8(5126-5132)Online publication date: 1-Aug-2022
    • (2022)Rate-Distance Trade-offs for List-Decodable Insertion-Deletion Codes2022 IEEE Information Theory Workshop (ITW)10.1109/ITW54588.2022.9965935(470-475)Online publication date: 1-Nov-2022
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media