×

Trace reconstruction: generalized and parameterized. (English) Zbl 07525505

Bender, Michael A. (ed.) et al., 27th annual European symposium on algorithms, ESA 2019, Munich/Garching, Germany, September 9–11, 2019. Proceedings. Wadern: Schloss Dagstuhl – Leibniz-Zentrum für Informatik. LIPIcs – Leibniz Int. Proc. Inform. 144, Article 68, 25 p. (2019).
Summary: In the beautifully simple-to-state problem of trace reconstruction, the goal is to reconstruct an unknown binary string \(x\) given random “traces” of \(x\) where each trace is generated by deleting each coordinate of \(x\) independently with probability \(p<1\). The problem is well studied both when the unknown string is arbitrary and when it is chosen uniformly at random. For both settings, there is still an exponential gap between upper and lower sample complexity bounds and our understanding of the problem is still surprisingly limited. In this paper, we consider natural parameterizations and generalizations of this problem in an effort to attain a deeper and more comprehensive understanding. Perhaps our most surprising results are:
1.
We prove that \(\exp(O(n^{1/4}\sqrt{\log n}))\) traces suffice for reconstructing arbitrary matrices. In the matrix version of the problem, each row and column of an unknown \(\sqrt{n}\times\sqrt{n}\) matrix is deleted independently with probability \(p\). Our results contrasts with the best known results for sequence reconstruction where the best known upper bound is \(\exp(O(n^{1/3}))\).
2.
An optimal result for random matrix reconstruction: we show that \(\Theta(\log n)\) traces are necessary and sufficient. This is in contrast to the problem for random sequences where there is a super-logarithmic lower bound and the best known upper bound is \(\exp(O(\log^{1/3}n))\).
3.
We show that \(\exp(O(k^{1/3}\log^{2/3}n))\) traces suffice to reconstruct \(k\)-sparse strings, providing an improvement over the best known sequence reconstruction results when \(k=o(n/\log^2 n)\).
4.
We show that poly\((n)\) traces suffice if \(x\) is \(k\)-sparse and we additionally have a “separation” promise, specifically that the indices of \(1\)’s in \(x\) all differ by \(\Omega(k\log n)\).

For the entire collection see [Zbl 1423.68016].

MSC:

68Wxx Algorithms in computer science
Full Text: DOI

References:

[1] Dimitris Achlioptas and Frank McSherry. On spectral learning of mixtures of distributions. In Conference on Learning Theory, 2005. · Zbl 1137.68512
[2] Sanjeev Arora and Ravi Kannan. Learning mixtures of arbitrary gaussians. In Symposium on Theory of Computing, 2001. · Zbl 1323.68440
[3] Frank Ban, Xi Chen, Adam Frelich, Rocco A. Servedio, and Sandip Sinha. Beyond trace reconstruction: Population recovery from the deletion channel. arXiv, 2019. arXiv:1904.05532.
[4] Tugkan Batu, Sampath Kannan, Sanjeev Khanna, and Andrew McGregor. Reconstructing strings from random traces. In Symposium on Discrete Algorithms, 2004. · Zbl 1318.68206
[5] Mikhail Belkin and Kaushik Sinha. Polynomial learning of distribution families. In Foundations of Computer Science, 2010. · Zbl 1335.68100
[6] P. Borwein and T. Erdélyi. Littlewood-Type Problems on Subarcs of the Unit Circle. Indiana University Mathematics Journal, 1997. · Zbl 0930.30005
[7] Siu-On Chan, Ilias Diakonikolas, Rocco A Servedio, and Xiaorui Sun. Learning mixtures of structured distributions over discrete domains. In Symposium on Discrete Algorithms, 2013. · Zbl 1421.68144
[8] Mahdi Cheraghchi, Ryan Gabrys, Olgica Milenkovic, and João Ribeiro. Coded trace recon-struction. arXiv e-prints, page arXiv:1903.09992, March 2019. arXiv:1903.09992.
[9] Sanjoy Dasgupta. Learning mixtures of Gaussians. In Foundations of Computer Science, 1999.
[10] Sami Davies, Miklos Z. Racz, and Cyrus Rashtchian. Reconstructing Trees from Traces. arXiv e-prints, page arXiv:1902.05101, February 2019. arXiv:1902.05101.
[11] Anindya De, Ryan O’Donnell, and Rocco A. Servedio. Optimal mean-based algorithms for trace reconstruction. In Symposium on Theory of Computing, 2017. · Zbl 1369.68202
[12] Jon Feldman, Ryan O’Donnell, and Rocco A Servedio. Learning mixtures of product distribu-tions over discrete domains. SIAM Journal on Computing, 2008. · Zbl 1178.68296
[13] Anna C. Gilbert and Piotr Indyk. Sparse Recovery Using Sparse Matrices. Proceedings of the IEEE, 2010. 68:25
[14] Moritz Hardt and Eric Price. Tight bounds for learning a mixture of two gaussians. In Symposium on Theory of Computing, 2015. · Zbl 1321.68405
[15] Lisa Hartung, Nina Holden, and Yuval Peres. Trace reconstruction with varying deletion probabilities. In Workshop on Analytic Algorithmics and Combinatorics, 2018. · Zbl 1430.68101
[16] Nina Holden and Russell Lyons. Lower bounds for trace reconstruction. arXiv, 2018. arXiv: 1808.02336. · Zbl 1445.62014
[17] Nina Holden, Robin Pemantle, and Yuval Peres. Subpolynomial trace reconstruction for random strings and arbitrary deletion probability. In Conference On Learning Theory, COLT 2018, Stockholm, Sweden, 6-9 July 2018., pages 1799-1840, 2018.
[18] Thomas Holenstein, Michael Mitzenmacher, Rina Panigrahy, and Udi Wieder. Trace recon-struction with constant deletion probability and related results. In Symposium on Discrete Algorithms, 2008. · Zbl 1192.94064
[19] Samuel B Hopkins and Jerry Li. Mixture models, robustness, and sum of squares proofs. In Symposium on Theory of Computing, 2018. · Zbl 1428.68240
[20] Adam Tauman Kalai, Ankur Moitra, and Gregory Valiant. Efficiently learning mixtures of two Gaussians. In Symposium on Theory of Computing, 2010. · Zbl 1293.68229
[21] Sampath Kannan and Andrew McGregor. More on Reconstructing Strings from Random Traces: Insertions and Deletions. In International Symposium on Information Theory, 2005.
[22] Géza Kós, Péter Ligeti, and Péter Sziklai. Reconstruction of matrices from submatrices. Mathematics of Computation, 2009. doi:10.1090/S0025-5718-09-02210-8. · Zbl 1198.05023 · doi:10.1090/S0025-5718-09-02210-8
[23] I. Krasikov and Y. Roditty. On a Reconstruction Problem for Sequences. Journal of Combin-atorial Theory, Series A, 1997. · Zbl 0871.05002
[24] Andrew McGregor, Eric Price, and Sofya Vorotnikova. Trace Reconstruction Revisited. In European Symposium on Algorithms, 2014. · Zbl 1425.68470
[25] Ankur Moitra and Gregory Valiant. Settling the polynomial learnability of mixtures of gaussians. In Foundations of Computer Science, 2010. · Zbl 1293.68229
[26] Fedor Nazarov and Yuval Peres. Trace reconstruction with exp(O(n 1/3 ) samples. In Symposium on Theory of Computing, 2017. · Zbl 1370.68087
[27] Yuval Peres and Alex Zhai. Average-Case Reconstruction for the Deletion Channel: Sub-polynomially Many Traces Suffice. In Symposium on Foundations of Computer Science, 2017.
[28] Krishnamurthy Viswanathan and Ram Swaminathan. Improved string reconstruction over insertion-deletion channels. In Symposium on Discrete Algorithms, 2008. · Zbl 1192.94072
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.