August 2024 Statistical limits of correlation detection in trees
Luca Ganassali, Laurent Massoulié, Guilhem Semerjian
Author Affiliations +
Ann. Appl. Probab. 34(4): 3701-3734 (August 2024). DOI: 10.1214/23-AAP2048

Abstract

In this paper we address the problem of testing whether two observed trees (t,t) are sampled either independently or from a joint distribution under which they are correlated. This problem, which we refer to as correlation detection in trees, plays a key role in the study of graph alignment for two correlated random graphs. Motivated by graph alignment, we investigate the conditions of existence of one-sided tests, that is, tests which have vanishing type I error and nonvanishing power in the limit of large tree depth.

For the correlated Galton–Watson model with Poisson offspring of mean λ>0 and correlation parameter s(0,1), we identify a phase transition in the limit of large degrees at s=α, where α0.3383 is Otter’s constant. Namely, we prove that no such test exists for sα, and that such a test exists whenever s>α, for λ large enough.

This result sheds new light on the graph alignment problem in the sparse regime (with O(1) average node degrees) and on the performance of the MPAlign method studied in (Ganassali, Massoulié and Lelarge (2022), J. Stat. Mech. Theory Exp. 2022 (2022)), proving in particular the conjecture of (J. Stat. Mech. Theory Exp. 2022 (2022)) that MPAlign succeeds in the partial recovery task for correlation parameter s>α provided the average node degree λ is large enough.

As a byproduct, we identify a new family of orthogonal polynomials for the Poisson–Galton–Watson measure which enjoy remarkable properties. These polynomials may be of independent interest for a variety of problems involving graphs, trees or branching processes, beyond the scope of graph alignment.

Funding Statement

The first author was supported by the French government under management of Agence Nationale de la Recherche as part of the “Investissements d’avenir” program, reference ANR19-P3IA-0001 (PRAIRIE 3IA Institute).

Acknowledgments

The authors would like to thank Marc Lelarge and Guillaume Barraquand for useful discussions.

Citation

Download Citation

Luca Ganassali. Laurent Massoulié. Guilhem Semerjian. "Statistical limits of correlation detection in trees." Ann. Appl. Probab. 34 (4) 3701 - 3734, August 2024. https://doi.org/10.1214/23-AAP2048

Information

Received: 1 January 2023; Revised: 1 November 2023; Published: August 2024
First available in Project Euclid: 6 August 2024

Digital Object Identifier: 10.1214/23-AAP2048

Subjects:
Primary: 05C80 , 05C85
Secondary: 62F03 , 68R05

Keywords: combinatorics , Hypothesis testing , Random graphs , statistical inference

Rights: Copyright © 2024 Institute of Mathematical Statistics

Vol.34 • No. 4 • August 2024
Back to Top