×

On the complexity of Newman’s community finding approach for biological and social networks. (English) Zbl 1258.05118

Summary: Given a graph of interactions, a module (also called a community or cluster) is a subset of nodes whose fitness is a function of the statistical significance of the pairwise interactions of nodes in the module. The topic of this paper is a model-based community finding approach, commonly referred to as modularity clustering, that was originally proposed by Newman (E. A. Leicht and M. E. J. Newman [Phys. Rev. Lett. 100, 118703 (2008)]) and has subsequently been extremely popular in practice.
Various heuristic methods are currently employed for finding the optimal solution. However, as observed in G. Agarwal and D. Kempe [Eur. Phys. J. B, Condens. Matter Complex Syst. 66, No. 3, 409–418 (2008; Zbl 1188.90262)], the exact computational complexity of this approach is still largely unknown. To this end, we initiate a systematic study of the computational complexity of modularity clustering. Due to the specific quadratic nature of the modularity function, it is necessary to study its value on sparse graphs and dense graphs separately.
Our main results include a (\(1+\varepsilon \))-inapproximability for dense graphs and a logarithmic approximation for sparse graphs. We make use of several combinatorial properties of modularity to get these results. These are the first non-trivial approximability results beyond the NP-hardness results in U. Brandes et al. [IEEE Trans. Knowl. Data Eng. 20, No. 172–188 (2007)].

MSC:

05C90 Applications of graph theory
91D30 Social networks; opinion dynamics
92C42 Systems biology, networks
68W25 Approximation algorithms
05C82 Small world graphs, complex networks (graph-theoretic aspects)

Citations:

Zbl 1188.90262