×

Equivalence classes and conditional hardness in massively parallel computations. (English) Zbl 1483.68032

Summary: The Massively Parallel Computation (MPC) model serves as a common abstraction of many modern large-scale data processing frameworks, and has been receiving increasingly more attention over the past few years, especially in the context of classical graph problems. So far, the only way to argue lower bounds for this model is to condition on conjectures about the hardness of some specific problems, such as graph connectivity on promise graphs that are either one cycle or two cycles, usually called the one cycle versus two cycles problem. This is unlike the traditional arguments based on conjectures about complexity classes (e.g., \(\mathsf{P}\ne \mathsf{NP}\)), which are often more robust in the sense that refuting them would lead to groundbreaking algorithms for a whole bunch of problems. In this paper we present connections between problems and classes of problems that allow the latter type of arguments. These connections concern the class of problems solvable in a sublogarithmic amount of rounds in the MPC model, denoted by \(\mathsf{MPC}(o(\log N))\), and the standard space complexity classes \(\mathsf{L}\) and \(\mathsf{NL}\), and suggest conjectures that are robust in the sense that refuting them would lead to many surprisingly fast new algorithms in the MPC model. We also obtain new conditional lower bounds, and prove new reductions and equivalences between problems in the MPC model. Specifically, our main results are as follows.
Lower bounds conditioned on the one cycle versus two cycles conjecture can be instead argued under the \(\mathsf{L}\nsubseteq \mathsf{MPC}(o(\log N))\) conjecture: these two assumptions are equivalent, and refuting either of them would lead to \(o(\log N)\)-round MPC algorithms for a large number of challenging problems, including list ranking, minimum cut, and planarity testing. In fact, we show that these problems and many others require asymptotically the same number of rounds as the seemingly much easier problem of distinguishing between a graph being one cycle or two cycles.
Many lower bounds previously argued under the one cycle versus two cycles conjecture can be argued under an even more robust (thus harder to refute) conjecture, namely \(\mathsf{NL}\nsubseteq \mathsf{MPC}(o(\log N))\). Refuting this conjecture would lead to \(o(\log N)\)-round MPC algorithms for an even larger set of problems, including all-pairs shortest paths, betweenness centrality, and all aforementioned ones. Lower bounds under this conjecture hold for problems such as perfect matching and network flow.

MSC:

68M14 Distributed systems
68Q10 Modes of computation (nondeterministic, parallel, interactive, probabilistic, etc.)
68Q17 Computational difficulty of problems (lower bounds, completeness, difficulty of approximation, etc.)
68Q25 Analysis of algorithms and problem complexity
68R10 Graph theory (including graph drawing) in computer science

References:

[1] Abboud, A., Grandoni, F., Vassilevska Williams, V.: Subcubic equivalences between graph centrality problems, APSP and diameter. In: Proceedings of the 26th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 1681-1697 (2015) · Zbl 1371.68203
[2] Adler, M. Dittrich, W. Juurlink, B.H.H., Kutylowski, M., Rieping, I.: Communication-optimal parallel minimum spanning tree algorithms. In: Proceedings of the 10th Annual ACM Symposium on Parallel Algorithms and Architectures (SPAA), pp. 27-36 (1998)
[3] Afrati, FN; Sarma, AD; Salihoglu, S.; Ullman, JD, Upper and lower bounds on the cost of a Map-Reduce computation, PVLDB, 6, 4, 277-288 (2013)
[4] Ahn, K.J., Guha, S., McGregor, A.: Analyzing graph structure via linear measurements. In: Proceedings of the 23rd Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 459-467 (2012) · Zbl 1422.68176
[5] Allender, E.; Mahajan, M., The complexity of planarity testing, Inf. Comput., 189, 1, 117-134 (2004) · Zbl 1072.68045 · doi:10.1016/j.ic.2003.09.002
[6] Àlvarez, C.; Greenlaw, R., A compendium of problems complete for symmetric logarithmic space, Comput. Complex., 9, 2, 123-145 (2000) · Zbl 0970.68070 · doi:10.1007/PL00001603
[7] Andoni, A., Nikolov, A., Onak, K., Yaroslavtsev, G.: Parallel algorithms for geometric graph problems. In: Proceedings of the 46th Annual ACM SIGACT Symposium on Theory of Computing (STOC), pp. 574-583 (2014) · Zbl 1315.05133
[8] Andoni, A., Stein, C., Song, Z., Wang, Z., Zhong, P.: Parallel graph connectivity in log diameter rounds. In: Proceedings of the 59th Annual IEEE Symposium on Foundations of Computer Science (FOCS), pp. 674-685 (2018) · Zbl 1405.68005
[9] Andoni, A., Stein, C., Zhong, P.: Log diameter rounds algorithms for \(2\)-vertex and \(2\)-edge connectivity. In: Proceedings of the 46th International Colloquium on Automata, Languages, and Programming (ICALP), pp. 14:1-14:16 (2019) · Zbl 07561507
[10] Arora, S.; Barak, B., Computational Complexity: A Modern Approach (2009), Cambridge: Cambridge University Press, Cambridge · Zbl 1193.68112 · doi:10.1017/CBO9780511804090
[11] Assadi, S., Bateni, M., Bernstein, A., Mirrokni, V.S., Stein, C.: Coresets meet EDCS: algorithms for matching and vertex cover on massive graphs. In: Proceedings of the 30th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 1616-1635 (2019) · Zbl 1431.68143
[12] Assadi, S., Chen, Y., Khanna, Y.: Sublinear algorithms for \(( \Delta +1)\) vertex coloring. In: Proceedings of the 30th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 767-786 (2019) · Zbl 1431.68174
[13] Assadi , S., Khanna, S.: Tight bounds on the round complexity of the distributed maximum coverage problem. In: Proceedings of the 29th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 2412-2431 (2018) · Zbl 1403.68325
[14] Assadi, S., Sun, X., Weinstein, O.: Massively parallel algorithms for finding well-connected components in sparse graphs. In: Proceedings of the 38th ACM Symposium on Principles of Distributed Computing (PODC), pp. 461-470 (2019) · Zbl 1542.68123
[15] Beame, P.; Koutris, P.; Suciu, D., Communication steps for parallel query processing, J. ACM, 64, 6, 1-58 (2017) · Zbl 1426.68074 · doi:10.1145/3125644
[16] Beaudry, M.; McKenzie, P., Circuits, matrices, and nonassociative computation, J. Comput. Syst. Sci., 50, 3, 441-455 (1995) · Zbl 0837.68031 · doi:10.1006/jcss.1995.1035
[17] Behnezhad, S., Brandt, S., Derakhshan, M., Fischer, M., Hajiaghayi, M., Karp, R.M., Uitto, J.: Massively parallel computation of matching and MIS in sparse graphs. In: Proceedings of the 38th ACM Symposium on Principles of Distributed Computing (PODC), pp. 481-490 (2019) · Zbl 1542.68125
[18] Behnezhad, S., Derakhshan, M., Hajiaghayi, M.: Semi-MapReduce meets congested clique. CoRR (2018). arXiv:1802.10297 · Zbl 1499.68250
[19] Behnezhad, S., Dhulipala, L., Esfandiari, H., Lacki, J., Mirrokni, V.: Near-optimal massively parallel graph connectivity. In: Proceedings of the 60th IEEE Annual Symposium on Foundations of Computer Science (FOCS), pp. 1615-1636 (2019)
[20] Behnezhad, S.; Dhulipala, L.; Esfandiari, H.; Lacki, J.; Mirrokni, VS; Schudy, W., Parallel graph algorithms in constant adaptive rounds: theory meets practice, Proc. VLDB Endow., 13, 13, 3588-3602 (2020) · doi:10.14778/3424573.3424579
[21] Behnezhad, S., Hajiaghayi, M., Harris, D.G.: Exponentially faster massively parallel maximal matching. In: Proceedings of the 60th IEEE Annual Symposium on Foundations of Computer Science (FOCS), pp. 1637-1649 (2019)
[22] Bilardi, G.; Scquizzato, M.; Silvestri, F., A lower bound technique for communication in BSP, ACM Trans. Parallel Comput., 4, 3, 1-27 (2018) · doi:10.1145/3181776
[23] Borodin, A.; Cook, SA; Dymond, PW; Ruzzo, WL; Tompa, M., Two applications of inductive counting for complementation problems, SIAM J. Comput., 18, 3, 559-578 (1989) · Zbl 0678.68031 · doi:10.1137/0218038
[24] Boroujeni, M., Ehsani, S., Ghodsi, M., Hajiaghayi, M.T. , Seddighin, S.: Approximating edit distance in truly subquadratic time: quantum and MapReduce. In: Proceedings of the 29th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 1170-1189 (2018) · Zbl 1403.68367
[25] Buss, S.R.: The Boolean formula value problem is in ALOGTIME. In: Proceedings of the 19th Annual ACM Symposium on Theory of Computing (STOC), pp. 123-131 (1987)
[26] Chandra, AK; Stockmeyer, LJ; Vishkin, U., Constant depth reducibility, SIAM J. Comput., 13, 2, 423-439 (1984) · Zbl 0538.68038 · doi:10.1137/0213028
[27] Chang, Y.-J., Fischer, M., Ghaffari, M., Uitto, J., Zheng, Y.: The complexity of \(( \Delta +1)\) coloring in congested clique, massively parallel computation, and centralized local computation. In: Proceedings of the 38th ACM Symposium on Principles of Distributed Computing (PODC), pp. 471-480 (2019) · Zbl 1530.68198
[28] Chung, K., Ho, K., Sun, X.: On the hardness of massively parallel computation. In: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and Architectures (SPAA), pp. 153-162 (2020)
[29] Cook, SA, A taxonomy of problems with fast parallel algorithms, Inf. Control, 64, 1-3, 2-22 (1985) · Zbl 0575.68045 · doi:10.1016/S0019-9958(85)80041-3
[30] Cook, SA; McKenzie, P., Problems complete for deterministic logarithmic space, J. Algorithms, 8, 3, 385-394 (1987) · Zbl 0644.68058 · doi:10.1016/0196-6774(87)90018-6
[31] Czumaj, A., Davies, P., Parter, M.: Component stability in low-space massively parallel computation. In: Proceedings of the 40th ACM Symposium on Principles of Distributed Computing (PODC), pp. 481-491 (2021) · Zbl 07475095
[32] Czumaj, A.; Davies, P.; Parter, M., Graph sparsification for derandomizing massively parallel computation with low space, ACM Trans. Algorithms, 17, 2, 1-27 (2021) · Zbl 07475095 · doi:10.1145/3451992
[33] Czumaj, A.; Lacki, J.; Madry, A.; Mitrovic, S.; Onak, K.; Sankowski, P., Round compression for parallel matching algorithms, SIAM J. Comput., 49, 5, STOC18-1 (2020) · Zbl 1445.68331 · doi:10.1137/18M1197655
[34] Das, B.; Datta, S.; Nimbhorkar, P., Log-space algorithms for paths and matchings in \(k\)-trees, Theory Comput. Syst., 53, 4, 669-689 (2013) · Zbl 1281.68132 · doi:10.1007/s00224-013-9469-9
[35] Dean, J.; Ghemawat, S., MapReduce: simplified data processing on large clusters, Commun. ACM, 51, 1, 107-113 (2008) · doi:10.1145/1327452.1327492
[36] Dhulipala, L., Durfee, D., Kulkarni, J., Peng, R., Sawlani, S., Sun, X.: Parallel batch-dynamic graphs: algorithms and lower bounds. In: Proceedings of the 31st Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 1300-1319 (2020) · Zbl 07304102
[37] Drucker, A., Kuhn, F., Oshman, R.: On the power of the congested clique model. In: Proceedings of the 2014 ACM Symposium on Principles of Distributed Computing (PODC), pp. 367-376 (2014) · Zbl 1321.68381
[38] Etessami, K., Counting quantifiers, successor relations, and logarithmic space, J. Comput. Syst. Sci., 54, 3, 400-411 (1997) · Zbl 0882.68074 · doi:10.1006/jcss.1997.1485
[39] Fischer, M.J., Meyer, A.R.: Boolean matrix multiplication and transitive closure. In: Proceedings of the 12th Annual IEEE Symposium on Foundations of Computer Science (FOCS), pp. 129-131 (1971)
[40] Fish, B., Kun, J., Lelkes, Á.D., Reyzin, L., Turán, G.: On the computational complexity of MapReduce. In: Proceedings of the 29th International Symposium on Distributed Computing (DISC), pp. 1-15 (2015) · Zbl 1394.68176
[41] Frei, F., Wada, K.: Efficient circuit simulation in MapReduce. In: Proceedings of the 30th International Symposium on Algorithms and Computation (ISAAC), pp. 52:1-52:21 (2019) · Zbl 07650285
[42] Ghaffari, M., Gouleakis, T., Konrad, C., Mitrovic, S., Rubinfeld, R.: Improved massively parallel computation algorithms for MIS, matching, and vertex cover. In: Proceedings of the 37th ACM Symposium on Principles of Distributed Computing (PODC), pp. 129-138 (2018) · Zbl 1428.68376
[43] Ghaffari, M., Kuhn, F., Uitto, J.: Conditional hardness results for massively parallel computation from distributed lower bounds. In: Proceedings of the 60th IEEE Annual Symposium on Foundations of Computer Science (FOCS), pp. 1650-1663 (2019)
[44] Ghaffari, M., Nowicki, K.: Massively parallel algorithms for minimum cut. In: Proceedings of the 39th ACM Symposium on Principles of Distributed Computing (PODC), pp. 119-128 (2020) · Zbl 07323177
[45] Ghaffari, M., Uitto, J.: Sparsifying distributed algorithms with ramifications in massively parallel computation and centralized local computation. In:Proceedings of the 30th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 1636-1653 (2019) · Zbl 1431.68133
[46] Goodrich, MT, Communication-efficient parallel sorting, SIAM J. Comput., 29, 2, 416-432 (1999) · Zbl 0939.68166 · doi:10.1137/S0097539795294141
[47] Goodrich, M.T., Sitchinava, N., Zhang, Q.: Sorting, searching, and simulation in the MapReduce framework. In: Proceedings of the 22nd International Symposium on Algorithms and Computation (ISAAC), pp. 374-383 (2011) · Zbl 1350.68085
[48] Hajiaghayi, M., Seddighin, S., Sun, X.: Massively parallel approximation algorithms for edit distance and longest common subsequence. In: Proceedings of the 30th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 1654-1672 (2019) · Zbl 1431.68171
[49] Hegeman, J.W., Pandurangan, G., Pemmaraju, S.V., Sardeshmukh, V.B., Scquizzato, M.: Toward optimal bounds in the congested clique: graph connectivity and MST. In: Proceedings of the 34th ACM Symposium on Principles of Distributed Computing (PODC), pp. 91-100 (2015) · Zbl 1333.68211
[50] Hegeman, JW; Pemmaraju, SV, Lessons from the congested clique applied to MapReduce, Theor. Comput. Sci., 608, 268-281 (2015) · Zbl 1333.68277 · doi:10.1016/j.tcs.2015.09.029
[51] Im, S.,Moseley, B.: A conditional lower bound on graph connectivity in MapReduce. CoRR (2019). arXiv: 1904.08954
[52] Im, S., Moseley, B., Sun, X.: Efficient massively parallel methods for dynamic programming. In Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing (STOC), pp. 798-811 (2017) · Zbl 1370.68316
[53] Immerman, N., Descriptive Complexity (1999), Berlin: Springer, Berlin · Zbl 0918.68031 · doi:10.1007/978-1-4612-0539-5
[54] Isard, M., Budiu, M., Yu, Y., Birrell, A.,Fetterly, D.: Dryad: distributed data-parallel programs from sequential building blocks. In: Proceedings of the 2007 EuroSys Conference, pp. 59-72 (2007)
[55] Jacob, R., Lieber, T., Sitchinava, N.: On the complexity of list ranking in the parallel external memory model. In: Proceedings of the 39th International Symposium on Mathematical Foundations of Computer Science (MFCS), pp. 384-395 (2014) · Zbl 1426.68124
[56] Jones, ND, Space-bounded reducibility among combinatorial problems, J. Comput. Syst. Sci., 11, 1, 68-85 (1975) · Zbl 0317.02039 · doi:10.1016/S0022-0000(75)80050-X
[57] Jones, ND; Lien, YE; Laaser, WT, New problems complete for nondeterministic log space, Math. Syst. Theory, 10, 1-17 (1976) · Zbl 0341.68035 · doi:10.1007/BF01683259
[58] Karger, D.R.: Global min-cuts in RNC, and other ramifications of a simple min-cut algorithm. In: Proceedings of the 4th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 21-30 (1993) · Zbl 0801.68124
[59] Karloff, H.J., Suri, S., Vassilvitskii, S.: A model of computation for MapReduce. In: Proceedings of the 21st annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 938-948 (2010) · Zbl 1288.68247
[60] Kiveris, R., Lattanzi, S., Mirrokni, V.S., Rastogi, V., Vassilvitskii, S.: Connected components in MapReduce and beyond. In: Proceedings of the ACM Symposium on Cloud Computing (SoCC), pp. 18:1-18:13 (2014)
[61] Klauck, H., Nanongkai, D., Pandurangan, G.,Robinson, P.: Distributed computation of large-scale graph problems. In: Proceedings of the 26th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 391-410 (2015) · Zbl 1371.68214
[62] Korhonen, J.H., Suomela, J.: Towards a complexity theory for the congested clique. In: Proceedings of the 30th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA), pp. 163-172 (2018)
[63] Kumar, R.; Moseley, B.; Vassilvitskii, S.; Vattani, A., Fast greedy algorithms in MapReduce and streaming, ACM Trans. Parallel Comput., 2, 3, 1-22 (2015) · doi:10.1145/2809814
[64] Lacki, J., Mitrovic, S., Onak, K., Sankowski,P.: Walking randomly, massively, and efficiently. In: Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing (STOC), pp. 364-377 (2020) · Zbl 07298254
[65] Lattanzi, S., Moseley, B., Suri, S., Vassilvitskii, S.: Filtering: a method for solving graph problems in MapReduce. In: Proceedings of the 23rd ACM Symposium on Parallelism in Algorithms and Architectures (SPAA), pp. 85-94 (2011)
[66] Lewis, HR; Papadimitriou, CH, Symmetric space-bounded computation, Theor. Comput. Sci., 19, 161-187 (1982) · Zbl 0491.68045 · doi:10.1016/0304-3975(82)90058-5
[67] MacKenzie, P.D., Ramachandran, V.: Computational bounds for fundamental problems on general-purpose parallel models. In: Proceedings of the 10th Annual ACM Symposium on Parallel Algorithms and Architectures (SPAA), pp. 152-163 (1998)
[68] Nanongkai, D., Scquizzato, M.: Equivalence classes and conditional hardness in massively parallel computations. In: Proceedings of the 23rd International Conference on Principles of Distributed Systems (OPODIS), pp. 33:1-33:16 (2019) · Zbl 07650883
[69] Nisan, N., Ta-Shma, A.: Symmetric logspace is closed under complement. In: Proceedings of the 27th Annual ACM Symposium on Theory of Computing (STOC), pp. 140-146 (1995) · Zbl 0978.68525
[70] Onak, K.: Personal communication (2019)
[71] Pandurangan, G.; Robinson, P.; Scquizzato, M., Fast distributed algorithms for connectivity and MST in large graphs, ACM Trans. Parallel Comput., 5, 1, 1-22 (2018) · doi:10.1145/3209689
[72] Pandurangan, G.; Robinson, P.; Scquizzato, M., On the distributed complexity of large-scale graph computations, ACM Trans. Parallel Comput., 8, 2, 1-28 (2021) · doi:10.1145/3460900
[73] Papadimitriou, CH, Computational Complexity (1994), Boston: Addison-Wesley, Boston · Zbl 0833.68049
[74] Pietracaprina, A., Pucci, G., Riondato, M., Silvestri, F., Upfal, E.: Space-round tradeoffs for MapReduce computations. In: Proceedings of the 26th International Conference on Supercomputing (ICS), pp. 235-244 (2012)
[75] Rastogi, V., Machanavajjhala, A., Chitnis, L., Sarma, A.D.: Finding connected components in Map-Reduce in logarithmic rounds. In: Proceedings of the 29th IEEE International Conference on Data Engineering (ICDE), pp. 50-61 (2013)
[76] Reingold, O., Undirected connectivity in log-space, J. ACM, 55, 4, 17:1-17:24 (2008) · Zbl 1315.68156 · doi:10.1145/1391289.1391291
[77] Reingold, O., Trevisan, L., Vadhan, S.P.: Pseudorandom walks on regular digraphs and the RL vs. L problem. In: Proceedings of the 38th Annual ACM Symposium on Theory of Computing (STOC), pp. 457-466 (2006) · Zbl 1301.05317
[78] Roughgarden, T.; Vassilvitskii, S.; Wang, JR, Shuffles and circuits (on lower bounds for modern parallel computation), J. ACM, 65, 6, 1-24 (2018) · Zbl 1426.68105 · doi:10.1145/3232536
[79] Savitch, WJ, Relationships between nondeterministic and deterministic tape complexities, J. Comput. Syst. Sci., 4, 2, 177-192 (1970) · Zbl 0188.33502 · doi:10.1016/S0022-0000(70)80006-X
[80] Scquizzato, M.,Silvestri, F.: Communication lower bounds for distributed-memory computations. In: Proceedings of the 31st International Symposium on Theoretical Aspects of Computer Science (STACS), pp. 627-638 (2014) · Zbl 1359.68027
[81] Sipser, M., Introduction to the Theory of Computation (1997), Boston: PWS Publishing Company, Boston · Zbl 1169.68300
[82] Svensson, O., Tarnawski, J.: The matching problem in general graphs is in quasi-NC. In: Proceedings of the 58th IEEE Annual Symposium on Foundations of Computer Science (FOCS), pp. 696-707 (2017)
[83] Valiant, LG, A bridging model for parallel computation, Commun. ACM, 33, 8, 103-111 (1990) · doi:10.1145/79173.79181
[84] Vassilevska Williams, V.: On some fine-grained questions in algorithms and complexity. In Proceedings of the International Congress of Mathematicians 2018 (ICM 2018), pp. 3431-3472 (2018) · Zbl 1490.68115
[85] Vassilevska Williams, V.; Williams, RR, Subcubic equivalences between path, matrix, and triangle problems, J. ACM, 65, 5, 1-38 (2018) · Zbl 1426.68133 · doi:10.1145/3186893
[86] White, T., Hadoop: The Definitive Guide (2012), Sebastopol: O’Reilly Media, Inc., Sebastopol
[87] Yaroslavtsev, G., Vadapalli, A.: Massively parallel algorithms and hardness for single-linkage clustering under \(\ell_p\)-distances. In: Proceedings of the 34th International Conference on Machine Learning (ICML), pp. 5596-5605 (2018)
[88] Zaharia, M.; Xin, RS; Wendell, P.; Das, T.; Armbrust, M.; Dave, A.; Meng, X.; Rosen, J.; Venkataraman, S.; Franklin, MJ; Ghodsi, A.; Gonzalez, J.; Shenker, S.; Stoica, I., Apache Spark: a unified engine for big data processing, Commun. ACM, 59, 11, 56-65 (2016) · doi:10.1145/2934664
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.