Skip to main content

Showing 1–26 of 26 results for author: Yaroslavtsev, G

  1. arXiv:2410.06878  [pdf, other

    cs.LG

    Noise is All You Need: Private Second-Order Convergence of Noisy SGD

    Authors: Dmitrii Avdiukhin, Michael Dinitz, Chenglin Fan, Grigory Yaroslavtsev

    Abstract: Private optimization is a topic of major interest in machine learning, with differentially private stochastic gradient descent (DP-SGD) playing a key role in both theory and practice. Furthermore, DP-SGD is known to be a powerful tool in contexts beyond privacy, including robustness, machine unlearning, etc. Existing analyses of DP-SGD either make relatively strong assumptions (e.g., Lipschitz con… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: 30 pages

  2. arXiv:2312.00379  [pdf, other

    cs.LG stat.ML

    Optimal Sample Complexity of Contrastive Learning

    Authors: Noga Alon, Dmitrii Avdiukhin, Dor Elboim, Orr Fischer, Grigory Yaroslavtsev

    Abstract: Contrastive learning is a highly successful technique for learning representations of data from labeled tuples, specifying the distance relations within the tuple. We study the sample complexity of contrastive learning, i.e. the minimum number of labeled tuples sufficient for getting high generalization accuracy. We give tight bounds on the sample complexity in a variety of settings, focusing on a… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  3. arXiv:2302.04492  [pdf, other

    cs.LG

    Tree Learning: Optimal Algorithms and Sample Complexity

    Authors: Dmitrii Avdiukhin, Grigory Yaroslavtsev, Danny Vainstein, Orr Fischer, Sauman Das, Faraz Mirza

    Abstract: We study the problem of learning a hierarchical tree representation of data from labeled samples, taken from an arbitrary (and possibly adversarial) distribution. Consider a collection of data tuples labeled according to their hierarchical structure. The smallest number of such tuples required in order to be able to accurately label subsequent tuples is of interest for data collection in machine l… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  4. arXiv:2205.13753  [pdf, other

    cs.LG math.OC stat.ML

    HOUDINI: Escaping from Moderately Constrained Saddles

    Authors: Dmitrii Avdiukhin, Grigory Yaroslavtsev

    Abstract: We give the first polynomial time algorithms for escaping from high-dimensional saddle points under a moderate number of constraints. Given gradient access to a smooth function $f \colon \mathbb R^d \to \mathbb R$ we show that (noisy) gradient descent methods can escape from saddle points under a logarithmic number of inequality constraints. This constitutes the first tangible progress (without re… ▽ More

    Submitted 20 April, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

  5. arXiv:2105.10090  [pdf, other

    cs.LG math.OC stat.ML

    Escaping Saddle Points with Compressed SGD

    Authors: Dmitrii Avdiukhin, Grigory Yaroslavtsev

    Abstract: Stochastic gradient descent (SGD) is a prevalent optimization technique for large-scale distributed machine learning. While SGD computation can be efficiently divided between multiple machines, communication typically becomes a bottleneck in the distributed setting. Gradient compression methods can be used to alleviate this problem, and a recent line of work shows that SGD augmented with gradient… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

  6. arXiv:2012.08466  [pdf, ps, other

    cs.LG cs.AI cs.DS stat.ML

    Objective-Based Hierarchical Clustering of Deep Embedding Vectors

    Authors: Stanislav Naumov, Grigory Yaroslavtsev, Dmitrii Avdiukhin

    Abstract: We initiate a comprehensive experimental study of objective-based hierarchical clustering methods on massive datasets consisting of deep embedding vectors from computer vision and NLP applications. This includes a large variety of image embedding (ImageNet, ImageNetV2, NaBirds), word embedding (Twitter, Wikipedia), and sentence embedding (SST-2) vectors from several popular recent models (e.g. Res… ▽ More

    Submitted 8 June, 2022; v1 submitted 15 December, 2020; originally announced December 2020.

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence (2021), 35(10), 9055-9063

  7. arXiv:1912.06983  [pdf, other

    cs.DS

    Bisect and Conquer: Hierarchical Clustering via Max-Uncut Bisection

    Authors: Sara Ahmadian, Vaggos Chatziafratis, Alessandro Epasto, Euiwoong Lee, Mohammad Mahdian, Konstantin Makarychev, Grigory Yaroslavtsev

    Abstract: Hierarchical Clustering is an unsupervised data analysis method which has been widely used for decades. Despite its popularity, it had an underdeveloped analytical foundation and to address this, Dasgupta recently introduced an optimization viewpoint of hierarchical clustering with pairwise similarity information that spurred a line of work shedding light on old algorithms (e.g., Average-Linkage),… ▽ More

    Submitted 15 December, 2019; originally announced December 2019.

  8. arXiv:1910.05686  [pdf, ps, other

    cs.DS

    Fast Fourier Sparsity Testing

    Authors: Grigory Yaroslavtsev, Samson Zhou

    Abstract: A function $f : \mathbb{F}_2^n \to \mathbb{R}$ is $s$-sparse if it has at most $s$ non-zero Fourier coefficients. Motivated by applications to fast sparse Fourier transforms over $\mathbb{F}_2^n$, we study efficient algorithms for the problem of approximating the $\ell_2$-distance from a given function to the closest $s$-sparse function. While previous works (e.g., Gopalan et al. SICOMP 2011) stud… ▽ More

    Submitted 13 October, 2019; originally announced October 2019.

  9. arXiv:1910.05646  [pdf, other

    cs.DS cs.LG

    "Bring Your Own Greedy"+Max: Near-Optimal $1/2$-Approximations for Submodular Knapsack

    Authors: Dmitrii Avdiukhin, Grigory Yaroslavtsev, Samson Zhou

    Abstract: The problem of selecting a small-size representative summary of a large dataset is a cornerstone of machine learning, optimization and data science. Motivated by applications to recommendation systems and other scenarios with query-limited access to vast amounts of data, we propose a new rigorous algorithmic framework for a standard formulation of this problem as a submodular maximization subject… ▽ More

    Submitted 12 October, 2019; originally announced October 2019.

  10. arXiv:1907.00524  [pdf, ps, other

    cs.DS

    Approximate $\mathbb{F}_2$-Sketching of Valuation Functions

    Authors: Grigory Yaroslavtsev, Samson Zhou

    Abstract: We study the problem of constructing a linear sketch of minimum dimension that allows approximation of a given real-valued function $f \colon \mathbb{F}_2^n \rightarrow \mathbb R$ with small expected squared error. We develop a general theory of linear sketching for such functions through which we analyze their dimension for most commonly studied types of valuation functions: additive, budget-addi… ▽ More

    Submitted 30 June, 2019; originally announced July 2019.

    Comments: To appear in RANDOM 2019

  11. arXiv:1905.02367  [pdf, other

    cs.DS cs.LG

    Adversarially Robust Submodular Maximization under Knapsack Constraints

    Authors: Dmitrii Avdiukhin, Slobodan Mitrović, Grigory Yaroslavtsev, Samson Zhou

    Abstract: We propose the first adversarially robust algorithm for monotone submodular maximization under single and multiple knapsack constraints with scalable implementations in distributed and streaming settings. For a single knapsack constraint, our algorithm outputs a robust summary of almost optimal (up to polylogarithmic factors) size, from which a constant-factor approximation to the optimal solution… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

    Comments: To appear in KDD 2019

  12. arXiv:1902.03522  [pdf, other

    cs.DS cs.DB cs.DC

    Multi-Dimensional Balanced Graph Partitioning via Projected Gradient Descent

    Authors: Dmitrii Avdiukhin, Sergey Pupyrev, Grigory Yaroslavtsev

    Abstract: Motivated by performance optimization of large-scale graph processing systems that distribute the graph across multiple machines, we consider the balanced graph partitioning problem. Compared to the previous work, we study the multi-dimensional variant when balance according to multiple weight functions is required. As we demonstrate by experimental evaluation, such multi-dimensional balance is im… ▽ More

    Submitted 15 February, 2019; v1 submitted 9 February, 2019; originally announced February 2019.

  13. arXiv:1812.10582  [pdf, other

    cs.DS

    Hierarchical Clustering for Euclidean Data

    Authors: Moses Charikar, Vaggos Chatziafratis, Rad Niazadeh, Grigory Yaroslavtsev

    Abstract: Recent works on Hierarchical Clustering (HC), a well-studied problem in exploratory data analysis, have focused on optimizing various objective functions for this problem under arbitrary similarity measures. In this paper we take the first step and give novel scalable algorithms for this problem tailored to Euclidean data in R^d and under vector-based similarity measures, a prevalent model in seve… ▽ More

    Submitted 26 December, 2018; originally announced December 2018.

  14. arXiv:1809.09063  [pdf, ps, other

    cs.CC

    Optimality of Linear Sketching under Modular Updates

    Authors: Kaave Hosseini, Shachar Lovett, Grigory Yaroslavtsev

    Abstract: We study the relation between streaming algorithms and linear sketching algorithms, in the context of binary updates. We show that for inputs in $n$ dimensions, the existence of efficient streaming algorithms which can process $Ω(n^2)$ updates implies efficient linear sketching algorithms with comparable cost. This improves upon the previous work of Li, Nguyen and Woodruff [LNW14] and Ai, Hu, Li a… ▽ More

    Submitted 24 September, 2018; originally announced September 2018.

  15. arXiv:1710.01431  [pdf, other

    cs.DS cs.DB cs.DC

    Massively Parallel Algorithms and Hardness for Single-Linkage Clustering Under $\ell_p$-Distances

    Authors: Grigory Yaroslavtsev, Adithya Vadapalli

    Abstract: We present massively parallel (MPC) algorithms and hardness of approximation results for computing Single-Linkage Clustering of $n$ input $d$-dimensional vectors under Hamming, $\ell_1, \ell_2$ and $\ell_\infty$ distances. All our algorithms run in $O(\log n)$ rounds of MPC for any fixed $d$ and achieve $(1+ε)$-approximation for all distances (except Hamming for which we show an exact algorithm).… ▽ More

    Submitted 25 March, 2018; v1 submitted 3 October, 2017; originally announced October 2017.

  16. arXiv:1611.01879  [pdf, ps, other

    cs.DS

    Linear Sketching over $\mathbb F_2$

    Authors: Sampath Kannan, Elchanan Mossel, Grigory Yaroslavtsev

    Abstract: We initiate a systematic study of linear sketching over $\mathbb F_2$. For a given Boolean function $f \colon \{0,1\}^n \to \{0,1\}$ a randomized $\mathbb F_2$-sketch is a distribution $\mathcal M$ over $d \times n$ matrices with elements over $\mathbb F_2$ such that $\mathcal Mx$ suffices for computing $f(x)$ with high probability. We study a connection between $\mathbb F_2$-sketching and a two-p… ▽ More

    Submitted 11 November, 2016; v1 submitted 6 November, 2016; originally announced November 2016.

  17. arXiv:1506.00242  [pdf, other

    cs.DS cs.CR cs.CY cs.SI

    Privacy for the Protected (Only)

    Authors: Michael Kearns, Aaron Roth, Zhiwei Steven Wu, Grigory Yaroslavtsev

    Abstract: Motivated by tensions between data privacy for individual citizens, and societal priorities such as counterterrorism and the containment of infectious disease, we introduce a computational model that distinguishes between parties for whom privacy is explicitly protected, and those for whom it is not (the targeted subpopulation). The goal is the development of algorithms that can effectively identi… ▽ More

    Submitted 31 May, 2015; originally announced June 2015.

  18. arXiv:1505.01467  [pdf, ps, other

    cs.DS

    Tight Bounds for Linear Sketches of Approximate Matchings

    Authors: Sepehr Assadi, Sanjeev Khanna, Yang Li, Grigory Yaroslavtsev

    Abstract: We resolve the space complexity of linear sketches for approximating the maximum matching problem in dynamic graph streams where the stream may include both edge insertion and deletion. Specifically, we show that for any $ε> 0$, there exists a one-pass streaming algorithm, which only maintains a linear sketch of size $\tilde{O}(n^{2-3ε})$ bits and recovers an $n^ε$-approximate maximum matching in… ▽ More

    Submitted 6 May, 2015; originally announced May 2015.

  19. arXiv:1412.0681  [pdf, other

    cs.DS

    Near Optimal LP Rounding Algorithm for Correlation Clustering on Complete and Complete k-partite Graphs

    Authors: Shuchi Chawla, Konstantin Makarychev, Tselil Schramm, Grigory Yaroslavtsev

    Abstract: We give new rounding schemes for the standard linear programming relaxation of the correlation clustering problem, achieving approximation factors almost matching the integrality gaps: - For complete graphs our appoximation is $2.06 - \varepsilon$ for a fixed constant $\varepsilon$, which almost matches the previously known integrality gap of $2$. - For complete $k$-partite graphs our approxim… ▽ More

    Submitted 23 June, 2015; v1 submitted 1 December, 2014; originally announced December 2014.

  20. arXiv:1407.7887  [pdf, ps, other

    cs.DS

    Going for Speed: Sublinear Algorithms for Dense r-CSPs

    Authors: Grigory Yaroslavtsev

    Abstract: We give new sublinear and parallel algorithms for the extensively studied problem of approximating n-variable r-CSPs (constraint satisfaction problems with constraints of arity r up to an additive error. The running time of our algorithms is O(n/ε^2) + 2^O(1/ε^2) for Boolean r-CSPs and O(k^4 n / ε^2) + 2^O(log k / ε^2) for r-CSPs with constraints on variables over an alphabet of size k. For any co… ▽ More

    Submitted 29 July, 2014; originally announced July 2014.

  21. arXiv:1403.0486  [pdf, ps, other

    cs.DM cs.DC cs.DS

    Online Algorithms for Machine Minimization

    Authors: Nikhil Devanur, Konstantin Makarychev, Debmalya Panigrahi, Grigory Yaroslavtsev

    Abstract: In this paper, we consider the online version of the machine minimization problem (introduced by Chuzhoy et al., FOCS 2004), where the goal is to schedule a set of jobs with release times, deadlines, and processing lengths on a minimum number of identical machines. Since the online problem has strong lower bounds if all the job parameters are arbitrary, we focus on jobs with uniform length. Our ma… ▽ More

    Submitted 4 March, 2014; v1 submitted 3 March, 2014; originally announced March 2014.

    Comments: After the first version of the manuscript was posted online, it was brought to our attention that Theorem 1.1 also follows from the results of Bansal, Kimbrel and Pruhs, "Speed Scaling to Minimize Energy and Temperature" (JACM'07), who consider energy-minimizing scheduling problems. This result follows from Lemma 4.7 and Lemma 4.8

  22. arXiv:1401.0042  [pdf, other

    cs.DS cs.DC

    Parallel Algorithms for Geometric Graph Problems

    Authors: Alexandr Andoni, Aleksandar Nikolov, Krzysztof Onak, Grigory Yaroslavtsev

    Abstract: We give algorithms for geometric graph problems in the modern parallel models inspired by MapReduce. For example, for the Minimum Spanning Tree (MST) problem over a set of points in the two-dimensional space, our algorithm computes a $(1+ε)$-approximate MST. Our algorithms work in a constant number of rounds of communication, while using total space and communication proportional to the size of th… ▽ More

    Submitted 4 January, 2014; v1 submitted 30 December, 2013; originally announced January 2014.

  23. arXiv:1304.1796   

    cs.CC

    The Round Complexity of Small Set Intersection

    Authors: David P. Woodruff, Grigory Yaroslavtsev

    Abstract: The set disjointness problem is one of the most fundamental and well-studied problems in communication complexity. In this problem Alice and Bob hold sets $S, T \subseteq [n]$, respectively, and the goal is to decide if $S \cap T = \emptyset$. Reductions from set disjointness are a canonical way of proving lower bounds in data stream algorithms, data structures, and distributed computation. In the… ▽ More

    Submitted 9 April, 2013; v1 submitted 5 April, 2013; originally announced April 2013.

    Comments: There is an error in the statement and proof of Lemma A.1, so we have decided to withdraw the current manuscript. For the round / communication tradeoff for small set disjointness, we refer the reader to the independent work: http://arxiv.org/pdf/1304.1217.pdf The other results concerning OR-Index and Augmented-OR-Index are not affected and will appear in a later manuscript

  24. arXiv:1208.2294  [pdf, ps, other

    cs.LG cs.DM cs.DS

    Learning pseudo-Boolean k-DNF and Submodular Functions

    Authors: Sofya Raskhodnikova, Grigory Yaroslavtsev

    Abstract: We prove that any submodular function f: {0,1}^n -> {0,1,...,k} can be represented as a pseudo-Boolean 2k-DNF formula. Pseudo-Boolean DNFs are a natural generalization of DNF representation for functions with integer range. Each term in such a formula has an associated integral constant. We show that an analog of Hastad's switching lemma holds for pseudo-Boolean k-DNFs if all constants associated… ▽ More

    Submitted 10 August, 2012; originally announced August 2012.

  25. arXiv:1207.6096  [pdf, other

    cs.DB

    Accurate and Efficient Private Release of Datacubes and Contingency Tables

    Authors: Graham Cormode, Cecilia M. Procopiuc, Divesh Srivastava, Grigory Yaroslavtsev

    Abstract: A central problem in releasing aggregate information about sensitive data is to do so accurately while providing a privacy guarantee on the output. Recent work focuses on the class of linear queries, which include basic counting queries, data cubes, and contingency tables. The goal is to maximize the utility of their output, while giving a rigorous privacy guarantee. Most results follow a common t… ▽ More

    Submitted 25 July, 2012; originally announced July 2012.

  26. arXiv:1011.6100  [pdf, other

    cs.DS cs.DM

    Steiner Transitive-Closure Spanners of d-Dimensional Posets

    Authors: Piotr Berman, Arnab Bhattacharyya, Elena Grigorescu, Sofya Raskhodnikova, David Woodruff, Grigory Yaroslavtsev

    Abstract: Given a directed graph G and an integer k >= 1, a k-transitive-closure-spanner (k-TCspanner) of G is a directed graph H that has (1) the same transitive-closure as G and (2) diameter at most k. In some applications, the shortcut paths added to the graph in order to obtain small diameter can use Steiner vertices, that is, vertices not in the original graph G. The resulting spanner is called a Stein… ▽ More

    Submitted 28 November, 2010; originally announced November 2010.