Skip to main content

Showing 1–11 of 11 results for author: Ghai, U

  1. arXiv:2410.02817  [pdf, other

    eess.SY cs.LG stat.ML

    Neural Coordination and Capacity Control for Inventory Management

    Authors: Carson Eisenach, Udaya Ghai, Dhruv Madeka, Kari Torkkola, Dean Foster, Sham Kakade

    Abstract: This paper addresses the capacitated periodic review inventory control problem, focusing on a retailer managing multiple products with limited shared resources, such as storage or inbound labor at a facility. Specifically, this paper is motivated by the questions of (1) what does it mean to backtest a capacity control mechanism, (2) can we devise and backtest a capacity control mechanism that is c… ▽ More

    Submitted 24 September, 2024; originally announced October 2024.

  2. arXiv:2305.17552  [pdf, other

    cs.LG math.OC

    Online Nonstochastic Model-Free Reinforcement Learning

    Authors: Udaya Ghai, Arushi Gupta, Wenhan Xia, Karan Singh, Elad Hazan

    Abstract: We investigate robust model-free reinforcement learning algorithms designed for environments that may be dynamic or even adversarial. Traditional state-based policies often struggle to accommodate the challenges imposed by the presence of unmodeled disturbances in such settings. Moreover, optimizing linear state-based policies pose an obstacle for efficient optimization, leading to nonconvex objec… ▽ More

    Submitted 31 October, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: Camera-ready version for NeurIPS 2023

  3. arXiv:2205.15235  [pdf, other

    cs.LG math.OC

    Non-convex online learning via algorithmic equivalence

    Authors: Udaya Ghai, Zhou Lu, Elad Hazan

    Abstract: We study an algorithmic equivalence technique between non-convex gradient descent and convex mirror descent. We start by looking at a harder problem of regret minimization in online non-convex optimization. We show that under certain geometric and smoothness conditions, online gradient descent applied to non-convex functions is an approximation of online mirror descent applied to convex functions… ▽ More

    Submitted 12 October, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

  4. arXiv:2201.13288  [pdf, other

    math.OC cs.LG stat.ML

    A Regret Minimization Approach to Multi-Agent Control

    Authors: Udaya Ghai, Udari Madhushani, Naomi Leonard, Elad Hazan

    Abstract: We study the problem of multi-agent control of a dynamical system with known dynamics and adversarial disturbances. Our study focuses on optimal control without centralized precomputed policies, but rather with adaptive control policies for the different agents that are only equipped with a stabilizing controller. We give a reduction from any (standard) regret minimizing control method to a distri… ▽ More

    Submitted 25 February, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:7422-7434, 2022

  5. arXiv:2111.10434  [pdf, other

    cs.LG

    Machine Learning for Mechanical Ventilation Control (Extended Abstract)

    Authors: Daniel Suo, Naman Agarwal, Wenhan Xia, Xinyi Chen, Udaya Ghai, Alexander Yu, Paula Gradu, Karan Singh, Cyril Zhang, Edgar Minasyan, Julienne LaChance, Tom Zajdel, Manuel Schottdorf, Daniel Cohen, Elad Hazan

    Abstract: Mechanical ventilation is one of the most widely used therapies in the ICU. However, despite broad application from anaesthesia to COVID-related life support, many injurious challenges remain. We frame these as a control problem: ventilators must let air in and out of the patient's lungs according to a prescribed trajectory of airway pressure. Industry-standard controllers, based on the PID method… ▽ More

    Submitted 23 December, 2021; v1 submitted 19 November, 2021; originally announced November 2021.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2021 - Extended Abstract. arXiv admin note: substantial text overlap with arXiv:2102.06779

  6. arXiv:2107.07732  [pdf, ps, other

    math.OC cs.LG

    Robust Online Control with Model Misspecification

    Authors: Xinyi Chen, Udaya Ghai, Elad Hazan, Alexandre Megretski

    Abstract: We study online control of an unknown nonlinear dynamical system that is approximated by a time-invariant linear system with model misspecification. Our study focuses on robustness, a measure of how much deviation from the assumed linear approximation can be tolerated by a controller while maintaining finite $\ell_2$-gain. A basic methodology to analyze robustness is via the small gain theorem.… ▽ More

    Submitted 4 April, 2022; v1 submitted 16 July, 2021; originally announced July 2021.

  7. arXiv:2102.09968  [pdf, other

    cs.RO cs.LG

    Deluca -- A Differentiable Control Library: Environments, Methods, and Benchmarking

    Authors: Paula Gradu, John Hallman, Daniel Suo, Alex Yu, Naman Agarwal, Udaya Ghai, Karan Singh, Cyril Zhang, Anirudha Majumdar, Elad Hazan

    Abstract: We present an open-source library of natively differentiable physics and robotics environments, accompanied by gradient-based control methods and a benchmark-ing suite. The introduced environments allow auto-differentiation through the simulation dynamics, and thereby permit fast training of controllers. The library features several popular environments, including classical control settings from O… ▽ More

    Submitted 19 February, 2021; originally announced February 2021.

  8. arXiv:2102.06779  [pdf, other

    cs.LG

    Machine Learning for Mechanical Ventilation Control

    Authors: Daniel Suo, Naman Agarwal, Wenhan Xia, Xinyi Chen, Udaya Ghai, Alexander Yu, Paula Gradu, Karan Singh, Cyril Zhang, Edgar Minasyan, Julienne LaChance, Tom Zajdel, Manuel Schottdorf, Daniel Cohen, Elad Hazan

    Abstract: We consider the problem of controlling an invasive mechanical ventilator for pressure-controlled ventilation: a controller must let air in and out of a sedated patient's lungs according to a trajectory of airway pressures specified by a clinician. Hand-tuned PID controllers and similar variants have comprised the industry standard for decades, yet can behave poorly by over- or under-shooting their… ▽ More

    Submitted 18 January, 2022; v1 submitted 12 February, 2021; originally announced February 2021.

  9. arXiv:2012.06695  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    Generating Adversarial Disturbances for Controller Verification

    Authors: Udaya Ghai, David Snyder, Anirudha Majumdar, Elad Hazan

    Abstract: We consider the problem of generating maximally adversarial disturbances for a given controller assuming only blackbox access to it. We propose an online learning approach to this problem that \emph{adaptively} generates disturbances based on control inputs chosen by the controller. The goal of the disturbance generator is to minimize \emph{regret} versus a benchmark disturbance-generating policy… ▽ More

    Submitted 31 January, 2022; v1 submitted 11 December, 2020; originally announced December 2020.

  10. arXiv:2002.02064  [pdf, ps, other

    cs.LG eess.SY math.OC stat.ML

    No-Regret Prediction in Marginally Stable Systems

    Authors: Udaya Ghai, Holden Lee, Karan Singh, Cyril Zhang, Yi Zhang

    Abstract: We consider the problem of online prediction in a marginally stable linear dynamical system subject to bounded adversarial or (non-isotropic) stochastic perturbations. This poses two challenges. Firstly, the system is in general unidentifiable, so recent and classical results on parameter recovery do not apply. Secondly, because we allow the system to be marginally stable, the state can grow polyn… ▽ More

    Submitted 23 June, 2020; v1 submitted 5 February, 2020; originally announced February 2020.

    Comments: 43 pages. Appears in COLT 2020

  11. arXiv:1902.01903  [pdf, other

    cs.LG math.OC stat.ML

    Exponentiated Gradient Meets Gradient Descent

    Authors: Udaya Ghai, Elad Hazan, Yoram Singer

    Abstract: The (stochastic) gradient descent and the multiplicative update method are probably the most popular algorithms in machine learning. We introduce and study a new regularization which provides a unification of the additive and multiplicative updates. This regularization is derived from an hyperbolic analogue of the entropy function, which we call hypentropy. It is motivated by a natural extension o… ▽ More

    Submitted 5 February, 2019; originally announced February 2019.