Skip to main content

Showing 1–15 of 15 results for author: Puri, N

  1. arXiv:2306.16503  [pdf, other

    cs.LG cs.AI

    SARC: Soft Actor Retrospective Critic

    Authors: Sukriti Verma, Ayush Chopra, Jayakumar Subramanian, Mausoom Sarkar, Nikaash Puri, Piyush Gupta, Balaji Krishnamurthy

    Abstract: The two-time scale nature of SAC, which is an actor-critic algorithm, is characterised by the fact that the critic estimate has not converged for the actor at any given time, but since the critic learns faster than the actor, it ensures eventual consistency between the two. Various strategies have been introduced in literature to learn better gradient estimates to help achieve better convergence.… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: Accepted at RLDM 2022

  2. arXiv:2305.09258  [pdf, other

    cs.IR cs.CL

    HyHTM: Hyperbolic Geometry based Hierarchical Topic Models

    Authors: Simra Shahid, Tanay Anand, Nikitha Srikanth, Sumit Bhatia, Balaji Krishnamurthy, Nikaash Puri

    Abstract: Hierarchical Topic Models (HTMs) are useful for discovering topic hierarchies in a collection of documents. However, traditional HTMs often produce hierarchies where lowerlevel topics are unrelated and not specific enough to their higher-level topics. Additionally, these methods can be computationally expensive. We present HyHTM - a Hyperbolic geometry based Hierarchical Topic Models - that addres… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: This paper is accepted in Findings of the Association for Computational Linguistics (2023)

  3. arXiv:2112.05969  [pdf, other

    quant-ph

    Q-means using variational quantum feature embedding

    Authors: Arvind S Menon, Nikaash Puri

    Abstract: This paper proposes a hybrid quantum-classical algorithm that learns a suitable quantum feature map that separates unlabelled data that is originally non linearly separable in the classical space using a Variational quantum feature map and q-means as a subroutine for unsupervised learning. The objective of the Variational circuit is to maximally separate the clusters in the quantum feature Hilbert… ▽ More

    Submitted 11 December, 2021; originally announced December 2021.

  4. arXiv:2111.11692  [pdf, other

    cs.MA

    Status-quo policy gradient in Multi-Agent Reinforcement Learning

    Authors: Pinkesh Badjatiya, Mausoom Sarkar, Nikaash Puri, Jayakumar Subramanian, Abhishek Sinha, Siddharth Singh, Balaji Krishnamurthy

    Abstract: Individual rationality, which involves maximizing expected individual returns, does not always lead to high-utility individual or group outcomes in multi-agent problems. For instance, in multi-agent social dilemmas, Reinforcement Learning (RL) agents trained to maximize individual rewards converge to a low-utility mutually harmful equilibrium. In contrast, humans evolve useful strategies in such s… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

  5. arXiv:2110.09318  [pdf

    cs.MM cs.AI cs.CV

    Mixed Reality using Illumination-aware Gradient Mixing in Surgical Telepresence: Enhanced Multi-layer Visualization

    Authors: Nirakar Puri, Abeer Alsadoon, P. W. C. Prasad, Nada Alsalami, Tarik A. Rashid

    Abstract: Background and aim: Surgical telepresence using augmented perception has been applied, but mixed reality is still being researched and is only theoretical. The aim of this work is to propose a solution to improve the visualization in the final merged video by producing globally consistent videos when the intensity of illumination in the input source and target video varies. Methodology: The propos… ▽ More

    Submitted 21 August, 2021; originally announced October 2021.

    Comments: 24 pages

    Journal ref: Multimedia Tools and Applications, 2021

  6. arXiv:2109.03813  [pdf, other

    cs.AI

    Video2Skill: Adapting Events in Demonstration Videos to Skills in an Environment using Cyclic MDP Homomorphisms

    Authors: Sumedh A Sontakke, Sumegh Roychowdhury, Mausoom Sarkar, Nikaash Puri, Balaji Krishnamurthy, Laurent Itti

    Abstract: Humans excel at learning long-horizon tasks from demonstrations augmented with textual commentary, as evidenced by the burgeoning popularity of tutorial videos online. Intuitively, this capability can be separated into 2 distinct subtasks - first, dividing a long-horizon demonstration sequence into semantically meaningful events; second, adapting such events into meaningful behaviors in one's own… ▽ More

    Submitted 9 September, 2021; v1 submitted 8 September, 2021; originally announced September 2021.

  7. arXiv:2105.06956  [pdf, other

    cs.LG

    Information-theoretic Evolution of Model Agnostic Global Explanations

    Authors: Sukriti Verma, Nikaash Puri, Piyush Gupta, Balaji Krishnamurthy

    Abstract: Explaining the behavior of black box machine learning models through human interpretable rules is an important research area. Recent work has focused on explaining model behavior locally i.e. for specific predictions as well as globally across the fields of vision, natural language, reinforcement learning and data science. We present a novel model-agnostic approach that derives rules to globally e… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

  8. arXiv:2010.02556  [pdf, other

    cs.LG cs.AI cs.CL

    SHERLock: Self-Supervised Hierarchical Event Representation Learning

    Authors: Sumegh Roychowdhury, Sumedh A. Sontakke, Nikaash Puri, Mausoom Sarkar, Milan Aggarwal, Pinkesh Badjatiya, Balaji Krishnamurthy, Laurent Itti

    Abstract: Temporal event representations are an essential aspect of learning among humans. They allow for succinct encoding of the experiences we have through a variety of sensory inputs. Also, they are believed to be arranged hierarchically, allowing for an efficient representation of complex long-horizon experiences. Additionally, these representations are acquired in a self-supervised manner. Analogously… ▽ More

    Submitted 22 August, 2022; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: Accepted at ICPR '22

  9. arXiv:2009.01571  [pdf, other

    cs.LG stat.ML

    MixBoost: Synthetic Oversampling with Boosted Mixup for Handling Extreme Imbalance

    Authors: Anubha Kabra, Ayush Chopra, Nikaash Puri, Pinkesh Badjatiya, Sukriti Verma, Piyush Gupta, Balaji K

    Abstract: Training a classification model on a dataset where the instances of one class outnumber those of the other class is a challenging problem. Such imbalanced datasets are standard in real-world situations such as fraud detection, medical diagnosis, and computational advertising. We propose an iterative data augmentation method, MixBoost, which intelligently selects (Boost) and then combines (Mix) ins… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

    Comments: Work done as part of internship at MDSR

  10. arXiv:2001.05458  [pdf, other

    cs.AI cs.GT cs.LG

    Inducing Cooperative behaviour in Sequential-Social dilemmas through Multi-Agent Reinforcement Learning using Status-Quo Loss

    Authors: Pinkesh Badjatiya, Mausoom Sarkar, Abhishek Sinha, Siddharth Singh, Nikaash Puri, Jayakumar Subramanian, Balaji Krishnamurthy

    Abstract: In social dilemma situations, individual rationality leads to sub-optimal group outcomes. Several human engagements can be modeled as a sequential (multi-step) social dilemmas. However, in contrast to humans, Deep Reinforcement Learning agents trained to optimize individual rewards in sequential social dilemmas converge to selfish, mutually harmful behavior. We introduce a status-quo loss (SQLoss)… ▽ More

    Submitted 13 February, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

  11. arXiv:1912.12191  [pdf, other

    cs.CV cs.AI

    Explain Your Move: Understanding Agent Actions Using Specific and Relevant Feature Attribution

    Authors: Nikaash Puri, Sukriti Verma, Piyush Gupta, Dhruv Kayastha, Shripad Deshmukh, Balaji Krishnamurthy, Sameer Singh

    Abstract: As deep reinforcement learning (RL) is applied to more tasks, there is a need to visualize and understand the behavior of learned agents. Saliency maps explain agent behavior by highlighting the features of the input state that are most relevant for the agent in taking an action. Existing perturbation-based approaches to compute saliency often highlight regions of the input that are not relevant t… ▽ More

    Submitted 3 April, 2020; v1 submitted 23 December, 2019; originally announced December 2019.

    Comments: Accepted at the International Conference on Learning Representations (ICLR) 2020

  12. arXiv:1909.07806  [pdf, other

    cs.ET

    OpticalGAN : Generative Adversarial Networks for Continuous Variable Quantum Computation

    Authors: Nilay Shrivastava, Nikaash Puri, Piyush Gupta, Balaji Krishnamurthy, Sukriti Verma

    Abstract: We present OpticalGAN, an extension of quantum generative adversarial networks for continuous-variable quantum computation. OpticalGAN consists of photonic variational circuits comprising of optical Gaussian and Kerr gates. Photonic quantum computation is a realization of continuous variable quantum computing which involves encoding and processing information in the continuous quadrature amplitude… ▽ More

    Submitted 15 September, 2019; originally announced September 2019.

  13. arXiv:1706.07160  [pdf, other

    cs.AI

    MAGIX: Model Agnostic Globally Interpretable Explanations

    Authors: Nikaash Puri, Piyush Gupta, Pratiksha Agarwal, Sukriti Verma, Balaji Krishnamurthy

    Abstract: Explaining the behavior of a black box machine learning model at the instance level is useful for building trust. However, it is also important to understand how the model behaves globally. Such an understanding provides insight into both the data on which the model was trained and the patterns that it learned. We present here an approach that learns if-then rules to globally explain the behavior… ▽ More

    Submitted 15 June, 2018; v1 submitted 21 June, 2017; originally announced June 2017.

  14. arXiv:1512.08611  [pdf

    physics.atom-ph

    Surface wake field model of beam-foil circular Rydberg states

    Authors: Gaurav Sharma, Nitin Kumar Puri, Adya Prasad Mishra, Tapan Nandi

    Abstract: Production of projectile Rydberg states in fast ion-solid collisions in H-like ions exhibits a pronounce target thickness dependence in spite of these states forming at the last layers. This occurs due to important role of the surface wake field which varies with the target foil thickness. Further, according to the proposed model Rydberg states with low angular momentum are transformed into a circ… ▽ More

    Submitted 29 December, 2015; originally announced December 2015.

  15. arXiv:1512.08399  [pdf

    physics.ins-det physics.atom-ph

    X-ray spectroscopy technique for the pile-up region

    Authors: Gaurav Sharma, Deepak Swami, Basu Kumar, Nitin Kumar Puri, Tapan Nandi

    Abstract: We report a pile-up rejection technique based on X-ray absorption concept of Beer-Lambert law for measuring true events in the pile-up region. We have detected a 10^4 times weaker peak in the pile-up region. This technique also enables one to resolve the weak peaks adjacent to an intense peak provided the later lies in the lower energy side, and the peaks are at least theoretically resolvable by t… ▽ More

    Submitted 19 January, 2016; v1 submitted 28 December, 2015; originally announced December 2015.