Skip to main content

Showing 1–19 of 19 results for author: Karande, S

  1. arXiv:2408.15879  [pdf, other

    cs.AI cs.CL

    Persuasion Games using Large Language Models

    Authors: Ganesh Prasath Ramani, Shirish Karande, Santhosh V, Yash Bhatia

    Abstract: Large Language Models (LLMs) have emerged as formidable instruments capable of comprehending and producing human-like text. This paper explores the potential of LLMs, to shape user perspectives and subsequently influence their decisions on particular tasks. This capability finds applications in diverse domains such as Investment, Credit cards and Insurance, wherein they assist users in selecting a… ▽ More

    Submitted 1 September, 2024; v1 submitted 28 August, 2024; originally announced August 2024.

  2. arXiv:2407.18541  [pdf, other

    cs.SD cs.AI eess.AS

    Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models

    Authors: Neil Shah, Shirish Karande, Vineet Gandhi

    Abstract: We propose a novel approach to significantly improve the intelligibility in the Non-Audible Murmur (NAM)-to-speech conversion task, leveraging self-supervision and sequence-to-sequence (Seq2Seq) learning techniques. Unlike conventional methods that explicitly record ground-truth speech, our methodology relies on self-supervision and speech-to-speech synthesis to simulate ground-truth speech. Despi… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: Accepted at Interspeech 2024

  3. arXiv:2312.01398  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Mitigating Perceived Unfairness in Contracts from a Non-Legal Stakeholder's Perspective

    Authors: Anmol Singhal, Preethu Rose Anish, Shirish Karande, Smita Ghaisas

    Abstract: Commercial contracts are known to be a valuable source for deriving project-specific requirements. However, contract negotiations mainly occur among the legal counsel of the parties involved. The participation of non-legal stakeholders, including requirement analysts, engineers, and solution architects, whose primary responsibility lies in ensuring the seamless implementation of contractual terms,… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: 9 pages, 2 figures, to be published in Natural Legal Language Processing Workshop at EMNLP 2023

  4. arXiv:2311.13885  [pdf, other

    cs.LG cs.AI math.AP

    Can Physics Informed Neural Operators Self Improve?

    Authors: Ritam Majumdar, Amey Varhade, Shirish Karande, Lovekesh Vig

    Abstract: Self-training techniques have shown remarkable value across many deep learning models and tasks. However, such techniques remain largely unexplored when considered in the context of learning fast solvers for systems of partial differential equations (Eg: Neural Operators). In this work, we explore the use of self-training for Fourier Neural Operators (FNO). Neural Operators emerged as a data drive… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: Paper accepted as a Spotlight talk at Symbiosis of Deep Learning and Differential Equations, Neural Information Processing Systems 2023

  5. arXiv:2308.09293  [pdf, other

    cs.LG cs.AI cs.CE math.AP

    How important are specialized transforms in Neural Operators?

    Authors: Ritam Majumdar, Shirish Karande, Lovekesh Vig

    Abstract: Simulating physical systems using Partial Differential Equations (PDEs) has become an indispensible part of modern industrial process optimization. Traditionally, numerical solvers have been used to solve the associated PDEs, however recently Transform-based Neural Operators such as the Fourier Neural Operator and Wavelet Neural Operator have received a lot of attention for their potential to prov… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 8 pages, 3 figures, 4 tables

  6. arXiv:2308.09290  [pdf, other

    cs.LG cs.AI cs.CE math.AP

    HyperLoRA for PDEs

    Authors: Ritam Majumdar, Vishal Jadhav, Anirudh Deodhar, Shirish Karande, Lovekesh Vig, Venkataramana Runkana

    Abstract: Physics-informed neural networks (PINNs) have been widely used to develop neural surrogates for solutions of Partial Differential Equations. A drawback of PINNs is that they have to be retrained with every change in initial-boundary conditions and PDE coefficients. The Hypernetwork, a model-based meta learning technique, takes in a parameterized task embedding as input and predicts the weights of… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 8 pages, 4 figures, 3 Tables

  7. arXiv:2304.03287  [pdf, other

    cs.AI cs.CL cs.LG

    Synthesis of Mathematical programs from Natural Language Specifications

    Authors: Ganesh Prasath, Shirish Karande

    Abstract: Several decision problems that are encountered in various business domains can be modeled as mathematical programs, i.e. optimization problems. The process of conducting such modeling often requires the involvement of experts trained in operations research and advanced algorithms. Surprisingly, despite the significant advances in the methods for program and code synthesis, AutoML, learning to opti… ▽ More

    Submitted 30 March, 2023; originally announced April 2023.

    Comments: Accepted in ICLR 2023 DL4C stream

  8. arXiv:2303.14194  [pdf, other

    cs.LG cs.AI q-bio.QM

    DeepEpiSolver: Unravelling Inverse problems in Covid, HIV, Ebola and Disease Transmission

    Authors: Ritam Majumdar, Shirish Karande, Lovekesh Vig

    Abstract: The spread of many infectious diseases is modeled using variants of the SIR compartmental model, which is a coupled differential equation. The coefficients of the SIR model determine the spread trajectories of disease, on whose basis proactive measures can be taken. Hence, the coefficient estimates must be both fast and accurate. Shaier et al. in the paper "Disease Informed Neural Networks" used P… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: Publication accepted at International Conference for Learning Representations 2023: First Workshop in Machine Learning and Global Health

  9. arXiv:2303.07009  [pdf, other

    cs.LG cs.AI math.AP

    Symbolic Regression for PDEs using Pruned Differentiable Programs

    Authors: Ritam Majumdar, Vishal Jadhav, Anirudh Deodhar, Shirish Karande, Lovekesh Vig, Venkataramana Runkana

    Abstract: Physics-informed Neural Networks (PINNs) have been widely used to obtain accurate neural surrogates for a system of Partial Differential Equations (PDE). One of the major limitations of PINNs is that the neural solutions are challenging to interpret, and are often treated as black-box solvers. While Symbolic Regression (SR) has been studied extensively, very few works exist which generate analytic… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: Publication accepted at International Conference for Learning Representations 2023: Physics for Machine Learning

  10. arXiv:2303.02648  [pdf, other

    cs.CV cs.LG

    Comparative study of Transformer and LSTM Network with attention mechanism on Image Captioning

    Authors: Pranav Dandwate, Chaitanya Shahane, Vandana Jagtap, Shridevi C. Karande

    Abstract: In a globalized world at the present epoch of generative intelligence, most of the manual labour tasks are automated with increased efficiency. This can support businesses to save time and money. A crucial component of generative intelligence is the integration of vision and language. Consequently, image captioning become an intriguing area of research. There have been multiple attempts by the res… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

    Comments: 13 pages, 7 figures, 2 tables

  11. arXiv:2212.10032  [pdf, other

    cs.LG cs.AI cs.CE

    Real-time Health Monitoring of Heat Exchangers using Hypernetworks and PINNs

    Authors: Ritam Majumdar, Vishal Jadhav, Anirudh Deodhar, Shirish Karande, Lovekesh Vig, Venkataramana Runkana

    Abstract: We demonstrate a Physics-informed Neural Network (PINN) based model for real-time health monitoring of a heat exchanger, that plays a critical role in improving energy efficiency of thermal power plants. A hypernetwork based approach is used to enable the domain-decomposed PINN learn the thermal behavior of the heat exchanger in response to dynamic boundary conditions, eliminating the need to re-t… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: Neural Information Processing Systems 2022: The Machine Learning and the Physical Sciences workshop

  12. arXiv:2207.06240  [pdf, ps, other

    cs.LG cs.AI math.NA

    Physics Informed Symbolic Networks

    Authors: Ritam Majumdar, Vishal Jadhav, Anirudh Deodhar, Shirish Karande, Lovekesh Vig, Venkataramana Runkana

    Abstract: We introduce Physics Informed Symbolic Networks (PISN) which utilize physics-informed loss to obtain a symbolic solution for a system of Partial Differential Equations (PDE). Given a context-free grammar to describe the language of symbolic expressions, we propose to use weighted sum as continuous approximation for selection of a production rule. We use this approximation to define multilayer symb… ▽ More

    Submitted 20 December, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: Neural Information Processing Systems 2022: The Symbiosis of Deep Learning and Differential Equations Workshop

  13. arXiv:2203.10085  [pdf, other

    cs.LG cs.AI

    I Know Therefore I Score: Label-Free Crafting of Scoring Functions using Constraints Based on Domain Expertise

    Authors: Ragja Palakkadavath, Sarath Sivaprasad, Shirish Karande, Niranjan Pedanekar

    Abstract: Several real-life applications require crafting concise, quantitative scoring functions (also called rating systems) from measured observations. For example, an effectiveness score needs to be created for advertising campaigns using a number of engagement metrics. Experts often need to create such scoring functions in the absence of labelled data, where the scores need to reflect business insights… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

  14. arXiv:1703.07131  [pdf, other

    cs.CV cs.LG stat.ML

    Knowledge distillation using unlabeled mismatched images

    Authors: Mandar Kulkarni, Kalpesh Patil, Shirish Karande

    Abstract: Current approaches for Knowledge Distillation (KD) either directly use training data or sample from the training data distribution. In this paper, we demonstrate effectiveness of 'mismatched' unlabeled stimulus to perform KD for image classification networks. For illustration, we consider scenarios where this is a complete absence of training data, or mismatched stimulus has to be used for augment… ▽ More

    Submitted 21 March, 2017; originally announced March 2017.

  15. arXiv:1703.07115  [pdf, other

    cs.LG

    Layer-wise training of deep networks using kernel similarity

    Authors: Mandar Kulkarni, Shirish Karande

    Abstract: Deep learning has shown promising results in many machine learning applications. The hierarchical feature representation built by deep networks enable compact and precise encoding of the data. A kernel analysis of the trained deep networks demonstrated that with deeper layers, more simple and more accurate data representations are obtained. In this paper, we propose an approach for layer-wise trai… ▽ More

    Submitted 21 March, 2017; originally announced March 2017.

    Journal ref: Deep Learning for Pattern Recognition (DLPR) workshop at ICPR 2016

  16. arXiv:1609.05001  [pdf, other

    cs.CV

    Stamp processing with examplar features

    Authors: Yash Bhalgat, Mandar Kulkarni, Shirish Karande, Sachin Lodha

    Abstract: Document digitization is becoming increasingly crucial. In this work, we propose a shape based approach for automatic stamp verification/detection in document images using an unsupervised feature learning. Given a small set of training images, our algorithm learns an appropriate shape representation using an unsupervised clustering. Experimental results demonstrate the effectiveness of our framewo… ▽ More

    Submitted 16 September, 2016; originally announced September 2016.

  17. arXiv:1609.02271  [pdf, other

    cs.CV cs.HC

    Ashwin: Plug-and-Play System for Machine-Human Image Annotation

    Authors: Anand Sriraman, Mandar Kulkarni, Rahul Kumar, Kanika Kalra, Purushotam Radadia, Shirish Karande

    Abstract: We present an end-to-end machine-human image annotation system where each component can be attached in a plug-and-play fashion. These components include Feature Extraction, Machine Classifier, Task Sampling and Crowd Consensus.

    Submitted 9 September, 2016; v1 submitted 8 September, 2016; originally announced September 2016.

    Comments: HCOMP 2016 Demonstrations Track

  18. arXiv:1609.02043  [pdf, ps, other

    cs.AI cs.CL

    Feasibility of Post-Editing Speech Transcriptions with a Mismatched Crowd

    Authors: Purushotam Radadia, Shirish Karande

    Abstract: Manual correction of speech transcription can involve a selection from plausible transcriptions. Recent work has shown the feasibility of employing a mismatched crowd for speech transcription. However, it is yet to be established whether a mismatched worker has sufficiently fine-granular speech perception to choose among the phonetically proximate options that are likely to be generated from the t… ▽ More

    Submitted 7 September, 2016; originally announced September 2016.

    Comments: HCOMP 2016 Works-in-Progress

  19. arXiv:0809.3600  [pdf, ps, other

    cs.IT

    On the Capacity Improvement of Multicast Traffic with Network Coding

    Authors: Zheng Wang, Shirish Karande, Hamid R. Sadjadpour, J. J. Garcia-Luna-Aceves

    Abstract: In this paper, we study the contribution of network coding (NC) in improving the multicast capacity of random wireless ad hoc networks when nodes are endowed with multi-packet transmission (MPT) and multi-packet reception (MPR) capabilities. We show that a per session throughput capacity of $Θ(nT^{3}(n))$, where $n$ is the total number of nodes and T(n) is the communication range, can be achieve… ▽ More

    Submitted 21 September, 2008; originally announced September 2008.