Skip to main content

Showing 1–50 of 130 results for author: Shah, V

  1. arXiv:2410.10900  [pdf, other

    cs.RO

    Oogway: Designing, Implementing, and Testing an AUV for RoboSub 2023

    Authors: Will Denton, Lilly Chiavetta, Michael Bryant, Vedarsh Shah, Rico Zhu, Ricky Weerts, Phillip Xue, Vincent Chen, Hung Le, Maxwell Lin, Austin Camacho, Drew Council, Ethan Horowitz, Jackie Ong, Morgan Chu, Alex Pool

    Abstract: The Duke Robotics Club is proud to present our robot for the 2023 RoboSub Competition: Oogway. Oogway marks one of the largest design overhauls in club history. Beyond a revamped formfactor, some of Oogway's notable features include all-new computer vision software, advanced sonar integration, novel acoustics hardware processing, and upgraded stereoscopic cameras. Oogway was built on the principle… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

    Comments: arXiv admin note: text overlap with arXiv:2410.09684

  2. arXiv:2410.10736  [pdf, other

    cs.LG stat.ML

    Towards Calibrated Losses for Adversarial Robust Reject Option Classification

    Authors: Vrund Shah, Tejas Chaudhari, Naresh Manwani

    Abstract: Robustness towards adversarial attacks is a vital property for classifiers in several applications such as autonomous driving, medical diagnosis, etc. Also, in such scenarios, where the cost of misclassification is very high, knowing when to abstain from prediction becomes crucial. A natural question is which surrogates can be used to ensure learning in scenarios where the input points are adversa… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: Accepted at Asian Conference on Machine Learning (ACML) , 2024

  3. arXiv:2410.09684  [pdf, other

    cs.RO

    Technical Design Review of Duke Robotics Club's Oogway: An AUV for RoboSub 2024

    Authors: Will Denton, Michael Bryant, Lilly Chiavetta, Vedarsh Shah, Rico Zhu, Philip Xue, Vincent Chen, Maxwell Lin, Hung Le, Austin Camacho, Raul Galvez, Nathan Yang, Nathanael Ren, Tyler Rose, Mathew Chu, Amir Ergashev, Saagar Arya, Kaelyn Pieter, Ethan Horowitz, Maanav Allampallam, Patrick Zheng, Mia Kaarls, June Wood

    Abstract: The Duke Robotics Club is proud to present our robot for the 2024 RoboSub Competition: Oogway. Now in its second year, Oogway has been dramatically upgraded in both its capabilities and reliability. Oogway was built on the principle of independent, well-integrated, and reliable subsystems. Individual components and subsystems were tested and designed separately. Oogway's most advanced capabilities… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

  4. arXiv:2410.07836  [pdf, other

    cs.LG cs.AI

    Masked Generative Priors Improve World Models Sequence Modelling Capabilities

    Authors: Cristian Meo, Mircea Lica, Zarif Ikram, Akihiro Nakano, Vedant Shah, Aniket Rajiv Didolkar, Dianbo Liu, Anirudh Goyal, Justin Dauwels

    Abstract: Deep Reinforcement Learning (RL) has become the leading approach for creating artificial agents in complex environments. Model-based approaches, which are RL methods with world models that predict environment dynamics, are among the most promising directions for improving data efficiency, forming a critical step toward bridging the gap between research and real-world deployment. In particular, wor… ▽ More

    Submitted 13 October, 2024; v1 submitted 10 October, 2024; originally announced October 2024.

  5. arXiv:2409.10532  [pdf, other

    cs.RO cs.LG

    Slug Mobile: Test-Bench for RL Testing

    Authors: Jonathan Wellington Morris, Vishrut Shah, Alex Besanceney, Daksh Shah, Leilani H. Gilpin

    Abstract: Sim-to real gap in Reinforcement Learning is when a model trained in a simulator does not translate to the real world. This is a problem for Autonomous Vehicles (AVs) as vehicle dynamics can vary from simulation to reality, and also from vehicle to vehicle. Slug Mobile is a one tenth scale autonomous vehicle created to help address the sim-to-real gap for AVs by acting as a test-bench to develop m… ▽ More

    Submitted 30 August, 2024; originally announced September 2024.

    Comments: Submitted to BayLearn 2024

  6. arXiv:2409.06899  [pdf, other

    cs.GT

    Inefficient Alliance Formation in Coalitional Blotto Games

    Authors: Vade Shah, Keith Paarporn, Jason R. Marden

    Abstract: When multiple agents are engaged in a network of conflict, some can advance their competitive positions by forming alliances with each other. However, the costs associated with establishing an alliance may outweigh the potential benefits. This study investigates costly alliance formation in the framework of coalitional Blotto games, in which two players compete separately against a common adversar… ▽ More

    Submitted 17 September, 2024; v1 submitted 10 September, 2024; originally announced September 2024.

  7. arXiv:2409.04669  [pdf, other

    cs.GT eess.SY

    Learning Optimal Stable Matches in Decentralized Markets with Unknown Preferences

    Authors: Vade Shah, Bryce L. Ferguson, Jason R. Marden

    Abstract: Matching algorithms have demonstrated great success in several practical applications, but they often require centralized coordination and plentiful information. In many modern online marketplaces, agents must independently seek out and match with another using little to no information. For these kinds of settings, can we design decentralized, limited-information matching algorithms that preserve… ▽ More

    Submitted 16 October, 2024; v1 submitted 6 September, 2024; originally announced September 2024.

  8. arXiv:2408.10397  [pdf, other

    cs.CV cs.AI cs.MM

    Webcam-based Pupil Diameter Prediction Benefits from Upscaling

    Authors: Vijul Shah, Brian B. Moser, Ko Watanabe, Andreas Dengel

    Abstract: Capturing pupil diameter is essential for assessing psychological and physiological states such as stress levels and cognitive load. However, the low resolution of images in eye datasets often hampers precise measurement. This study evaluates the impact of various upscaling methods, ranging from bicubic interpolation to advanced super-resolution, on pupil diameter predictions. We compare several p… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  9. arXiv:2407.21009  [pdf, other

    cs.AI cs.LG

    AI-Assisted Generation of Difficult Math Questions

    Authors: Vedant Shah, Dingli Yu, Kaifeng Lyu, Simon Park, Jiatong Yu, Yinghui He, Nan Rosemary Ke, Michael Mozer, Yoshua Bengio, Sanjeev Arora, Anirudh Goyal

    Abstract: Current LLM training positions mathematical reasoning as a core capability. With publicly available sources fully tapped, there is unmet demand for diverse and challenging math questions. Relying solely on human experts is both time-consuming and costly, while LLM-generated questions often lack the requisite diversity and difficulty. We present a design framework that combines the strengths of LLM… ▽ More

    Submitted 5 October, 2024; v1 submitted 30 July, 2024; originally announced July 2024.

  10. arXiv:2407.11364  [pdf, ps, other

    cs.DS cs.LG

    Learning-augmented Maximum Independent Set

    Authors: Vladimir Braverman, Prathamesh Dharangutte, Vihan Shah, Chen Wang

    Abstract: We study the Maximum Independent Set (MIS) problem on general graphs within the framework of learning-augmented algorithms. The MIS problem is known to be NP-hard and is also NP-hard to approximate to within a factor of $n^{1-δ}$ for any $δ>0$. We show that we can break this barrier in the presence of an oracle obtained through predictions from a machine learning model that answers vertex membersh… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: APPROX 2024

  11. arXiv:2407.11204  [pdf, other

    cs.CV cs.AI cs.CY cs.HC cs.LG

    EyeDentify: A Dataset for Pupil Diameter Estimation based on Webcam Images

    Authors: Vijul Shah, Ko Watanabe, Brian B. Moser, Andreas Dengel

    Abstract: In this work, we introduce EyeDentify, a dataset specifically designed for pupil diameter estimation based on webcam images. EyeDentify addresses the lack of available datasets for pupil diameter estimation, a crucial domain for understanding physiological and psychological states traditionally dominated by highly specialized sensor systems such as Tobii. Unlike these advanced sensor systems and a… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  12. arXiv:2407.06245  [pdf

    cs.NI cs.AI cs.CL cs.LG

    ORAN-Bench-13K: An Open Source Benchmark for Assessing LLMs in Open Radio Access Networks

    Authors: Pranshav Gajjar, Vijay K. Shah

    Abstract: Large Language Models (LLMs) can revolutionize how we deploy and operate Open Radio Access Networks (O-RAN) by enhancing network analytics, anomaly detection, and code generation and significantly increasing the efficiency and reliability of a plethora of O-RAN tasks. In this paper, we present ORAN-Bench-13K, the first comprehensive benchmark designed to evaluate the performance of Large Language… ▽ More

    Submitted 13 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

  13. arXiv:2404.14355  [pdf, other

    cs.CL cs.AI

    Pre-Calc: Learning to Use the Calculator Improves Numeracy in Language Models

    Authors: Vishruth Veerendranath, Vishwa Shah, Kshitish Ghate

    Abstract: Quantitative and numerical comprehension in language is an important task in many fields like education and finance, but still remains a challenging task for language models. While tool and calculator usage has shown to be helpful to improve mathematical reasoning in large pretrained decoder-only language models, this remains unexplored for smaller language models with encoders. In this paper, we… ▽ More

    Submitted 25 June, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: AI4Math workshop, ICML 2024

  14. arXiv:2404.12464  [pdf, other

    cs.CL

    NormAd: A Framework for Measuring the Cultural Adaptability of Large Language Models

    Authors: Abhinav Rao, Akhila Yerukola, Vishwa Shah, Katharina Reinecke, Maarten Sap

    Abstract: To be effectively and safely deployed to global user populations, large language models (LLMs) must adapt outputs to user values and culture, not just know about them. We introduce NormAd, an evaluation framework to assess LLMs' cultural adaptability, specifically measuring their ability to judge social acceptability across different levels of cultural norm specificity, from abstract values to exp… ▽ More

    Submitted 19 October, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Preprint. In Review

  15. arXiv:2404.10094  [pdf, other

    cs.LG q-bio.QM

    Towards DNA-Encoded Library Generation with GFlowNets

    Authors: Michał Koziarski, Mohammed Abukalam, Vedant Shah, Louis Vaillancourt, Doris Alexandra Schuetz, Moksh Jain, Almer van der Sloot, Mathieu Bourgey, Anne Marinier, Yoshua Bengio

    Abstract: DNA-encoded libraries (DELs) are a powerful approach for rapidly screening large numbers of diverse compounds. One of the key challenges in using DELs is library design, which involves choosing the building blocks that will be combinatorially combined to produce the final library. In this paper we consider the task of protein-protein interaction (PPI) biased DEL design. To this end, we evaluate se… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  16. arXiv:2402.09710  [pdf

    cs.CR cs.LG cs.NI

    Preserving Data Privacy for ML-driven Applications in Open Radio Access Networks

    Authors: Pranshav Gajjar, Azuka Chiejina, Vijay K. Shah

    Abstract: Deep learning offers a promising solution to improve spectrum access techniques by utilizing data-driven approaches to manage and share limited spectrum resources for emerging applications. For several of these applications, the sensitive wireless data (such as spectrograms) are stored in a shared database or multistakeholder cloud environment and are therefore prone to privacy leaks. This paper a… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  17. arXiv:2402.06846  [pdf, other

    cs.CR eess.SY

    System-level Analysis of Adversarial Attacks and Defenses on Intelligence in O-RAN based Cellular Networks

    Authors: Azuka Chiejina, Brian Kim, Kaushik Chowhdury, Vijay K. Shah

    Abstract: While the open architecture, open interfaces, and integration of intelligence within Open Radio Access Network technology hold the promise of transforming 5G and 6G networks, they also introduce cybersecurity vulnerabilities that hinder its widespread adoption. In this paper, we conduct a thorough system-level investigation of cyber threats, with a specific focus on machine learning (ML) intellige… ▽ More

    Submitted 13 February, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: his paper has been accepted for publication in ACM WiSec 2024

  18. arXiv:2402.04447  [pdf, other

    cs.NI eess.SP

    Context-Aware Spectrum Coexistence of Terrestrial Beyond 5G Networks in Satellite Bands

    Authors: Ta Seen Reaz Niloy, Zoheb Hasan, Rob Smith, Vikram R. Anapana, Vijay K. Shah

    Abstract: Spectrum sharing between terrestrial 5G and incumbent networks in the satellite bands presents a promising avenue to satisfy the ever-increasing bandwidth demand of the next-generation wireless networks. However, protecting incumbent operations from harmful interference poses a fundamental challenge in accommodating terrestrial broadband cellular networks in the satellite bands. State-of-the-art s… ▽ More

    Submitted 14 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  19. arXiv:2402.02593  [pdf, other

    cs.LG

    Leveraging Continuously Differentiable Activation Functions for Learning in Quantized Noisy Environments

    Authors: Vivswan Shah, Nathan Youngblood

    Abstract: Real-world analog systems intrinsically suffer from noise that can impede model convergence and accuracy on a variety of deep learning models. We demonstrate that differentiable activations like GELU and SiLU enable robust propagation of gradients which help to mitigate analog quantization error that is ubiquitous to all analog systems. We perform analysis and training of convolutional, linear, an… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  20. arXiv:2402.01207  [pdf, other

    cs.LG cs.AI stat.ME

    Efficient Causal Graph Discovery Using Large Language Models

    Authors: Thomas Jiralerspong, Xiaoyin Chen, Yash More, Vedant Shah, Yoshua Bengio

    Abstract: We propose a novel framework that leverages LLMs for full causal graph discovery. While previous LLM-based methods have used a pairwise query approach, this requires a quadratic number of queries which quickly becomes impractical for larger causal graphs. In contrast, the proposed framework uses a breadth-first search (BFS) approach which allows it to use only a linear number of queries. We also s… ▽ More

    Submitted 20 July, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  21. arXiv:2401.06378  [pdf, ps, other

    cs.DS

    New Lower Bounds in Merlin-Arthur Communication and Graph Streaming Verification

    Authors: Prantar Ghosh, Vihan Shah

    Abstract: We show new lower bounds in the \emph{Merlin-Arthur} (MA) communication model and the related \emph{annotated streaming} or stream verification model. The MA communication model is an enhancement of the classical communication model, where in addition to the usual players Alice and Bob, there is an all-powerful but untrusted player Merlin who knows their inputs and tries to convince them about the… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: To appear in ITCS 2024

  22. arXiv:2401.03271  [pdf, other

    eess.IV cs.CV cs.IR

    Analysis and Validation of Image Search Engines in Histopathology

    Authors: Isaiah Lahr, Saghir Alfasly, Peyman Nejat, Jibran Khan, Luke Kottom, Vaishnavi Kumbhar, Areej Alsaafin, Abubakr Shafique, Sobhan Hemati, Ghazal Alabtah, Nneka Comfere, Dennis Murphee, Aaron Mangold, Saba Yasir, Chady Meroueh, Lisa Boardman, Vijay H. Shah, Joaquin J. Garcia, H. R. Tizhoosh

    Abstract: Searching for similar images in archives of histology and histopathology images is a crucial task that may aid in patient matching for various purposes, ranging from triaging and diagnosis to prognosis and prediction. Whole slide images (WSIs) are highly detailed digital representations of tissue specimens mounted on glass slides. Matching WSI to WSI can serve as the critical method for patient ma… ▽ More

    Submitted 8 June, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

  23. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  24. arXiv:2311.16094  [pdf, other

    cs.CV cs.GR

    Street TryOn: Learning In-the-Wild Virtual Try-On from Unpaired Person Images

    Authors: Aiyu Cui, Jay Mahajan, Viraj Shah, Preeti Gomathinayagam, Chang Liu, Svetlana Lazebnik

    Abstract: Most virtual try-on research is motivated to serve the fashion business by generating images to demonstrate garments on studio models at a lower cost. However, virtual try-on should be a broader application that also allows customers to visualize garments on themselves using their own casual photos, known as in-the-wild try-on. Unfortunately, the existing methods, which achieve plausible results f… ▽ More

    Submitted 16 July, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: The abstract and intro are updated. Some typos and some pdf rendering errors have been fixed in the version

  25. arXiv:2311.15268  [pdf, other

    cs.LG cs.AI

    Unlearning via Sparse Representations

    Authors: Vedant Shah, Frederik Träuble, Ashish Malik, Hugo Larochelle, Michael Mozer, Sanjeev Arora, Yoshua Bengio, Anirudh Goyal

    Abstract: Machine \emph{unlearning}, which involves erasing knowledge about a \emph{forget set} from a trained model, can prove to be costly and infeasible by existing techniques. We propose a nearly compute-free zero-shot unlearning technique based on a discrete representational bottleneck. We show that the proposed technique efficiently unlearns the forget set and incurs negligible damage to the model's p… ▽ More

    Submitted 10 October, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

  26. arXiv:2311.13600  [pdf, other

    cs.CV cs.GR cs.LG

    ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs

    Authors: Viraj Shah, Nataniel Ruiz, Forrester Cole, Erika Lu, Svetlana Lazebnik, Yuanzhen Li, Varun Jampani

    Abstract: Methods for finetuning generative models for concept-driven personalization generally achieve strong results for subject-driven or style-driven generation. Recently, low-rank adaptations (LoRA) have been proposed as a parameter-efficient way of achieving concept-driven personalization. While recent work explores the combination of separate LoRAs to achieve joint generation of learned styles and su… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: Project page: https://ziplora.github.io

  27. arXiv:2309.11510  [pdf, other

    cs.IR cs.AI cs.CV

    When is a Foundation Model a Foundation Model

    Authors: Saghir Alfasly, Peyman Nejat, Sobhan Hemati, Jibran Khan, Isaiah Lahr, Areej Alsaafin, Abubakr Shafique, Nneka Comfere, Dennis Murphree, Chady Meroueh, Saba Yasir, Aaron Mangold, Lisa Boardman, Vijay Shah, Joaquin J. Garcia, H. R. Tizhoosh

    Abstract: Recently, several studies have reported on the fine-tuning of foundation models for image-text modeling in the field of medicine, utilizing images from online data sources such as Twitter and PubMed. Foundation models are large, deep artificial neural networks capable of learning the context of a specific domain through training on exceptionally extensive datasets. Through validation, we have obse… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  28. arXiv:2309.03844  [pdf, other

    cs.NI eess.SP

    Experimental Study of Adversarial Attacks on ML-based xApps in O-RAN

    Authors: Naveen Naik Sapavath, Brian Kim, Kaushik Chowdhury, Vijay K Shah

    Abstract: Open Radio Access Network (O-RAN) is considered as a major step in the evolution of next-generation cellular networks given its support for open interfaces and utilization of artificial intelligence (AI) into the deployment, operation, and maintenance of RAN. However, due to the openness of the O-RAN architecture, such AI models are inherently vulnerable to various adversarial machine learning (ML… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: Accepted for Globecom 2023

  29. arXiv:2308.07071  [pdf, other

    cs.RO

    RL-based Variable Horizon Model Predictive Control of Multi-Robot Systems using Versatile On-Demand Collision Avoidance

    Authors: Shreyash Gupta, Abhinav Kumar, Niladri S. Tripathy, Suril V. Shah

    Abstract: Multi-robot systems have become very popular in recent years because of their wide spectrum of applications, ranging from surveillance to cooperative payload transportation. Model Predictive Control (MPC) is a promising controller for multi-robot control because of its preview capability and ability to handle constraints easily. The performance of the MPC widely depends on many parameters, among w… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  30. arXiv:2308.02053  [pdf, other

    cs.CL cs.AI cs.CY

    The Unequal Opportunities of Large Language Models: Revealing Demographic Bias through Job Recommendations

    Authors: Abel Salinas, Parth Vipul Shah, Yuzhong Huang, Robert McCormack, Fred Morstatter

    Abstract: Large Language Models (LLMs) have seen widespread deployment in various real-world applications. Understanding these biases is crucial to comprehend the potential downstream consequences when using LLMs to make decisions, particularly for historically disadvantaged groups. In this work, we propose a simple method for analyzing and comparing demographic bias in LLMs, through the lens of job recomme… ▽ More

    Submitted 9 January, 2024; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: Accepted to EAAMO 2023

  31. arXiv:2307.12473  [pdf, other

    cs.IT

    Adaptive RRI Selection Algorithms for Improved Cooperative Awareness in Decentralized NR-V2X

    Authors: Avik Dayal, Vijay K. Shah, Harpreet S. Dhillon, Jeffrey H. Reed

    Abstract: Decentralized vehicle-to-everything (V2X) networks (i.e., C-V2X Mode-4 and NR-V2X Mode-2) utilize sensing-based semi-persistent scheduling (SPS) where vehicles sense and reserve suitable radio resources for Basic Safety Message (BSM) transmissions at prespecified periodic intervals termed as Resource Reservation Interval (RRI). Vehicles rely on these received periodic BSMs to localize nearby (tran… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

  32. Machine Vision Using Cellphone Camera: A Comparison of deep networks for classifying three challenging denominations of Indian Coins

    Authors: Keyur D. Joshi, Dhruv Shah, Varshil Shah, Nilay Gandhi, Sanket J. Shah, Sanket B. Shah

    Abstract: Indian currency coins come in a variety of denominations. Off all the varieties Rs.1, RS.2, and Rs.5 have similar diameters. Majority of the coin styles in market circulation for denominations of Rs.1 and Rs.2 coins are nearly the same except for numerals on its reverse side. If a coin is resting on its obverse side, the correct denomination is not distinguishable by humans. Therefore, it was hypo… ▽ More

    Submitted 12 May, 2023; originally announced June 2023.

    Comments: 6 Pages, 4 Figures, 6 Tables, Conference paper

  33. arXiv:2305.13582  [pdf, other

    cs.CL

    Translation and Fusion Improves Zero-shot Cross-lingual Information Extraction

    Authors: Yang Chen, Vedaant Shah, Alan Ritter

    Abstract: Large language models (LLMs) combined with instruction tuning have shown significant progress in information extraction (IE) tasks, exhibiting strong generalization capabilities to unseen datasets by following annotation guidelines. However, their applicability to low-resource languages remains limited due to lack of both labeled data for fine-tuning, and unlabeled text for pre-training. In this p… ▽ More

    Submitted 20 June, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

  34. arXiv:2305.11321  [pdf, other

    cs.CV

    JoIN: Joint GANs Inversion for Intrinsic Image Decomposition

    Authors: Viraj Shah, Svetlana Lazebnik, Julien Philip

    Abstract: In this work, we propose to solve ill-posed inverse imaging problems using a bank of Generative Adversarial Networks (GAN) as a prior and apply our method to the case of Intrinsic Image Decomposition for faces and materials. Our method builds on the demonstrated success of GANs to capture complex image distributions. At the core of our approach is the idea that the latent space of a GAN is a well-… ▽ More

    Submitted 22 January, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Project webpage is available at https://virajshah.com/join

  35. arXiv:2304.14403  [pdf, other

    cs.CV cs.GR cs.LG

    Make It So: Steering StyleGAN for Any Image Inversion and Editing

    Authors: Anand Bhattad, Viraj Shah, Derek Hoiem, D. A. Forsyth

    Abstract: StyleGAN's disentangled style representation enables powerful image editing by manipulating the latent variables, but accurately mapping real-world images to their latent variables (GAN inversion) remains a challenge. Existing GAN inversion methods struggle to maintain editing directions and produce realistic results. To address these limitations, we propose Make It So, a novel GAN inversion met… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: project: https://anandbhattad.github.io/makeitso/

  36. arXiv:2304.02068  [pdf, other

    cs.GT eess.SY

    Battlefield Transfers in Coalitional Blotto Games

    Authors: Vade Shah, Jason R. Marden

    Abstract: In competitive resource allocation environments, agents often choose to form alliances; however, for some agents, doing so may not always be beneficial. Is there a method of forming alliances that always reward each of their members? We study this question using the framework of the coalitional Blotto game, in which two players compete against a common adversary by allocating their budgeted resour… ▽ More

    Submitted 16 October, 2024; v1 submitted 4 April, 2023; originally announced April 2023.

  37. arXiv:2303.06288  [pdf, ps, other

    cs.DS

    Generalizing Greenwald-Khanna Streaming Quantile Summaries for Weighted Inputs

    Authors: Sepehr Assadi, Nirmit Joshi, Milind Prabhu, Vihan Shah

    Abstract: Estimating quantiles, like the median or percentiles, is a fundamental task in data mining and data science. A (streaming) quantile summary is a data structure that can process a set S of n elements in a streaming fashion and at the end, for any phi in (0,1], return a phi-quantile of S up to an eps error, i.e., return a phi'-quantile with phi'=phi +- eps. We are particularly interested in comparis… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: 33 pages, 7 figures, International Conference on Database Theory 2023

  38. Keep It Simple: CNN Model Complexity Studies for Interference Classification Tasks

    Authors: Taiwo Oyedare, Vijay K. Shah, Daniel J. Jakubisin, Jeffrey H. Reed

    Abstract: The growing number of devices using the wireless spectrum makes it important to find ways to minimize interference and optimize the use of the spectrum. Deep learning models, such as convolutional neural networks (CNNs), have been widely utilized to identify, classify, or mitigate interference due to their ability to learn from the data directly. However, there have been limited research on the co… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: 6 pages, 7 figures, 3 tables

  39. Learning Vision-based Robotic Manipulation Tasks Sequentially in Offline Reinforcement Learning Settings

    Authors: Sudhir Pratap Yadav, Rajendra Nagar, Suril V. Shah

    Abstract: With the rise of deep reinforcement learning (RL) methods, many complex robotic manipulation tasks are being solved. However, harnessing the full power of deep learning requires large datasets. Online-RL does not suit itself readily into this paradigm due to costly and time-taking agent environment interaction. Therefore recently, many offline-RL algorithms have been proposed to learn robotic task… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

    Comments: 7 pages, 5 Figures

  40. arXiv:2211.16047  [pdf, other

    cs.AI cs.LG cs.LO

    Neural Feature-Adaptation for Symbolic Predictions Using Pre-Training and Semantic Loss

    Authors: Vedant Shah, Aditya Agrawal, Lovekesh Vig, Ashwin Srinivasan, Gautam Shroff, Tanmay Verlekar

    Abstract: We are interested in neurosymbolic systems consisting of a high-level symbolic layer for explainable prediction in terms of human-intelligible concepts; and a low-level neural layer for extracting symbols required to generate the symbolic explanation. Real data is often imperfect meaning that even if the symbolic theory remains unchanged, we may still need to address the problem of mapping raw dat… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  41. arXiv:2211.04685  [pdf, ps, other

    cs.DS

    Tight Bounds for Vertex Connectivity in Dynamic Streams

    Authors: Sepehr Assadi, Vihan Shah

    Abstract: We present a streaming algorithm for the vertex connectivity problem in dynamic streams with a (nearly) optimal space bound: for any $n$-vertex graph $G$ and any integer $k \geq 1$, our algorithm with high probability outputs whether or not $G$ is $k$-vertex-connected in a single pass using $\widetilde{O}(k n)$ space. Our upper bound matches the known $Ω(k n)$ lower bound for this problem even i… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: Full version of the paper accepted to SOSA 2023. 15 pages, 3 Figures

  42. arXiv:2210.10048  [pdf

    cs.LG cs.ET physics.optics

    AnalogVNN: A fully modular framework for modeling and optimizing photonic neural networks

    Authors: Vivswan Shah, Nathan Youngblood

    Abstract: AnalogVNN, a simulation framework built on PyTorch which can simulate the effects of optoelectronic noise, limited precision, and signal normalization present in photonic neural network accelerators. We use this framework to train and optimize linear and convolutional neural networks with up to 9 layers and ~1.7 million parameters, while gaining insights into how normalization, activation function… ▽ More

    Submitted 6 June, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: 28 pages; replace figure 6C; better format; updated links

    Report number: 026116

    Journal ref: APL Mach. Learn. 1 June 2023; 1 (2): 026116

  43. arXiv:2210.08016  [pdf, other

    q-bio.QM cs.LG q-bio.BM

    Prediction of drug effectiveness in rheumatoid arthritis patients based on machine learning algorithms

    Authors: Shengjia Chen, Nikunj Gupta, Woodward B. Galbraith, Valay Shah, Jacopo Cirrone

    Abstract: Rheumatoid arthritis (RA) is an autoimmune condition caused when patients' immune system mistakenly targets their own tissue. Machine learning (ML) has the potential to identify patterns in patient electronic health records (EHR) to forecast the best clinical treatment to improve patient outcomes. This study introduced a Drug Response Prediction (DRP) framework with two main goals: 1) design a dat… ▽ More

    Submitted 21 October, 2022; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: 13 pages, 5 figures, to be published in ICBBE 2022

  44. arXiv:2210.04120  [pdf, other

    cs.CV

    MultiStyleGAN: Multiple One-shot Image Stylizations using a Single GAN

    Authors: Viraj Shah, Ayush Sarkar, Sudharsan Krishnakumar Anitha, Svetlana Lazebnik

    Abstract: Image stylization aims at applying a reference style to arbitrary input images. A common scenario is one-shot stylization, where only one example is available for each reference style. Recent approaches for one-shot stylization such as JoJoGAN fine-tune a pre-trained StyleGAN2 generator on a single style reference image. However, such methods cannot generate multiple stylizations without fine-tuni… ▽ More

    Submitted 20 April, 2023; v1 submitted 8 October, 2022; originally announced October 2022.

    Comments: Project webpage available at https://virajshah.com/multistyle

  45. arXiv:2210.03022  [pdf, other

    cs.AI cs.LG

    Stateful active facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning

    Authors: Dianbo Liu, Vedant Shah, Oussama Boussif, Cristian Meo, Anirudh Goyal, Tianmin Shu, Michael Mozer, Nicolas Heess, Yoshua Bengio

    Abstract: In cooperative multi-agent reinforcement learning, a team of agents works together to achieve a common goal. Different environments or tasks may require varying degrees of coordination among agents in order to achieve the goal in an optimal way. The nature of coordination will depend on the properties of the environment -- its spatial layout, distribution of obstacles, dynamics, etc. We term this… ▽ More

    Submitted 6 October, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: Published at ICLR 2023

  46. arXiv:2209.08750  [pdf, other

    cs.AI cs.LG

    Knowledge-based Analogical Reasoning in Neuro-symbolic Latent Spaces

    Authors: Vishwa Shah, Aditya Sharma, Gautam Shroff, Lovekesh Vig, Tirtharaj Dash, Ashwin Srinivasan

    Abstract: Analogical Reasoning problems challenge both connectionist and symbolic AI systems as these entail a combination of background knowledge, reasoning and pattern recognition. While symbolic systems ingest explicit domain knowledge and perform deductive reasoning, they are sensitive to noise and require inputs be mapped to preset symbolic features. Connectionist systems on the other hand can directly… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 13 pages, 4 figures, Accepted at 16th International Workshop on Neural-Symbolic Learning and Reasoning as part of the 2nd International Joint Conference on Learning & Reasoning (IJCLR 2022)

  47. arXiv:2209.05623  [pdf, ps, other

    cs.DS

    Space Optimal Vertex Cover in Dynamic Streams

    Authors: Kheeran K. Naidu, Vihan Shah

    Abstract: We optimally resolve the space complexity for the problem of finding an $α$-approximate minimum vertex cover ($α$MVC) in dynamic graph streams. We give a randomised algorithm for $α$MVC which uses $O(n^2/α^2)$ bits of space matching Dark and Konrad's lower bound [CCC 2020] up to constant factors. By computing a random greedy matching, we identify `easy' instances of the problem which can trivially… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

  48. arXiv:2205.13178  [pdf, other

    cs.NI cs.DC

    Prototyping Next-Generation O-RAN Research Testbeds with SDRs

    Authors: Pratheek S. Upadhyaya, Aly S. Abdalla, Vuk Marojevic, Jeffrey H. Reed, Vijay K. Shah

    Abstract: Open RAN (O-RAN) defines an emerging cellular radio access network (RAN) architecture for future 6G wireless networks, emphasizing openness and intelligence which are considered the foundations of future 6G wireless networks. While the inherent complexity and flexibility of the RAN give rise to many new research problems, progress in developing solutions is hampered due to the lack of end-to-end,… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: This manuscript has been submitted to IEEE Vehicular Technology Magazine for possible publication

  49. arXiv:2205.10607  [pdf, other

    cs.AI

    Coordinating Policies Among Multiple Agents via an Intelligent Communication Channel

    Authors: Dianbo Liu, Vedant Shah, Oussama Boussif, Cristian Meo, Anirudh Goyal, Tianmin Shu, Michael Mozer, Nicolas Heess, Yoshua Bengio

    Abstract: In Multi-Agent Reinforcement Learning (MARL), specialized channels are often introduced that allow agents to communicate directly with one another. In this paper, we propose an alternative approach whereby agents communicate through an intelligent facilitator that learns to sift through and interpret signals provided by all agents to improve the agents' collective performance. To ensure that this… ▽ More

    Submitted 25 May, 2022; v1 submitted 21 May, 2022; originally announced May 2022.

  50. arXiv:2204.10743  [pdf, other

    cs.DB

    An Evaluation of Intra-Transaction Parallelism in Actor-Relational Database Systems

    Authors: Vivek Shah, Marcos Antonio Vaz Salles

    Abstract: Over the past decade, we have witnessed a dramatic evolution in main-memory capacity and multi-core parallelism of server hardware. To leverage this hardware potential, multi-core in-memory OLTP database systems have been extensively re-designed. The core objective of this re-design has been scaling up sequential execution of OLTP transactions, wherein alternative database architectures have been… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.