Skip to main content

Showing 1–50 of 105 results for author: Arushi

  1. arXiv:2410.12109  [pdf, other

    cs.CL cs.CV

    OMCAT: Omni Context Aware Transformer

    Authors: Arushi Goel, Karan Sapra, Matthieu Le, Rafael Valle, Andrew Tao, Bryan Catanzaro

    Abstract: Large Language Models (LLMs) have made significant strides in text generation and comprehension, with recent advancements extending into multimodal LLMs that integrate visual and audio inputs. However, these models continue to struggle with fine-grained, cross-modal temporal understanding, particularly when correlating events across audio and video streams. We address these challenges with two key… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: Demo page: https://om-cat.github.io

  2. arXiv:2409.09222  [pdf

    cs.HC

    Dark Patterns in the Opt-Out Process and Compliance with the California Consumer Privacy Act (CCPA)

    Authors: Van Hong Tran, Aarushi Mehrotra, Ranya Sharma, Marshini Chetty, Nick Feamster, Jens Frankenreiter, Lior Strahilevitz

    Abstract: To protect consumer privacy, the California Consumer Privacy Act (CCPA) mandates that businesses provide consumers with a straightforward way to opt out of the sale and sharing of their personal information. However, the control that businesses enjoy over the opt-out process allows them to impose hurdles on consumers aiming to opt out, including by employing dark patterns. Motivated by the enactme… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

  3. Enumeration of groups in some special varieties of $A$-groups

    Authors: Arushi, Geetha Venkataraman

    Abstract: We find an upper bound for the number of groups of order $n$ up to isomorphism in the variety $G = A_pA_qA_r$, where $p$, $q$ and $r$ are distinct primes. We also find a bound on the orders and on the number of conjugacy classes of subgroups that are maximal amongst the subgroups of the general linear group that are also in the variety $A_qA_r$.

    Submitted 13 September, 2024; originally announced September 2024.

    MSC Class: 20B15; 20B35; 20D10; 20E10; 20E45; 20H30. 20B15; 20B35; 20D10; 20E10; 20E45; 20H30

    Journal ref: Bulletin of the Australian Mathematical Society, First View, p1-11, 27 August 2024,

  4. arXiv:2409.07524  [pdf, other

    hep-ph astro-ph.CO

    Grand Unification at the Cosmological Collider with Chemical Potential

    Authors: Arushi Bodas, Edward Broadberry, Raman Sundrum

    Abstract: We introduce a tree-level chemical potential mechanism for spin-1 particles within cosmological collider physics, allowing them to be detected in primordial non-Gaussianities for masses above the inflationary Hubble scale. We apply this mechanism to orbifold grand unification and the massive unification partners of the standard model gauge bosons. Our mechanism requires at least a pair of massive… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

    Comments: 33 pages, 7 figures

  5. arXiv:2408.14795  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Electron ptychography reveals a ferroelectricity dominated by anion displacements

    Authors: Harikrishnan K. P., Ruijuan Xu, Kinnary Patel, Kevin J. Crust, Aarushi Khandelwal, Chenyu Zhang, Sergey Prosandeev, Hua Zhou, Yu-Tsun Shao, Laurent Bellaiche, Harold Y. Hwang, David A. Muller

    Abstract: Sodium niobate, a lead-free ferroic material, hosts delicately-balanced, competing order parameters, including ferroelectric states that can be stabilized by epitaxial strain. Here, we show that the resulting macroscopic ferroelectricity exhibits an unconventional microscopic structure using multislice electron ptychography. This technique overcomes multiple scattering artifacts limiting conventio… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: 63 pages, 5 figures, 17 supplementary figures

  6. arXiv:2407.03451  [pdf, other

    cs.CR cs.HC

    The Role of Privacy Guarantees in Voluntary Donation of Private Data for Altruistic Goals

    Authors: Ruizhe Wang, Roberta De Viti, Aarushi Dubey, Elissa M. Redmiles

    Abstract: Voluntary donation of private information for altruistic purposes, such as advancing research, is common. However, concerns about data misuse and leakage may deter individuals from donating their information. While prior research has indicated that Privacy Enhancement Technologies (PETs) can alleviate these concerns, the extent to which these techniques influence willingness to donate data remains… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  7. arXiv:2406.00314  [pdf, other

    cs.CL cs.AI cs.LG

    CASE: Efficient Curricular Data Pre-training for Building Assistive Psychology Expert Models

    Authors: Sarthak Harne, Monjoy Narayan Choudhury, Madhav Rao, TK Srikanth, Seema Mehrotra, Apoorva Vashisht, Aarushi Basu, Manjit Sodhi

    Abstract: The limited availability of psychologists necessitates efficient identification of individuals requiring urgent mental healthcare. This study explores the use of Natural Language Processing (NLP) pipelines to analyze text data from online mental health forums used for consultations. By analyzing forum posts, these pipelines can flag users who may require immediate professional attention. A crucial… ▽ More

    Submitted 2 October, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

  8. arXiv:2405.15152  [pdf, other

    cs.CL cs.AI

    Machine Unlearning in Large Language Models

    Authors: Saaketh Koundinya Gundavarapu, Shreya Agarwal, Arushi Arora, Chandana Thimmalapura Jagadeeshaiah

    Abstract: Machine unlearning, a novel area within artificial intelligence, focuses on addressing the challenge of selectively forgetting or reducing undesirable knowledge or behaviors in machine learning models, particularly in the context of large language models (LLMs). This paper introduces a methodology to align LLMs, such as Open Pre-trained Transformer Language Models, with ethical, privacy, and safet… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 10 pages

  9. arXiv:2405.12842  [pdf, other

    cs.RO cs.CV

    SmartFlow: Robotic Process Automation using LLMs

    Authors: Arushi Jain, Shubham Paliwal, Monika Sharma, Lovekesh Vig, Gautam Shroff

    Abstract: Robotic Process Automation (RPA) systems face challenges in handling complex processes and diverse screen layouts that require advanced human-like decision-making capabilities. These systems typically rely on pixel-level encoding through drag-and-drop or automation frameworks such as Selenium to create navigation workflows, rather than visual understanding of screen elements. In this context, we p… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 32nd ACM International Conference on Information and Knowledge Management

  10. arXiv:2405.12742  [pdf, other

    cs.CV

    Multi-Subject Personalization

    Authors: Arushi Jain, Shubham Paliwal, Monika Sharma, Vikram Jamwal, Lovekesh Vig

    Abstract: Creative story illustration requires a consistent interplay of multiple characters or objects. However, conventional text-to-image models face significant challenges while producing images featuring multiple personalized subjects. For example, they distort the subject rendering, or the text descriptions fail to render coherent subject interactions. We present Multi-Subject Personalization (MSP) to… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 2023 Conference on Neural Information Processing Systems

  11. arXiv:2405.12531  [pdf, other

    cs.CV cs.LG

    CustomText: Customized Textual Image Generation using Diffusion Models

    Authors: Shubham Paliwal, Arushi Jain, Monika Sharma, Vikram Jamwal, Lovekesh Vig

    Abstract: Textual image generation spans diverse fields like advertising, education, product packaging, social media, information visualization, and branding. Despite recent strides in language-guided image synthesis using diffusion models, current models excel in image generation but struggle with accurate text rendering and offer limited control over font attributes. In this paper, we aim to enhance the s… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: Accepted by AI for Content Creation (AI4CC) workshop at CVPR 2024

  12. arXiv:2405.07838  [pdf, other

    cs.LG cs.AI

    Adaptive Exploration for Data-Efficient General Value Function Evaluations

    Authors: Arushi Jain, Josiah P. Hanna, Doina Precup

    Abstract: General Value Functions (GVFs) (Sutton et al., 2011) represent predictive knowledge in reinforcement learning. Each GVF computes the expected return for a given policy, based on a unique reward. Existing methods relying on fixed behavior policies or pre-collected data often face data efficiency issues when learning multiple GVFs in parallel using off-policy methods. To address this, we introduce G… ▽ More

    Submitted 13 October, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: 26 pages, 16 figures, Accepted in NeurIPS 2024 Conference

  13. arXiv:2405.07284  [pdf

    cs.CV cs.AI

    Zero Shot Context-Based Object Segmentation using SLIP (SAM+CLIP)

    Authors: Saaketh Koundinya Gundavarapu, Arushi Arora, Shreya Agarwal

    Abstract: We present SLIP (SAM+CLIP), an enhanced architecture for zero-shot object segmentation. SLIP combines the Segment Anything Model (SAM) \cite{kirillov2023segment} with the Contrastive Language-Image Pretraining (CLIP) \cite{radford2021learning}. By incorporating text prompts into SAM using CLIP, SLIP enables object segmentation without prior training on specific classes or categories. We fine-tune… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 5 pages, 3 figures

  14. arXiv:2404.08156  [pdf, other

    cs.CL

    Multimodal Contextual Dialogue Breakdown Detection for Conversational AI Models

    Authors: Md Messal Monem Miah, Ulie Schnaithmann, Arushi Raghuvanshi, Youngseo Son

    Abstract: Detecting dialogue breakdown in real time is critical for conversational AI systems, because it enables taking corrective action to successfully complete a task. In spoken dialog systems, this breakdown can be caused by a variety of unexpected situations including high levels of background noise, causing STT mistranscriptions, or unexpected user flows. In particular, industry settings like healthc… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Published in NAACL 2024 Industry Track

  15. arXiv:2404.08155  [pdf, other

    cs.CL

    Graph Integrated Language Transformers for Next Action Prediction in Complex Phone Calls

    Authors: Amin Hosseiny Marani, Ulie Schnaithmann, Youngseo Son, Akil Iyer, Manas Paldhe, Arushi Raghuvanshi

    Abstract: Current Conversational AI systems employ different machine learning pipelines, as well as external knowledge sources and business logic to predict the next action. Maintaining various components in dialogue managers' pipeline adds complexity in expansion and updates, increases processing time, and causes additive noise through the pipeline that can lead to incorrect next action prediction. This pa… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Published in NAACL 2024 Industry Track

  16. arXiv:2404.07616  [pdf, other

    cs.CL cs.SD eess.AS

    Audio Dialogues: Dialogues dataset for audio and music understanding

    Authors: Arushi Goel, Zhifeng Kong, Rafael Valle, Bryan Catanzaro

    Abstract: Existing datasets for audio understanding primarily focus on single-turn interactions (i.e. audio captioning, audio question answering) for describing audio in natural language, thus limiting understanding audio via interactive dialogue. To address this gap, we introduce Audio Dialogues: a multi-turn dialogue dataset containing 163.8k samples for general audio sounds and music. In addition to dial… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Demo website: https://audiodialogues.github.io/

  17. arXiv:2404.06318  [pdf, other

    nucl-th

    Variational Optimization for Constructing Inverse Potentials of Proton-Proton Scattering: A Phase Function Method Study

    Authors: Lalit Kumar, Arushi Sharma, Anil Khachi, Ayushi Awasthi, O. S. K. S. Sastri

    Abstract: Background: The phase-shift analysis for proton-proton scattering has been studied by various research groups using the realistic potentials to be comprised of various internal interactions based on an exchange of pions and mesons, involving a large number of parameters. Purpose: The goal of the research is to construct inverse potentials for various l-channels of proton-proton (pp) elastic scatte… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 13 pages, 5 figures, 5 Tables

  18. arXiv:2403.17225  [pdf

    cs.HC cs.CR cs.CY

    Measuring Compliance with the California Consumer Privacy Act Over Space and Time

    Authors: Van Tran, Aarushi Mehrotra, Marshini Chetty, Nick Feamster, Jens Frankenreiter, Lior Strahilevitz

    Abstract: The widespread sharing of consumers personal information with third parties raises significant privacy concerns. The California Consumer Privacy Act (CCPA) mandates that online businesses offer consumers the option to opt out of the sale and sharing of personal information. Our study automatically tracks the presence of the opt-out link longitudinally across multiple states after the California Pr… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  19. arXiv:2402.01831  [pdf, other

    cs.SD cs.LG eess.AS

    Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities

    Authors: Zhifeng Kong, Arushi Goel, Rohan Badlani, Wei Ping, Rafael Valle, Bryan Catanzaro

    Abstract: Augmenting large language models (LLMs) to understand audio -- including non-speech sounds and non-verbal speech -- is critically important for diverse real-world applications of LLMs. In this paper, we propose Audio Flamingo, a novel audio language model with 1) strong audio understanding abilities, 2) the ability to quickly adapt to unseen tasks via in-context learning and retrieval, and 3) stro… ▽ More

    Submitted 28 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  20. arXiv:2401.12286  [pdf, other

    hep-ph astro-ph.CO

    A Closer Look in the Mirror: Reflections on the Matter/Dark Matter Coincidence

    Authors: Arushi Bodas, Manuel A. Buen-Abad, Anson Hook, Raman Sundrum

    Abstract: We argue that the striking similarity between the cosmic abundances of baryons and dark matter, despite their very different astrophysical behavior, strongly motivates the scenario in which dark matter resides within a rich dark sector parallel in structure to that of the standard model. The near cosmic coincidence is then explained by an approximate $\mathbb{Z}_2$ exchange symmetry between the tw… ▽ More

    Submitted 12 June, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: Published version. 29 pages + references, 6 figures, 2 sectors

    Report number: FERMILAB-PUB-24-0023-T-V

  21. arXiv:2401.07614  [pdf, other

    cond-mat.supr-con

    Broken time-reversal symmetry in a new non-centrosymmetric superconductor Re8NbTa

    Authors: R. K. Kushwaha, Arushi, S. Sharma, S. Srivastava, P. K. Meena, M. Pula, J. Beare, J. Gautreau, A. D. Hillier, G. M. Luke, R. P. Singh

    Abstract: Re-based superconductors provide a rich platform for the study of unconventional superconductivity. We have investigated the superconducting properties of Re$_{8}$NbTa, a new noncentrosymmetric cubic ($α$-Mn structure) rhenium-based ternary superconductor using transport, magnetization, specific heat, and muon spin rotation/relaxation ($μ$SR) measurements. Specific heat and transverse field $μ$SR… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 9 pages, 7 figures

  22. arXiv:2312.10991  [pdf, other

    cond-mat.mtrl-sci cond-mat.other

    Defect-driven tunable electronic and optical properties of two-dimensional silicon carbide

    Authors: Arushi Singh, Vikram Mahamiya, Alok Shukla

    Abstract: Recently, an atomic-scale two-dimensional silicon carbide monolayer has been synthesized {[}Polley \emph{et al., }Phys. Rev. Lett. \textbf{130},076203 (2023){]} which opens up new possibilities for developing next-generation electronic and optoelectronic devices. Our study predicts the pristine SiC monolayer to have an ``indirect'' band gap of 3.38 eV $(K\rightarrow M)$ and a ``direct'' band gap o… ▽ More

    Submitted 21 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 34 pages, 13 figures (manuscript 31 pages, 10 figures + supplemental material 3 pages, 3 figures)

    Journal ref: Phys. Rev. B 108, 235311 (2023)

  23. arXiv:2311.18507  [pdf, other

    cond-mat.str-el cond-mat.supr-con

    Ce$_{2}$Ir$_{3}$Ga$_{5}$ : a new locally non-centrosymmetric heavy fermion system

    Authors: Arushi, Raul Cardoso-Gil, Christoph Geibel

    Abstract: Recently, a new type of unconventional superconductivity with a field-induced transition between two different superconducting (SC) states was discovered in the heavy fermion system CeRh$_{2}$As$_{2}$. This unusual SC state was proposed to be based on specific symmetries of the underlying structure, i.e., a globally centrosymmetric layered structure, but where the Ce-layers themselves lack inversi… ▽ More

    Submitted 1 December, 2023; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: 9 pages, 4 figures

  24. arXiv:2311.05779  [pdf, other

    cs.RO cs.CV

    Language-guided Robot Grasping: CLIP-based Referring Grasp Synthesis in Clutter

    Authors: Georgios Tziafas, Yucheng Xu, Arushi Goel, Mohammadreza Kasaei, Zhibin Li, Hamidreza Kasaei

    Abstract: Robots operating in human-centric environments require the integration of visual grounding and grasping capabilities to effectively manipulate objects based on user instructions. This work focuses on the task of referring grasp synthesis, which predicts a grasp pose for an object referred through natural language in cluttered scenes. Existing approaches often employ multi-stage pipelines that firs… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Poster CoRL 2023. Dataset and code available here: https://github.com/gtziafas/OCID-VLG

  25. arXiv:2311.01971  [pdf, other

    astro-ph.EP

    Photometry of the Didymos system across the DART impact apparition

    Authors: Nicholas Moskovitz, Cristina Thomas, Petr Pravec, Tim Lister, Tom Polakis, David Osip, Theodore Kareta, Agata Rożek, Steven R. Chesley, Shantanu P. Naidu, Peter Scheirich, William Ryan, Eileen Ryan, Brian Skiff, Colin Snodgrass, Matthew M. Knight, Andrew S. Rivkin, Nancy L. Chabot, Vova Ayvazian, Irina Belskaya, Zouhair Benkhaldoun, Daniel N. Berteşteanu, Mariangela Bonavita, Terrence H. Bressi, Melissa J. Brucker , et al. (56 additional authors not shown)

    Abstract: On 26 September 2022, the Double Asteroid Redirection Test (DART) spacecraft impacted Dimorphos, the satellite of binary near-Earth asteroid (65803) Didymos. This demonstrated the efficacy of a kinetic impactor for planetary defense by changing the orbital period of Dimorphos by 33 minutes (Thomas et al. 2023). Measuring the period change relied heavily on a coordinated campaign of lightcurve phot… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: 52 pages, 5 tables, 9 figures, accepted to PSJ

  26. arXiv:2311.01019  [pdf, other

    cond-mat.supr-con

    Superconducting Properties of Topological Semimetal 1$T$-RhSeTe

    Authors: C. Patra, T. Agarwal, Arushi, P. Manna, N. Bhatt, R. S. Singh, R. P. Singh

    Abstract: Platinum-group transition-metal dichalcogenides have emerged as a subject of considerable interest in condensed matter physics due to their remarkable topological properties and unconventional superconducting behavior. In this study, we report the synthesis and superconducting characteristics of a new Dirac-type topological semimetallic compound 1$T$-RhSeTe. It shows type-II superconductivity with… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: 8 oages, 7 figures

  27. arXiv:2310.17567  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models

    Authors: Dingli Yu, Simran Kaur, Arushi Gupta, Jonah Brown-Cohen, Anirudh Goyal, Sanjeev Arora

    Abstract: With LLMs shifting their role from statistical modeling of language to serving as general-purpose AI agents, how should LLM evaluations change? Arguably, a key ability of an AI agent is to flexibly combine, as needed, the basic skills it has learned. The capability to combine skills plays an important role in (human) pedagogy and also in a paper on emergence phenomena (Arora & Goyal, 2023). This… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  28. arXiv:2310.13619  [pdf, other

    cs.CL cs.CV

    Semi-supervised multimodal coreference resolution in image narrations

    Authors: Arushi Goel, Basura Fernando, Frank Keller, Hakan Bilen

    Abstract: In this paper, we study multimodal coreference resolution, specifically where a longer descriptive text, i.e., a narration is paired with an image. This poses significant challenges due to fine-grained image-text alignment, inherent ambiguity present in narrative language, and unavailability of large annotated training sets. To tackle these challenges, we present a data efficient semi-supervised a… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Long paper at EMNLP'23-Main

  29. arXiv:2310.07093  [pdf, other

    cs.CL

    Argumentative Stance Prediction: An Exploratory Study on Multimodality and Few-Shot Learning

    Authors: Arushi Sharma, Abhibha Gupta, Maneesh Bilalpur

    Abstract: To advance argumentative stance prediction as a multimodal problem, the First Shared Task in Multimodal Argument Mining hosted stance prediction in crucial social topics of gun control and abortion. Our exploratory study attempts to evaluate the necessity of images for stance prediction in tweets and compare out-of-the-box text-based large-language models (LLM) in few-shot settings against fine-tu… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  30. arXiv:2309.11580  [pdf, other

    cs.RO

    A real-time, hardware agnostic framework for close-up branch reconstruction using RGB data

    Authors: Alexander You, Aarushi Mehta, Luke Strohbehn, Jochen Hemming, Cindy Grimm, Joseph R. Davidson

    Abstract: Creating accurate 3D models of tree topology is an important task for tree pruning. The 3D model is used to decide which branches to prune and then to execute the pruning cuts. Previous methods for creating 3D tree models have typically relied on point clouds, which are often computationally expensive to process and can suffer from data defects, especially with thin branches. In this paper, we pro… ▽ More

    Submitted 18 June, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

  31. arXiv:2309.08687  [pdf, other

    cs.DC physics.plasm-ph

    Speeding up charge exchange recombination spectroscopy analysis in support of NERSC/DIII-D realtime workflow

    Authors: Aarushi Jain, Laurie Stephey, Erik Linsenmayer, Colin Chrystal, Jonathan Dursi, Hannah Ross

    Abstract: We report optimization work made in support of the development of a realtime Superfacility workflow between DIII-D and NERSC. At DIII-D, the ion properties measured by charge exchange recombination (CER) spectroscopy are required inputs for a Superfacility realtime workflow that computes the full plasma kinetic equilibrium. In this workflow, minutes matter since the results must be ready during th… ▽ More

    Submitted 18 September, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: 8 pages, 5 figures. Not a preprint- this work was rejected from a conference proceedings, so arXiv will hopefully be the final home Updated to add arXiv link/DOI to header of paper

  32. arXiv:2308.11266  [pdf, other

    nucl-th

    Constructing Inverse Scattering Potentials for α-α System using Reference Potential Approach

    Authors: O. S. K. S. Sastri, Arushi Sharma, Ayushi Awasthi

    Abstract: Background: An accurate way to incorporate long range Coulomb interaction alongside short-range nuclear interaction has been a challenge for theoretical physicists. Purpose: In this paper, we propose a methodology based on the reference potential approach for constructing inverse potentials of alpha-alpha scattering. Methods: Two smoothly joined Morse potentials, regular for short-range nuclear in… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: 10 pages, 4 figures, 2 Tables

  33. arXiv:2308.02748  [pdf

    cs.CV

    Discrimination of Radiologists Utilizing Eye-Tracking Technology and Machine Learning: A Case Study

    Authors: Stanford Martinez, Carolina Ramirez-Tamayo, Syed Hasib Akhter Faruqui, Kal L. Clark, Adel Alaeddini, Nicholas Czarnek, Aarushi Aggarwal, Sahra Emamzadeh, Jeffrey R. Mock, Edward J. Golob

    Abstract: Perception-related errors comprise most diagnostic mistakes in radiology. To mitigate this problem, radiologists employ personalized and high-dimensional visual search strategies, otherwise known as search patterns. Qualitative descriptions of these search patterns, which involve the physician verbalizing or annotating the order he/she analyzes the image, can be unreliable due to discrepancies in… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: Submitting for Review in "IEEE Journal of Biomedical and Health Informatics"

  34. arXiv:2307.16382  [pdf, other

    cs.LG cs.CL

    Does fine-tuning GPT-3 with the OpenAI API leak personally-identifiable information?

    Authors: Albert Yu Sun, Eliott Zemour, Arushi Saxena, Udith Vaidyanathan, Eric Lin, Christian Lau, Vaikkunth Mugunthan

    Abstract: Machine learning practitioners often fine-tune generative pre-trained models like GPT-3 to improve model performance at specific tasks. Previous works, however, suggest that fine-tuned machine learning models memorize and emit sensitive information from the original fine-tuning dataset. Companies such as OpenAI offer fine-tuning services for their models, but no prior work has conducted a memoriza… ▽ More

    Submitted 15 April, 2024; v1 submitted 30 July, 2023; originally announced July 2023.

  35. A Coronal Mass Ejection Source Region Catalogue and their Associated Properties

    Authors: Satabdwa Majumdar, Ritesh Patel, Vaibhav Pant, Dipankar Banerjee, Aarushi Rawat, Abhas Pradhan, Paritosh Singh

    Abstract: The primary objective of this study is to connect the coronal mass ejections (CMEs) to their source regions, primarily creating a CME source region (CSR) catalogue, and secondly probing into the influence the source regions have on different statistical properties of CMEs. We create a source region catalogue for 3327 CMEs from 1998 to 2017, thus capturing the different phases of cycle 23 and 24. T… ▽ More

    Submitted 26 October, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: 29 Pages, 18 Figures. Accepted in The Astrophysical Journal Supplement Series (APJS)

  36. arXiv:2306.09224  [pdf, other

    cs.CV

    Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories

    Authors: Thomas Mensink, Jasper Uijlings, Lluis Castrejon, Arushi Goel, Felipe Cadar, Howard Zhou, Fei Sha, André Araujo, Vittorio Ferrari

    Abstract: We propose Encyclopedic-VQA, a large scale visual question answering (VQA) dataset featuring visual questions about detailed properties of fine-grained categories and instances. It contains 221k unique question+answer pairs each matched with (up to) 5 images, resulting in a total of 1M VQA samples. Moreover, our dataset comes with a controlled knowledge base derived from Wikipedia, marking the evi… ▽ More

    Submitted 24 July, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: ICCV'23

  37. arXiv:2306.03959  [pdf, other

    cs.CL cs.IR

    Leveraging Explicit Procedural Instructions for Data-Efficient Action Prediction

    Authors: Julia White, Arushi Raghuvanshi, Yada Pruksachatkun

    Abstract: Task-oriented dialogues often require agents to enact complex, multi-step procedures in order to meet user requests. While large language models have found success automating these dialogues in constrained environments, their widespread deployment is limited by the substantial quantities of task-specific data required for training. The following paper presents a data-efficient solution to construc… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  38. arXiv:2305.17552  [pdf, other

    cs.LG math.OC

    Online Nonstochastic Model-Free Reinforcement Learning

    Authors: Udaya Ghai, Arushi Gupta, Wenhan Xia, Karan Singh, Elad Hazan

    Abstract: We investigate robust model-free reinforcement learning algorithms designed for environments that may be dynamic or even adversarial. Traditional state-based policies often struggle to accommodate the challenges imposed by the presence of unmodeled disturbances in such settings. Moreover, optimizing linear state-based policies pose an obstacle for efficient optimization, leading to nonconvex objec… ▽ More

    Submitted 31 October, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: Camera-ready version for NeurIPS 2023

  39. arXiv:2305.08577  [pdf, other

    cond-mat.supr-con

    Superconducting properties of new hexagonal and noncentrosymmetric cubic high entropy alloys

    Authors: K. Motla, Arushi, S. Jangid, P. Meena, R. K. Kushwaha, R. P. Singh

    Abstract: Superconducting high-entropy alloys (HEAs) are a newly burgeoning field of unconventional superconductors and raise intriguing questions about the presence of superconductivity in highly disordered systems, which lack regular phonon modes. In our study, we have synthesized and investigated the superconducting characteristics of two new transition elements based HEAs Re$_{0.35} $Os$_{0.35} $Mo… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 7 pages, 6 figures

  40. arXiv:2305.00875  [pdf, other

    cs.SE cs.AI cs.LG

    Redundancy and Concept Analysis for Code-trained Language Models

    Authors: Arushi Sharma, Zefu Hu, Christopher Quinn, Ali Jannesari

    Abstract: Code-trained language models have proven to be highly effective for various code intelligence tasks. However, they can be challenging to train and deploy for many software engineering applications due to computational bottlenecks and memory constraints. Implementing effective strategies to address these issues requires a better understanding of these 'black box' models. In this paper, we perform t… ▽ More

    Submitted 15 February, 2024; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: 4 figures, 6 tables

  41. Phase Stability of Hexagonal/cubic Boron Nitride Nanocomposites

    Authors: Abhijit Biswas, Rui Xu, Joyce Christiansen-Salameh, Eugene Jeong, Gustavo A. Alvarez, Chenxi Li, Anand B. Puthirath, Bin Gao, Arushi Garg, Tia Gray, Harikishan Kannan, Xiang Zhang, Jacob Elkins, Tymofii S. Pieshkov, Robert Vajtai, A. Glen Birdwell, Mahesh R. Neupane, Bradford B. Pate, Tony Ivanov, Elias J. Garratt, Pengcheng Dai, Hanyu Zhu, Zhiting Tian, Pulickel M. Ajayan

    Abstract: Boron nitride (BN) is an exceptional material and among its polymorphs, two-dimensional (2D) hexagonal and three-dimensional (3D) cubic BN (h-BN and c-BN) phases are most common. The phase stability regimes of these BN phases are still under debate and phase transformations of h-BN/c-BN remain a topic of interest. Here, we investigate the phase stability of 2D/3D h-BN/c-BN nanocomposites and show… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 29 pages, 5 figures

    Journal ref: Nano Lett. 2023, 23, 15, 6927

  42. arXiv:2303.09608  [pdf, other

    cs.CV

    VEIL: Vetting Extracted Image Labels from In-the-Wild Captions for Weakly-Supervised Object Detection

    Authors: Arushi Rai, Adriana Kovashka

    Abstract: The use of large-scale vision-language datasets is limited for object detection due to the negative impact of label noise on localization. Prior methods have shown how such large-scale datasets can be used for pretraining, which can provide initial signal for localization, but is insufficient without clean bounding-box data for at least some categories. We propose a technique to "vet" labels extra… ▽ More

    Submitted 10 March, 2024; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL) 2024 camera-ready

  43. arXiv:2303.05323  [pdf, other

    cs.CV

    Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE

    Authors: Yucheng Xu, Li Nanbo, Arushi Goel, Zijian Guo, Zonghai Yao, Hamidreza Kasaei, Mohammadreze Kasaei, Zhibin Li

    Abstract: Videos depict the change of complex dynamical systems over time in the form of discrete image sequences. Generating controllable videos by learning the dynamical system is an important yet underexplored topic in the computer vision community. This paper presents a novel framework, TiV-ODE, to generate highly controllable videos from a static image and a text caption. Specifically, our framework le… ▽ More

    Submitted 4 April, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

  44. Superconducting ground state study of valence skip compound AgSnSe$_2$

    Authors: A. Kataria, Arushi, S. Sharma, T. Agarwal, M. Pula, J. Beare, S. Yoon, Y. Cai, K. M. Kojima, G. M. Luke, R. P. Singh

    Abstract: The valence-skipped superconductors are natural candidates for unconventional superconductivity, as they can exhibit a negative effective, attractive interaction for electron-pairing. This work reports comprehensive XRD, magnetization, specific heat and muon spin rotation and relaxation measurements ($μ$SR) on a valence-skipped compound: AgSnSe$_2$. The temperature dependence of the electronic spe… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: 8 pages, 6 figures

  45. arXiv:2301.02242  [pdf, other

    q-bio.GN cs.LG

    Graph Contrastive Learning for Multi-omics Data

    Authors: Nishant Rajadhyaksha, Aarushi Chitkara

    Abstract: Advancements in technologies related to working with omics data require novel computation methods to fully leverage information and help develop a better understanding of human diseases. This paper studies the effects of introducing graph contrastive learning to help leverage graph structure and information to produce better representations for downstream classification tasks for multi-omics datas… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

  46. arXiv:2211.14563  [pdf, other

    cs.CV cs.CL

    Who are you referring to? Coreference resolution in image narrations

    Authors: Arushi Goel, Basura Fernando, Frank Keller, Hakan Bilen

    Abstract: Coreference resolution aims to identify words and phrases which refer to same entity in a text, a core task in natural language processing. In this paper, we extend this task to resolving coreferences in long-form narrations of visual scenes. First we introduce a new dataset with annotated coreference chains and their bounding boxes, as most existing image-text datasets only contain short sentence… ▽ More

    Submitted 17 March, 2023; v1 submitted 26 November, 2022; originally announced November 2022.

    Comments: 15 pages

  47. arXiv:2211.09301  [pdf, other

    hep-ph astro-ph.CO

    Large Primordial Fluctuations in Gravitational Waves from Phase Transitions

    Authors: Arushi Bodas, Raman Sundrum

    Abstract: It is well-known that first order phase transitions in the early universe can be a powerful source of observable stochastic gravitational wave backgrounds. Any such gravitational wave background must exhibit large-scale anisotropies at least as large as those seen in the CMB $\sim 10^{-5}$, providing a valuable new window onto the (inflationary) origins of primordial fluctuations. While significan… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 21 pages, 6 figures

  48. arXiv:2211.02912  [pdf, other

    stat.ML cs.LG

    New Definitions and Evaluations for Saliency Methods: Staying Intrinsic, Complete and Sound

    Authors: Arushi Gupta, Nikunj Saunshi, Dingli Yu, Kaifeng Lyu, Sanjeev Arora

    Abstract: Saliency methods compute heat maps that highlight portions of an input that were most {\em important} for the label assigned to it by a deep net. Evaluations of saliency methods convert this heat map into a new {\em masked input} by retaining the $k$ highest-ranked pixels of the original input and replacing the rest with \textquotedblleft uninformative\textquotedblright\ pixels, and checking if th… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022 (Oral)

  49. arXiv:2210.06257  [pdf, other

    cs.CV cs.LG eess.IV

    What can we learn about a generated image corrupting its latent representation?

    Authors: Agnieszka Tomczak, Aarushi Gupta, Slobodan Ilic, Nassir Navab, Shadi Albarqouni

    Abstract: Generative adversarial networks (GANs) offer an effective solution to the image-to-image translation problem, thereby allowing for new possibilities in medical imaging. They can translate images from one imaging modality to another at a low cost. For unpaired datasets, they rely mostly on cycle loss. Despite its effectiveness in learning the underlying data distribution, it can lead to a discrepan… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  50. arXiv:2210.01072  [pdf, other

    cs.LG cs.AI

    Understanding Influence Functions and Datamodels via Harmonic Analysis

    Authors: Nikunj Saunshi, Arushi Gupta, Mark Braverman, Sanjeev Arora

    Abstract: Influence functions estimate effect of individual data points on predictions of the model on test data and were adapted to deep learning in Koh and Liang [2017]. They have been used for detecting data poisoning, detecting helpful and harmful examples, influence of groups of datapoints, etc. Recently, Ilyas et al. [2022] introduced a linear regression method they termed datamodels to predict the ef… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.