Skip to main content

Showing 1–50 of 464 results for author: Shah, R

  1. arXiv:2410.15012  [pdf

    eess.IV cs.AI cs.CV

    Pathologist-like explainable AI for interpretable Gleason grading in prostate cancer

    Authors: Gesa Mittmann, Sara Laiouar-Pedari, Hendrik A. Mehrtens, Sarah Haggenmüller, Tabea-Clara Bucher, Tirtha Chanda, Nadine T. Gaisa, Mathias Wagner, Gilbert Georg Klamminger, Tilman T. Rau, Christina Neppl, Eva Maria Compérat, Andreas Gocht, Monika Hämmerle, Niels J. Rupp, Jula Westhoff, Irene Krücken, Maximillian Seidl, Christian M. Schürch, Marcus Bauer, Wiebke Solass, Yu Chun Tam, Florian Weber, Rainer Grobholz, Jaroslaw Augustyniak , et al. (41 additional authors not shown)

    Abstract: The aggressiveness of prostate cancer, the most common cancer in men worldwide, is primarily assessed based on histopathological data using the Gleason scoring system. While artificial intelligence (AI) has shown promise in accurately predicting Gleason scores, these predictions often lack inherent explainability, potentially leading to distrust in human-machine interactions. To address this issue… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

    Comments: 58 pages, 15 figures (incl. supplementary)

  2. arXiv:2410.13331  [pdf, other

    cs.LG cs.AI

    Improving Discrete Optimisation Via Decoupled Straight-Through Gumbel-Softmax

    Authors: Rushi Shah, Mingyuan Yan, Michael Curtis Mozer, Dianbo Liu

    Abstract: Discrete representations play a crucial role in many deep learning architectures, yet their non-differentiable nature poses significant challenges for gradient-based optimization. To address this issue, various gradient estimators have been developed, including the Straight-Through Gumbel-Softmax (ST-GS) estimator, which combines the Straight-Through Estimator (STE) and the Gumbel-based reparamete… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  3. arXiv:2410.11086  [pdf, other

    cs.CL

    JOOCI: a Framework for Learning Comprehensive Speech Representations

    Authors: Hemant Yadav, Rajiv Ratn Shah, Sunayana Sitaram

    Abstract: Information in speech can be divided into two categories: what is being said (content) and how it is expressed (other). Current state-of-the-art (SOTA) techniques model speech at fixed segments, usually 10-25 ms, using a single embedding. Given the orthogonal nature of other and content information, attempting to optimize both within a single embedding results in suboptimal solutions. This approac… ▽ More

    Submitted 16 October, 2024; v1 submitted 14 October, 2024; originally announced October 2024.

    Comments: Submitted to ICLR 2025

  4. arXiv:2410.10180  [pdf, other

    cs.LG stat.ML

    Gaussian Mixture Vector Quantization with Aggregated Categorical Posterior

    Authors: Mingyuan Yan, Jiawei Wu, Rushi Shah, Dianbo Liu

    Abstract: The vector quantization is a widely used method to map continuous representation to discrete space and has important application in tokenization for generative mode, bottlenecking information and many other tasks in machine learning. Vector Quantized Variational Autoencoder (VQ-VAE) is a type of variational autoencoder using discrete embedding as latent. We generalize the technique further, enrich… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  5. arXiv:2410.06237  [pdf, other

    cs.RO cs.AI

    BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile Manipulation

    Authors: Rutav Shah, Albert Yu, Yifeng Zhu, Yuke Zhu, Roberto Martín-Martín

    Abstract: To operate at a building scale, service robots must perform very long-horizon mobile manipulation tasks by navigating to different rooms, accessing different floors, and interacting with a wide and unseen range of everyday objects. We refer to these tasks as Building-wide Mobile Manipulation. To tackle these inherently long-horizon tasks, we introduce BUMBLE, a unified Vision-Language Model (VLM)-… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: 7 Figures, 2 Tables, 11 Pages

  6. arXiv:2410.05719  [pdf, other

    hep-ex astro-ph.HE

    Charge ratio of cosmic ray muons in momentum range ~ 1 to 3 GeV/c

    Authors: Raj Shah, J. M. John, Suryanarayan Mondal, S. Pethuraj, G. Majumder, P. Shukla

    Abstract: This work presents the measurements of the cosmic muon charge ratio as a function of full azimuthal angle and momentum within the range of 0.8 to 3.0 GeV/c, using the mini-ICAL detector. The detector, comprising 10 layers of RPCs, has collected cosmic muon data since August 2018 till recent time, at an altitude of 160 m above sea level at the Inter-Institutional Center for High Energy Physics in M… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: 19 pages, 15 figures

  7. arXiv:2410.03471  [pdf, other

    math.ST

    ROSE Random Forests for Robust Semiparametric Efficient Estimation

    Authors: Elliot H. Young, Rajen D. Shah

    Abstract: It is widely recognised that semiparametric efficient estimation can be hard to achieve in practice: estimators that are in theory efficient may require unattainable levels of accuracy for the estimation of complex nuisance functions. As a consequence, estimators deployed on real datasets are often chosen in a somewhat ad hoc fashion, and may suffer high variance. We study this gap between theory… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  8. arXiv:2410.03273  [pdf, ps, other

    cond-mat.soft nlin.PS physics.comp-ph

    Probing 3D Velocity Distributions Insights from a Vibrated Dual-Species Granular System

    Authors: Rameez Farooq Shah, Syed Rashid Ahmad

    Abstract: We explore the velocity distributions in a vibrated binary granular gas system, focusing on how these distributions are influenced by the coefficient of restitution (CoR) and the inelasticity of particle collisions. Through molecular dynamics simulations, we examine the system's behavior for a range of CoR values below unity: specifically, 0.80, 0.85, 0.90, and 0.95. We track the evolution of velo… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  9. arXiv:2409.02266  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    LSTMSE-Net: Long Short Term Speech Enhancement Network for Audio-visual Speech Enhancement

    Authors: Arnav Jain, Jasmer Singh Sanjotra, Harshvardhan Choudhary, Krish Agrawal, Rupal Shah, Rohan Jha, M. Sajid, Amir Hussain, M. Tanveer

    Abstract: In this paper, we propose long short term memory speech enhancement network (LSTMSE-Net), an audio-visual speech enhancement (AVSE) method. This innovative method leverages the complementary nature of visual and audio information to boost the quality of speech signals. Visual features are extracted with VisualFeatNet (VFN), and audio features are processed through an encoder and decoder. The syste… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Journal ref: INTERSPEECH 2024

  10. arXiv:2409.01535  [pdf, other

    math.OC

    A proximal splitting algorithm for generalized DC programming with applications in signal recovery

    Authors: Tan Nhat Pham, Minh N. Dao, Nima Amjady, Rakibuzzaman Shah

    Abstract: The difference-of-convex (DC) program is a crucial approach to nonconvex optimization problems due to its structure, which encompasses a wide ranges of practical applications. In this paper, we aim to tackle a generalized class of DC programs, where the objective function is formed by summing a possibly nonsmooth nonconvex function and a differentiable nonconvex function with Lipschitz continuous… ▽ More

    Submitted 3 September, 2024; v1 submitted 2 September, 2024; originally announced September 2024.

    MSC Class: 90C26; 49M27; 65K05

  11. arXiv:2408.11526  [pdf, other

    cs.AI

    RConE: Rough Cone Embedding for Multi-Hop Logical Query Answering on Multi-Modal Knowledge Graphs

    Authors: Mayank Kharbanda, Rajiv Ratn Shah, Raghava Mutharaju

    Abstract: Multi-hop query answering over a Knowledge Graph (KG) involves traversing one or more hops from the start node to answer a query. Path-based and logic-based methods are state-of-the-art for multi-hop question answering. The former is used in link prediction tasks. The latter is for answering complex logical queries. The logical multi-hop querying technique embeds the KG and queries in the same emb… ▽ More

    Submitted 26 August, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

  12. arXiv:2408.10604  [pdf

    cs.CL cs.AI cs.IR cs.LG

    Multilingual Non-Factoid Question Answering with Silver Answers

    Authors: Ritwik Mishra, Sreeram Vennam, Rajiv Ratn Shah, Ponnurangam Kumaraguru

    Abstract: Most existing Question Answering Datasets (QuADs) primarily focus on factoid-based short-context Question Answering (QA) in high-resource languages. However, the scope of such datasets for low-resource languages remains limited, with only a few works centered on factoid-based QuADs and none on non-factoid QuADs. Therefore, this work presents MuNfQuAD, a multilingual QuAD with non-factoid questions… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  13. arXiv:2408.10557  [pdf, other

    cs.CL

    Speech Representation Learning Revisited: The Necessity of Separate Learnable Parameters and Robust Data Augmentation

    Authors: Hemant Yadav, Sunayana Sitaram, Rajiv Ratn Shah

    Abstract: Speech modeling methods learn one embedding for a fixed segment of speech, typically in between 10-25 ms. The information present in speech can be divided into two categories: "what is being said" (content) and "how it is expressed" (other) and these two are orthogonal in nature causing the optimization algorithm to find a sub-optimal solution if forced to optimize together. This leads to sub-opti… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  14. arXiv:2408.09880  [pdf, other

    math.NA

    Fast Hermitian Diagonalization with Nearly Optimal Precision

    Authors: Rikhav Shah

    Abstract: Algorithms for numerical tasks in finite precision simultaneously seek to minimize the number of floating point operations performed, and also the number of bits of precision required by each floating point operation. This paper presents an algorithm for Hermitian diagonalization requiring only $\lg(1/\varepsilon)+O(\log(n)+\log\log(1/\varepsilon))$ bits of precision where $n$ is the size of the i… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  15. arXiv:2408.05147  [pdf, other

    cs.LG cs.AI cs.CL

    Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2

    Authors: Tom Lieberum, Senthooran Rajamanoharan, Arthur Conmy, Lewis Smith, Nicolas Sonnerat, Vikrant Varma, János Kramár, Anca Dragan, Rohin Shah, Neel Nanda

    Abstract: Sparse autoencoders (SAEs) are an unsupervised method for learning a sparse decomposition of a neural network's latent representations into seemingly interpretable features. Despite recent excitement about their potential, research applications outside of industry are limited by the high cost of training a comprehensive suite of SAEs. In this work, we introduce Gemma Scope, an open suite of JumpRe… ▽ More

    Submitted 19 August, 2024; v1 submitted 9 August, 2024; originally announced August 2024.

    Comments: 12 main text pages, and 14 pages of acknowledgements, references and appendices

  16. arXiv:2407.17765  [pdf, other

    cs.CR

    Utilizing Blockchain and Smart Contracts for Enhanced Fraud Prevention and Minimization in Health Insurance through Multi-Signature Claim Processing

    Authors: Md Al Amin, Rushabh Shah, Hemanth Tummala, Indrajit Ray

    Abstract: Healthcare insurance provides financial support to access medical services for patients while ensuring timely and guaranteed payment for providers. Insurance fraud poses a significant challenge to insurance companies and policyholders, leading to increased costs and compromised healthcare treatment and service delivery. Most frauds, like phantom billing, upcoding, and unbundling, happen due to the… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: 2024 IEEE 4th International Conference on Emerging Trends in Networks and Computer Communications (ETNCC 2024

  17. arXiv:2407.11914  [pdf, ps, other

    math.PR

    Introduction to Martingales

    Authors: Rohan Shah

    Abstract: This paper introduces Martingales by covering introductory measure theory concepts and the Lebesgue Integration and Conditional Expectation. It follows up with proofs of Kolomorgov's Theorem on conditional expectations, the Martingale Property, and the Pythagorean Theorem on Martingales. Finally, it ends with Martingales' applications in finance.

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 18 pages

  18. arXiv:2407.11149  [pdf

    cs.NE

    BMR and BWR: Two simple metaphor-free optimization algorithms for solving real-life non-convex constrained and unconstrained problems

    Authors: Ravipudi Venkata Rao, Ravikumar shah

    Abstract: Two simple yet powerful optimization algorithms, named the Best-Mean-Random (BMR) and Best-Worst-Random (BWR) algorithms, are developed and presented in this paper to handle both constrained and unconstrained optimization problems. These algorithms are free of metaphors and algorithm-specific parameters. The BMR algorithm is based on the best, mean, and random solutions of the population generated… ▽ More

    Submitted 8 September, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

    Comments: 37 pages, 6 figures, improved version of the own original paper

    ACM Class: C.1.3; I.2.6; I.5

  19. arXiv:2407.06125  [pdf, other

    cs.HC cs.AI

    Depression Detection and Analysis using Large Language Models on Textual and Audio-Visual Modalities

    Authors: Avinash Anand, Chayan Tank, Sarthak Pol, Vinayak Katoch, Shaina Mehta, Rajiv Ratn Shah

    Abstract: Depression has proven to be a significant public health issue, profoundly affecting the psychological well-being of individuals. If it remains undiagnosed, depression can lead to severe health issues, which can manifest physically and even lead to suicide. Generally, Diagnosing depression or any other mental disorder involves conducting semi-structured interviews alongside supplementary questionna… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 12 pages, 9 figures, 9 tables

  20. arXiv:2407.04622  [pdf, other

    cs.LG

    On scalable oversight with weak LLMs judging strong LLMs

    Authors: Zachary Kenton, Noah Y. Siegel, János Kramár, Jonah Brown-Cohen, Samuel Albanie, Jannis Bulian, Rishabh Agarwal, David Lindner, Yunhao Tang, Noah D. Goodman, Rohin Shah

    Abstract: Scalable oversight protocols aim to enable humans to accurately supervise superhuman AI. In this paper we study debate, where two AI's compete to convince a judge; consultancy, where a single AI tries to convince a judge that asks questions; and compare to a baseline of direct question-answering, where the judge just answers outright without the AI. We use large language models (LLMs) as both AI a… ▽ More

    Submitted 12 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: 15 pages (53 including appendices). V2: minor correction to Figure 3; add Figure A.9 comparing open vs assigned consultancy; add a reference

  21. arXiv:2407.04577  [pdf, other

    cs.IR

    Optimizing Nepali PDF Extraction: A Comparative Study of Parser and OCR Technologies

    Authors: Prabin Paudel, Supriya Khadka, Ranju G. C., Rahul Shah

    Abstract: This research compares PDF parsing and Optical Character Recognition (OCR) methods for extracting Nepali content from PDFs. PDF parsing offers fast and accurate extraction but faces challenges with non-Unicode Nepali fonts. OCR, specifically PyTesseract, overcomes these challenges, providing versatility for both digital and scanned PDFs. The study reveals that while PDF parsers are faster, their a… ▽ More

    Submitted 9 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

  22. arXiv:2407.02766  [pdf, other

    cs.CR

    Balancing Patient Privacy and Health Data Security: The Role of Compliance in Protected Health Information (PHI) Sharing

    Authors: Md Al Amin, Hemanth Tummala, Rushabh Shah, Indrajit Ray

    Abstract: Protected Health Information (PHI) sharing significantly enhances patient care quality and coordination, contributing to more accurate diagnoses, efficient treatment plans, and a comprehensive understanding of patient history. Compliance with strict privacy and security policies, such as those required by laws like HIPAA, is critical to protect PHI. Blockchain technology, which offers a decentrali… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: The 21st International Conference on Security and Cryptography (SECRYPT 2024)

  23. arXiv:2407.01351  [pdf, other

    astro-ph.HE

    Probing the connection between IceCube neutrinos and MOJAVE AGN

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (399 additional authors not shown)

    Abstract: Active Galactic Nuclei (AGN) are prime candidate sources of the high-energy, astrophysical neutrinos detected by IceCube. This is demonstrated by the real-time multi-messenger detection of the blazar TXS 0506+056 and the recent evidence of neutrino emission from NGC 1068 from a separate time-averaged study. However, the production mechanism of the astrophysical neutrinos in AGN is not well establi… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 14 Pages 7 Figures

  24. Search for a light sterile neutrino with 7.5 years of IceCube DeepCore data

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (399 additional authors not shown)

    Abstract: We present a search for an eV-scale sterile neutrino using 7.5 years of data from the IceCube DeepCore detector. The analysis uses a sample of 21,914 events with energies between 5 and 150 GeV to search for sterile neutrinos through atmospheric muon neutrino disappearance. Improvements in event selection and treatment of systematic uncertainties provide greater statistical power compared to previo… ▽ More

    Submitted 9 September, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: 11 pages, 5 figures. Version accepted by Physical Review D for publication

  25. arXiv:2407.01047  [pdf, other

    cs.CL

    Development of Cognitive Intelligence in Pre-trained Language Models

    Authors: Raj Sanjay Shah, Khushi Bhardwaj, Sashank Varma

    Abstract: Recent studies show evidence for emergent cognitive abilities in Large Pre-trained Language Models (PLMs). The increasing cognitive alignment of these models has made them candidates for cognitive science theories. Prior research into the emergent cognitive abilities of PLMs has largely been path-independent to model training, i.e., has focused on the final model weights and not the intermediate s… ▽ More

    Submitted 12 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

  26. arXiv:2406.16253  [pdf, other

    cs.CL

    LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing

    Authors: Jiangshu Du, Yibo Wang, Wenting Zhao, Zhongfen Deng, Shuaiqi Liu, Renze Lou, Henry Peng Zou, Pranav Narayanan Venkit, Nan Zhang, Mukund Srinath, Haoran Ranran Zhang, Vipul Gupta, Yinghui Li, Tao Li, Fei Wang, Qin Liu, Tianlin Liu, Pengzhi Gao, Congying Xia, Chen Xing, Jiayang Cheng, Zhaowei Wang, Ying Su, Raj Sanjay Shah, Ruohao Guo , et al. (15 additional authors not shown)

    Abstract: This work is motivated by two key trends. On one hand, large language models (LLMs) have shown remarkable versatility in various generative tasks such as writing, drawing, and question answering, significantly reducing the time required for many routine tasks. On the other hand, researchers, whose work is not only time-consuming but also highly expertise-demanding, face increasing challenges as th… ▽ More

    Submitted 2 October, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

    Comments: Accepted by EMNLP 2024 main conference

  27. arXiv:2406.15335  [pdf, other

    cs.CV cs.CY

    Keystroke Dynamics Against Academic Dishonesty in the Age of LLMs

    Authors: Debnath Kundu, Atharva Mehta, Rajesh Kumar, Naman Lal, Avinash Anand, Apoorv Singh, Rajiv Ratn Shah

    Abstract: The transition to online examinations and assignments raises significant concerns about academic integrity. Traditional plagiarism detection systems often struggle to identify instances of intelligent cheating, particularly when students utilize advanced generative AI tools to craft their responses. This study proposes a keystroke dynamics-based method to differentiate between bona fide and assist… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Accepted for publication at The IEEE International Joint Conference on Biometrics (IJCB2024), contains 9 pages, 3 figures, 3 tables

    ACM Class: I.5.4

  28. arXiv:2406.11106  [pdf, other

    cs.CL cs.AI

    From Intentions to Techniques: A Comprehensive Taxonomy and Challenges in Text Watermarking for Large Language Models

    Authors: Harsh Nishant Lalai, Aashish Anantha Ramakrishnan, Raj Sanjay Shah, Dongwon Lee

    Abstract: With the rapid growth of Large Language Models (LLMs), safeguarding textual content against unauthorized use is crucial. Text watermarking offers a vital solution, protecting both - LLM-generated and plain text sources. This paper presents a unified overview of different perspectives behind designing watermarking techniques, through a comprehensive survey of the research literature. Our work has t… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  29. arXiv:2406.09395  [pdf, other

    cs.CV

    Modeling Ambient Scene Dynamics for Free-view Synthesis

    Authors: Meng-Li Shih, Jia-Bin Huang, Changil Kim, Rajvi Shah, Johannes Kopf, Chen Gao

    Abstract: We introduce a novel method for dynamic free-view synthesis of an ambient scenes from a monocular capture bringing a immersive quality to the viewing experience. Our method builds upon the recent advancements in 3D Gaussian Splatting (3DGS) that can faithfully reconstruct complex static scenes. Previous attempts to extend 3DGS to represent dynamics have been confined to bounded scenes or require m… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  30. arXiv:2406.08802  [pdf, other

    eess.AS cs.SD

    DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing

    Authors: Neha Sahipjohn, Ashishkumar Gudmalwar, Nirmesh Shah, Pankaj Wasnik, Rajiv Ratn Shah

    Abstract: Audio-visual alignment after dubbing is a challenging research problem. To this end, we propose a novel method, DubWise Multi-modal Large Language Model (LLM)-based Text-to-Speech (TTS), which can control the speech duration of synthesized speech in such a way that it aligns well with the speakers lip movements given in the reference video even when the spoken text is different or in a different l… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted at INTERSPEECH 2024

  31. arXiv:2406.08076  [pdf, other

    eess.AS cs.SD

    VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech

    Authors: Ashishkumar Gudmalwar, Nirmesh Shah, Sai Akarsh, Pankaj Wasnik, Rajiv Ratn Shah

    Abstract: Despite the significant advancements in Text-to-Speech (TTS) systems, their full utilization in automatic dubbing remains limited. This task necessitates the extraction of voice identity and emotional style from a reference speech in a source language and subsequently transferring them to a target language using cross-lingual TTS techniques. While previous approaches have mainly concentrated on co… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted at INTERSPEECH 2024

  32. arXiv:2406.07601  [pdf, other

    astro-ph.HE hep-ex

    IceCube Search for Neutrino Emission from X-ray Bright Seyfert Galaxies

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (400 additional authors not shown)

    Abstract: The recent IceCube detection of TeV neutrino emission from the nearby active galaxy NGC 1068 suggests that active galactic nuclei (AGN) could make a sizable contribution to the diffuse flux of astrophysical neutrinos. The absence of TeV $γ$-rays from NGC 1068 indicates neutrino production in the vicinity of the supermassive black hole, where the high radiation density leads to $γ$-ray attenuation.… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 17 pages, 9 figures

  33. arXiv:2406.06684  [pdf, other

    astro-ph.HE

    Search for neutrino emission from hard X-ray AGN with IceCube

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (401 additional authors not shown)

    Abstract: Active Galactic Nuclei (AGN) are promising candidate sources of high-energy astrophysical neutrinos since they provide environments rich in matter and photon targets where cosmic ray interactions may lead to the production of gamma rays and neutrinos. We searched for high-energy neutrino emission from AGN using the $\textit{Swift}$-BAT Spectroscopic Survey (BASS) catalog of hard X-ray sources and… ▽ More

    Submitted 12 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  34. arXiv:2406.05661  [pdf, other

    cs.CL

    MS-HuBERT: Mitigating Pre-training and Inference Mismatch in Masked Language Modelling methods for learning Speech Representations

    Authors: Hemant Yadav, Sunayana Sitaram, Rajiv Ratn Shah

    Abstract: In recent years, self-supervised pre-training methods have gained significant traction in learning high-level information from raw speech. Among these methods, HuBERT has demonstrated SOTA performance in automatic speech recognition (ASR). However, HuBERT's performance lags behind data2vec due to disparities in pre-training strategies. In this paper, we propose (i) a Swap method to address pre-tra… ▽ More

    Submitted 15 August, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: 4 pages, submitted to interspeech2024

  35. arXiv:2406.05578  [pdf, other

    cs.IR

    Prioritizing Potential Wetland Areas via Region-to-Region Knowledge Transfer and Adaptive Propagation

    Authors: Yoonhyuk Choi, Reepal Shah, John Sabo, K. Selcuk Candan, Huan Liu

    Abstract: Wetlands are important to communities, offering benefits ranging from water purification, and flood protection to recreation and tourism. Therefore, identifying and prioritizing potential wetland areas is a critical decision problem. While data-driven solutions are feasible, this is complicated by significant data sparsity due to the low proportion of wetlands (3-6\%) in many areas of interest in… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  36. arXiv:2406.01237  [pdf, ps, other

    math.DS math.GR

    Charactarisation of distal actions of automorphisms on the space of one-parameter subgroups of Lie groups

    Authors: Debamita Chatterjee, Riddhi Shah

    Abstract: For a connected Lie group $G$ and an automorphism $T$ of $G$, we consider the action of $T$ on Sub$_G$, the compact space of closed subgroups of $G$ endowed with the Chabauty topology. We study the action of $T$ on Sub$^p_G$, the closure in Sub$_G$ of the set of closed one-parameter subgroups of $G$. We relate the distality of the $T$-action on Sub$^p_G$ with that of the $T$-action on $G$ and char… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  37. arXiv:2406.00905  [pdf, other

    hep-ex

    Exploration of mass splitting and muon/tau mixing parameters for an eV-scale sterile neutrino with IceCube

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (400 additional authors not shown)

    Abstract: We present the first three-parameter fit to a 3+1 sterile neutrino model using 7.634 years of data from the IceCube Neutrino Observatory on $ν_μ+\overlineν_μ$ charged-current interactions in the energy range 500-9976 GeV. Our analysis is sensitive to the mass-squared splitting between the heaviest and lightest mass state ($Δm_{41}^2$), the mixing matrix element connecting muon flavor to the fourth… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  38. arXiv:2405.16128  [pdf, other

    cs.AI cs.CL

    How Well Do Deep Learning Models Capture Human Concepts? The Case of the Typicality Effect

    Authors: Siddhartha K. Vemuri, Raj Sanjay Shah, Sashank Varma

    Abstract: How well do representations learned by ML models align with those of humans? Here, we consider concept representations learned by deep learning models and evaluate whether they show a fundamental behavioral signature of human concepts, the typicality effect. This is the finding that people judge some instances (e.g., robin) of a category (e.g., Bird) to be more typical than others (e.g., penguin).… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: To appear at CogSci 2024

  39. arXiv:2405.16042  [pdf, other

    cs.CL

    Incremental Comprehension of Garden-Path Sentences by Large Language Models: Semantic Interpretation, Syntactic Re-Analysis, and Attention

    Authors: Andrew Li, Xianle Feng, Siddhant Narang, Austin Peng, Tianle Cai, Raj Sanjay Shah, Sashank Varma

    Abstract: When reading temporarily ambiguous garden-path sentences, misinterpretations sometimes linger past the point of disambiguation. This phenomenon has traditionally been studied in psycholinguistic experiments using online measures such as reading times and offline measures such as comprehension questions. Here, we investigate the processing of garden-path sentences and the fate of lingering misinter… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted by CogSci-24

  40. arXiv:2405.08077  [pdf, other

    hep-ex hep-ph

    Methods and stability tests associated with the sterile neutrino search using improved high-energy $ν_μ$ event reconstruction in IceCube

    Authors: IceCube Collaboration, R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise , et al. (398 additional authors not shown)

    Abstract: We provide supporting details for the search for a 3+1 sterile neutrino using data collected over eleven years at the IceCube Neutrino Observatory. The analysis uses atmospheric muon-flavored neutrinos from 0.5 to 100\, TeV that traverse the Earth to reach the IceCube detector, and finds a best-fit point at $\sin^2(2θ_{24}) = 0.16$ and $Δm^{2}_{41} = 3.5$ eV$^2$ with a goodness-of-fit p-value of 1… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 18 pages, 17 figures, 2 tables. This long-form paper is a companion to the letter "A search for an eV-scale sterile neutrino using improved high-energy νμ event reconstruction in IceCube."

  41. arXiv:2405.08070  [pdf, other

    hep-ex hep-ph

    A search for an eV-scale sterile neutrino using improved high-energy $ν_μ$ event reconstruction in IceCube

    Authors: IceCube Collaboration, R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise , et al. (398 additional authors not shown)

    Abstract: This Letter presents the result of a 3+1 sterile neutrino search using 10.7 years of IceCube data. We analyze atmospheric muon neutrinos that traverse the Earth with energies ranging from 0.5 to 100 TeV, incorporating significant improvements in modeling neutrino flux and detector response compared to earlier studies. Notably, for the first time, we categorize data into starting and through-going… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 9 pages, 3 figures. This letter is supported by the long-form paper "Methods and stability tests associated with the sterile neutrino search using improved high-energy $ν_μ$ event reconstruction in IceCube," also appearing on arXiv

  42. arXiv:2405.03817  [pdf, other

    astro-ph.HE

    Search for joint multimessenger signals from potential Galactic PeVatrons with HAWC and IceCube

    Authors: R. Alfaro, C. Alvarez, J. C. Arteaga-Velázquez, D. Avila Rojas, H. A. Ayala Solares, R. Babu, E. Belmont-Moreno, K. S. Caballero-Mora, T. Capistrán, A. Carramiñana, S. Casanova, U. Cotti, J. Cotzomi, S. Coutiño de León, E. De la Fuente, D. Depaoli, N. Di Lalla, R. Diaz Hernandez, J. C. Díaz-Vélez, K. Engel, T. Ergin, K. L. Fan, K. Fang, N. Fraija, S. Fraija , et al. (469 additional authors not shown)

    Abstract: Galactic PeVatrons are sources that can accelerate cosmic rays to PeV energies. The high-energy cosmic rays are expected to interact with the surrounding ambient material or radiation, resulting in the production of gamma rays and neutrinos. To optimize for the detection of such associated production of gamma rays and neutrinos for a given source morphology and spectrum, a multi-messenger analysis… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  43. arXiv:2405.00942  [pdf, other

    cs.CV cs.CL

    Teaching Human Behavior Improves Content Understanding Abilities Of LLMs

    Authors: Somesh Singh, Harini S I, Yaman K Singla, Veeky Baths, Rajiv Ratn Shah, Changyou Chen, Balaji Krishnamurthy

    Abstract: Communication is defined as "Who says what to whom with what effect". A message from a communicator generates downstream receiver effects, also known as behavior. Receiver behavior, being a downstream effect of the message, carries rich signals about it. Even after carrying signals about the message, the behavior data is often ignored while training large language models. We show that training LLM… ▽ More

    Submitted 10 October, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

  44. arXiv:2404.19589  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    Acceptance Tests of more than 10 000 Photomultiplier Tubes for the multi-PMT Digital Optical Modules of the IceCube Upgrade

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (399 additional authors not shown)

    Abstract: More than 10,000 photomultiplier tubes (PMTs) with a diameter of 80 mm will be installed in multi-PMT Digital Optical Modules (mDOMs) of the IceCube Upgrade. These have been tested and pre-calibrated at two sites. A throughput of more than 1000 PMTs per week with both sites was achieved with a modular design of the testing facilities and highly automated testing procedures. The testing facilities… ▽ More

    Submitted 20 June, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: 24 pages, 19 figures, 2 tables, submitted to JINST

  45. arXiv:2404.19108  [pdf, other

    cs.CV astro-ph.IM eess.IV

    Real-Time Convolutional Neural Network-Based Star Detection and Centroiding Method for CubeSat Star Tracker

    Authors: Hongrui Zhao, Michael F. Lembeck, Adrian Zhuang, Riya Shah, Jesse Wei

    Abstract: Star trackers are one of the most accurate celestial sensors used for absolute attitude determination. The devices detect stars in captured images and accurately compute their projected centroids on an imaging focal plane with subpixel precision. Traditional algorithms for star detection and centroiding often rely on threshold adjustments for star pixel detection and pixel brightness weighting for… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  46. arXiv:2404.16014  [pdf, other

    cs.LG cs.AI

    Improving Dictionary Learning with Gated Sparse Autoencoders

    Authors: Senthooran Rajamanoharan, Arthur Conmy, Lewis Smith, Tom Lieberum, Vikrant Varma, János Kramár, Rohin Shah, Neel Nanda

    Abstract: Recent work has found that sparse autoencoders (SAEs) are an effective technique for unsupervised discovery of interpretable features in language models' (LMs) activations, by finding sparse, linear reconstructions of LM activations. We introduce the Gated Sparse Autoencoder (Gated SAE), which achieves a Pareto improvement over training with prevailing methods. In SAEs, the L1 penalty used to enco… ▽ More

    Submitted 30 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: 15 main text pages, 22 appendix pages

  47. Context-Enhanced Language Models for Generating Multi-Paper Citations

    Authors: Avinash Anand, Kritarth Prasad, Ujjwal Goel, Mohit Gupta, Naman Lal, Astha Verma, Rajiv Ratn Shah

    Abstract: Citation text plays a pivotal role in elucidating the connection between scientific documents, demanding an in-depth comprehension of the cited paper. Constructing citations is often time-consuming, requiring researchers to delve into extensive literature and grapple with articulating relevant content. To address this challenge, the field of citation text generation (CTG) has emerged. However, whi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 14 pages, 7 figures, 11th International Conference, BDA 2023, Delhi, India

    Journal ref: Big Data and Artificial Intelligence 2023, Delhi, India, December 7, 80 94

  48. arXiv:2404.13099  [pdf, other

    cs.CL cs.AI

    Mathify: Evaluating Large Language Models on Mathematical Problem Solving Tasks

    Authors: Avinash Anand, Mohit Gupta, Kritarth Prasad, Navya Singla, Sanjana Sanjeev, Jatin Kumar, Adarsh Raj Shivam, Rajiv Ratn Shah

    Abstract: The rapid progress in the field of natural language processing (NLP) systems and the expansion of large language models (LLMs) have opened up numerous opportunities in the field of education and instructional methods. These advancements offer the potential for tailored learning experiences and immediate feedback, all delivered through accessible and cost-effective services. One notable application… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 10 pages, 3 figures, NeurIPS 2023 Workshop on Generative AI for Education (GAIED)

    Journal ref: NeurIPS 2023 Workshop on Generative AI for Education (GAIED)

  49. arXiv:2404.12926  [pdf, other

    cs.AI

    MM-PhyRLHF: Reinforcement Learning Framework for Multimodal Physics Question-Answering

    Authors: Avinash Anand, Janak Kapuriya, Chhavi Kirtani, Apoorv Singh, Jay Saraf, Naman Lal, Jatin Kumar, Adarsh Raj Shivam, Astha Verma, Rajiv Ratn Shah, Roger Zimmermann

    Abstract: Recent advancements in LLMs have shown their significant potential in tasks like text summarization and generation. Yet, they often encounter difficulty while solving complex physics problems that require arithmetic calculation and a good understanding of concepts. Moreover, many physics problems include images that contain important details required to understand the problem's context. We propose… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  50. arXiv:2404.10545  [pdf

    q-bio.GN

    A Systematic Overview of Single-Cell Transcriptomics Databases, their Use cases, and Limitations

    Authors: Mahnoor N. Gondal, Saad Ur Rehman Shah, Arul M. Chinnaiyan, Marcin Cieslik

    Abstract: Rapid advancements in high-throughput single-cell RNA-seq (scRNA-seq) technologies and experimental protocols have led to the generation of vast amounts of genomic data that populates several online databases and repositories. Here, we systematically examined large-scale scRNA-seq databases, categorizing them based on their scope and purpose such as general, tissue-specific databases, disease-spec… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 17 pages, 1 figure, 1 table