-
Fourier Synthetic Aperture-based Time-resolved Terahertz Imaging
Authors:
Vivek Kumar,
Pitambar Mukherjee,
Lorenzo Valzania,
Amaury Badon,
Patrick Mounaix,
Sylvain Gigan
Abstract:
Terahertz microscopy has attracted attention owing to distinctive characteristics of the THz frequency region, particularly non-ionizing photon energy, spectral fingerprint, and transparency to most nonpolar materials. Nevertheless, the well-known Rayleigh diffraction limit imposed on THz waves commonly constrains the resultant imaging resolution to values beyond the millimeter scale, consequently…
▽ More
Terahertz microscopy has attracted attention owing to distinctive characteristics of the THz frequency region, particularly non-ionizing photon energy, spectral fingerprint, and transparency to most nonpolar materials. Nevertheless, the well-known Rayleigh diffraction limit imposed on THz waves commonly constrains the resultant imaging resolution to values beyond the millimeter scale, consequently limiting the applicability in numerous emerging applications for chemical sensing and complex media imaging. In this theoretical and numerical work, we address this challenge by introducing a new imaging approach, based on acquiring high-spatial frequencies by adapting the Fourier synthetic aperture approach to the terahertz spectral range, thus surpassing the diffraction-limited resolution. Our methodology combines multi-angle terahertz pulsed illumination with time-resolved field measurements, as enabled by the state-of-the-art time-domain spectroscopy technique. We demonstrate the potential of the approach for hyperspectral terahertz imaging of semi-transparent samples and show that the technique can reconstruct spatial and temporal features of complex inhomogeneous samples with subwavelength resolution.
△ Less
Submitted 4 October, 2024;
originally announced October 2024.
-
Hierarchical Conditional Multi-Task Learning for Streamflow Modeling
Authors:
Shaoming Xu,
Arvind Renganathan,
Ankush Khandelwal,
Rahul Ghosh,
Xiang Li,
Licheng Liu,
Kshitij Tayal,
Peter Harrington,
Xiaowei Jia,
Zhenong Jin,
Jonh Nieber,
Vipin Kumar
Abstract:
Streamflow, vital for water resource management, is governed by complex hydrological systems involving intermediate processes driven by meteorological forces. While deep learning models have achieved state-of-the-art results of streamflow prediction, their end-to-end single-task learning approach often fails to capture the causal relationships within these systems. To address this, we propose Hier…
▽ More
Streamflow, vital for water resource management, is governed by complex hydrological systems involving intermediate processes driven by meteorological forces. While deep learning models have achieved state-of-the-art results of streamflow prediction, their end-to-end single-task learning approach often fails to capture the causal relationships within these systems. To address this, we propose Hierarchical Conditional Multi-Task Learning (HCMTL), a hierarchical approach that jointly models soil water and snowpack processes based on their causal connections to streamflow. HCMTL utilizes task embeddings to connect network modules, enhancing flexibility and expressiveness while capturing unobserved processes beyond soil water and snowpack. It also incorporates the Conditional Mini-Batch strategy to improve long time series modeling. We compare HCMTL with five baselines on a global dataset. HCMTL's superior performance across hundreds of drainage basins over extended periods shows that integrating domain-specific causal knowledge into deep learning enhances both prediction accuracy and interpretability. This is essential for advancing our understanding of complex hydrological systems and supporting efficient water resource management to mitigate natural disasters like droughts and floods.
△ Less
Submitted 17 October, 2024;
originally announced October 2024.
-
Jailbreaking LLM-Controlled Robots
Authors:
Alexander Robey,
Zachary Ravichandran,
Vijay Kumar,
Hamed Hassani,
George J. Pappas
Abstract:
The recent introduction of large language models (LLMs) has revolutionized the field of robotics by enabling contextual reasoning and intuitive human-robot interaction in domains as varied as manipulation, locomotion, and self-driving vehicles. When viewed as a stand-alone technology, LLMs are known to be vulnerable to jailbreaking attacks, wherein malicious prompters elicit harmful text by bypass…
▽ More
The recent introduction of large language models (LLMs) has revolutionized the field of robotics by enabling contextual reasoning and intuitive human-robot interaction in domains as varied as manipulation, locomotion, and self-driving vehicles. When viewed as a stand-alone technology, LLMs are known to be vulnerable to jailbreaking attacks, wherein malicious prompters elicit harmful text by bypassing LLM safety guardrails. To assess the risks of deploying LLMs in robotics, in this paper, we introduce RoboPAIR, the first algorithm designed to jailbreak LLM-controlled robots. Unlike existing, textual attacks on LLM chatbots, RoboPAIR elicits harmful physical actions from LLM-controlled robots, a phenomenon we experimentally demonstrate in three scenarios: (i) a white-box setting, wherein the attacker has full access to the NVIDIA Dolphins self-driving LLM, (ii) a gray-box setting, wherein the attacker has partial access to a Clearpath Robotics Jackal UGV robot equipped with a GPT-4o planner, and (iii) a black-box setting, wherein the attacker has only query access to the GPT-3.5-integrated Unitree Robotics Go2 robot dog. In each scenario and across three new datasets of harmful robotic actions, we demonstrate that RoboPAIR, as well as several static baselines, finds jailbreaks quickly and effectively, often achieving 100% attack success rates. Our results reveal, for the first time, that the risks of jailbroken LLMs extend far beyond text generation, given the distinct possibility that jailbroken robots could cause physical damage in the real world. Indeed, our results on the Unitree Go2 represent the first successful jailbreak of a deployed commercial robotic system. Addressing this emerging vulnerability is critical for ensuring the safe deployment of LLMs in robotics. Additional media is available at: https://robopair.org
△ Less
Submitted 17 October, 2024;
originally announced October 2024.
-
Optimal Beamforming Design for ISAC with Sensor-Aided Active RIS
Authors:
Ahmed Magbool,
Vaibhav Kumar,
Mark F. Flanagan
Abstract:
Active reconfigurable intelligent surfaces (RISs) can improve the performance of integrated sensing and communication (ISAC), and therefore enable simultaneous data transmission and target sensing. However, when the line-of-sight (LoS) link between the base station and the sensing target is blocked, the sensing signals suffer from severe path loss, resulting in an inferior sensing performance. To…
▽ More
Active reconfigurable intelligent surfaces (RISs) can improve the performance of integrated sensing and communication (ISAC), and therefore enable simultaneous data transmission and target sensing. However, when the line-of-sight (LoS) link between the base station and the sensing target is blocked, the sensing signals suffer from severe path loss, resulting in an inferior sensing performance. To address this issue, this paper employs a sensor-aided active RIS to enhance ISAC system performance. The goal is to maximize the signal-to-noise ratio of the echo signal from the target at the sensor-array while meeting constraints on communication signal quality, power budgets, and RIS amplification limits. The optimization problem is challenging due to its non-convex nature and the coupling between the optimization variables. We propose a closed-form solution for receive beamforming, and a successive convex approximation based iterative method for transmit and reflection beamforming design. Simulation results demonstrate the advantage of the proposed sensor-aided active RIS-assisted system model over its non-sensor-aided counterpart.
△ Less
Submitted 16 October, 2024;
originally announced October 2024.
-
ExoTST: Exogenous-Aware Temporal Sequence Transformer for Time Series Prediction
Authors:
Kshitij Tayal,
Arvind Renganathan,
Xiaowei Jia,
Vipin Kumar,
Dan Lu
Abstract:
Accurate long-term predictions are the foundations for many machine learning applications and decision-making processes. Traditional time series approaches for prediction often focus on either autoregressive modeling, which relies solely on past observations of the target ``endogenous variables'', or forward modeling, which considers only current covariate drivers ``exogenous variables''. However,…
▽ More
Accurate long-term predictions are the foundations for many machine learning applications and decision-making processes. Traditional time series approaches for prediction often focus on either autoregressive modeling, which relies solely on past observations of the target ``endogenous variables'', or forward modeling, which considers only current covariate drivers ``exogenous variables''. However, effectively integrating past endogenous and past exogenous with current exogenous variables remains a significant challenge. In this paper, we propose ExoTST, a novel transformer-based framework that effectively incorporates current exogenous variables alongside past context for improved time series prediction. To integrate exogenous information efficiently, ExoTST leverages the strengths of attention mechanisms and introduces a novel cross-temporal modality fusion module. This module enables the model to jointly learn from both past and current exogenous series, treating them as distinct modalities. By considering these series separately, ExoTST provides robustness and flexibility in handling data uncertainties that arise from the inherent distribution shift between historical and current exogenous variables. Extensive experiments on real-world carbon flux datasets and time series benchmarks demonstrate ExoTST's superior performance compared to state-of-the-art baselines, with improvements of up to 10\% in prediction accuracy. Moreover, ExoTST exhibits strong robustness against missing values and noise in exogenous drivers, maintaining consistent performance in real-world situations where these imperfections are common.
△ Less
Submitted 15 October, 2024;
originally announced October 2024.
-
Multi-Functional RIS for a Multi-Functional System: Integrating Sensing, Communication, and Wireless Power Transfer
Authors:
Ahmed Magbool,
Vaibhav Kumar,
Ahmad Bazzi,
Mark F. Flanagan,
Marwa Chafii
Abstract:
Communication networks are evolving from solely emphasizing communication to facilitating multiple functionalities. In this regard, integrated sensing, communication, and powering (ISCAP) provides an efficient way of enabling data transmission, radar sensing, and wireless power transfer simultaneously. Such a multi-functional network requires a multi-functional architectural solution. Toward this…
▽ More
Communication networks are evolving from solely emphasizing communication to facilitating multiple functionalities. In this regard, integrated sensing, communication, and powering (ISCAP) provides an efficient way of enabling data transmission, radar sensing, and wireless power transfer simultaneously. Such a multi-functional network requires a multi-functional architectural solution. Toward this end, sensor-aided zero-energy reconfigurable intelligent surfaces (SAZE-RISs) offer an energy-efficient solution for ISCAP by meeting the requirements of the end users as well as supplying power for the RIS. This paper explores the use of SAZE-RIS within the ISCAP framework. First, we present the general system architecture, operational protocols, and main application scenarios for employing SAZE-RIS in ISCAP. Next, we discuss methods for managing the conflicting requirements of communication, sensing, and powering within ISCAP and the role of SAZE-RIS in this process. We then provide a detailed case study complete with simulation results, offering valuable insights into the design choices and tradeoffs that come into play when adopting this technology. Furthermore, we discuss the related challenges and open research avenues, highlighting areas that require further exploration to fully realize the potential of SAZE-RIS within this ISCAP framework.
△ Less
Submitted 11 October, 2024;
originally announced October 2024.
-
Quantifying the Jet Energy Loss in Pb+Pb collisions at LHC
Authors:
Vineet Kumar,
Prashant Shukla
Abstract:
In this work, we give a method to study the energy loss of jets in the medium using a variety of jet energy loss observables such as nuclear modification factor and transverse momentum asymmetry in dijets and $γ$-jets in heavy ion collisions. The energy loss of jets in the medium depends mainly on the size and the properties of medium viz. temperature and is a function of energy of the jets as pre…
▽ More
In this work, we give a method to study the energy loss of jets in the medium using a variety of jet energy loss observables such as nuclear modification factor and transverse momentum asymmetry in dijets and $γ$-jets in heavy ion collisions. The energy loss of jets in the medium depends mainly on the size and the properties of medium viz. temperature and is a function of energy of the jets as predicted by various models. A Monte Carlo (MC) method is employed to generate the transverse momentum and path-lengths of the initial jets that undergo energy loss. Using different scenarios of energy loss, the transverse momentum and system size dependence of nuclear modification factors and different measures of dijet momentum imbalance at energies $sqrt{s_{\rm NN}}$ = 2.76 TeV and 5.02 TeV and $γ$-jet asymmetry at $\sqrt{s_{\rm NN}}$ =2.76 TeV in Pb+Pb collisions are simulated. The results are compared with the measurements by ATLAS and CMS experiments as a function of transverse momentum and centrality. The study demonstrates how the system size and energy dependence of jet energy loss can be quantified using various experimental observables.
△ Less
Submitted 10 October, 2024;
originally announced October 2024.
-
Study of $β^+$/EC-decay properties of $sd$ shell nuclei using nuclear shell model
Authors:
Surender,
Vikas Kumar,
Praveen C. Srivastava
Abstract:
Our study employs the nuclear shell model to systematically compute the half-lives of $β$ -decay for nuclei in the mass range of $A = 18-39$, encompassing the majority of $sd$ shell nuclei. This analysis utilizes the USDB and SDNN Hamiltonians. The theoretical outcomes contain calculations of various parameters such as $Q$ -values, half-lives, excitation energy, log$ft$ values, and branching ratio…
▽ More
Our study employs the nuclear shell model to systematically compute the half-lives of $β$ -decay for nuclei in the mass range of $A = 18-39$, encompassing the majority of $sd$ shell nuclei. This analysis utilizes the USDB and SDNN Hamiltonians. The theoretical outcomes contain calculations of various parameters such as $Q$ -values, half-lives, excitation energy, log$ft$ values, and branching ratios. We explore these results with axial-vector coupling constant for weak interactions, denoted as $g_A$$(= 1.27)$, and $κ$ value $(= 6289)$. We perform calculations of Gamow Teller matrix elements for 116 decay processes to calculate the quenching factor; we found a quenching factor of $q = 0.794\pm0.05 $ for the USDB interaction and $q = 0.815\pm0.04 $ for the SDNN interaction. We have also calculated superallowed transitions $0^+ \rightarrow 0^+$ for seven nuclei. Further, we have also included the electron capture phase space factor for the required nuclei to calculate the half-lives. This inclusion leads to small contribution in results, particularly for nuclei where electron capture (EC) plays a significant role. The overall results are in agreement with the experimental data.
△ Less
Submitted 9 October, 2024;
originally announced October 2024.
-
DDES Study of Confined and Unconfined NACA Wing Sections Using Spectral Elements
Authors:
Vishal Kumar,
Ananias Tomboulides,
Paul Fischer,
Misun Min
Abstract:
We develop hybrid RANS-LES strategies within the spectral element code Nek5000 based on the $k-τ$ class of turbulence models. We chose airfoil sections at small flight configurations as our target problem to comprehensively test the solver accuracy and performance. We present verification and validation results of an unconfined NACA0012 wing section in a pure RANS and in a hybrid RANS-LES setup fo…
▽ More
We develop hybrid RANS-LES strategies within the spectral element code Nek5000 based on the $k-τ$ class of turbulence models. We chose airfoil sections at small flight configurations as our target problem to comprehensively test the solver accuracy and performance. We present verification and validation results of an unconfined NACA0012 wing section in a pure RANS and in a hybrid RANS-LES setup for an angle of attack ranging from 0 to 90 degrees. The RANS results shows good corroboration with existing experimental and numerical datasets for low incoming flow angles. A small discrepancy appears at higher angle in comparison with the experiments, which is in line with our expectations from a RANS formulation. On the other hand, DDES captures both the attached and separated flow dynamics well when compared with available numerical datasets. We demonstrate that for the hybrid turbulence modeling approach a high-order spectral element discretization converges faster (i.e., with less resolution) and captures the flow dynamics more accurately than representative low-order finite-volume and finite-difference approaches. We also revise some of the guidelines on sample size requirements for statistics convergence. Furthermore, we analyze some of the observed discrepancies of our unconfined DDES at higher angles with the experiments by evaluating the side wall "blocking" effect. We carry out additional simulations in a confined 'numerical wind tunnel' and assess the observed differences as a function of Reynolds number.
△ Less
Submitted 7 October, 2024;
originally announced October 2024.
-
A Digital Twin Framework for Liquid-cooled Supercomputers as Demonstrated at Exascale
Authors:
Wesley Brewer,
Matthias Maiterth,
Vineet Kumar,
Rafal Wojda,
Sedrick Bouknight,
Jesse Hines,
Woong Shin,
Scott Greenwood,
David Grant,
Wesley Williams,
Feiyi Wang
Abstract:
We present ExaDigiT, an open-source framework for developing comprehensive digital twins of liquid-cooled supercomputers. It integrates three main modules: (1) a resource allocator and power simulator, (2) a transient thermo-fluidic cooling model, and (3) an augmented reality model of the supercomputer and central energy plant. The framework enables the study of "what-if" scenarios, system optimiz…
▽ More
We present ExaDigiT, an open-source framework for developing comprehensive digital twins of liquid-cooled supercomputers. It integrates three main modules: (1) a resource allocator and power simulator, (2) a transient thermo-fluidic cooling model, and (3) an augmented reality model of the supercomputer and central energy plant. The framework enables the study of "what-if" scenarios, system optimizations, and virtual prototyping of future systems. Using Frontier as a case study, we demonstrate the framework's capabilities by replaying six months of system telemetry for systematic verification and validation. Such a comprehensive analysis of a liquid-cooled exascale supercomputer is the first of its kind. ExaDigiT elucidates complex transient cooling system dynamics, runs synthetic or real workloads, and predicts energy losses due to rectification and voltage conversion. Throughout our paper, we present lessons learned to benefit HPC practitioners developing similar digital twins. We envision the digital twin will be a key enabler for sustainable, energy-efficient supercomputing.
△ Less
Submitted 7 October, 2024;
originally announced October 2024.
-
Learning Code Preference via Synthetic Evolution
Authors:
Jiawei Liu,
Thanh Nguyen,
Mingyue Shang,
Hantian Ding,
Xiaopeng Li,
Yu Yu,
Varun Kumar,
Zijian Wang
Abstract:
Large Language Models (LLMs) have recently demonstrated remarkable coding capabilities. However, assessing code generation based on well-formed properties and aligning it with developer preferences remains challenging. In this paper, we explore two key questions under the new challenge of code preference learning: (i) How do we train models to predict meaningful preferences for code? and (ii) How…
▽ More
Large Language Models (LLMs) have recently demonstrated remarkable coding capabilities. However, assessing code generation based on well-formed properties and aligning it with developer preferences remains challenging. In this paper, we explore two key questions under the new challenge of code preference learning: (i) How do we train models to predict meaningful preferences for code? and (ii) How do human and LLM preferences align with verifiable code properties and developer code tastes? To this end, we propose CodeFavor, a framework for training pairwise code preference models from synthetic evolution data, including code commits and code critiques. To evaluate code preferences, we introduce CodePrefBench, a benchmark comprising 1364 rigorously curated code preference tasks to cover three verifiable properties-correctness, efficiency, and security-along with human preference. Our evaluation shows that CodeFavor holistically improves the accuracy of model-based code preferences by up to 28.8%. Meanwhile, CodeFavor models can match the performance of models with 6-9x more parameters while being 34x more cost-effective. We also rigorously validate the design choices in CodeFavor via a comprehensive set of controlled experiments. Furthermore, we discover the prohibitive costs and limitations of human-based code preference: despite spending 23.4 person-minutes on each task, 15.1-40.3% of tasks remain unsolved. Compared to model-based preference, human preference tends to be more accurate under the objective of code correctness, while being sub-optimal for non-functional objectives.
△ Less
Submitted 4 October, 2024;
originally announced October 2024.
-
Horizon-Length Prediction: Advancing Fill-in-the-Middle Capabilities for Code Generation with Lookahead Planning
Authors:
Yifeng Ding,
Hantian Ding,
Shiqi Wang,
Qing Sun,
Varun Kumar,
Zijian Wang
Abstract:
Fill-in-the-Middle (FIM) has become integral to code language models, enabling generation of missing code given both left and right contexts. However, the current FIM training paradigm, which reorders original training sequences and then performs regular next-token prediction (NTP), often leads to models struggling to generate content that aligns smoothly with the surrounding context. Crucially, w…
▽ More
Fill-in-the-Middle (FIM) has become integral to code language models, enabling generation of missing code given both left and right contexts. However, the current FIM training paradigm, which reorders original training sequences and then performs regular next-token prediction (NTP), often leads to models struggling to generate content that aligns smoothly with the surrounding context. Crucially, while existing works rely on rule-based post-processing to circumvent this weakness, such methods are not practically usable in open-domain code completion tasks as they depend on restrictive, dataset-specific assumptions (e.g., generating the same number of lines as in the ground truth). Moreover, model performance on FIM tasks deteriorates significantly without these unrealistic assumptions.
We hypothesize that NTP alone is insufficient for models to learn effective planning conditioned on the distant right context, a critical factor for successful code infilling. To overcome this, we propose Horizon-Length Prediction (HLP), a novel training objective that teaches models to predict the number of remaining middle tokens (i.e., horizon length) at each step. HLP advances FIM with lookahead planning, enabling models to inherently learn infilling boundaries for arbitrary left and right contexts without relying on dataset-specific post-processing. Our evaluation across different models and sizes shows that HLP significantly improves FIM performance by up to 24% relatively on diverse benchmarks, across file-level and repository-level, and without resorting to unrealistic post-processing methods. Furthermore, the enhanced planning capability gained through HLP boosts model performance on code reasoning. Importantly, HLP only incurs negligible training overhead and no additional inference cost, ensuring its practicality for real-world scenarios.
△ Less
Submitted 3 October, 2024;
originally announced October 2024.
-
SPINE: Online Semantic Planning for Missions with Incomplete Natural Language Specifications in Unstructured Environments
Authors:
Zachary Ravichandran,
Varun Murali,
Mariliza Tzes,
George J. Pappas,
Vijay Kumar
Abstract:
As robots become increasingly capable, users will want to describe high-level missions and have robots fill in the gaps. In many realistic settings, pre-built maps are difficult to obtain, so execution requires exploration and mapping that are necessary and specific to the mission. Consider an emergency response scenario where a user commands a robot, "triage impacted regions." The robot must infe…
▽ More
As robots become increasingly capable, users will want to describe high-level missions and have robots fill in the gaps. In many realistic settings, pre-built maps are difficult to obtain, so execution requires exploration and mapping that are necessary and specific to the mission. Consider an emergency response scenario where a user commands a robot, "triage impacted regions." The robot must infer relevant semantics (victims, etc.) and exploration targets (damaged regions) based on priors or other context, then explore and refine its plan online. These missions are incompletely specified, meaning they imply subtasks and semantics. While many semantic planning methods operate online, they are typically designed for well specified tasks such as object search or exploration. Recently, Large Language Models (LLMs) have demonstrated powerful contextual reasoning over a range of robotic tasks described in natural language. However, existing LLM planners typically do not consider online planning or complex missions; rather, relevant subtasks are provided by a pre-built map or a user. We address these limitations via SPINE (online Semantic Planner for missions with Incomplete Natural language specifications in unstructured Environments). SPINE uses an LLM to reason about subtasks implied by the mission then realizes these subtasks in a receding horizon framework. Tasks are automatically validated for safety and refined online with new observations. We evaluate SPINE in simulation and real-world settings. Evaluation missions require multiple steps of semantic reasoning and exploration in cluttered outdoor environments of over 20,000m$^2$ area. We evaluate SPINE against competitive baselines in single-agent and air-ground teaming applications. Please find videos and software on our project page: https://zacravichandran.github.io/SPINE
△ Less
Submitted 3 October, 2024;
originally announced October 2024.
-
Beamsplitter-free, high bit-rate, quantum random number generator based on temporal and spatial correlations of heralded single-photons
Authors:
Ayan Kumar Nai,
Amritash Sharma,
Vimlesh Kumar,
Sandeep Singh,
Shreya Mishra,
C. M. Chandrashekar,
G. K. Samanta
Abstract:
The spontaneous parametric down-conversion (SPDC), an inherently random quantum process, produces a non-deterministic photon-pair with strong temporal and spatial correlations owing to both energy and momentum conservation. Therefore, the SPDC-based photon pairs are used for quantum random number generation (QRNG). Typically, temporal correlation in association with an ideal unbiased beam splitter…
▽ More
The spontaneous parametric down-conversion (SPDC), an inherently random quantum process, produces a non-deterministic photon-pair with strong temporal and spatial correlations owing to both energy and momentum conservation. Therefore, the SPDC-based photon pairs are used for quantum random number generation (QRNG). Typically, temporal correlation in association with an ideal unbiased beam splitter is used for QRNG without fully exploring the spatial correction. As a result, SPDC-based QRNG has a low bit rate. On the other hand, due to the spatial correlation, the photon pairs in non-collinear phase-matched geometry are generated randomly in diametrically opposite points over an annular ring spatial distribution. Therefore, exploring the temporal correlation between photon pairs from different sections of the annual ring can lead to multi-bit QRNG at a high rate, avoiding the need for a beam splitter. As a proof-of-concept, we report on high-bit-rate QRNG by using spatial correlation of photon-pairs by sectioning the SPDC ring of a non-collinear, degenerate, high-brightness source and temporal correlation between the diametrically opposite sections. Dividing the annular ring of the high-brightness photon-pair source based on a 20 mm long, type-0 phase-matched, periodically-poled KTP crystal into four sections, recording the timestamp of the coincidences (widow of 1 ns) between photons from diametrically opposite sections and assigning bits (0 and 1), we extracted 90 million raw bits over 27.7 s at a pump power of 17 mW. We determined the extraction ratio using the minimum entropy evaluation of more than 95% in our case. Using Toeplitz matrix-based post-processing, we achieved a QRNG with a bit-rate of 3 Mbps, passing all NIST 800-22 and TestU01 test suites. The generic scheme shows the possibility of further enhancement of the bit rate through more sectioning of the SPDC ring.
△ Less
Submitted 1 October, 2024;
originally announced October 2024.
-
Computational Investigation of Roughness Effects on Boundary Layer Transition for Stetson's Blunt Cone at Mach 6
Authors:
Arturo Rodriguez,
Piyush Kumar,
Cesar Diaz-Caraveo,
Richard O. Adansi,
Luis F. Rodriguez,
Vinod Kumar
Abstract:
In this aerothermal study, we performed a two-dimensional steady-state Computational Fluid Dynamics (CFD) and heat conduction simulation at Mach 6. The key to our methodology was a one-way coupling between CFD surface temperature as a boundary condition and the calculation of the heat transfer flux and temperatures inside the solid stainless-steel body of a nose geometry. This approach allowed us…
▽ More
In this aerothermal study, we performed a two-dimensional steady-state Computational Fluid Dynamics (CFD) and heat conduction simulation at Mach 6. The key to our methodology was a one-way coupling between CFD surface temperature as a boundary condition and the calculation of the heat transfer flux and temperatures inside the solid stainless-steel body of a nose geometry. This approach allowed us to gain insight into surface heat transfer signatures with corresponding fluid flow regimes, such as the one experienced in laminar fluid flow. We have also examined this heat transfer under roughness values encountered in Stetson's studies at the Wright-Patterson Air Force Base Ludwig tube. To validate our findings, we have performed this type of work on a blunt cone, specifically for the U.S. Air Force.
△ Less
Submitted 30 September, 2024;
originally announced October 2024.
-
Explain in Plain Language Questions with Indic Languages: Drawbacks, Affordances, and Opportunities
Authors:
David H. Smith IV,
Viraj Kumar,
Paul Denny
Abstract:
Background: Introductory computer science courses use ``Explain in Plain English'' (EiPE) activities to develop and assess students' code comprehension skills, but creating effective autograders for these questions is challenging and limited to English. This is a particular challenge in linguistically diverse countries like India where students may have limited proficiency in English.
Methods: W…
▽ More
Background: Introductory computer science courses use ``Explain in Plain English'' (EiPE) activities to develop and assess students' code comprehension skills, but creating effective autograders for these questions is challenging and limited to English. This is a particular challenge in linguistically diverse countries like India where students may have limited proficiency in English.
Methods: We evaluate the efficacy of a recently introduced approach called Code Generation Based Grading (CGBG) in enabling language agnostic ``Explain in Plain Language'' (EiPL) activities. Here students' EiPL responses generate code that is tested for functional equivalence to the original which was being described.
Objectives: We initially evaluate the correctness of code generated from correct EiPL responses provided in 10 of India's most commonly spoken languages. To evaluate the effectiveness of the approach in practice, we assess student success and perceptions of EiPL questions in a NPTEL (National Programme on Technology Enhanced Learning) course.
Results: We find promising results for the correctness of code generated from translations of correct EiPL responses, with most languages achieving a correctness rate of 75% or higher. However, in practice, many students preferred to respond in English due to greater familiarity with English as a technical language, difficulties writing in their native language, and perceptions of the grader being less capable of generating code from prompts in their mother tongue.
△ Less
Submitted 30 September, 2024;
originally announced September 2024.
-
4D Metric-Semantic Mapping for Persistent Orchard Monitoring: Method and Dataset
Authors:
Jiuzhou Lei,
Ankit Prabhu,
Xu Liu,
Fernando Cladera,
Mehrad Mortazavi,
Reza Ehsani,
Pratik Chaudhari,
Vijay Kumar
Abstract:
Automated persistent and fine-grained monitoring of orchards at the individual tree or fruit level helps maximize crop yield and optimize resources such as water, fertilizers, and pesticides while preventing agricultural waste. Towards this goal, we present a 4D spatio-temporal metric-semantic mapping method that fuses data from multiple sensors, including LiDAR, RGB camera, and IMU, to monitor th…
▽ More
Automated persistent and fine-grained monitoring of orchards at the individual tree or fruit level helps maximize crop yield and optimize resources such as water, fertilizers, and pesticides while preventing agricultural waste. Towards this goal, we present a 4D spatio-temporal metric-semantic mapping method that fuses data from multiple sensors, including LiDAR, RGB camera, and IMU, to monitor the fruits in an orchard across their growth season. A LiDAR-RGB fusion module is designed for 3D fruit tracking and localization, which first segments fruits using a deep neural network and then tracks them using the Hungarian Assignment algorithm. Additionally, the 4D data association module aligns data from different growth stages into a common reference frame and tracks fruits spatio-temporally, providing information such as fruit counts, sizes, and positions. We demonstrate our method's accuracy in 4D metric-semantic mapping using data collected from a real orchard under natural, uncontrolled conditions with seasonal variations. We achieve a 3.1 percent error in total fruit count estimation for over 1790 fruits across 60 apple trees, along with accurate size estimation results with a mean error of 1.1 cm. The datasets, consisting of LiDAR, RGB, and IMU data of five fruit species captured across their growth seasons, along with corresponding ground truth data, will be made publicly available at: https://4d-metric-semantic-mapping.org/
△ Less
Submitted 29 September, 2024;
originally announced September 2024.
-
RT-GuIDE: Real-Time Gaussian splatting for Information-Driven Exploration
Authors:
Yuezhan Tao,
Dexter Ong,
Varun Murali,
Igor Spasojevic,
Pratik Chaudhari,
Vijay Kumar
Abstract:
We propose a framework for active mapping and exploration that leverages Gaussian splatting for constructing information-rich maps. Further, we develop a parallelized motion planning algorithm that can exploit the Gaussian map for real-time navigation. The Gaussian map constructed onboard the robot is optimized for both photometric and geometric quality while enabling real-time situational awarene…
▽ More
We propose a framework for active mapping and exploration that leverages Gaussian splatting for constructing information-rich maps. Further, we develop a parallelized motion planning algorithm that can exploit the Gaussian map for real-time navigation. The Gaussian map constructed onboard the robot is optimized for both photometric and geometric quality while enabling real-time situational awareness for autonomy. We show through simulation experiments that our method is competitive with approaches that use alternate information gain metrics, while being orders of magnitude faster to compute. In real-world experiments, our algorithm achieves better map quality (10% higher Peak Signal-to-Noise Ratio (PSNR) and 30% higher geometric reconstruction accuracy) than Gaussian maps constructed by traditional exploration baselines. Experiment videos and more details can be found on our project page: https://tyuezhan.github.io/RT_GuIDE/
△ Less
Submitted 26 September, 2024;
originally announced September 2024.
-
EvMAPPER: High Altitude Orthomapping with Event Cameras
Authors:
Fernando Cladera,
Kenneth Chaney,
M. Ani Hsieh,
Camillo J. Taylor,
Vijay Kumar
Abstract:
Traditionally, unmanned aerial vehicles (UAVs) rely on CMOS-based cameras to collect images about the world below. One of the most successful applications of UAVs is to generate orthomosaics or orthomaps, in which a series of images are integrated together to develop a larger map. However, the use of CMOS-based cameras with global or rolling shutters mean that orthomaps are vulnerable to challengi…
▽ More
Traditionally, unmanned aerial vehicles (UAVs) rely on CMOS-based cameras to collect images about the world below. One of the most successful applications of UAVs is to generate orthomosaics or orthomaps, in which a series of images are integrated together to develop a larger map. However, the use of CMOS-based cameras with global or rolling shutters mean that orthomaps are vulnerable to challenging light conditions, motion blur, and high-speed motion of independently moving objects under the camera. Event cameras are less sensitive to these issues, as their pixels are able to trigger asynchronously on brightness changes. This work introduces the first orthomosaic approach using event cameras. In contrast to existing methods relying only on CMOS cameras, our approach enables map generation even in challenging light conditions, including direct sunlight and after sunset.
△ Less
Submitted 26 September, 2024;
originally announced September 2024.
-
Classical inequalities for all Fourier matrix coefficients of $\mathrm{SL}(2,\mathbb{R})$ and their applications
Authors:
Vishvesh Kumar,
Tapendu Rana,
Michael Ruzhansky
Abstract:
In this article, we establish three fundamental Fourier inequalities: the Hausdorff-Young inequality, the Paley inequality, and the Hausdorff-Young-Paley inequality for $(l, n)$-type functions on $\mathrm{SL}(2,\mathbb{R})$. Utilizing these inequalities, we demonstrate the $L^p$-$L^q$ boundedness of $(l, n)$-type Fourier multipliers on $\mathrm{SL}(2,\mathbb{R})$. Furthermore, we explore applicati…
▽ More
In this article, we establish three fundamental Fourier inequalities: the Hausdorff-Young inequality, the Paley inequality, and the Hausdorff-Young-Paley inequality for $(l, n)$-type functions on $\mathrm{SL}(2,\mathbb{R})$. Utilizing these inequalities, we demonstrate the $L^p$-$L^q$ boundedness of $(l, n)$-type Fourier multipliers on $\mathrm{SL}(2,\mathbb{R})$. Furthermore, we explore applications related to the $L^p$-$L^q$ estimates of the heat kernel of the Casimir element on $\mathrm{SL}(2,\mathbb{R})$ and address the global well-posedness of certain parabolic and hyperbolic nonlinear equations.
△ Less
Submitted 26 September, 2024;
originally announced September 2024.
-
The global flow state in a precessing cylinder
Authors:
André Giesecke,
Tobias Vogt,
Federico Pizzi,
Vivaswat Kumar,
Fernando Garcia Gonzalez,
Thomas Gundrum,
Frank Stefani
Abstract:
We examine the fluid flow forced by precession of a rotating cylindrical container using numerical simulations and experimental flow measurements with ultrasonic Doppler velocimetry (UDV). The analysis is based on the decomposition of the flow field into contributions with distinct azimuthal symmetry or analytically known inertial modes and the corresponding calculation of their amplitudes. We sho…
▽ More
We examine the fluid flow forced by precession of a rotating cylindrical container using numerical simulations and experimental flow measurements with ultrasonic Doppler velocimetry (UDV). The analysis is based on the decomposition of the flow field into contributions with distinct azimuthal symmetry or analytically known inertial modes and the corresponding calculation of their amplitudes. We show that the predominant fraction of the kinetic energy of the precession-driven fluid flow is contained only within a few large-scale modes.
The most striking observation shown by simulations and experiments is the transition from a flow dominated by large-scale structures to a more turbulent behaviour with the small-scale fluctuations becoming increasingly important. At a fixed rotation frequency (parametrized by the Reynolds number, ${\rm{Re}}$) this transition occurs when a critical precession ratio is exceeded and consists of a two-stage collapse of the directly driven flow going along with a massive modification of the azimuthal circulation (the zonal flow) and the appearance of an axisymmetric double-roll mode limited to a narrow range of precession ratios. A similar behaviour is found in experiments which make it possible to follow the transition up to Reynolds numbers of ${\rm{Re}}\approx 2\times 10^6$. We find that the critical precession ratio decreases with rotation, initially showing a particular scaling $\propto {\rm{Re}}^{-\frac{1}{5}}$ but developing an asymptotic behaviour for ${\rm{Re}}\gtrsim 10^5$ which might be explained by the onset of turbulence in boundary layers.
△ Less
Submitted 26 September, 2024;
originally announced September 2024.
-
Collision-free time-optimal path parameterization for multi-robot teams
Authors:
Katherine Mao,
Igor Spasojevic,
Malakhi Hopkins,
M. Ani Hsieh,
Vijay Kumar
Abstract:
Coordinating the motion of multiple robots in cluttered environments remains a computationally challenging task. We study the problem of minimizing the execution time of a set of geometric paths by a team of robots with state-dependent actuation constraints. We propose a Time-Optimal Path Parameterization (TOPP) algorithm for multiple car-like agents, where the modulation of the timing of every ro…
▽ More
Coordinating the motion of multiple robots in cluttered environments remains a computationally challenging task. We study the problem of minimizing the execution time of a set of geometric paths by a team of robots with state-dependent actuation constraints. We propose a Time-Optimal Path Parameterization (TOPP) algorithm for multiple car-like agents, where the modulation of the timing of every robot along its assigned path is employed to ensure collision avoidance and dynamic feasibility. This is achieved through the use of a priority queue to determine the order of trajectory execution for each robot while taking into account all possible collisions with higher priority robots in a spatiotemporal graph. We show a 10-20% reduction in makespan against existing state-of-the-art methods and validate our approach through simulations and hardware experiments.
△ Less
Submitted 25 September, 2024;
originally announced September 2024.
-
A Novel MOSFET based Single Event Latchup Detection, Current Limiting & Self Power Cycling circuit for Spacecraft systems
Authors:
Ishan Pandey,
Kinshuk Gupta,
Vinod Kumar,
A. R. Khan,
Sandhya V. Kamat
Abstract:
Single Event Latch-up (SEL) is one of the prime concerns for CMOS ICs used in space systems. Galactic Cosmic Rays or Solar Energetic Particles (SEP) may trigger the parasitic latch up circuit in CMOS ICs and cause increase in current beyond the safe limits thereby presenting a threat of permanent failure of the IC. Mitigation of the SEL is always a challenging task. The conventional mitigation app…
▽ More
Single Event Latch-up (SEL) is one of the prime concerns for CMOS ICs used in space systems. Galactic Cosmic Rays or Solar Energetic Particles (SEP) may trigger the parasitic latch up circuit in CMOS ICs and cause increase in current beyond the safe limits thereby presenting a threat of permanent failure of the IC. Mitigation of the SEL is always a challenging task. The conventional mitigation approaches inherently introduce some response time which presents an uncertainty because during this response time the current may exceed the safe current limits. This paper presents a novel circuit based on MOSFETs which provides end-to-end complete solution of detecting SEL, limiting the current below the set threshold and executing power cycling to restore the normal functioning of the CMOS IC. The proposed circuit has been simulated in MULTISIM and the simulation results match very well with the expected behavior of (i)current limiting and (ii) the total time duration taken in power cycling to bring the SEL sensitive device back to its normal operational state. This circuit can be harnessed by spacecraft system designers to overcome the catastrophic threat of SEL posed by space radiation environment.
△ Less
Submitted 25 September, 2024;
originally announced September 2024.
-
Optothermal Revolution: Colloids in an Optical Ring Trap
Authors:
Rahul Chand,
Ashutosh Shukla,
G V Pavan Kumar
Abstract:
Directional motion is commonly observed in various living active systems, such as bacterial colonies moving through confined environments. In these systems, the dynamics arise from the collective effects of mutual interactions between individual elements, as well as their interactions with obstacles or boundaries. In this study, we turn our focus to an artificial system and experimentally investig…
▽ More
Directional motion is commonly observed in various living active systems, such as bacterial colonies moving through confined environments. In these systems, the dynamics arise from the collective effects of mutual interactions between individual elements, as well as their interactions with obstacles or boundaries. In this study, we turn our focus to an artificial system and experimentally investigate the emergence of directional revolution in dimer and trimer structures composed of colloidal particles in ring-shaped optical illumination. In this case, the movement of these colloidal structures is exclusively facilitated by optothermal interactions without any direct mechanical force applied from the external optical field. Depending on the optical absorption properties of the colloidal particles, these optothermal interactions can exhibit both attractive and repulsive characteristics. The attractive interactions provide the necessary driving force that propels the motion, while the repulsive interactions serve to control the structural parameters of the system. The arrangement and interaction of the colloidal particles within these dimer and trimer structures fuel the controlled, directional revolution, with the optical gradient force acting as a confining factor, guiding the movement along a specific path. Notably, the dynamics of these systems can be tuned by altering the intensity of the optical field. This study can be useful as a model for understanding insights into biological systems where group dynamics and environmental interactions are key to coordinated movement.
△ Less
Submitted 25 September, 2024;
originally announced September 2024.
-
Quasielastic $\overrightarrow{^{3}\mathrm{He}}(\overrightarrow{e},{e'})$ Asymmetry in the Threshold Region
Authors:
M. Nycz,
W. Armstrong,
T. Averett,
C. Ayerbe Gayoso,
X. Bai,
J. Bane,
S. Barcus,
J. Benesch,
H. Bhatt,
D. Bhetuwal,
D. Biswas,
A. Camsonne,
G. Cates,
J-P. Chen,
J. Chen,
M. Chen,
C. Cotton,
M-M. Dalton,
A. Deltuva,
A. Deur,
B. Dhital,
B. Duran,
S. C. Dusa,
I. Fernando,
E. Fuchey
, et al. (75 additional authors not shown)
Abstract:
A measurement of the double-spin asymmetry from electron-$^{3}$He scattering in the threshold region of two- and three-body breakup of $^{3}$He was performed at Jefferson Lab, for Q$^{2}$ values of 0.1 and 0.2 (GeV/$c$)$^{2}$. The results of this measurement serve as a stringent test of our understanding of few-body systems. When compared with calculations from plane wave impulse approximation and…
▽ More
A measurement of the double-spin asymmetry from electron-$^{3}$He scattering in the threshold region of two- and three-body breakup of $^{3}$He was performed at Jefferson Lab, for Q$^{2}$ values of 0.1 and 0.2 (GeV/$c$)$^{2}$. The results of this measurement serve as a stringent test of our understanding of few-body systems. When compared with calculations from plane wave impulse approximation and Faddeev theory, we found that the Faddeev calculations, which use modern nuclear potentials and prescriptions for meson-exchange currents, demonstrate an overall good agreement with data.
△ Less
Submitted 24 September, 2024;
originally announced September 2024.
-
AgriNeRF: Neural Radiance Fields for Agriculture in Challenging Lighting Conditions
Authors:
Samarth Chopra,
Fernando Cladera,
Varun Murali,
Vijay Kumar
Abstract:
Neural Radiance Fields (NeRFs) have shown significant promise in 3D scene reconstruction and novel view synthesis. In agricultural settings, NeRFs can serve as digital twins, providing critical information about fruit detection for yield estimation and other important metrics for farmers. However, traditional NeRFs are not robust to challenging lighting conditions, such as low-light, extreme brigh…
▽ More
Neural Radiance Fields (NeRFs) have shown significant promise in 3D scene reconstruction and novel view synthesis. In agricultural settings, NeRFs can serve as digital twins, providing critical information about fruit detection for yield estimation and other important metrics for farmers. However, traditional NeRFs are not robust to challenging lighting conditions, such as low-light, extreme bright light and varying lighting. To address these issues, this work leverages three different sensors: an RGB camera, an event camera and a thermal camera. Our RGB scene reconstruction shows an improvement in PSNR and SSIM by +2.06 dB and +8.3% respectively. Our cross-spectral scene reconstruction enhances downstream fruit detection by +43.0% in mAP50 and +61.1% increase in mAP50-95. The integration of additional sensors leads to a more robust and informative NeRF. We demonstrate that our multi-modal system yields high quality photo-realistic reconstructions under various tree canopy covers and at different times of the day. This work results in the development of a resilient NeRF, capable of performing well in visibly degraded scenarios, as well as a learnt cross-spectral representation, that is used for automated fruit detection.
△ Less
Submitted 23 September, 2024;
originally announced September 2024.
-
Exact mean and variance of the squared Hellinger distance for random density matrices
Authors:
Vinay Kumar,
Kaushik Vasan,
Santosh Kumar
Abstract:
The Hellinger distance between quantum states is a significant measure in quantum information theory, known for its Riemannian and monotonic properties. It is also easier to compute than the Bures distance, another measure that shares these properties. In this work, we derive the mean and variance of the Hellinger distance between pairs of density matrices, where one or both matrices are random. A…
▽ More
The Hellinger distance between quantum states is a significant measure in quantum information theory, known for its Riemannian and monotonic properties. It is also easier to compute than the Bures distance, another measure that shares these properties. In this work, we derive the mean and variance of the Hellinger distance between pairs of density matrices, where one or both matrices are random. Along the way, we also obtain exact results for the mean affinity and mean square affinity. The first two cumulants of the Hellinger distance allow us to propose an approximation for the corresponding probability density function based on the gamma distribution. Our analytical results are corroborated through Monte Carlo simulations, showing excellent agreement.
△ Less
Submitted 22 September, 2024;
originally announced September 2024.
-
Hierarchical LLMs In-the-loop Optimization for Real-time Multi-Robot Target Tracking under Unknown Hazards
Authors:
Yuwei Wu,
Yuezhan Tao,
Peihan Li,
Guangyao Shi,
Gaurav S. Sukhatmem,
Vijay Kumar,
Lifeng Zhou
Abstract:
In this paper, we propose a hierarchical Large Language Models (LLMs) in-the-loop optimization framework for real-time multi-robot task allocation and target tracking in an unknown hazardous environment subject to sensing and communication attacks. We formulate multi-robot coordination for tracking tasks as a bi-level optimization problem, with LLMs to reason about potential hazards in the environ…
▽ More
In this paper, we propose a hierarchical Large Language Models (LLMs) in-the-loop optimization framework for real-time multi-robot task allocation and target tracking in an unknown hazardous environment subject to sensing and communication attacks. We formulate multi-robot coordination for tracking tasks as a bi-level optimization problem, with LLMs to reason about potential hazards in the environment and the status of the robot team and modify both the inner and outer levels of the optimization. The inner LLM adjusts parameters to prioritize various objectives, including performance, safety, and energy efficiency, while the outer LLM handles online variable completion for team reconfiguration. This hierarchical approach enables real-time adjustments to the robots' behavior. Additionally, a human supervisor can offer broad guidance and assessments to address unexpected dangers, model mismatches, and performance issues arising from local minima. We validate our proposed framework in both simulation and real-world experiments with comprehensive evaluations, which provide the potential for safe LLM integration for multi-robot problems.
△ Less
Submitted 18 September, 2024;
originally announced September 2024.
-
Empowering Abilities: Increasing Representation of Students with Disabilities in the STEM Field
Authors:
Esperanza Moreno,
Piyush Kumar,
Richard O Adansi,
Dorothy Moreno,
Demy Rodriguez,
Raul Baez Ramirez,
Audrey R Kapsa,
Arturo Rodriguez,
Neelam Agarwal,
Vinod Kumar,
Beverley A Calvo,
Vivek Tandon
Abstract:
The ExploreSTEM Summer Camps 2023 were designed to deliver inclusive STEM education to students aged 14 to 22 years with disabilities. This paper presents a thorough examination of the 2023 camp program, emphasizing the pivotal role of inclusive STEM education in potentially shaping students' personal and academic trajectories. The curriculum encompassed four weeklong fundamental STEM domains: Int…
▽ More
The ExploreSTEM Summer Camps 2023 were designed to deliver inclusive STEM education to students aged 14 to 22 years with disabilities. This paper presents a thorough examination of the 2023 camp program, emphasizing the pivotal role of inclusive STEM education in potentially shaping students' personal and academic trajectories. The curriculum encompassed four weeklong fundamental STEM domains: Internet of Things (IoT), Computational Engineering, Artificial Intelligence (AI), and Augmented and Virtual Reality (AR/VR). Within Camp 1, students actively engaged with Dash robots, employing dedicated programming environments to command actions and gather sensor data, fostering interactions with the IoT platform and facilitating seamless data transmission. Camp 2 was dedicated to acquainting students with foundational computational engineering principles, establishing a robust framework for comprehending intricate engineering concepts. Camp 3 commenced with insightful presentations elucidating AI applications across multifaceted industries, including engineering, healthcare, and education, illuminating AI's pervasive influence on contemporary society. The primary aim of Camp 4 was to introduce students to the immersive domains of AR and VR, showcasing their applications beyond conventional STEM disciplines into everyday life experiences. The amalgamation of informative presentations, interactive activities, and a nurturing learning environment cultivated an engaging and enriching experience for all participants. By embracing inclusivity and harnessing innovative pedagogical approaches, the ExploreSTEM Summer Camps empowered students to explore, innovate, and excel within the dynamic realm of STEM education.
△ Less
Submitted 18 September, 2024;
originally announced September 2024.
-
Closed-loop Analysis of ADMM-based Suboptimal Linear Model Predictive Control
Authors:
Anusha Srikanthan,
Aren Karapetyan,
Vijay Kumar,
Nikolai Matni
Abstract:
Many practical applications of optimal control are subject to real-time computational constraints. When applying model predictive control (MPC) in these settings, respecting timing constraints is achieved by limiting the number of iterations of the optimization algorithm used to compute control actions at each time step, resulting in so-called suboptimal MPC. This paper proposes a suboptimal MPC s…
▽ More
Many practical applications of optimal control are subject to real-time computational constraints. When applying model predictive control (MPC) in these settings, respecting timing constraints is achieved by limiting the number of iterations of the optimization algorithm used to compute control actions at each time step, resulting in so-called suboptimal MPC. This paper proposes a suboptimal MPC scheme based on the alternating direction method of multipliers (ADMM). With a focus on the linear quadratic regulator problem with state and input constraints, we show how ADMM can be used to split the MPC problem into iterative updates of an unconstrained optimal control problem (with an analytical solution), and a dynamics-free feasibility step. We show that using a warm-start approach combined with enough iterations per time-step, yields an ADMM-based suboptimal MPC scheme which asymptotically stabilizes the system and maintains recursive feasibility.
△ Less
Submitted 17 September, 2024;
originally announced September 2024.
-
Constrained Learning for Decentralized Multi-Objective Coverage Control
Authors:
Juan Cervino,
Saurav Agarwal,
Vijay Kumar,
Alejandro Ribeiro
Abstract:
The multi-objective coverage control problem requires a robot swarm to collaboratively provide sensor coverage to multiple heterogeneous importance density fields (IDFs) simultaneously. We pose this as an optimization problem with constraints and study two different formulations: (1) Fair coverage, where we minimize the maximum coverage cost for any field, promoting equitable resource distribution…
▽ More
The multi-objective coverage control problem requires a robot swarm to collaboratively provide sensor coverage to multiple heterogeneous importance density fields (IDFs) simultaneously. We pose this as an optimization problem with constraints and study two different formulations: (1) Fair coverage, where we minimize the maximum coverage cost for any field, promoting equitable resource distribution among all fields; and (2) Constrained coverage, where each field must be covered below a certain cost threshold, ensuring that critical areas receive adequate coverage according to predefined importance levels. We study the decentralized setting where robots have limited communication and local sensing capabilities, making the system more realistic, scalable, and robust. Given the complexity, we propose a novel decentralized constrained learning approach that combines primal-dual optimization with a Learnable Perception-Action-Communication (LPAC) neural network architecture. We show that the Lagrangian of the dual problem can be reformulated as a linear combination of the IDFs, enabling the LPAC policy to serve as a primal solver. We empirically demonstrate that the proposed method (i) significantly outperforms existing state-of-the-art decentralized controllers by 30% on average in terms of coverage cost, (ii) transfers well to larger environments with more robots and (iii) is scalable in the number of fields and robots in the swarm.
△ Less
Submitted 17 September, 2024;
originally announced September 2024.
-
Leveraging Symmetry to Accelerate Learning of Trajectory Tracking Controllers for Free-Flying Robotic Systems
Authors:
Jake Welde,
Nishanth Rao,
Pratik Kunapuli,
Dinesh Jayaraman,
Vijay Kumar
Abstract:
Tracking controllers enable robotic systems to accurately follow planned reference trajectories. In particular, reinforcement learning (RL) has shown promise in the synthesis of controllers for systems with complex dynamics and modest online compute budgets. However, the poor sample efficiency of RL and the challenges of reward design make training slow and sometimes unstable, especially for high-…
▽ More
Tracking controllers enable robotic systems to accurately follow planned reference trajectories. In particular, reinforcement learning (RL) has shown promise in the synthesis of controllers for systems with complex dynamics and modest online compute budgets. However, the poor sample efficiency of RL and the challenges of reward design make training slow and sometimes unstable, especially for high-dimensional systems. In this work, we leverage the inherent Lie group symmetries of robotic systems with a floating base to mitigate these challenges when learning tracking controllers. We model a general tracking problem as a Markov decision process (MDP) that captures the evolution of both the physical and reference states. Next, we prove that symmetry in the underlying dynamics and running costs leads to an MDP homomorphism, a mapping that allows a policy trained on a lower-dimensional "quotient" MDP to be lifted to an optimal tracking controller for the original system. We compare this symmetry-informed approach to an unstructured baseline, using Proximal Policy Optimization (PPO) to learn tracking controllers for three systems: the Particle (a forced point mass), the Astrobee (a fullyactuated space robot), and the Quadrotor (an underactuated system). Results show that a symmetry-aware approach both accelerates training and reduces tracking error after the same number of training steps.
△ Less
Submitted 17 September, 2024;
originally announced September 2024.
-
Resilient and Adaptive Replanning for Multi-Robot Target Tracking with Sensing and Communication Danger Zones
Authors:
Peihan Li,
Yuwei Wu,
Jiazhen Liu,
Gaurav S. Sukhatme,
Vijay Kumar,
Lifeng Zhou
Abstract:
Multi-robot collaboration for target tracking presents significant challenges in hazardous environments, including addressing robot failures, dynamic priority changes, and other unpredictable factors. Moreover, these challenges are increased in adversarial settings if the environment is unknown. In this paper, we propose a resilient and adaptive framework for multi-robot, multi-target tracking in…
▽ More
Multi-robot collaboration for target tracking presents significant challenges in hazardous environments, including addressing robot failures, dynamic priority changes, and other unpredictable factors. Moreover, these challenges are increased in adversarial settings if the environment is unknown. In this paper, we propose a resilient and adaptive framework for multi-robot, multi-target tracking in environments with unknown sensing and communication danger zones. The damages posed by these zones are temporary, allowing robots to track targets while accepting the risk of entering dangerous areas. We formulate the problem as an optimization with soft chance constraints, enabling real-time adjustments to robot behavior based on varying types of dangers and failures. An adaptive replanning strategy is introduced, featuring different triggers to improve group performance. This approach allows for dynamic prioritization of target tracking and risk aversion or resilience, depending on evolving resources and real-time conditions. To validate the effectiveness of the proposed method, we benchmark and evaluate it across multiple scenarios in simulation and conduct several real-world experiments.
△ Less
Submitted 17 September, 2024;
originally announced September 2024.
-
Safe Interval Motion Planning for Quadrotors in Dynamic Environments
Authors:
Songhao Huang,
Yuwei Wu,
Yuezhan Tao,
Vijay Kumar
Abstract:
Trajectory generation in dynamic environments presents a significant challenge for quadrotors, particularly due to the non-convexity in the spatial-temporal domain. Many existing methods either assume simplified static environments or struggle to produce optimal solutions in real-time. In this work, we propose an efficient safe interval motion planning framework for navigation in dynamic environme…
▽ More
Trajectory generation in dynamic environments presents a significant challenge for quadrotors, particularly due to the non-convexity in the spatial-temporal domain. Many existing methods either assume simplified static environments or struggle to produce optimal solutions in real-time. In this work, we propose an efficient safe interval motion planning framework for navigation in dynamic environments. A safe interval refers to a time window during which a specific configuration is safe. Our approach addresses trajectory generation through a two-stage process: a front-end graph search step followed by a back-end gradient-based optimization. We ensure completeness and optimality by constructing a dynamic connected visibility graph and incorporating low-order dynamic bounds within safe intervals and temporal corridors. To avoid local minima, we propose a Uniform Temporal Visibility Deformation (UTVD) for the complete evaluation of spatial-temporal topological equivalence. We represent trajectories with B-Spline curves and apply gradient-based optimization to navigate around static and moving obstacles within spatial-temporal corridors. Through simulation and real-world experiments, we show that our method can achieve a success rate of over 95% in environments with different density levels, exceeding the performance of other approaches, demonstrating its potential for practical deployment in highly dynamic environments.
△ Less
Submitted 16 September, 2024;
originally announced September 2024.
-
NLLB-E5: A Scalable Multilingual Retrieval Model
Authors:
Arkadeep Acharya,
Rudra Murthy,
Vishwajeet Kumar,
Jaydeep Sen
Abstract:
Despite significant progress in multilingual information retrieval, the lack of models capable of effectively supporting multiple languages, particularly low-resource like Indic languages, remains a critical challenge. This paper presents NLLB-E5: A Scalable Multilingual Retrieval Model. NLLB-E5 leverages the in-built multilingual capabilities in the NLLB encoder for translation tasks. It proposes…
▽ More
Despite significant progress in multilingual information retrieval, the lack of models capable of effectively supporting multiple languages, particularly low-resource like Indic languages, remains a critical challenge. This paper presents NLLB-E5: A Scalable Multilingual Retrieval Model. NLLB-E5 leverages the in-built multilingual capabilities in the NLLB encoder for translation tasks. It proposes a distillation approach from multilingual retriever E5 to provide a zero-shot retrieval approach handling multiple languages, including all major Indic languages, without requiring multilingual training data. We evaluate the model on a comprehensive suite of existing benchmarks, including Hindi-BEIR, highlighting its robust performance across diverse languages and tasks. Our findings uncover task and domain-specific challenges, providing valuable insights into the retrieval performance, especially for low-resource languages. NLLB-E5 addresses the urgent need for an inclusive, scalable, and language-agnostic text retrieval model, advancing the field of multilingual information access and promoting digital inclusivity for millions of users globally.
△ Less
Submitted 9 September, 2024;
originally announced September 2024.
-
Subelliptic Nonlocal Brezis-Nirenberg Problems on Stratified Lie Groups
Authors:
Sekhar Ghosh,
Vishvesh Kumar,
Michael Ruzhansky
Abstract:
In this paper, we investigate the subelliptic nonlocal Brezis-Nirenberg problem on stratified Lie groups involving critical nonlinearities, namely, \begin{align*}
(-Δ_{\mathbb{G}, p})^s u&= μ|u|^{p_s^*-2}u+λh(x, u) \quad \text{in}\quad Ω, \\ u&=0\quad \text{in}\quad \mathbb{G}\backslash Ω, \end{align*} where $(-Δ_{\mathbb{G}, p})^s$ is the fractional $p$-sub-Laplacian on a stratified Lie group…
▽ More
In this paper, we investigate the subelliptic nonlocal Brezis-Nirenberg problem on stratified Lie groups involving critical nonlinearities, namely, \begin{align*}
(-Δ_{\mathbb{G}, p})^s u&= μ|u|^{p_s^*-2}u+λh(x, u) \quad \text{in}\quad Ω, \\ u&=0\quad \text{in}\quad \mathbb{G}\backslash Ω, \end{align*} where $(-Δ_{\mathbb{G}, p})^s$ is the fractional $p$-sub-Laplacian on a stratified Lie group $\mathbb{G}$ with homogeneous dimension $Q,$ $Ω$ is an open bounded subset of $\mathbb{G},$ $s \in (0,1)$, $\frac{Q}{s}>p\geq2,$ $p_s^*:=\frac{pQ}{Q-ps}$ is subelliptic fractional Sobolev critical exponent, $μ, λ>0$ are real parameters and $h$ is a lower order perturbation of the critical power $|u|^{p_s^*-2}u$. Utilising direct methods of the calculus of variation, we establish the existence of at least one weak solution for the above problem under the condition that the real parameter $λ$ is sufficiently small. Additionally, we examine the problem for $μ= 0$, representing subelliptic nonlocal equations on stratified Lie groups depending on one real positive parameter and involving a subcritical nonlinearity. We demonstrate the existence of at least one solution in this scenario as well. We emphasize that the results obtained here are also novel for $p=2$ even for the Heisenberg group.
△ Less
Submitted 5 September, 2024;
originally announced September 2024.
-
Pion electroproduction measurements in the nucleon resonance region
Authors:
R. Li,
N. Sparveris,
H. Atac,
M. K. Jones,
M. Paolone,
Z. Akbar,
M. Ali,
C. Ayerbe Gayoso,
V. Berdnikov,
D. Biswas,
M. Boer,
A. Camsonne,
J. -P. Chen,
M. Diefenthaler,
B. Duran,
D. Dutta,
D. Gaskell,
O. Hansen,
F. Hauenstein,
N. Heinrich,
W. Henry,
T. Horn,
G. M. Huber,
S. Jia,
S. Joosten
, et al. (24 additional authors not shown)
Abstract:
We report new pion electroproduction measurements in the $Δ(1232)$ resonance, utilizing the SHMS - HMS magnetic spectrometers of Hall C at Jefferson Lab. The data focus on a region that exhibits a strong and rapidly changing interplay of the mesonic cloud and quark-gluon dynamics in the nucleon. The results are in reasonable agreement with models that employ pion cloud effects and chiral effective…
▽ More
We report new pion electroproduction measurements in the $Δ(1232)$ resonance, utilizing the SHMS - HMS magnetic spectrometers of Hall C at Jefferson Lab. The data focus on a region that exhibits a strong and rapidly changing interplay of the mesonic cloud and quark-gluon dynamics in the nucleon. The results are in reasonable agreement with models that employ pion cloud effects and chiral effective field theory calculations, but at the same time they suggest that an improvement is required to the theoretical calculations and provide valuable input that will allow their refinements. The data illustrate the potential of the magnetic spectrometers setup in Hall C towards the study the $Δ(1232)$ resonance. These first reported results will be followed by a series of measurements in Hall C, that will expand the studies of the $Δ(1232)$ resonance offering a high precision insight within a wide kinematic range from low to high momentum transfers.
△ Less
Submitted 5 September, 2024;
originally announced September 2024.
-
Semantically Controllable Augmentations for Generalizable Robot Learning
Authors:
Zoey Chen,
Zhao Mandi,
Homanga Bharadhwaj,
Mohit Sharma,
Shuran Song,
Abhishek Gupta,
Vikash Kumar
Abstract:
Generalization to unseen real-world scenarios for robot manipulation requires exposure to diverse datasets during training. However, collecting large real-world datasets is intractable due to high operational costs. For robot learning to generalize despite these challenges, it is essential to leverage sources of data or priors beyond the robot's direct experience. In this work, we posit that image…
▽ More
Generalization to unseen real-world scenarios for robot manipulation requires exposure to diverse datasets during training. However, collecting large real-world datasets is intractable due to high operational costs. For robot learning to generalize despite these challenges, it is essential to leverage sources of data or priors beyond the robot's direct experience. In this work, we posit that image-text generative models, which are pre-trained on large corpora of web-scraped data, can serve as such a data source. These generative models encompass a broad range of real-world scenarios beyond a robot's direct experience and can synthesize novel synthetic experiences that expose robotic agents to additional world priors aiding real-world generalization at no extra cost.
In particular, our approach leverages pre-trained generative models as an effective tool for data augmentation. We propose a generative augmentation framework for semantically controllable augmentations and rapidly multiplying robot datasets while inducing rich variations that enable real-world generalization. Based on diverse augmentations of robot data, we show how scalable robot manipulation policies can be trained and deployed both in simulation and in unseen real-world environments such as kitchens and table-tops. By demonstrating the effectiveness of image-text generative models in diverse real-world robotic applications, our generative augmentation framework provides a scalable and efficient path for boosting generalization in robot learning at no extra human cost.
△ Less
Submitted 2 September, 2024;
originally announced September 2024.
-
Constraint-Aware Intent Estimation for Dynamic Human-Robot Object Co-Manipulation
Authors:
Yifei Simon Shao,
Tianyu Li,
Shafagh Keyvanian,
Pratik Chaudhari,
Vijay Kumar,
Nadia Figueroa
Abstract:
Constraint-aware estimation of human intent is essential for robots to physically collaborate and interact with humans. Further, to achieve fluid collaboration in dynamic tasks intent estimation should be achieved in real-time. In this paper, we present a framework that combines online estimation and control to facilitate robots in interpreting human intentions, and dynamically adjust their action…
▽ More
Constraint-aware estimation of human intent is essential for robots to physically collaborate and interact with humans. Further, to achieve fluid collaboration in dynamic tasks intent estimation should be achieved in real-time. In this paper, we present a framework that combines online estimation and control to facilitate robots in interpreting human intentions, and dynamically adjust their actions to assist in dynamic object co-manipulation tasks while considering both robot and human constraints. Central to our approach is the adoption of a Dynamic Systems (DS) model to represent human intent. Such a low-dimensional parameterized model, along with human manipulability and robot kinematic constraints, enables us to predict intent using a particle filter solely based on past motion data and tracking errors. For safe assistive control, we propose a variable impedance controller that adapts the robot's impedance to offer assistance based on the intent estimation confidence from the DS particle filter. We validate our framework on a challenging real-world human-robot co-manipulation task and present promising results over baselines. Our framework represents a significant step forward in physical human-robot collaboration (pHRC), ensuring that robot cooperative interactions with humans are both feasible and effective.
△ Less
Submitted 30 August, 2024;
originally announced September 2024.
-
A Prototype Model of Zero-Trust Architecture Blockchain with EigenTrust-Based Practical Byzantine Fault Tolerance Protocol to Manage Decentralized Clinical Trials
Authors:
Ashok Kumar Peepliwall,
Hari Mohan Pandey,
Surya Prakash,
Anand A Mahajan,
Sudhinder Singh Chowhan,
Vinesh Kumar,
Rahul Sharma
Abstract:
The COVID-19 pandemic necessitated the emergence of decentralized Clinical Trials (DCTs) due to patient retention, accelerate trials, improve data accessibility, enable virtual care, and facilitate seamless communication through integrated systems. However, integrating systems in DCTs exposes clinical data to potential security threats, making them susceptible to theft at any stage, a high risk of…
▽ More
The COVID-19 pandemic necessitated the emergence of decentralized Clinical Trials (DCTs) due to patient retention, accelerate trials, improve data accessibility, enable virtual care, and facilitate seamless communication through integrated systems. However, integrating systems in DCTs exposes clinical data to potential security threats, making them susceptible to theft at any stage, a high risk of protocol deviations, and monitoring issues. To mitigate these challenges, blockchain technology serves as a secure framework, acting as a decentralized ledger, creating an immutable environment by establishing a zero-trust architecture, where data are deemed untrusted until verified. In combination with Internet of Things (IoT)-enabled wearable devices, blockchain secures the transfer of clinical trial data on private blockchains during DCT automation and operations. This paper proposes a prototype model of the Zero-Trust Architecture Blockchain (z-TAB) to integrate patient-generated clinical trial data during DCT operation management. The EigenTrust-based Practical Byzantine Fault Tolerance (T-PBFT) algorithm has been incorporated as a consensus protocol, leveraging Hyperledger Fabric. Furthermore, the Internet of Things (IoT) has been integrated to streamline data processing among stakeholders within the blockchain platforms. Rigorous evaluation has been done to evaluate the quality of the system.
△ Less
Submitted 29 August, 2024;
originally announced August 2024.
-
Flavor Dependence of Charged Pion Fragmentation Functions
Authors:
H. Bhatt,
P. Bosted,
S. Jia,
W. Armstrong,
D. Dutta,
R. Ent,
D. Gaskell,
E. Kinney,
H. Mkrtchyan,
S. Ali,
R. Ambrose,
D. Androic,
C. Ayerbe Gayoso,
A. Bandari,
V. Berdnikov,
D. Bhetuwal,
D. Biswas,
M. Boer,
E. Brash,
A. Camsonne,
J. P. Chen,
J. Chen,
M. Chen,
E. M. Christy,
S. Covrig
, et al. (45 additional authors not shown)
Abstract:
We have measured the flavor dependence of multiplicities for pi^+ and pi^- production in semi-inclusive deep-inelastic scattering (SIDIS) on proton and deuteron targets to explore a possible charge symmetry violation in fragmentation functions. The experiment used an electron beam with energies of 10.2 and 10.6 GeV at Jefferson Lab and the Hall-C spectrometers. The electron kinematics spanned the…
▽ More
We have measured the flavor dependence of multiplicities for pi^+ and pi^- production in semi-inclusive deep-inelastic scattering (SIDIS) on proton and deuteron targets to explore a possible charge symmetry violation in fragmentation functions. The experiment used an electron beam with energies of 10.2 and 10.6 GeV at Jefferson Lab and the Hall-C spectrometers. The electron kinematics spanned the range 0.3<x<0.6, 2<Q^2<5.5 GeV^2, and 4<W^2<11 GeV^2. The pion fractional momentum range was 0.3< z <0.7, and the transverse momentum range was 0<p_T<0.25 GeV/c. Assuming factorization at low p_T and allowing for isospin breaking, we find that the results can be described by two "favored" and two "un-favored" effective low $p_T$ fragmentation functions that are flavor-dependent. However, they converge to a common flavor-independent value at the lowest x or highest W of this experiment.
△ Less
Submitted 5 September, 2024; v1 submitted 29 August, 2024;
originally announced August 2024.
-
Improved Circuit Lower Bounds With Applications to Exponential Separations Between Quantum and Classical Circuits
Authors:
Sabee Grewal,
Vinayak M. Kumar
Abstract:
Kumar used a switching lemma to prove exponential-size lower bounds for a circuit class GC^0 that not only contains AC^0 but can--with a single gate--compute functions that require exponential-size TC^0 circuits. His main result was that switching-lemma lower bounds for AC^0 lift to GC^0 with no loss in parameters, even though GC^0 requires exponential-size TC^0 circuits. Informally, GC^0 is AC^0…
▽ More
Kumar used a switching lemma to prove exponential-size lower bounds for a circuit class GC^0 that not only contains AC^0 but can--with a single gate--compute functions that require exponential-size TC^0 circuits. His main result was that switching-lemma lower bounds for AC^0 lift to GC^0 with no loss in parameters, even though GC^0 requires exponential-size TC^0 circuits. Informally, GC^0 is AC^0 with unbounded-fan-in gates that behave arbitrarily inside a sufficiently small Hamming ball but must be constant outside it.
We show an analogous result for GC^0[p] (GC^0 with MODp gates) and the polynomial method. Specifically, we show that polynomial-method lower bounds for AC^0[p] lift to GC^0[p] with no loss in parameters. As an application, we prove Majority requires depth-d GC^0[p] circuits of size $2^{Ω(n^{1/2(d-1)})}$, matching the state-of-the-art lower bounds for AC^0[p]. We also show that E^NP requires exponential-size GCC^0 circuits (the union of GC^0[m] for all m), extending the result of Williams.
It is striking that the switching lemma, polynomial method, and algorithmic method all generalize to GC^0-related classes, with the first two methods doing so without any loss.
We also establish the strongest known unconditional separations between quantum and classical circuits:
1. There's an oracle relative to which BQP is not contained in the class of languages decidable by uniform families of size-$2^{n^{O(1)}}$ GC^0 circuits, generalizing Raz and Tal's relativized separation of BQP from the polynomial hierarchy.
2. There's a search problem that QNC^0 circuits can solve but average-case hard for exponential-size GC^0 circuits.
3. There's a search problem that QNC^0/qpoly circuits can solve but average-case hard for exponential-size GC^0[p] circuits.
4. There's an interactive problem that QNC^0 circuits can solve but exponential-size GC^0[p] circuits cannot.
△ Less
Submitted 29 August, 2024;
originally announced August 2024.
-
Brownian Colloids in Optothermal Field: An Experimental Perspective
Authors:
G. V. Pavan Kumar
Abstract:
Colloidal matter undergoing Brownian motion serves as a model system to study various physical phenomena. Understanding the effect of external perturbation on the assembly and dynamics of Brownian colloids has emerged as a relevant research issue in soft matter and biological physics. Optical perturbation in the form of photonic forces and torques has added impetus to this exploration. In recent y…
▽ More
Colloidal matter undergoing Brownian motion serves as a model system to study various physical phenomena. Understanding the effect of external perturbation on the assembly and dynamics of Brownian colloids has emerged as a relevant research issue in soft matter and biological physics. Optical perturbation in the form of photonic forces and torques has added impetus to this exploration. In recent years, optothermal effects arising due to optical excitation of mesoscale matter have expanded the toolbox of light-colloidal matter interactions. In this perspective, we present an experimental viewpoint on some of the developments related to the assembly and dynamics of Brownian colloids driven by the optothermal field. Furthermore, we discuss some interesting prospects on driven colloidal matter that can have implications on soft matter physics and soft photonics.
△ Less
Submitted 28 August, 2024;
originally announced August 2024.
-
Mistral-SPLADE: LLMs for better Learned Sparse Retrieval
Authors:
Meet Doshi,
Vishwajeet Kumar,
Rudra Murthy,
Vignesh P,
Jaydeep Sen
Abstract:
Learned Sparse Retrievers (LSR) have evolved into an effective retrieval strategy that can bridge the gap between traditional keyword-based sparse retrievers and embedding-based dense retrievers. At its core, learned sparse retrievers try to learn the most important semantic keyword expansions from a query and/or document which can facilitate better retrieval with overlapping keyword expansions. L…
▽ More
Learned Sparse Retrievers (LSR) have evolved into an effective retrieval strategy that can bridge the gap between traditional keyword-based sparse retrievers and embedding-based dense retrievers. At its core, learned sparse retrievers try to learn the most important semantic keyword expansions from a query and/or document which can facilitate better retrieval with overlapping keyword expansions. LSR like SPLADE has typically been using encoder only models with MLM (masked language modeling) style objective in conjunction with known ways of retrieval performance improvement such as hard negative mining, distillation, etc. In this work, we propose to use decoder-only model for learning semantic keyword expansion. We posit, decoder only models that have seen much higher magnitudes of data are better equipped to learn keyword expansions needed for improved retrieval. We use Mistral as the backbone to develop our Learned Sparse Retriever similar to SPLADE and train it on a subset of sentence-transformer data which is often used for training text embedding models. Our experiments support the hypothesis that a sparse retrieval model based on decoder only large language model (LLM) surpasses the performance of existing LSR systems, including SPLADE and all its variants. The LLM based model (Echo-Mistral-SPLADE) now stands as a state-of-the-art learned sparse retrieval model on the BEIR text retrieval benchmark.
△ Less
Submitted 21 August, 2024; v1 submitted 20 August, 2024;
originally announced August 2024.
-
Hindi-BEIR : A Large Scale Retrieval Benchmark in Hindi
Authors:
Arkadeep Acharya,
Rudra Murthy,
Vishwajeet Kumar,
Jaydeep Sen
Abstract:
Given the large number of Hindi speakers worldwide, there is a pressing need for robust and efficient information retrieval systems for Hindi. Despite ongoing research, there is a lack of comprehensive benchmark for evaluating retrieval models in Hindi. To address this gap, we introduce the Hindi version of the BEIR benchmark, which includes a subset of English BEIR datasets translated to Hindi, e…
▽ More
Given the large number of Hindi speakers worldwide, there is a pressing need for robust and efficient information retrieval systems for Hindi. Despite ongoing research, there is a lack of comprehensive benchmark for evaluating retrieval models in Hindi. To address this gap, we introduce the Hindi version of the BEIR benchmark, which includes a subset of English BEIR datasets translated to Hindi, existing Hindi retrieval datasets, and synthetically created datasets for retrieval. The benchmark is comprised of $15$ datasets spanning across $8$ distinct tasks. We evaluate state-of-the-art multilingual retrieval models on this benchmark to identify task and domain-specific challenges and their impact on retrieval performance. By releasing this benchmark and a set of relevant baselines, we enable researchers to understand the limitations and capabilities of current Hindi retrieval models, promoting advancements in this critical area. The datasets from Hindi-BEIR are publicly available.
△ Less
Submitted 18 August, 2024;
originally announced August 2024.
-
A Theory-Based Explainable Deep Learning Architecture for Music Emotion
Authors:
Hortense Fong,
Vineet Kumar,
K. Sudhir
Abstract:
This paper paper develops a theory-based, explainable deep learning convolutional neural network (CNN) classifier to predict the time-varying emotional response to music. We design novel CNN filters that leverage the frequency harmonics structure from acoustic physics known to impact the perception of musical features. Our theory-based model is more parsimonious, but provides comparable predictive…
▽ More
This paper paper develops a theory-based, explainable deep learning convolutional neural network (CNN) classifier to predict the time-varying emotional response to music. We design novel CNN filters that leverage the frequency harmonics structure from acoustic physics known to impact the perception of musical features. Our theory-based model is more parsimonious, but provides comparable predictive performance to atheoretical deep learning models, while performing better than models using handcrafted features. Our model can be complemented with handcrafted features, but the performance improvement is marginal. Importantly, the harmonics-based structure placed on the CNN filters provides better explainability for how the model predicts emotional response (valence and arousal), because emotion is closely related to consonance--a perceptual feature defined by the alignment of harmonics. Finally, we illustrate the utility of our model with an application involving digital advertising. Motivated by YouTube mid-roll ads, we conduct a lab experiment in which we exogenously insert ads at different times within videos. We find that ads placed in emotionally similar contexts increase ad engagement (lower skip rates, higher brand recall rates). Ad insertion based on emotional similarity metrics predicted by our theory-based, explainable model produces comparable or better engagement relative to atheoretical models.
△ Less
Submitted 13 August, 2024;
originally announced August 2024.
-
Higher order hypoelliptic damped wave equations on graded Lie groups with data from negative order Sobolev spaces: the critical case
Authors:
Vishvesh Kumar,
Shyam Swarup Mondal,
Michael Ruzhansky,
Berikbol T. Torebek
Abstract:
Let $\mathbb G$ be a graded Lie group with homogeneous dimension $Q$. In this paper, we study the Cauchy problem for a semilinear hypoelliptic damped wave equation involving a positive Rockland operator $\mathcal{R}$ of homogeneous degree $ν\geq 2$ on $\mathbb G$ with power type nonlinearity $|u|^p$ and initial data taken from negative order homogeneous Sobolev space…
▽ More
Let $\mathbb G$ be a graded Lie group with homogeneous dimension $Q$. In this paper, we study the Cauchy problem for a semilinear hypoelliptic damped wave equation involving a positive Rockland operator $\mathcal{R}$ of homogeneous degree $ν\geq 2$ on $\mathbb G$ with power type nonlinearity $|u|^p$ and initial data taken from negative order homogeneous Sobolev space $\dot H^{-γ}(\mathbb G), γ>0,$ for the critical exponent case $p=1+\frac{2ν}{Q+2γ}.$ We also explore the diffusion phenomenon of the higher-order hypoelliptic damped wave equations on graded Lie groups with initial data belonging to Sobolev spaces of negative order. We emphasize that our results are also new, even in the setting of higher-order differential operators on $\mathbb{R}^n$, and more generally, on stratified Lie groups.
△ Less
Submitted 11 September, 2024; v1 submitted 10 August, 2024;
originally announced August 2024.
-
Tangent Space of the Stable And Unstable Manifold of Anosov Diffeomorphism on 2-Torus
Authors:
Federico Bonneto,
Jack Wang,
Vishal Kumar
Abstract:
In this paper we describe the tangent vectors of the stable and unstable manifold of a class of Anosov diffeomorphisms on the torus $\mathbb{T}^2$ using the method of formal series and derivative trees. We start with linear automorphism that is hyperbolic and whose eigenvectors are orthogonal. Then we study the perturbation of such maps by trigonometric polynomial. It is known that there exist a (…
▽ More
In this paper we describe the tangent vectors of the stable and unstable manifold of a class of Anosov diffeomorphisms on the torus $\mathbb{T}^2$ using the method of formal series and derivative trees. We start with linear automorphism that is hyperbolic and whose eigenvectors are orthogonal. Then we study the perturbation of such maps by trigonometric polynomial. It is known that there exist a (continuous) map $H$ which acts as a change of coordinate between the perturbed and unperturbed system, but such a map is in general, not differentiable. By "re-scaling" the parametrization $H$, we will be able to obtain the explicit formula for the tangent vectors of these maps.
△ Less
Submitted 7 August, 2024;
originally announced August 2024.
-
Synergistic Learning with Multi-Task DeepONet for Efficient PDE Problem Solving
Authors:
Varun Kumar,
Somdatta Goswami,
Katiana Kontolati,
Michael D. Shields,
George Em Karniadakis
Abstract:
Multi-task learning (MTL) is an inductive transfer mechanism designed to leverage useful information from multiple tasks to improve generalization performance compared to single-task learning. It has been extensively explored in traditional machine learning to address issues such as data sparsity and overfitting in neural networks. In this work, we apply MTL to problems in science and engineering…
▽ More
Multi-task learning (MTL) is an inductive transfer mechanism designed to leverage useful information from multiple tasks to improve generalization performance compared to single-task learning. It has been extensively explored in traditional machine learning to address issues such as data sparsity and overfitting in neural networks. In this work, we apply MTL to problems in science and engineering governed by partial differential equations (PDEs). However, implementing MTL in this context is complex, as it requires task-specific modifications to accommodate various scenarios representing different physical processes. To this end, we present a multi-task deep operator network (MT-DeepONet) to learn solutions across various functional forms of source terms in a PDE and multiple geometries in a single concurrent training session. We introduce modifications in the branch network of the vanilla DeepONet to account for various functional forms of a parameterized coefficient in a PDE. Additionally, we handle parameterized geometries by introducing a binary mask in the branch network and incorporating it into the loss term to improve convergence and generalization to new geometry tasks. Our approach is demonstrated on three benchmark problems: (1) learning different functional forms of the source term in the Fisher equation; (2) learning multiple geometries in a 2D Darcy Flow problem and showcasing better transfer learning capabilities to new geometries; and (3) learning 3D parameterized geometries for a heat transfer problem and demonstrate the ability to predict on new but similar geometries. Our MT-DeepONet framework offers a novel approach to solving PDE problems in engineering and science under a unified umbrella based on synergistic learning that reduces the overall training cost for neural operators.
△ Less
Submitted 4 August, 2024;
originally announced August 2024.
-
No Size Fits All: The Perils and Pitfalls of Leveraging LLMs Vary with Company Size
Authors:
Ashok Urlana,
Charaka Vinayak Kumar,
Bala Mallikarjunarao Garlapati,
Ajeet Kumar Singh,
Rahul Mishra
Abstract:
Large language models (LLMs) are playing a pivotal role in deploying strategic use cases across a range of organizations, from large pan-continental companies to emerging startups. The issues and challenges involved in the successful utilization of LLMs can vary significantly depending on the size of the organization. It is important to study and discuss these pertinent issues of LLM adaptation wi…
▽ More
Large language models (LLMs) are playing a pivotal role in deploying strategic use cases across a range of organizations, from large pan-continental companies to emerging startups. The issues and challenges involved in the successful utilization of LLMs can vary significantly depending on the size of the organization. It is important to study and discuss these pertinent issues of LLM adaptation with a focus on the scale of the industrial concerns and brainstorm possible solutions and prospective directions. Such a study has not been prominently featured in the current research literature. In this study, we adopt a threefold strategy: first, we conduct a case study with industry practitioners to formulate the key research questions; second, we examine existing industrial publications to address these questions; and finally, we provide a practical guide for industries to utilize LLMs more efficiently.
△ Less
Submitted 21 July, 2024;
originally announced August 2024.