subscribe to arXiv mailings

DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing

Authors: Shreya Shankar, Aditya G. Parameswaran, Eugene Wu

Abstract: Analyzing unstructured data, such as complex documents, has been a persistent challenge in data processing. Large Language Models (LLMs) have shown promise in this regard, leading to recent proposals for declarative frameworks for LLM-powered unstructured data processing. However, these frameworks focus on reducing cost when executing user-specified operations using LLMs, rather than improving acc… ▽ More Analyzing unstructured data, such as complex documents, has been a persistent challenge in data processing. Large Language Models (LLMs) have shown promise in this regard, leading to recent proposals for declarative frameworks for LLM-powered unstructured data processing. However, these frameworks focus on reducing cost when executing user-specified operations using LLMs, rather than improving accuracy, executing most operations as-is. This is problematic for complex tasks and data, where LLM outputs for user-defined operations are often inaccurate, even with optimized prompts. We present DocETL, a system that optimizes complex document processing pipelines, while accounting for LLM shortcomings. DocETL offers a declarative interface for users to define such pipelines and uses an agent-based framework to automatically optimize them, leveraging novel agent-based rewrites (that we call {\em rewrite directives}) and an optimization and evaluation framework that we introduce. We introduce {\em (i)} logical rewriting of pipelines, tailored for LLM-based tasks, {\em (ii)} an agent-guided plan evaluation mechanism that synthesizes and orchestrates task-specific validation prompts, and {\em (iii)} an optimization algorithm that efficiently finds promising plans, considering the time constraints of LLM-based plan generation and evaluation. Our evaluation on three different unstructured document analysis tasks demonstrates that DocETL finds plans with outputs that are $1.34$ to $4.6\times$ higher quality (e.g., more accurate, comprehensive) than well-engineered baselines, addressing a critical gap in existing declarative frameworks for unstructured data analysis. DocETL is open-source at \ttt{docetl.org}, and as of October 2024, has amassed over 800 GitHub Stars, with users spanning a variety of domains. △ Less

Submitted 15 October, 2024; originally announced October 2024.

Comments: 21 pages, 7 figures, 3 tables

arXiv:2408.16784 [pdf, other]

Turbulent mixing controls fixation of growing antagonistic populations

Authors: Jonathan Bauermann, Roberto Benzi, David R. Nelson, Suraj Shankar, Federico Toschi

Abstract: Unlike coffee and cream that homogenize when stirred, growing micro-organisms (e.g., bacteria, baker's yeast) can actively kill each other and avoid mixing. How do such antagonistic interactions impact the growth and survival of competing strains, while being spatially advected by turbulent flows? By using numerical simulations of a continuum model, we study the dynamics of two antagonistic strain… ▽ More Unlike coffee and cream that homogenize when stirred, growing micro-organisms (e.g., bacteria, baker's yeast) can actively kill each other and avoid mixing. How do such antagonistic interactions impact the growth and survival of competing strains, while being spatially advected by turbulent flows? By using numerical simulations of a continuum model, we study the dynamics of two antagonistic strains that are dispersed by incompressible turbulent flows in two spatial dimensions. A key parameter is the ratio of the fluid transport time to that of biological reproduction, which determines the winning strain that ultimately takes over the whole population from an initial heterogeneous state. By quantifying the probability and mean time for fixation along with the spatial structure of concentration fluctuations, we demonstrate how turbulence raises the threshold for biological nucleation and antagonism suppresses flow-induced mixing by depleting the population at interfaces. Our work highlights the unusual biological consequences of the interplay of turbulent fluid flows with antagonistic population dynamics, with potential implications for marine microbial ecology and origins of biological chirality. △ Less

Submitted 21 August, 2024; originally announced August 2024.

Comments: 7 pages, 4 figures

arXiv:2407.21783 [pdf, other]

The Llama 3 Herd of Models

Authors: Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang , et al. (510 additional authors not shown)

Abstract: Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical… ▽ More Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical evaluation of Llama 3. We find that Llama 3 delivers comparable quality to leading language models such as GPT-4 on a plethora of tasks. We publicly release Llama 3, including pre-trained and post-trained versions of the 405B parameter language model and our Llama Guard 3 model for input and output safety. The paper also presents the results of experiments in which we integrate image, video, and speech capabilities into Llama 3 via a compositional approach. We observe this approach performs competitively with the state-of-the-art on image, video, and speech recognition tasks. The resulting models are not yet being broadly released as they are still under development. △ Less

Submitted 15 August, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

arXiv:2407.20946 [pdf, other]

General Relativistic Magneto-Hydrodynamic Simulations with BAM: Implementation and Code Comparison

Authors: Anna Neuweiler, Tim Dietrich, Bernd Brügmann, Edoardo Giangrandi, Kenta Kiuchi, Federico Schianchi, Philipp Mösta, Swapnil Shankar, Bruno Giacomazzo, Masaru Shibata

Abstract: Binary neutron star mergers are among the most energetic events in our Universe, with magnetic fields significantly impacting their dynamics, particularly after the merger. While numerical-relativity simulations that correctly describe the physics are essential to model their rich phenomenology, the inclusion of magnetic fields is crucial for realistic simulations. For this reason, we have extende… ▽ More Binary neutron star mergers are among the most energetic events in our Universe, with magnetic fields significantly impacting their dynamics, particularly after the merger. While numerical-relativity simulations that correctly describe the physics are essential to model their rich phenomenology, the inclusion of magnetic fields is crucial for realistic simulations. For this reason, we have extended the BAM code to enable general relativistic magneto-hydrodynamic (GRMHD) simulations employing a hyperbolic `divergence cleaning' scheme. We present a large set of standard GRMHD tests and compare the BAM code to other GRMHD codes, SPRITZ, GRaM-X, and SACRA$_{\rm KK22}$, which employ different schemes for the evolution of the magnetic fields. Overall, we find that the BAM code shows a good performance in simple special-relativistic tests. In addition, we find good agreement and consistent results when comparing GRMHD simulation results between BAM and SACRA$_{\rm KK22}$. △ Less

Submitted 30 July, 2024; originally announced July 2024.

Comments: 21 pages, 21 figures

arXiv:2406.17910 [pdf]

Transforming Software Development: Evaluating the Efficiency and Challenges of GitHub Copilot in Real-World Projects

Authors: Ruchika Pandey, Prabhat Singh, Raymond Wei, Shaila Shankar

Abstract: Generative AI technologies promise to transform the product development lifecycle. This study evaluates the efficiency gains, areas for improvement, and emerging challenges of using GitHub Copilot, an AI-powered coding assistant. We identified 15 software development tasks and assessed Copilot's benefits through real-world projects on large proprietary code bases. Our findings indicate significant… ▽ More Generative AI technologies promise to transform the product development lifecycle. This study evaluates the efficiency gains, areas for improvement, and emerging challenges of using GitHub Copilot, an AI-powered coding assistant. We identified 15 software development tasks and assessed Copilot's benefits through real-world projects on large proprietary code bases. Our findings indicate significant reductions in developer toil, with up to 50% time saved in code documentation and autocompletion, and 30-40% in repetitive coding tasks, unit test generation, debugging, and pair programming. However, Copilot struggles with complex tasks, large functions, multiple files, and proprietary contexts, particularly with C/C++ code. We project a 33-36% time reduction for coding-related tasks in a cloud-first software development lifecycle. This study aims to quantify productivity improvements, identify underperforming scenarios, examine practical benefits and challenges, investigate performance variations across programming languages, and discuss emerging issues related to code quality, security, and developer experience. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: 13 pages, 8 figures

arXiv:2406.05224 [pdf, other]

ON-OFF Neuromorphic ISING Machines using Fowler-Nordheim Annealers

Authors: Zihao Chen, Zhili Xiao, Mahmoud Akl, Johannes Leugring, Omowuyi Olajide, Adil Malik, Nik Dennler, Chad Harper, Subhankar Bose, Hector A. Gonzalez, Jason Eshraghian, Riccardo Pignari, Gianvito Urgese, Andreas G. Andreou, Sadasivan Shankar, Christian Mayr, Gert Cauwenberghs, Shantanu Chakrabartty

Abstract: We introduce NeuroSA, a neuromorphic architecture specifically designed to ensure asymptotic convergence to the ground state of an Ising problem using an annealing process that is governed by the physics of quantum mechanical tunneling using Fowler-Nordheim (FN). The core component of NeuroSA consists of a pair of asynchronous ON-OFF neurons, which effectively map classical simulated annealing (SA… ▽ More We introduce NeuroSA, a neuromorphic architecture specifically designed to ensure asymptotic convergence to the ground state of an Ising problem using an annealing process that is governed by the physics of quantum mechanical tunneling using Fowler-Nordheim (FN). The core component of NeuroSA consists of a pair of asynchronous ON-OFF neurons, which effectively map classical simulated annealing (SA) dynamics onto a network of integrate-and-fire (IF) neurons. The threshold of each ON-OFF neuron pair is adaptively adjusted by an FN annealer which replicates the optimal escape mechanism and convergence of SA, particularly at low temperatures. To validate the effectiveness of our neuromorphic Ising machine, we systematically solved various benchmark MAX-CUT combinatorial optimization problems. Across multiple runs, NeuroSA consistently generates solutions that approach the state-of-the-art level with high accuracy (greater than 99%), and without any graph-specific hyperparameter tuning. For practical illustration, we present results from an implementation of NeuroSA on the SpiNNaker2 platform, highlighting the feasibility of mapping our proposed architecture onto a standard neuromorphic accelerator platform. △ Less

Submitted 7 June, 2024; originally announced June 2024.

Comments: 36 pages, 8 figures

arXiv:2405.19297 [pdf, other]

Genuine Retrieval of the AGN Host Stellar Population (GRAHSP)

Authors: Johannes Buchner, Hattie Starck, Mara Salvato, Hagai Netzer, Zsofi Igo, Brivael Laloux, Antonis Georgakakis, Isabelle Gauger, Anna Olechowska, Nicolas Lopez, Suraj D Shankar, Junyao Li, Kirpal Nandra, Andrea Merloni

Abstract: The assembly and co-evolution of supermassive black holes (SMBH) and their host galaxy stellar population is a key open questions in galaxy evolution. Stellar mass ($M_\star$) and star formation rate (SFR), are inferred by modeling the spectral energy distribution (SED). For galaxies triggering SMBH activity, the active galactic nucleus (AGN) contaminates the light at all wavelengths, hampering th… ▽ More The assembly and co-evolution of supermassive black holes (SMBH) and their host galaxy stellar population is a key open questions in galaxy evolution. Stellar mass ($M_\star$) and star formation rate (SFR), are inferred by modeling the spectral energy distribution (SED). For galaxies triggering SMBH activity, the active galactic nucleus (AGN) contaminates the light at all wavelengths, hampering the inference of galaxy parameters. Incomplete AGN templates can lead to systematic overestimates of the stellar mass, biasing our understanding of AGN-galaxy co-evolution. This challenge has gained further impetus with the advent of sensitive wide-area surveys with millions of luminous AGN, including by eROSITA, Euclid and LSST. We aim to estimate the accuracy and bias of AGN host galaxy parameters and improve upon existing techniques. This work makes two contributions: 1) a new SED fitting code, GRAHSP, with a flexible, empirically motivated AGN model including a power law continuum emission lines, a FeII forest and a flexible infrared torus. We verify that our model reproduces published X-ray to infrared SEDs of AGN to better than 20\% accuracy. A fully Bayesian fit with nested sampling includes uncertainties in the model and the data, making the inference highly robust. 2) we created a benchmark photometric dataset where pure quasars are merged with non-AGN pure galaxies into a hybrid (Chimera) object but with known galaxy and AGN properties. Comparing the true and retrieved $M_\star$, SFR and AGN luminosities shows that previous codes systematically over-estimate $M_\star$ and SFR by 0.5 dex with a wide scatter of 0.7 dex, at AGN luminosities above 10^44 erg/s. In contrast, GRAHSP shows no bias on $M_\star$ and SFR. GRAHSP also estimates more realistic uncertainties. GRAHSP enables characterization of the environmental conditions conducive to black hole growth. (abridged) △ Less

Submitted 12 September, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

Comments: accepted in A&A

arXiv:2405.04674 [pdf, other]

Towards Accurate and Efficient Document Analytics with Large Language Models

Authors: Yiming Lin, Madelon Hulsebos, Ruiying Ma, Shreya Shankar, Sepanta Zeigham, Aditya G. Parameswaran, Eugene Wu

Abstract: Unstructured data formats account for over 80% of the data currently stored, and extracting value from such formats remains a considerable challenge. In particular, current approaches for managing unstructured documents do not support ad-hoc analytical queries on document collections. Moreover, Large Language Models (LLMs) directly applied to the documents themselves, or on portions of documents t… ▽ More Unstructured data formats account for over 80% of the data currently stored, and extracting value from such formats remains a considerable challenge. In particular, current approaches for managing unstructured documents do not support ad-hoc analytical queries on document collections. Moreover, Large Language Models (LLMs) directly applied to the documents themselves, or on portions of documents through a process of Retrieval-Augmented Generation (RAG), fail to provide high accuracy query results, and in the LLM-only case, additionally incur high costs. Since many unstructured documents in a collection often follow similar templates that impart a common semantic structure, we introduce ZenDB, a document analytics system that leverages this semantic structure, coupled with LLMs, to answer ad-hoc SQL queries on document collections. ZenDB efficiently extracts semantic hierarchical structures from such templatized documents, and introduces a novel query engine that leverages these structures for accurate and cost-effective query execution. Users can impose a schema on their documents, and query it, all via SQL. Extensive experiments on three real-world document collections demonstrate ZenDB's benefits, achieving up to 30% cost savings compared to LLM-based baselines, while maintaining or improving accuracy, and surpassing RAG-based baselines by up to 61% in precision and 80% in recall, at a marginally higher cost. △ Less

Submitted 7 May, 2024; originally announced May 2024.

arXiv:2404.12272 [pdf, other]

Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences

Authors: Shreya Shankar, J. D. Zamfirescu-Pereira, Björn Hartmann, Aditya G. Parameswaran, Ian Arawjo

Abstract: Due to the cumbersome nature of human evaluation and limitations of code-based evaluation, Large Language Models (LLMs) are increasingly being used to assist humans in evaluating LLM outputs. Yet LLM-generated evaluators simply inherit all the problems of the LLMs they evaluate, requiring further human validation. We present a mixed-initiative approach to ``validate the validators'' -- aligning LL… ▽ More Due to the cumbersome nature of human evaluation and limitations of code-based evaluation, Large Language Models (LLMs) are increasingly being used to assist humans in evaluating LLM outputs. Yet LLM-generated evaluators simply inherit all the problems of the LLMs they evaluate, requiring further human validation. We present a mixed-initiative approach to ``validate the validators'' -- aligning LLM-generated evaluation functions (be it prompts or code) with human requirements. Our interface, EvalGen, provides automated assistance to users in generating evaluation criteria and implementing assertions. While generating candidate implementations (Python functions, LLM grader prompts), EvalGen asks humans to grade a subset of LLM outputs; this feedback is used to select implementations that better align with user grades. A qualitative study finds overall support for EvalGen but underscores the subjectivity and iterative process of alignment. In particular, we identify a phenomenon we dub \emph{criteria drift}: users need criteria to grade outputs, but grading outputs helps users define criteria. What is more, some criteria appears \emph{dependent} on the specific LLM outputs observed (rather than independent criteria that can be defined \emph{a priori}), raising serious questions for approaches that assume the independence of evaluation from observation of model outputs. We present our interface and implementation details, a comparison of our algorithm with a baseline approach, and implications for the design of future LLM evaluation assistants. △ Less

Submitted 18 April, 2024; originally announced April 2024.

Comments: 16 pages, 4 figures, 2 tables

arXiv:2404.10547 [pdf, other]

A/B testing under Interference with Partial Network Information

Authors: Shiv Shankar, Ritwik Sinha, Yash Chandak, Saayan Mitra, Madalina Fiterau

Abstract: A/B tests are often required to be conducted on subjects that might have social connections. For e.g., experiments on social media, or medical and social interventions to control the spread of an epidemic. In such settings, the SUTVA assumption for randomized-controlled trials is violated due to network interference, or spill-over effects, as treatments to group A can potentially also affect the c… ▽ More A/B tests are often required to be conducted on subjects that might have social connections. For e.g., experiments on social media, or medical and social interventions to control the spread of an epidemic. In such settings, the SUTVA assumption for randomized-controlled trials is violated due to network interference, or spill-over effects, as treatments to group A can potentially also affect the control group B. When the underlying social network is known exactly, prior works have demonstrated how to conduct A/B tests adequately to estimate the global average treatment effect (GATE). However, in practice, it is often impossible to obtain knowledge about the exact underlying network. In this paper, we present UNITE: a novel estimator that relax this assumption and can identify GATE while only relying on knowledge of the superset of neighbors for any subject in the graph. Through theoretical analysis and extensive experiments, we show that the proposed approach performs better in comparison to standard estimators. △ Less

Submitted 16 April, 2024; originally announced April 2024.

Comments: AISTATS 2024

arXiv:2403.16795 [pdf, other]

doi 10.1145/3653697

"We Have No Idea How Models will Behave in Production until Production": How Engineers Operationalize Machine Learning

Authors: Shreya Shankar, Rolando Garcia, Joseph M Hellerstein, Aditya G Parameswaran

Abstract: Organizations rely on machine learning engineers (MLEs) to deploy models and maintain ML pipelines in production. Due to models' extensive reliance on fresh data, the operationalization of machine learning, or MLOps, requires MLEs to have proficiency in data science and engineering. When considered holistically, the job seems staggering -- how do MLEs do MLOps, and what are their unaddressed chall… ▽ More Organizations rely on machine learning engineers (MLEs) to deploy models and maintain ML pipelines in production. Due to models' extensive reliance on fresh data, the operationalization of machine learning, or MLOps, requires MLEs to have proficiency in data science and engineering. When considered holistically, the job seems staggering -- how do MLEs do MLOps, and what are their unaddressed challenges? To address these questions, we conducted semi-structured ethnographic interviews with 18 MLEs working on various applications, including chatbots, autonomous vehicles, and finance. We find that MLEs engage in a workflow of (i) data preparation, (ii) experimentation, (iii) evaluation throughout a multi-staged deployment, and (iv) continual monitoring and response. Throughout this workflow, MLEs collaborate extensively with data scientists, product stakeholders, and one another, supplementing routine verbal exchanges with communication tools ranging from Slack to organization-wide ticketing and reporting systems. We introduce the 3Vs of MLOps: velocity, visibility, and versioning -- three virtues of successful ML deployments that MLEs learn to balance and grow as they mature. Finally, we discuss design implications and opportunities for future work. △ Less

Submitted 25 March, 2024; originally announced March 2024.

Comments: arXiv admin note: text overlap with arXiv:2209.09125

Journal ref: Proc. ACM Hum.-Comput. Interact. 8, CSCW1, Article 206 (April 2024)

arXiv:2402.15968 [pdf, other]

CoDream: Exchanging dreams instead of models for federated aggregation with heterogeneous models

Authors: Abhishek Singh, Gauri Gupta, Ritvik Kapila, Yichuan Shi, Alex Dang, Sheshank Shankar, Mohammed Ehab, Ramesh Raskar

Abstract: Federated Learning (FL) enables collaborative optimization of machine learning models across decentralized data by aggregating model parameters. Our approach extends this concept by aggregating "knowledge" derived from models, instead of model parameters. We present a novel framework called CoDream, where clients collaboratively optimize randomly initialized data using federated optimization in th… ▽ More Federated Learning (FL) enables collaborative optimization of machine learning models across decentralized data by aggregating model parameters. Our approach extends this concept by aggregating "knowledge" derived from models, instead of model parameters. We present a novel framework called CoDream, where clients collaboratively optimize randomly initialized data using federated optimization in the input data space, similar to how randomly initialized model parameters are optimized in FL. Our key insight is that jointly optimizing this data can effectively capture the properties of the global data distribution. Sharing knowledge in data space offers numerous benefits: (1) model-agnostic collaborative learning, i.e., different clients can have different model architectures; (2) communication that is independent of the model size, eliminating scalability concerns with model parameters; (3) compatibility with secure aggregation, thus preserving the privacy benefits of federated learning; (4) allowing of adaptive optimization of knowledge shared for personalized learning. We empirically validate CoDream on standard FL tasks, demonstrating competitive performance despite not sharing model parameters. Our code: https://mitmedialab.github.io/codream.github.io/ △ Less

Submitted 27 February, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

Comments: 16 pages, 12 figures, 5 tables

arXiv:2402.11085 [pdf, other]

doi 10.1063/5.0205053

Kerr nonlinearity and parametric amplification with an Al-InAs superconductor-semiconductor Josephson junction

Authors: Z. Hao, T. Shaw, M. Hatefipour, W. M. Strickland, B. H. Elfeky, D. Langone, J. Shabani, S. Shankar

Abstract: Nearly quantum limited Josephson parametric amplifiers (JPAs) are essential components in superconducting quantum circuits. However, higher order nonlinearities of the Josephson cosine potential are known to cause gain compression, therefore limiting scalability. In an effort to reduce the fourth order, or Kerr nonlinearity, we realize a parametric amplifier with an Al-InAs superconductor-semicond… ▽ More Nearly quantum limited Josephson parametric amplifiers (JPAs) are essential components in superconducting quantum circuits. However, higher order nonlinearities of the Josephson cosine potential are known to cause gain compression, therefore limiting scalability. In an effort to reduce the fourth order, or Kerr nonlinearity, we realize a parametric amplifier with an Al-InAs superconductor-semiconductor hybrid Josephson junction (JJ). We extract the Kerr nonlinearity of the Al-InAs JJ from two different devices and show that it is three orders of magnitude lower compared to an Al-$\text{AlO}_\text{X}$ junction with identical Josephson inductance. We then demonstrate a four-wave-mixing (4WM) parametric amplifier made with an Al-InAs junction that achieves more than 20 dB of gain and -119 dBm of compression power, that outperforms single resonant JPAs based on Al junctions. △ Less

Submitted 16 August, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

Comments: 7 pages, 5 figures, v3 - final submission version, added data repository DOI

Journal ref: Appl. Phys. Lett. 124, 254003 (2024)

arXiv:2401.03038 [pdf, other]

SPADE: Synthesizing Data Quality Assertions for Large Language Model Pipelines

Authors: Shreya Shankar, Haotian Li, Parth Asawa, Madelon Hulsebos, Yiming Lin, J. D. Zamfirescu-Pereira, Harrison Chase, Will Fu-Hinthorn, Aditya G. Parameswaran, Eugene Wu

Abstract: Large language models (LLMs) are being increasingly deployed as part of pipelines that repeatedly process or generate data of some sort. However, a common barrier to deployment are the frequent and often unpredictable errors that plague LLMs. Acknowledging the inevitability of these errors, we propose {\em data quality assertions} to identify when LLMs may be making mistakes. We present SPADE, a m… ▽ More Large language models (LLMs) are being increasingly deployed as part of pipelines that repeatedly process or generate data of some sort. However, a common barrier to deployment are the frequent and often unpredictable errors that plague LLMs. Acknowledging the inevitability of these errors, we propose {\em data quality assertions} to identify when LLMs may be making mistakes. We present SPADE, a method for automatically synthesizing data quality assertions that identify bad LLM outputs. We make the observation that developers often identify data quality issues during prototyping prior to deployment, and attempt to address them by adding instructions to the LLM prompt over time. SPADE therefore analyzes histories of prompt versions over time to create candidate assertion functions and then selects a minimal set that fulfills both coverage and accuracy requirements. In testing across nine different real-world LLM pipelines, SPADE efficiently reduces the number of assertions by 14\% and decreases false failures by 21\% when compared to simpler baselines. SPADE has been deployed as an offering within LangSmith, LangChain's LLM pipeline hub, and has been used to generate data quality assertions for over 2000 pipelines across a spectrum of industries. △ Less

Submitted 31 March, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

Comments: 17 pages, 6 figures

arXiv:2312.02438 [pdf, other]

Adaptive Instrument Design for Indirect Experiments

Authors: Yash Chandak, Shiv Shankar, Vasilis Syrgkanis, Emma Brunskill

Abstract: Indirect experiments provide a valuable framework for estimating treatment effects in situations where conducting randomized control trials (RCTs) is impractical or unethical. Unlike RCTs, indirect experiments estimate treatment effects by leveraging (conditional) instrumental variables, enabling estimation through encouragement and recommendation rather than strict treatment assignment. However,… ▽ More Indirect experiments provide a valuable framework for estimating treatment effects in situations where conducting randomized control trials (RCTs) is impractical or unethical. Unlike RCTs, indirect experiments estimate treatment effects by leveraging (conditional) instrumental variables, enabling estimation through encouragement and recommendation rather than strict treatment assignment. However, the sample efficiency of such estimators depends not only on the inherent variability in outcomes but also on the varying compliance levels of users with the instrumental variables and the choice of estimator being used, especially when dealing with numerous instrumental variables. While adaptive experiment design has a rich literature for direct experiments, in this paper we take the initial steps towards enhancing sample efficiency for indirect experiments by adaptively designing a data collection policy over instrumental variables. Our main contribution is a practical computational procedure that utilizes influence functions to search for an optimal data collection policy, minimizing the mean-squared error of the desired (non-linear) estimator. Through experiments conducted in various domains inspired by real-world applications, we showcase how our method can significantly improve the sample efficiency of indirect experiments. △ Less

Submitted 4 December, 2023; originally announced December 2023.

arXiv:2311.14641 [pdf, other]

doi 10.1038/s41467-024-52259-9

Neuromorphic Intermediate Representation: A Unified Instruction Set for Interoperable Brain-Inspired Computing

Authors: Jens E. Pedersen, Steven Abreu, Matthias Jobst, Gregor Lenz, Vittorio Fra, Felix C. Bauer, Dylan R. Muir, Peng Zhou, Bernhard Vogginger, Kade Heckel, Gianvito Urgese, Sadasivan Shankar, Terrence C. Stewart, Sadique Sheik, Jason K. Eshraghian

Abstract: Spiking neural networks and neuromorphic hardware platforms that simulate neuronal dynamics are getting wide attention and are being applied to many relevant problems using Machine Learning. Despite a well-established mathematical foundation for neural dynamics, there exists numerous software and hardware solutions and stacks whose variability makes it difficult to reproduce findings. Here, we est… ▽ More Spiking neural networks and neuromorphic hardware platforms that simulate neuronal dynamics are getting wide attention and are being applied to many relevant problems using Machine Learning. Despite a well-established mathematical foundation for neural dynamics, there exists numerous software and hardware solutions and stacks whose variability makes it difficult to reproduce findings. Here, we establish a common reference frame for computations in digital neuromorphic systems, titled Neuromorphic Intermediate Representation (NIR). NIR defines a set of computational and composable model primitives as hybrid systems combining continuous-time dynamics and discrete events. By abstracting away assumptions around discretization and hardware constraints, NIR faithfully captures the computational model, while bridging differences between the evaluated implementation and the underlying mathematical formalism. NIR supports an unprecedented number of neuromorphic systems, which we demonstrate by reproducing three spiking neural network models of different complexity across 7 neuromorphic simulators and 4 digital hardware platforms. NIR decouples the development of neuromorphic hardware and software, enabling interoperability between platforms and improving accessibility to multiple neuromorphic technologies. We believe that NIR is a key next step in brain-inspired hardware-software co-evolution, enabling research towards the implementation of energy efficient computational principles of nervous systems. NIR is available at neuroir.org △ Less

Submitted 30 September, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

Comments: NIR is available at https://neuroir.org

Journal ref: Nat Commun 15, 8122 (2024)

arXiv:2310.07516 [pdf]

Energy Estimates Across Layers of Computing: From Devices to Large-Scale Applications in Machine Learning for Natural Language Processing, Scientific Computing, and Cryptocurrency Mining

Authors: Sadasivan Shankar

Abstract: Estimates of energy usage in layers of computing from devices to algorithms have been determined and analyzed. Building on the previous analysis [3], energy needed from single devices and systems including three large-scale computing applications such as Artificial Intelligence (AI)/Machine Learning for Natural Language Processing, Scientific Simulations, and Cryptocurrency Mining have been estima… ▽ More Estimates of energy usage in layers of computing from devices to algorithms have been determined and analyzed. Building on the previous analysis [3], energy needed from single devices and systems including three large-scale computing applications such as Artificial Intelligence (AI)/Machine Learning for Natural Language Processing, Scientific Simulations, and Cryptocurrency Mining have been estimated. In contrast to the bit-level switching, in which transistors achieved energy efficiency due to geometrical scaling, higher energy is expended both at the at the instructions and simulations levels of an application. Additionally, the analysis based on AI/ML Accelerators indicate that changes in architectures using an older semiconductor technology node have comparable energy efficiency with a different architecture using a newer technology. Further comparisons of the energy in computing systems with the thermodynamic and biological limits, indicate that there is a 27-36 orders of magnitude higher energy requirements for total simulation of an application. These energy estimates underscore the need for serious considerations of energy efficiency in computing by including energy as a design parameter, enabling growing needs of compute-intensive applications in a digital world. △ Less

Submitted 11 October, 2023; originally announced October 2023.

Comments: 6 pages, 5 figures

ACM Class: C.3; C.4; I.2; J.2

arXiv:2310.07000 [pdf]

CarDS-Plus ECG Platform: Development and Feasibility Evaluation of a Multiplatform Artificial Intelligence Toolkit for Portable and Wearable Device Electrocardiograms

Authors: Sumukh Vasisht Shankar, Evangelos K Oikonomou, Rohan Khera

Abstract: In the rapidly evolving landscape of modern healthcare, the integration of wearable & portable technology provides a unique opportunity for personalized health monitoring in the community. Devices like the Apple Watch, FitBit, and AliveCor KardiaMobile have revolutionized the acquisition and processing of intricate health data streams. Amidst the variety of data collected by these gadgets, single-… ▽ More In the rapidly evolving landscape of modern healthcare, the integration of wearable & portable technology provides a unique opportunity for personalized health monitoring in the community. Devices like the Apple Watch, FitBit, and AliveCor KardiaMobile have revolutionized the acquisition and processing of intricate health data streams. Amidst the variety of data collected by these gadgets, single-lead electrocardiogram (ECG) recordings have emerged as a crucial source of information for monitoring cardiovascular health. There has been significant advances in artificial intelligence capable of interpreting these 1-lead ECGs, facilitating clinical diagnosis as well as the detection of rare cardiac disorders. This design study describes the development of an innovative multiplatform system aimed at the rapid deployment of AI-based ECG solutions for clinical investigation & care delivery. The study examines design considerations, aligning them with specific applications, develops data flows to maximize efficiency for research & clinical use. This process encompasses the reception of single-lead ECGs from diverse wearable devices, channeling this data into a centralized data lake & facilitating real-time inference through AI models for ECG interpretation. An evaluation of the platform demonstrates a mean duration from acquisition to reporting of results of 33.0 to 35.7 seconds, after a standard 30 second acquisition. There were no substantial differences in acquisition to reporting across two commercially available devices (Apple Watch and KardiaMobile). These results demonstrate the succcessful translation of design principles into a fully integrated & efficient strategy for leveraging 1-lead ECGs across platforms & interpretation by AI-ECG algorithms. Such a platform is critical to translating AI discoveries for wearable and portable ECG devices to clinical impact through rapid deployment. △ Less

Submitted 10 October, 2023; originally announced October 2023.

arXiv:2310.00903 [pdf, ps, other]

Symmetric Solutions to Symmetric Partial Difference Equations

Authors: Shiva Shankar

Abstract: This paper studies systems of linear difference equations on the lattice $\Z^n$ that are invariant under a finite group of symmetries, and shows that there exist solutions to such systems that are also invariant under this group of symmetries. This paper studies systems of linear difference equations on the lattice $\Z^n$ that are invariant under a finite group of symmetries, and shows that there exist solutions to such systems that are also invariant under this group of symmetries. △ Less

Submitted 2 October, 2023; originally announced October 2023.

MSC Class: 39A14; 93A30

arXiv:2309.11943 [pdf]

Multi-contrast x-ray identification of inhomogeneous materials and their discrimination through deep learning approaches

Authors: Thomas Partridge, Sukrit S. Shankar, Ian Buchanan, Peter Modregger, Alberto Astolfo, David Bate, Alessandro Olivo

Abstract: Recent innovations in x-ray technology (namely phase-based and energy-resolved imaging) offer unprecedented opportunities for material discrimination, however they are often used in isolation or in limited combinations. Here we show that the optimized combination of contrast channels (attenuation at three x-ray energies, ultra-small angle scattering at two, standard deviation of refraction) signif… ▽ More Recent innovations in x-ray technology (namely phase-based and energy-resolved imaging) offer unprecedented opportunities for material discrimination, however they are often used in isolation or in limited combinations. Here we show that the optimized combination of contrast channels (attenuation at three x-ray energies, ultra-small angle scattering at two, standard deviation of refraction) significantly enhances material identification abilities compared to dual-energy x-ray imaging alone, and that a combination of off-the-shelf machine learning approaches can effectively discriminate e.g., threat materials in complex datasets. The methodology is validated on a range of materials and image dataset that are both an order of magnitude larger than those used in previous studies. Our results can provide an effective methodology to discriminate, and in some cases identify, different materials in complex imaging scenarios, with prospective applications across the life and physical sciences. While the detection of threat materials is used as a demonstrator here, the methodology could be equally applied to e.g., the distinction between diseased and healthy tissues or degraded vs. pristine materials. △ Less

Submitted 21 September, 2023; originally announced September 2023.

Comments: 23 pages - 13 main text, 10 supplementary 11 figures - 5 main text, 6 supplementary

arXiv:2308.16229 [pdf, other]

doi 10.1103/PhysRevA.109.022606

Sequential quantum simulation of spin chains with a single circuit QED device

Authors: Yuxuan Zhang, Shahin Jahanbani, Ameya Riswadkar, S. Shankar, Andrew C. Potter

Abstract: Quantum simulation of many-body systems in materials science and chemistry are promising application areas for quantum computers. However, the limited scale and coherence of near-term quantum processors pose a significant obstacle to realizing this potential. Here, we theoretically outline how a single-circuit quantum electrodynamics (cQED) device, consisting of a transmon qubit coupled to a long-… ▽ More Quantum simulation of many-body systems in materials science and chemistry are promising application areas for quantum computers. However, the limited scale and coherence of near-term quantum processors pose a significant obstacle to realizing this potential. Here, we theoretically outline how a single-circuit quantum electrodynamics (cQED) device, consisting of a transmon qubit coupled to a long-lived cavity mode, can be used to simulate the ground state of a highly-entangled quantum many-body spin chain. We exploit recently developed methods for implementing quantum operations to sequentially build up a matrix product state (MPS) representation of a many-body state. This approach re-uses the transmon qubit to read out the state of each spin in the chain and exploits the large state space of the cavity as a quantum memory encoding inter-site correlations and entanglement. We show, through simulation, that analog (pulse-level) control schemes can accurately prepare a known MPS representation of a quantum critical spin chain in significantly less time than digital (gate-based) methods, thereby reducing the exposure to decoherence. We then explore this analog-control approach for the variational preparation of an unknown ground state. We demonstrate that the large state space of the cavity can be used to replace multiple qubits in a qubit-only architecture, and could therefore simplify the design of quantum processors for materials simulation. We explore the practical limitations of realistic noise and decoherence and discuss avenues for scaling this approach to more complex problems that challenge classical computational methods. △ Less

Submitted 30 August, 2023; originally announced August 2023.

Comments: 9 pages, 4 figures

Journal ref: Phys. Rev. A 109, 022606 (2024)

arXiv:2308.03854 [pdf, ps, other]

Revisiting Prompt Engineering via Declarative Crowdsourcing

Authors: Aditya G. Parameswaran, Shreya Shankar, Parth Asawa, Naman Jain, Yujie Wang

Abstract: Large language models (LLMs) are incredibly powerful at comprehending and generating data in the form of text, but are brittle and error-prone. There has been an advent of toolkits and recipes centered around so-called prompt engineering-the process of asking an LLM to do something via a series of prompts. However, for LLM-powered data processing workflows, in particular, optimizing for quality, w… ▽ More Large language models (LLMs) are incredibly powerful at comprehending and generating data in the form of text, but are brittle and error-prone. There has been an advent of toolkits and recipes centered around so-called prompt engineering-the process of asking an LLM to do something via a series of prompts. However, for LLM-powered data processing workflows, in particular, optimizing for quality, while keeping cost bounded, is a tedious, manual process. We put forth a vision for declarative prompt engineering. We view LLMs like crowd workers and leverage ideas from the declarative crowdsourcing literature-including leveraging multiple prompting strategies, ensuring internal consistency, and exploring hybrid-LLM-non-LLM approaches-to make prompt engineering a more principled process. Preliminary case studies on sorting, entity resolution, and imputation demonstrate the promise of our approach △ Less

Submitted 7 August, 2023; originally announced August 2023.

arXiv:2305.04184 [pdf, other]

doi 10.1103/PhysRevApplied.21.014021

Fully Directional Quantum-limited Phase-Preserving Amplifier

Authors: Gangqiang Liu, Andrew Lingenfelter, Vidul R. Joshi, Nicholas E. Frattini, Volodymyr V. Sivak, Shyam Shankar, Michel H. Devoret

Abstract: We present a way to achieve fully directional, quantum-limited phase-preserving amplification in a four-port, four-mode superconducting Josephson circuit by utilizing interference between six parametric processes that couple all four modes. Full directionality, defined as the reverse isolation surpassing forward gain between the matched input and output ports of the amplifier, ensures its robustne… ▽ More We present a way to achieve fully directional, quantum-limited phase-preserving amplification in a four-port, four-mode superconducting Josephson circuit by utilizing interference between six parametric processes that couple all four modes. Full directionality, defined as the reverse isolation surpassing forward gain between the matched input and output ports of the amplifier, ensures its robustness against impedance mismatch that might be present at its output port during applications. Unlike existing directional phase-preserving amplifiers, both the minimal back-action and the quantum-limited added noise of this amplifier remains unaffected by noise incident on its output port. In addition, the matched input and output ports allow direct on-chip integration of these amplifiers with other circuit QED components, facilitating scaling up of superconducting quantum processors. △ Less

Submitted 13 May, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

Journal ref: Phys. Rev. Applied 21, 014021 (2024)

arXiv:2303.06094 [pdf, other]

Moving Fast With Broken Data

Authors: Shreya Shankar, Labib Fawaz, Karl Gyllstrom, Aditya G. Parameswaran

Abstract: Machine learning (ML) models in production pipelines are frequently retrained on the latest partitions of large, continually-growing datasets. Due to engineering bugs, partitions in such datasets almost always have some corrupted features; thus, it's critical to detect data issues and block retraining before downstream ML model accuracy decreases. However, it's difficult to identify when a partiti… ▽ More Machine learning (ML) models in production pipelines are frequently retrained on the latest partitions of large, continually-growing datasets. Due to engineering bugs, partitions in such datasets almost always have some corrupted features; thus, it's critical to detect data issues and block retraining before downstream ML model accuracy decreases. However, it's difficult to identify when a partition is corrupted enough to block retraining. Blocking too often yields stale model snapshots in production; blocking too little yields broken model snapshots in production. In this paper, we present an automatic data validation system for ML pipelines implemented at Meta. We employ what we call a Partition Summarization (PS) approach to data validation: each timestamp-based partition of data is summarized with data quality metrics, and summaries are compared to detect corrupted partitions. We describe how we can adapt PS for several data validation methods and compare their pros and cons. Since none of the methods by themselves met our requirements for high precision and recall in detecting corruptions, we devised GATE, our high-precision and recall data validation method. GATE gave a 2.1x average improvement in precision over the baseline on a case study with Instagram's data. Finally, we discuss lessons learned from implementing data validation for Meta's production ML pipelines. △ Less

Submitted 10 March, 2023; originally announced March 2023.

Comments: 14 pages, 4 figures

arXiv:2303.02492 [pdf, other]

doi 10.1103/PhysRevB.107.245102

Kondo effect in twisted bilayer graphene

Authors: A. S. Shankar, D. O. Oriekhov, Andrew K. Mitchell, L. Fritz

Abstract: The emergence of flat bands in twisted bilayer graphene at the magic angle can be understood in terms of a vanishing Fermi velocity of the Dirac cone. This is associated with van Hove singularities approaching the Fermi energy and becoming higher-order. In the density of states this is reflected by flanking logarithmic van Hove divergences pinching off the central Dirac cone in energy space. The l… ▽ More The emergence of flat bands in twisted bilayer graphene at the magic angle can be understood in terms of a vanishing Fermi velocity of the Dirac cone. This is associated with van Hove singularities approaching the Fermi energy and becoming higher-order. In the density of states this is reflected by flanking logarithmic van Hove divergences pinching off the central Dirac cone in energy space. The low-energy pseudogap of the Dirac cone away from the magic angle is replaced by a power-law divergence due to the higher-order van Hove singularity at the magic angle. This plays an important role in the exotic phenomena observed in this material, such as superconductivity and magnetism, by amplifying electronic correlation effects. Here we investigate one such correlation effect -- the Kondo effect due to a magnetic impurity embedded in twisted bilayer graphene. We use the Bistritzer-MacDonald model to extract the low-energy density of states of the material as a function of twist angle, and study the resulting quantum impurity physics using perturbative and numerical renormalization group methods. Although at zero temperature the impurity is only Kondo screened precisely at the magic angle, we find highly nontrivial behavior at finite temperatures relevant to experiment, due to the complex interplay between Dirac, van Hove, and Kondo physics. △ Less

Submitted 2 June, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

Comments: 16 pages, 8 figures

Journal ref: Phys. Rev. B 107,245102 (2023)

arXiv:2303.02374 [pdf, other]

Social Media COVID-19 Contact Tracing Using Mobile Social Payments and Facebook Data

Authors: Shrivu Shankar, Dhiraj Murthy, Hassan Dashtian

Abstract: Many in the US were reluctant to report their COVID-19 cases at the height of the pandemic (e.g., for fear of missing work or other obligations due to quarantine mandates). Other methods such as using public social media data can therefore help augment current approaches to surveilling pandemics. This study evaluated the effectiveness of using social media data as a data source for tracking public… ▽ More Many in the US were reluctant to report their COVID-19 cases at the height of the pandemic (e.g., for fear of missing work or other obligations due to quarantine mandates). Other methods such as using public social media data can therefore help augment current approaches to surveilling pandemics. This study evaluated the effectiveness of using social media data as a data source for tracking public health pandemics. There have been several attempts at using social media data from platforms like Twitter for analyzing the COVID-19 pandemic. While these provide a multitude of useful insights, new platforms like Venmo, a popular U.S. mobile social payment app often used during in-person activities, remain understudied. We developed unique computational methods (combining Venmo- and Facebook- derived data) to classify post content, including the location where the content was likely posted. This approach enabled geotemporal COVID-19-related infoveillance. By examining 135M publicly available Venmo transactions from 22.1M unique users, we found significant spikes in the use of COVID-19 related keywords in March 2020. Using Facebook-based geotags for 9K users along with transaction geo-parsing (i.e., parsing text to detect place names), we identified 38K location-based clusters. Within these groups, we found a strong correlation (0.81) between the use of COVID-19 keywords in a region and the number of reported COVID-19 cases as well as an aggregate decrease in transactions during lockdowns and an increase when lockdowns are lifted. Surprisingly, we saw a weak negative correlation between the number of transactions and reported cases over time (-0.49). Our results indicate that using non-Twitter social media trace data can aid pandemic- and other health-related infoveillance. △ Less

Submitted 4 March, 2023; originally announced March 2023.

arXiv:2302.08876 [pdf, other]

doi 10.1103/PhysRevD.108.094039

Lyapunov exponents in a Sachdev-Ye-Kitaev-type model with population imbalance in the conformal limit and beyond

Authors: A. S. Shankar, M. Fremling, S. Plugge, L. Fritz

Abstract: The Sachdev-Ye-Kitaev (SYK) model shows chaotic behavior with a maximal Lyapunov exponent. In this paper, we investigate the four-point function of a SYK-type model numerically, which gives us access to its Lyapunov exponent. The model consists of two sets of Majorana fermions, called A and B, and the interactions are restricted to being exclusively pairwise between the two sets, not within the se… ▽ More The Sachdev-Ye-Kitaev (SYK) model shows chaotic behavior with a maximal Lyapunov exponent. In this paper, we investigate the four-point function of a SYK-type model numerically, which gives us access to its Lyapunov exponent. The model consists of two sets of Majorana fermions, called A and B, and the interactions are restricted to being exclusively pairwise between the two sets, not within the sets. We find that the Lyapunov exponent is still maximal at strong coupling. Furthermore, we show that even though the conformal dimensions of the A and B fermions change with the population ratio, the Lyapunov exponent remains constant, not just in the conformal limit where it is maximal, but also in the intermediate and weak coupling regimes. △ Less

Submitted 13 October, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

Comments: 12 pages, 8 figures. Comments welcome

Journal ref: Phys. Rev. D 108, 094039, (2023)

arXiv:2302.03161 [pdf, other]

Optimization using Parallel Gradient Evaluations on Multiple Parameters

Authors: Yash Chandak, Shiv Shankar, Venkata Gandikota, Philip S. Thomas, Arya Mazumdar

Abstract: We propose a first-order method for convex optimization, where instead of being restricted to the gradient from a single parameter, gradients from multiple parameters can be used during each step of gradient descent. This setup is particularly useful when a few processors are available that can be used in parallel for optimization. Our method uses gradients from multiple parameters in synergy to u… ▽ More We propose a first-order method for convex optimization, where instead of being restricted to the gradient from a single parameter, gradients from multiple parameters can be used during each step of gradient descent. This setup is particularly useful when a few processors are available that can be used in parallel for optimization. Our method uses gradients from multiple parameters in synergy to update these parameters together towards the optima. While doing so, it is ensured that the computational and memory complexity is of the same order as that of gradient descent. Empirical results demonstrate that even using gradients from as low as \textit{two} parameters, our method can often obtain significant acceleration and provide robustness to hyper-parameter settings. We remark that the primary goal of this work is less theoretical, and is instead aimed at exploring the understudied case of using multiple gradients during each step of optimization. △ Less

Submitted 6 February, 2023; originally announced February 2023.

Comments: Accepted at OPT workshop @ Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022)

arXiv:2301.10330 [pdf, other]

Off-Policy Evaluation for Action-Dependent Non-Stationary Environments

Authors: Yash Chandak, Shiv Shankar, Nathaniel D. Bastian, Bruno Castro da Silva, Emma Brunskil, Philip S. Thomas

Abstract: Methods for sequential decision-making are often built upon a foundational assumption that the underlying decision process is stationary. This limits the application of such methods because real-world problems are often subject to changes due to external factors (passive non-stationarity), changes induced by interactions with the system itself (active non-stationarity), or both (hybrid non-station… ▽ More Methods for sequential decision-making are often built upon a foundational assumption that the underlying decision process is stationary. This limits the application of such methods because real-world problems are often subject to changes due to external factors (passive non-stationarity), changes induced by interactions with the system itself (active non-stationarity), or both (hybrid non-stationarity). In this work, we take the first steps towards the fundamental challenge of on-policy and off-policy evaluation amidst structured changes due to active, passive, or hybrid non-stationarity. Towards this goal, we make a higher-order stationarity assumption such that non-stationarity results in changes over time, but the way changes happen is fixed. We propose, OPEN, an algorithm that uses a double application of counterfactual reasoning and a novel importance-weighted instrument-variable regression to obtain both a lower bias and a lower variance estimate of the structure in the changes of a policy's past performances. Finally, we show promising results on how OPEN can be used to predict future performances for several domains inspired by real-world applications that exhibit non-stationarity. △ Less

Submitted 24 January, 2023; originally announced January 2023.

Comments: Accepted at Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022)

arXiv:2212.00666 [pdf, other]

doi 10.1073/pnas.2400933121

Design rules for controlling active topological defects

Authors: Suraj Shankar, Luca V. D. Scharrer, Mark J. Bowick, M. Cristina Marchetti

Abstract: Topological defects play a central role in the physics of many materials, including magnets, superconductors and liquid crystals. In active fluids, defects become autonomous particles that spontaneously propel from internal active stresses and drive chaotic flows stirring the fluid. The intimate connection between defect textures and active flow suggests that properties of active materials can be… ▽ More Topological defects play a central role in the physics of many materials, including magnets, superconductors and liquid crystals. In active fluids, defects become autonomous particles that spontaneously propel from internal active stresses and drive chaotic flows stirring the fluid. The intimate connection between defect textures and active flow suggests that properties of active materials can be engineered by controlling defects, but design principles for their spatiotemporal control remain elusive. Here we propose a symmetry-based additive strategy for using elementary activity patterns, as active topological tweezers, to create, move and braid such defects. By combining theory and simulations, we demonstrate how, at the collective level, spatial activity gradients act like electric fields which, when strong enough, induce an inverted topological polarization of defects, akin to a negative susceptibility dielectric. We harness this feature in a dynamic setting to collectively pattern and transport interacting active defects. Our work establishes an additive framework to sculpt flows and manipulate active defects in both space and time, paving the way to design programmable active and living materials for transport, memory and logic. △ Less

Submitted 2 May, 2024; v1 submitted 1 December, 2022; originally announced December 2022.

Comments: 11 pages (including Methods), 5 figures. Changed title and format, final version

Report number: 2212.00666

Journal ref: Proc. Nat. Acad. Sci. 121 (21) e2400933121 (2024)

arXiv:2211.11649 [pdf, other]

Implicit Training of Energy Model for Structure Prediction

Authors: Shiv Shankar, Vihari Piratla

Abstract: Most deep learning research has focused on developing new model and training procedures. On the other hand the training objective has usually been restricted to combinations of standard losses. When the objective aligns well with the evaluation metric, this is not a major issue. However when dealing with complex structured outputs, the ideal objective can be hard to optimize and the efficacy of us… ▽ More Most deep learning research has focused on developing new model and training procedures. On the other hand the training objective has usually been restricted to combinations of standard losses. When the objective aligns well with the evaluation metric, this is not a major issue. However when dealing with complex structured outputs, the ideal objective can be hard to optimize and the efficacy of usual objectives as a proxy for the true objective can be questionable. In this work, we argue that the existing inference network based structure prediction methods ( Tu and Gimpel 2018; Tu, Pang, and Gimpel 2020) are indirectly learning to optimize a dynamic loss objective parameterized by the energy model. We then explore using implicit-gradient based technique to learn the corresponding dynamic objectives. Our experiments show that implicitly learning a dynamic loss landscape is an effective method for improving model performance in structure prediction. △ Less

Submitted 21 November, 2022; originally announced November 2022.

Comments: AAAI

arXiv:2211.03758 [pdf, other]

Privacy Aware Experiments without Cookies

Authors: Shiv Shankar, Ritwik Sinha, Saayan Mitra, Viswanathan Swaminathan, Sridhar Mahadevan, Moumita Sinha

Abstract: Consider two brands that want to jointly test alternate web experiences for their customers with an A/B test. Such collaborative tests are today enabled using \textit{third-party cookies}, where each brand has information on the identity of visitors to another website. With the imminent elimination of third-party cookies, such A/B tests will become untenable. We propose a two-stage experimental de… ▽ More Consider two brands that want to jointly test alternate web experiences for their customers with an A/B test. Such collaborative tests are today enabled using \textit{third-party cookies}, where each brand has information on the identity of visitors to another website. With the imminent elimination of third-party cookies, such A/B tests will become untenable. We propose a two-stage experimental design, where the two brands only need to agree on high-level aggregate parameters of the experiment to test the alternate experiences. Our design respects the privacy of customers. We propose an estimater of the Average Treatment Effect (ATE), show that it is unbiased and theoretically compute its variance. Our demonstration describes how a marketer for a brand can design such an experiment and analyze the results. On real and simulated data, we show that the approach provides valid estimate of the ATE with low variance and is robust to the proportion of visitors overlapping across the brands. △ Less

Submitted 6 February, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

Comments: Technical report supplementing paper accepted to WSDM 23

arXiv:2210.17509 [pdf, other]

GRaM-X: A new GPU-accelerated dynamical spacetime GRMHD code for Exascale computing with the Einstein Toolkit

Authors: Swapnil Shankar, Philipp Mösta, Steven R. Brandt, Roland Haas, Erik Schnetter, Yannick de Graaf

Abstract: We present GRaM-X (General Relativistic accelerated Magnetohydrodynamics on AMReX), a new GPU-accelerated dynamical-spacetime general relativistic magnetohydrodynamics (GRMHD) code which extends the GRMHD capability of Einstein Toolkit to GPU-based exascale systems. GRaM-X supports 3D adaptive mesh refinement (AMR) on GPUs via a new AMR driver for the Einstein Toolkit called CarpetX which in turn… ▽ More We present GRaM-X (General Relativistic accelerated Magnetohydrodynamics on AMReX), a new GPU-accelerated dynamical-spacetime general relativistic magnetohydrodynamics (GRMHD) code which extends the GRMHD capability of Einstein Toolkit to GPU-based exascale systems. GRaM-X supports 3D adaptive mesh refinement (AMR) on GPUs via a new AMR driver for the Einstein Toolkit called CarpetX which in turn leverages AMReX, an AMR library developed for use by the United States DOE's Exascale Computing Project (ECP). We use the Z4c formalism to evolve the equations of GR and the Valencia formulation to evolve the equations of GRMHD. GRaM-X supports both analytic as well as tabulated equations of state. We implement TVD and WENO reconstruction methods as well as the HLLE Riemann solver. We test the accuracy of the code using a range of tests on static spacetime, e.g. 1D MHD shocktubes, the 2D magnetic rotor and a cylindrical explosion, as well as on dynamical spacetimes, i.e. the oscillations of a 3D TOV star. We find excellent agreement with analytic results and results of other codes reported in literature. We also perform scaling tests and find that GRaM-X shows a weak scaling efficiency of $\sim 40-50\%$ on 2304 nodes (13824 NVIDIA V100 GPUs) with respect to single-node performance on OLCF's supercomputer Summit. △ Less

Submitted 21 November, 2022; v1 submitted 31 October, 2022; originally announced October 2022.

Comments: 22 pages, 8 figures, to be submitted to Classical and Quantum Gravity

arXiv:2210.17331 [pdf]

doi 10.1109/HPEC55821.2022.9926296

Trends in Energy Estimates for Computing in AI/Machine Learning Accelerators, Supercomputers, and Compute-Intensive Applications

Authors: Sadasivan Shankar, Albert Reuther

Abstract: We examine the computational energy requirements of different systems driven by the geometrical scaling law, and increasing use of Artificial Intelligence or Machine Learning (AI-ML) over the last decade. With more scientific and technology applications based on data-driven discovery, machine learning methods, especially deep neural networks, have become widely used. In order to enable such applic… ▽ More We examine the computational energy requirements of different systems driven by the geometrical scaling law, and increasing use of Artificial Intelligence or Machine Learning (AI-ML) over the last decade. With more scientific and technology applications based on data-driven discovery, machine learning methods, especially deep neural networks, have become widely used. In order to enable such applications, both hardware accelerators and advanced AI-ML methods have led to the introduction of new architectures, system designs, algorithms, and software. Our analysis of energy trends indicates three important observations: 1) Energy efficiency due to geometrical scaling is slowing down; 2) The energy efficiency at the bit-level does not translate into efficiency at the instruction-level, or at the system-level for a variety of systems, especially for large-scale AI-ML accelerators or supercomputers; 3) At the application level, general-purpose AI-ML methods can be computationally energy intensive, off-setting the gains in energy from geometrical scaling and special purpose accelerators. Further, our analysis provides specific pointers for integrating energy efficiency with performance analysis for enabling high-performance and sustainable computing in the future. △ Less

Submitted 12 October, 2022; originally announced October 2022.

Comments: 8 pages, 9 figures, Submitted to Proceedings of IEEE Conference on High Performance Extreme Computing (HPEC) 2022

MSC Class: 68U01 ACM Class: C.4; I.2

arXiv:2209.09125 [pdf, other]

Operationalizing Machine Learning: An Interview Study

Authors: Shreya Shankar, Rolando Garcia, Joseph M. Hellerstein, Aditya G. Parameswaran

Abstract: Organizations rely on machine learning engineers (MLEs) to operationalize ML, i.e., deploy and maintain ML pipelines in production. The process of operationalizing ML, or MLOps, consists of a continual loop of (i) data collection and labeling, (ii) experimentation to improve ML performance, (iii) evaluation throughout a multi-staged deployment process, and (iv) monitoring of performance drops in p… ▽ More Organizations rely on machine learning engineers (MLEs) to operationalize ML, i.e., deploy and maintain ML pipelines in production. The process of operationalizing ML, or MLOps, consists of a continual loop of (i) data collection and labeling, (ii) experimentation to improve ML performance, (iii) evaluation throughout a multi-staged deployment process, and (iv) monitoring of performance drops in production. When considered together, these responsibilities seem staggering -- how does anyone do MLOps, what are the unaddressed challenges, and what are the implications for tool builders? We conducted semi-structured ethnographic interviews with 18 MLEs working across many applications, including chatbots, autonomous vehicles, and finance. Our interviews expose three variables that govern success for a production ML deployment: Velocity, Validation, and Versioning. We summarize common practices for successful ML experimentation, deployment, and sustaining production performance. Finally, we discuss interviewees' pain points and anti-patterns, with implications for tool design. △ Less

Submitted 16 September, 2022; originally announced September 2022.

Comments: 20 pages, 4 figures

arXiv:2209.03056 [pdf]

Parallel and Streaming Wavelet Neural Networks for Classification and Regression under Apache Spark

Authors: Eduru Harindra Venkatesh, Yelleti Vivek, Vadlamani Ravi, Orsu Shiva Shankar

Abstract: Wavelet neural networks (WNN) have been applied in many fields to solve regression as well as classification problems. After the advent of big data, as data gets generated at a brisk pace, it is imperative to analyze it as soon as it is generated owing to the fact that the nature of the data may change dramatically in short time intervals. This is necessitated by the fact that big data is all perv… ▽ More Wavelet neural networks (WNN) have been applied in many fields to solve regression as well as classification problems. After the advent of big data, as data gets generated at a brisk pace, it is imperative to analyze it as soon as it is generated owing to the fact that the nature of the data may change dramatically in short time intervals. This is necessitated by the fact that big data is all pervasive and throws computational challenges for data scientists. Therefore, in this paper, we built an efficient Scalable, Parallelized Wavelet Neural Network (SPWNN) which employs the parallel stochastic gradient algorithm (SGD) algorithm. SPWNN is designed and developed under both static and streaming environments in the horizontal parallelization framework. SPWNN is implemented by using Morlet and Gaussian functions as activation functions. This study is conducted on big datasets like gas sensor data which has more than 4 million samples and medical research data which has more than 10,000 features, which are high dimensional in nature. The experimental analysis indicates that in the static environment, SPWNN with Morlet activation function outperformed SPWNN with Gaussian on the classification datasets. However, in the case of regression, the opposite was observed. In contrast, in the streaming environment i.e., Gaussian outperformed Morlet on the classification and Morlet outperformed Gaussian on the regression datasets. Overall, the proposed SPWNN architecture achieved a speedup of 1.32-1.40. △ Less

Submitted 7 September, 2022; originally announced September 2022.

Comments: 25 pages; 2 Tables; 7 Figures

MSC Class: 68T09; 68Txx ACM Class: I.2

arXiv:2209.00302 [pdf, other]

Progressive Fusion for Multimodal Integration

Authors: Shiv Shankar, Laure Thompson, Madalina Fiterau

Abstract: Integration of multimodal information from various sources has been shown to boost the performance of machine learning models and thus has received increased attention in recent years. Often such models use deep modality-specific networks to obtain unimodal features which are combined to obtain "late-fusion" representations. However, these designs run the risk of information loss in the respective… ▽ More Integration of multimodal information from various sources has been shown to boost the performance of machine learning models and thus has received increased attention in recent years. Often such models use deep modality-specific networks to obtain unimodal features which are combined to obtain "late-fusion" representations. However, these designs run the risk of information loss in the respective unimodal pipelines. On the other hand, "early-fusion" methodologies, which combine features early, suffer from the problems associated with feature heterogeneity and high sample complexity. In this work, we present an iterative representation refinement approach, called Progressive Fusion, which mitigates the issues with late fusion representations. Our model-agnostic technique introduces backward connections that make late stage fused representations available to early layers, improving the expressiveness of the representations at those stages, while retaining the advantages of late fusion designs. We test Progressive Fusion on tasks including affective sentiment detection, multimedia analysis, and time series fusion with different models, demonstrating its versatility. We show that our approach consistently improves performance, for instance attaining a 5% reduction in MSE and 40% improvement in robustness on multimodal time series prediction. △ Less

Submitted 20 November, 2022; v1 submitted 1 September, 2022; originally announced September 2022.

arXiv:2205.11473 [pdf, other]

Rethinking Streaming Machine Learning Evaluation

Authors: Shreya Shankar, Bernease Herman, Aditya G. Parameswaran

Abstract: While most work on evaluating machine learning (ML) models focuses on computing accuracy on batches of data, tracking accuracy alone in a streaming setting (i.e., unbounded, timestamp-ordered datasets) fails to appropriately identify when models are performing unexpectedly. In this position paper, we discuss how the nature of streaming ML problems introduces new real-world challenges (e.g., delaye… ▽ More While most work on evaluating machine learning (ML) models focuses on computing accuracy on batches of data, tracking accuracy alone in a streaming setting (i.e., unbounded, timestamp-ordered datasets) fails to appropriately identify when models are performing unexpectedly. In this position paper, we discuss how the nature of streaming ML problems introduces new real-world challenges (e.g., delayed arrival of labels) and recommend additional metrics to assess streaming ML performance. △ Less

Submitted 23 May, 2022; originally announced May 2022.

Comments: ML Evaluation Standards Workshop (ICLR 2022)

arXiv:2205.08636 [pdf, other]

doi 10.3389/fphy.2022.948415

Boundaries control active channel flows

Authors: Paarth Gulati, Suraj Shankar, M. Cristina Marchetti

Abstract: Boundary conditions dictate how fluids, including liquid crystals, flow when pumped through a channel. Can boundary conditions also be used to control internally driven active fluids that generate flows spontaneously? By using numerical simulations and stability analysis we explore how surface anchoring of active agents at the boundaries and substrate drag can be used to rectify coherent flow of a… ▽ More Boundary conditions dictate how fluids, including liquid crystals, flow when pumped through a channel. Can boundary conditions also be used to control internally driven active fluids that generate flows spontaneously? By using numerical simulations and stability analysis we explore how surface anchoring of active agents at the boundaries and substrate drag can be used to rectify coherent flow of an active polar fluid in a 2D channel. Upon increasing activity, a succession of dynamical states is obtained, from laminar flow to vortex arrays to eventual turbulence, that are controlled by the interplay between the hydrodynamic screening length and the extrapolation length quantifying the anchoring strength of the orientational order parameter. We highlight the key role of symmetry in both flow and order and show that coherent laminar flow with net throughput is only possible for weak anchoring and intermediate activity. Our work demonstrates the possibility of controlling the nature and properties of active flows in a channel simply by patterning the confining boundaries. △ Less

Submitted 17 May, 2022; originally announced May 2022.

Comments: 12 pages, 9 figures

arXiv:2201.00247 [pdf]

Now is the time to build a national data ecosystem for materials science and chemistry research data

Authors: E. M. Campo, S. Shankar, A. S. Szalay, R. J. Hanisch

Abstract: A call for coordinated action from government, academia, and industry. A call for coordinated action from government, academia, and industry. △ Less

Submitted 1 January, 2022; originally announced January 2022.

arXiv:2112.09079 [pdf, other]

doi 10.1088/1361-6633/ac8231

Spatial population genetics with fluid flow

Authors: Roberto Benzi, David R. Nelson, Suraj Shankar, Federico Toschi, Xiaojue Zhu

Abstract: The growth and evolution of microbial populations is often subjected to advection by fluid flows in spatially extended environments, with immediate consequences for questions of spatial population genetics in marine ecology, planktonic diversity and origin of life scenarios. Here, we review recent progress made in understanding this rich problem in the simplified setting of two competing genetic m… ▽ More The growth and evolution of microbial populations is often subjected to advection by fluid flows in spatially extended environments, with immediate consequences for questions of spatial population genetics in marine ecology, planktonic diversity and origin of life scenarios. Here, we review recent progress made in understanding this rich problem in the simplified setting of two competing genetic microbial strains subjected to fluid flows. As a pedagogical example we focus on antagonsim, i.e., two killer microorganism strains, each secreting toxins that impede the growth of their competitors (competitive exclusion), in the presence of stationary fluid flows. By solving two coupled reaction-diffusion equations that include advection by simple steady cellular flows composed of characteristic flow motifs in two dimensions (2d), we show how local flow shear and compressibility effects can interact with selective advantage to have a dramatic influence on genetic competition and fixation in spatially distributed populations. We analyze several 1d and 2d flow geometries including sources, sinks, vortices and saddles, and show how simple analytical models of the dynamics of the genetic interface can be used to shed light on the nucleation, coexistence and flow-driven instabilities of genetic drops. By exploiting an analogy with phase separation with nonconserved order parameters, we uncover how these genetic drops harness fluid flows for novel evolutionary strategies, even in the presence of number fluctuations, as confirmed by agent-based simulations as well. △ Less

Submitted 30 June, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

Comments: 29 pages, 22 figures

Journal ref: Rep. Prog. Phys. 85 096601, 2022

arXiv:2112.05676 [pdf, ps, other]

doi 10.1073/pnas.2121985119

Optimal transport and control of active drops

Authors: Suraj Shankar, Vidya Raju, L. Mahadevan

Abstract: Understanding the complex patterns in space-time exhibited by active systems has been the subject of much interest in recent times. Complementing this forward problem is the inverse problem of controlling active matter. Here we use optimal control theory to pose the problem of transporting a slender drop of an active fluid and determine the dynamical profile of the active stresses to move it with… ▽ More Understanding the complex patterns in space-time exhibited by active systems has been the subject of much interest in recent times. Complementing this forward problem is the inverse problem of controlling active matter. Here we use optimal control theory to pose the problem of transporting a slender drop of an active fluid and determine the dynamical profile of the active stresses to move it with minimal viscous dissipation. By parametrizing the position and size of the drop using a low-order description based on lubrication theory, we uncover a natural ''gather-move-spread'' strategy that leads to an optimal bound on the maximum achievable displacement of the drop relative to its size. In the continuum setting, the competition between passive surface tension, and active controls generates richer behaviour with futile oscillations and complex drop morphologies that trade internal dissipation against the transport cost to select optimal strategies. Our work combines active hydrodynamics and optimal control in a tractable and interpretable framework, and begins to pave the way for the spatiotemporal manipulation of active matter. △ Less

Submitted 10 December, 2021; originally announced December 2021.

Comments: 8 pages, 4 figures, SI available upon request

Journal ref: PNAS 119 (35) e2121985119, 2022

arXiv:2110.00385 [pdf, ps, other]

Neural Dependency Coding inspired Multimodal Fusion

Authors: Shiv Shankar

Abstract: Information integration from different modalities is an active area of research. Human beings and, in general, biological neural systems are quite adept at using a multitude of signals from different sensory perceptive fields to interact with the environment and each other. Recent work in deep fusion models via neural networks has led to substantial improvements over unimodal approaches in areas l… ▽ More Information integration from different modalities is an active area of research. Human beings and, in general, biological neural systems are quite adept at using a multitude of signals from different sensory perceptive fields to interact with the environment and each other. Recent work in deep fusion models via neural networks has led to substantial improvements over unimodal approaches in areas like speech recognition, emotion recognition and analysis, captioning and image description. However, such research has mostly focused on architectural changes allowing for fusion of different modalities while keeping the model complexity manageable. Inspired by recent neuroscience ideas about multisensory integration and processing, we investigate the effect of synergy maximizing loss functions. Experiments on multimodal sentiment analysis tasks: CMU-MOSI and CMU-MOSEI with different models show that our approach provides a consistent performance boost. △ Less

Submitted 4 October, 2021; v1 submitted 28 September, 2021; originally announced October 2021.

arXiv:2108.13557 [pdf, other]

Towards Observability for Production Machine Learning Pipelines

Authors: Shreya Shankar, Aditya Parameswaran

Abstract: Software organizations are increasingly incorporating machine learning (ML) into their product offerings, driving a need for new data management tools. Many of these tools facilitate the initial development of ML applications, but sustaining these applications post-deployment is difficult due to lack of real-time feedback (i.e., labels) for predictions and silent failures that could occur at any c… ▽ More Software organizations are increasingly incorporating machine learning (ML) into their product offerings, driving a need for new data management tools. Many of these tools facilitate the initial development of ML applications, but sustaining these applications post-deployment is difficult due to lack of real-time feedback (i.e., labels) for predictions and silent failures that could occur at any component of the ML pipeline (e.g., data distribution shift or anomalous features). We propose a new type of data management system that offers end-to-end observability, or visibility into complex system behavior, for deployed ML pipelines through assisted (1) detection, (2) diagnosis, and (3) reaction to ML-related bugs. We describe new research challenges and suggest preliminary solution ideas in all three aspects. Finally, we introduce an example architecture for a "bolt-on" ML observability system, or one that wraps around existing tools in the stack. △ Less

Submitted 15 July, 2022; v1 submitted 30 August, 2021; originally announced August 2021.

Comments: 11 pages, 6 figures

arXiv:2108.12982 [pdf, other]

Adversarial Stein Training for Graph Energy Models

Authors: Shiv Shankar

Abstract: Learning distributions over graph-structured data is a challenging task with many applications in biology and chemistry. In this work we use an energy-based model (EBM) based on multi-channel graph neural networks (GNN) to learn permutation invariant unnormalized density functions on graphs. Unlike standard EBM training methods our approach is to learn the model via minimizing adversarial stein di… ▽ More Learning distributions over graph-structured data is a challenging task with many applications in biology and chemistry. In this work we use an energy-based model (EBM) based on multi-channel graph neural networks (GNN) to learn permutation invariant unnormalized density functions on graphs. Unlike standard EBM training methods our approach is to learn the model via minimizing adversarial stein discrepancy. Samples from the model can be obtained via Langevin dynamics based MCMC. We find that this approach achieves competitive results on graph generation compared to benchmark models. △ Less

Submitted 29 August, 2021; originally announced August 2021.

Comments: Appeared at Machine Learning for Molecules Workshop at NeurIPS 2020.https://ml4molecules.github.io

arXiv:2108.10875 [pdf, other]

doi 10.1073/pnas.2117241119

Geometric control of topological dynamics in a singing saw

Authors: Suraj Shankar, Petur Bryde, L. Mahadevan

Abstract: The common handsaw can be converted into a bowed musical instrument capable of producing exquisitely sustained notes when its blade is appropriately bent. Acoustic modes localized at an inflection point are known to underlie the saw's sonorous quality, yet the origin of localization has remained mysterious. Here we uncover a topological basis for the existence of localized modes, that relies on an… ▽ More The common handsaw can be converted into a bowed musical instrument capable of producing exquisitely sustained notes when its blade is appropriately bent. Acoustic modes localized at an inflection point are known to underlie the saw's sonorous quality, yet the origin of localization has remained mysterious. Here we uncover a topological basis for the existence of localized modes, that relies on and is protected by spatial curvature. By combining experimental demonstrations, theory and computation, we show how spatial variations in blade curvature control the localization of these trapped states, allowing the saw to function as a geometrically tunable high quality oscillator. Our work establishes an unexpected connection between the dynamics of thin shells and topological insulators, and offers a robust principle to design high quality resonators across scales, from macroscopic instruments to nanoscale devices, simply through geometry. △ Less

Submitted 24 August, 2021; originally announced August 2021.

Comments: 17 pages, 3 figures, SI available upon request

Journal ref: PNAS, 119 (17) e2117241119, 2022

arXiv:2108.09569 [pdf, other]

doi 10.1109/CONECCT52877.2021.9622534

Wireless Sensor Networks for Optimisation of Search and Rescue Management in Floods

Authors: Harshil Bhatt, Pranesh G, Samarth Shankar, Shriyash Haralikar

Abstract: We propose a novel search-and-rescue management method that relies on the aerial deployment of Wireless Sensor Network (WSN) for locating victims after floods. The sensor nodes will collect vital information such as heat signatures for detecting human presence and location, the flow of flood. The sensor modules are packed in a portable floating buoy with a user interface to convey emergency messag… ▽ More We propose a novel search-and-rescue management method that relies on the aerial deployment of Wireless Sensor Network (WSN) for locating victims after floods. The sensor nodes will collect vital information such as heat signatures for detecting human presence and location, the flow of flood. The sensor modules are packed in a portable floating buoy with a user interface to convey emergency messages to the base station. Sensor nodes are designed based on disaster conditions, cost-effectiveness and deployed in the affected region by a centrifugal dispersion system from a helicopter. A mobile ad-hoc network is set up by modifying the Low Energy Adaptive Cluster Hierarchy (LEACH) protocol for greater efficiency and adoption of multi-hop of Cluster Heads for long-distance communication to Base Station. The model metrics have been defined considering previous rural floods in India. The efficiency and power characteristics of the network are compared to other protocols via simulations. The sensor data from the network makes resource management, rescue planning and emergency priority more efficient, thus saving more lives from floods. △ Less

Submitted 21 August, 2021; originally announced August 2021.

arXiv:2108.09047 [pdf, other]

doi 10.1109/IROS45743.2020.9341724

AutoLay: Benchmarking amodal layout estimation for autonomous driving

Authors: Kaustubh Mani, N. Sai Shankar, Krishna Murthy Jatavallabhula, K. Madhava Krishna

Abstract: Given an image or a video captured from a monocular camera, amodal layout estimation is the task of predicting semantics and occupancy in bird's eye view. The term amodal implies we also reason about entities in the scene that are occluded or truncated in image space. While several recent efforts have tackled this problem, there is a lack of standardization in task specification, datasets, and eva… ▽ More Given an image or a video captured from a monocular camera, amodal layout estimation is the task of predicting semantics and occupancy in bird's eye view. The term amodal implies we also reason about entities in the scene that are occluded or truncated in image space. While several recent efforts have tackled this problem, there is a lack of standardization in task specification, datasets, and evaluation protocols. We address these gaps with AutoLay, a dataset and benchmark for amodal layout estimation from monocular images. AutoLay encompasses driving imagery from two popular datasets: KITTI and Argoverse. In addition to fine-grained attributes such as lanes, sidewalks, and vehicles, we also provide semantically annotated 3D point clouds. We implement several baselines and bleeding edge approaches, and release our data and code. △ Less

Submitted 20 August, 2021; originally announced August 2021.

Comments: published in 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

arXiv:2107.14139 [pdf, other]

Vaccination Worldwide: Strategies, Distribution and Challenges

Authors: Chirag Samal, Kasia Jakimowicz, Krishnendu Dasgupta, Aniket Vashishtha, Francisco O., Arunakiry Natarajan, Haris Nazir, Alluri Siddhartha Varma, Tejal Dahake, Amitesh Anand Pandey, Ishaan Singh, John Sangyeob Kim, Mehrab Singh Gill, Saurish Srivastava, Orna Mukhopadhyay, Parth Patwa, Qamil Mirza, Sualeha Irshad, Sheshank Shankar, Rohan Iyer, Rohan Sukumaran, Ashley Mehra, Anshuman Sharma, Abhishek Singh, Maurizio Arseni , et al. (4 additional authors not shown)

Abstract: The Coronavirus 2019 (Covid-19) pandemic caused by the SARS-CoV-2 virus represents an unprecedented crisis for our planet. It is a bane of the über connected world that we live in that this virus has affected almost all countries and caused mortality and economic upheaval at a scale whose effects are going to be felt for generations to come. While we can all be buoyed at the pace at which vaccines… ▽ More The Coronavirus 2019 (Covid-19) pandemic caused by the SARS-CoV-2 virus represents an unprecedented crisis for our planet. It is a bane of the über connected world that we live in that this virus has affected almost all countries and caused mortality and economic upheaval at a scale whose effects are going to be felt for generations to come. While we can all be buoyed at the pace at which vaccines have been developed and brought to market, there are still challenges ahead for all countries to get their populations vaccinated equitably and effectively. This paper provides an overview of ongoing immunization efforts in various countries. In this early draft, we have identified a few key factors that we use to review different countries' current COVID-19 immunization strategies and their strengths and draw conclusions so that policymakers worldwide can learn from them. Our paper focuses on processes related to vaccine approval, allocation and prioritization, distribution strategies, population to vaccine ratio, vaccination governance, accessibility and use of digital solutions, and government policies. The statistics and numbers are dated as per the draft date [June 24th, 2021]. △ Less

Submitted 21 July, 2021; originally announced July 2021.

arXiv:2107.01368 [pdf, ps, other]

The coarsest lattice that determines a discrete multidimensional system

Authors: Debasattam Pal, Shiva Shankar

Abstract: A discrete multidimensional system is the set of solutions to a system of linear partial difference equations defined on the lattice $\Z^n$. This paper shows that it is determined by a unique coarsest sublattice, in the sense that the solutions of the system on this sublattice determine the solutions on $\Z^n$; it is therefore the correct domain of definition of the discrete system. In turn, the d… ▽ More A discrete multidimensional system is the set of solutions to a system of linear partial difference equations defined on the lattice $\Z^n$. This paper shows that it is determined by a unique coarsest sublattice, in the sense that the solutions of the system on this sublattice determine the solutions on $\Z^n$; it is therefore the correct domain of definition of the discrete system. In turn, the defining sublattice is determined by a Galois group of symmetries that leave invariant the equations defining the system. These results find application in understanding properties of the system such as controllability and autonomy, and in its order reduction. △ Less

Submitted 23 January, 2022; v1 submitted 3 July, 2021; originally announced July 2021.

Comments: To appear in Mathematics of Control Signals and Systems

MSC Class: 39A14; 93B25; 13B05

Showing 1–50 of 164 results for author: Shankar, S