Skip to main content

Showing 1–50 of 125 results for author: Rao, N

  1. arXiv:2410.08980  [pdf, other

    quant-ph cs.NI

    Leveraging Internet Principles to Build a Quantum Network

    Authors: Leonardo Bacciottini, Aparimit Chandra, Matheus Guedes De Andrade, Nitish K. Panigrahy, Shahrooz Pouryousef, Nageswara S. V. Rao, Emily Van Milligen, Gayane Vardoyan, Don Towsley

    Abstract: Designing an operational architecture for the Quantum Internet is a challenging task in light of both fundamental limitations imposed by the laws of physics and technological constraints. Here, we propose a method to abstract away most of the quantum-specific elements and formulate a best-effort quantum network architecture based on packet-switching, akin to that of the classical Internet. Such re… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: 9 pages, 5 figures

  2. arXiv:2410.04249  [pdf, other

    cs.SE

    DiffSpec: Differential Testing with LLMs using Natural Language Specifications and Code Artifacts

    Authors: Nikitha Rao, Elizabeth Gilbert, Tahina Ramananandro, Nikhil Swamy, Claire Le Goues, Sarah Fakhoury

    Abstract: Differential testing can be an effective way to find bugs in software systems with multiple implementations that conform to the same specification, like compilers, network protocol parsers, and language runtimes. Specifications for such systems are often standardized in natural language documents, like Instruction Set Architecture (ISA) specifications, Wasm specifications or IETF RFC's. Large Lang… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

  3. arXiv:2409.12447  [pdf, other

    cs.SE cs.AI cs.HC

    Prompts Are Programs Too! Understanding How Developers Build Software Containing Prompts

    Authors: Jenny T. Liang, Melissa Lin, Nikitha Rao, Brad A. Myers

    Abstract: The introduction of generative pre-trained models, like GPT-4, has introduced a phenomenon known as prompt engineering, whereby model users repeatedly write and revise prompts while trying to achieve a task. Using these AI models for intelligent features in software applications require using APIs that are controlled through developer-written prompts. These prompts have powered AI experiences in p… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

  4. arXiv:2409.11238  [pdf, other

    cs.RO cs.LG eess.SY

    Leveraging Symmetry to Accelerate Learning of Trajectory Tracking Controllers for Free-Flying Robotic Systems

    Authors: Jake Welde, Nishanth Rao, Pratik Kunapuli, Dinesh Jayaraman, Vijay Kumar

    Abstract: Tracking controllers enable robotic systems to accurately follow planned reference trajectories. In particular, reinforcement learning (RL) has shown promise in the synthesis of controllers for systems with complex dynamics and modest online compute budgets. However, the poor sample efficiency of RL and the challenges of reward design make training slow and sometimes unstable, especially for high-… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

    Comments: The first three authors contributed equally to this work

  5. Data Collectives as a means to Improve Accountability, Combat Surveillance and Reduce Inequalities

    Authors: Jane Hsieh, Angie Zhang, Seyun Kim, Varun Nagaraj Rao, Samantha Dalal, Alexandra Mateescu, Rafael Do Nascimento Grohmann, Motahhare Eslami, Min Kyung Lee, Haiyi Zhu

    Abstract: Platform-based laborers face unprecedented challenges and working conditions that result from algorithmic opacity, insufficient data transparency, and unclear policies and regulations. The CSCW and HCI communities increasingly turn to worker data collectives as a means to advance related policy and regulation, hold platforms accountable for data transparency and disclosure, and empower the collect… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

  6. arXiv:2407.18919  [pdf

    cs.LG q-bio.QM

    Accelerating Drug Safety Assessment using Bidirectional-LSTM for SMILES Data

    Authors: K. Venkateswara Rao, Kunjam Nageswara Rao, G. Sita Ratnam

    Abstract: Computational methods are useful in accelerating the pace of drug discovery. Drug discovery carries several steps such as target identification and validation, lead discovery, and lead optimisation etc., In the phase of lead optimisation, the absorption, distribution, metabolism, excretion, and toxicity properties of lead compounds are assessed. To address the issue of predicting toxicity and solu… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 10 pages

  7. arXiv:2406.19150  [pdf, other

    cs.CV cs.AI cs.IR

    RAVEN: Multitask Retrieval Augmented Vision-Language Learning

    Authors: Varun Nagaraj Rao, Siddharth Choudhary, Aditya Deshpande, Ravi Kumar Satzoda, Srikar Appalaraju

    Abstract: The scaling of large language models to encode all the world's knowledge in model parameters is unsustainable and has exacerbated resource barriers. Retrieval-Augmented Generation (RAG) presents a potential solution, yet its application to vision-language models (VLMs) is under explored. Existing methods focus on models designed for single tasks. Furthermore, they're limited by the need for resour… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  8. arXiv:2406.10768  [pdf, other

    cs.CY cs.HC

    Rideshare Transparency: Translating Gig Worker Insights on AI Platform Design to Policy

    Authors: Varun Nagaraj Rao, Samantha Dalal, Eesha Agarwal, Dana Calacci, Andrés Monroy-Hernández

    Abstract: Rideshare platforms exert significant control over workers through algorithmic systems that can result in financial, emotional, and physical harm. What steps can platforms, designers, and practitioners take to mitigate these negative impacts and meet worker needs? In this paper, through a novel mixed methods study combining a LLM-based analysis of over 1 million comments posted to online platform… ▽ More

    Submitted 19 June, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

  9. arXiv:2406.07667  [pdf, other

    cs.CV

    PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow

    Authors: Joshua Tokarsky, Ibrahim Abdulhafiz, Satya Ayyalasomayajula, Mostafa Mohsen, Navya G. Rao, Adam Forbes

    Abstract: Autonomous driving has experienced remarkable progress, bolstered by innovations in computational hardware and sophisticated deep learning methodologies. The foundation of these advancements rests on the availability and quality of datasets, which are crucial for the development and refinement of dependable and versatile autonomous driving algorithms. While numerous datasets have been developed to… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  10. arXiv:2405.10391  [pdf, other

    cs.RO cs.AI eess.IV

    Vision Transformers for End-to-End Vision-Based Quadrotor Obstacle Avoidance

    Authors: Anish Bhattacharya, Nishanth Rao, Dhruv Parikh, Pratik Kunapuli, Yuwei Wu, Yuezhan Tao, Nikolai Matni, Vijay Kumar

    Abstract: We demonstrate the capabilities of an attention-based end-to-end approach for high-speed vision-based quadrotor obstacle avoidance in dense, cluttered environments, with comparison to various state-of-the-art learning architectures. Quadrotor unmanned aerial vehicles (UAVs) have tremendous maneuverability when flown fast; however, as flight speed increases, traditional model-based approaches to na… ▽ More

    Submitted 27 September, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: 11 pages, 18 figures, 3 tables (with supplementary)

  11. MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels

    Authors: Qi Chen, Xiubo Geng, Corby Rosset, Carolyn Buractaon, Jingwen Lu, Tao Shen, Kun Zhou, Chenyan Xiong, Yeyun Gong, Paul Bennett, Nick Craswell, Xing Xie, Fan Yang, Bryan Tower, Nikhil Rao, Anlei Dong, Wenqi Jiang, Zheng Liu, Mingqin Li, Chuanjie Liu, Zengzhong Li, Rangan Majumder, Jennifer Neville, Andy Oakley, Knut Magne Risvik , et al. (6 additional authors not shown)

    Abstract: Recent breakthroughs in large models have highlighted the critical significance of data scale, labels and modals. In this paper, we introduce MS MARCO Web Search, the first large-scale information-rich web dataset, featuring millions of real clicked query-document labels. This dataset closely mimics real-world web document and query distribution, provides rich information for various kinds of down… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 10 pages, 6 figures, for associated dataset, see http://github.com/microsoft/MS-MARCO-Web-Search

  12. arXiv:2405.05345  [pdf, other

    cs.CL cs.HC

    QuaLLM: An LLM-based Framework to Extract Quantitative Insights from Online Forums

    Authors: Varun Nagaraj Rao, Eesha Agarwal, Samantha Dalal, Dan Calacci, Andrés Monroy-Hernández

    Abstract: Online discussion forums provide crucial data to understand the concerns of a wide range of real-world communities. However, the typical qualitative and quantitative methods used to analyze those data, such as thematic analysis and topic modeling, are infeasible to scale or require significant human effort to translate outputs to human readable forms. This study introduces QuaLLM, a novel LLM-base… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted to CHI LLM as Research Tools Workshop (2024)

  13. arXiv:2405.00820  [pdf, other

    cs.AR cs.LG

    HLSFactory: A Framework Empowering High-Level Synthesis Datasets for Machine Learning and Beyond

    Authors: Stefan Abi-Karam, Rishov Sarkar, Allison Seigler, Sean Lowe, Zhigang Wei, Hanqiu Chen, Nanditha Rao, Lizy John, Aman Arora, Cong Hao

    Abstract: Machine learning (ML) techniques have been applied to high-level synthesis (HLS) flows for quality-of-result (QoR) prediction and design space exploration (DSE). Nevertheless, the scarcity of accessible high-quality HLS datasets and the complexity of building such datasets present challenges. Existing datasets have limitations in terms of benchmark coverage, design space enumeration, vendor extens… ▽ More

    Submitted 17 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: Edit to "Section V.E" for proper attribution of open-source HLSyn, AutoDSE, and the Merlin compiler

  14. arXiv:2402.17896  [pdf, other

    cs.CL cs.AI

    Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents

    Authors: Corby Rosset, Ho-Lam Chung, Guanghui Qin, Ethan C. Chau, Zhuo Feng, Ahmed Awadallah, Jennifer Neville, Nikhil Rao

    Abstract: Existing question answering (QA) datasets are no longer challenging to most powerful Large Language Models (LLMs). Traditional QA benchmarks like TriviaQA, NaturalQuestions, ELI5 and HotpotQA mainly study ``known unknowns'' with clear indications of both what information is missing, and how to find it to answer the question. Hence, good performance on these benchmarks provides a false sense of sec… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  15. arXiv:2312.17479  [pdf, other

    cs.AI cs.CY cs.HC cs.LG

    Culturally-Attuned Moral Machines: Implicit Learning of Human Value Systems by AI through Inverse Reinforcement Learning

    Authors: Nigini Oliveira, Jasmine Li, Koosha Khalvati, Rodolfo Cortes Barragan, Katharina Reinecke, Andrew N. Meltzoff, Rajesh P. N. Rao

    Abstract: Constructing a universal moral code for artificial intelligence (AI) is difficult or even impossible, given that different human cultures have different definitions of morality and different societal norms. We therefore argue that the value system of an AI should be culturally attuned: just as a child raised in a particular culture learns the specific values and norms of that culture, we propose t… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

  16. arXiv:2312.10049  [pdf

    cs.IR

    Knowledge Graph Reasoning Based on Attention GCN

    Authors: Meera Gupta, Ravi Khanna, Divya Choudhary, Nandini Rao

    Abstract: We propose a novel technique to enhance Knowledge Graph Reasoning by combining Graph Convolution Neural Network (GCN) with the Attention Mechanism. This approach utilizes the Attention Mechanism to examine the relationships between entities and their neighboring nodes, which helps to develop detailed feature vectors for each entity. The GCN uses shared parameters to effectively represent the chara… ▽ More

    Submitted 27 January, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

  17. arXiv:2310.18918  [pdf, other

    cs.LG cs.SI

    Hyperbolic Graph Neural Networks at Scale: A Meta Learning Approach

    Authors: Nurendra Choudhary, Nikhil Rao, Chandan K. Reddy

    Abstract: The progress in hyperbolic neural networks (HNNs) research is hindered by their absence of inductive bias mechanisms, which are essential for generalizing to new tasks and facilitating scalable learning over large datasets. In this paper, we aim to alleviate these issues by learning generalizable inductive biases from the nodes' local subgraph and transfer them for faster learning over new subgrap… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023. 14 pages of main paper, 5 pages of supplementary

  18. arXiv:2310.05972  [pdf, other

    cs.ET

    Normality of I-V Measurements Using ML

    Authors: Anees Al-Najjar, Nageswara S. V. Rao, Craig A. Bridges, Sheng Dai

    Abstract: Electrochemistry ecosystems are promising for accelerating the design and discovery of electrochemical systems for energy storage and conversion, by automating significant parts of workflows that combine synthesis and characterization experiments with computations. They require the integration of flow controllers, solvent containers, pumps, fraction collectors, and potentiostats, all connected to… ▽ More

    Submitted 28 September, 2023; originally announced October 2023.

    Comments: published at eScience 2023

    Journal ref: in 2023 IEEE 19th International Conference on e-Science (e-Science), Limassol, Cyprus, 2023 pp. 1-2

  19. arXiv:2310.02409  [pdf, other

    cs.CL cs.AI cs.LG

    Dodo: Dynamic Contextual Compression for Decoder-only LMs

    Authors: Guanghui Qin, Corby Rosset, Ethan C. Chau, Nikhil Rao, Benjamin Van Durme

    Abstract: Transformer-based language models (LMs) are inefficient in long contexts. We propose Dodo, a solution for context compression. Instead of one vector per token in a standard transformer model, Dodo represents text with a dynamic number of hidden states at each layer, reducing the cost of self-attention to a fraction of typical time and space. Moreover, off-the-shelf models such as LLaMA can be adap… ▽ More

    Submitted 13 June, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ACL 2024 camera-ready. 15 pages and 7 figures

    ACM Class: I.2.7; I.2.6

  20. arXiv:2310.02263  [pdf, other

    cs.CL cs.AI cs.LG

    Automatic Pair Construction for Contrastive Post-training

    Authors: Canwen Xu, Corby Rosset, Ethan C. Chau, Luciano Del Corro, Shweti Mahajan, Julian McAuley, Jennifer Neville, Ahmed Hassan Awadallah, Nikhil Rao

    Abstract: Alignment serves as an important step to steer large language models (LLMs) towards human preferences. In this paper, we propose an automatic way to construct contrastive data for LLM, using preference pairs from multiple models of varying strengths (e.g., InstructGPT, ChatGPT and GPT-4). We compare the contrastive techniques of SLiC and DPO to SFT baselines and find that DPO provides a step-funct… ▽ More

    Submitted 2 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: NAACL 2024 (Findings)

  21. arXiv:2310.01602  [pdf, other

    cs.SE cs.AI

    CAT-LM: Training Language Models on Aligned Code And Tests

    Authors: Nikitha Rao, Kush Jain, Uri Alon, Claire Le Goues, Vincent J. Hellendoorn

    Abstract: Testing is an integral part of the software development process. Yet, writing tests is time-consuming and therefore often neglected. Classical test generation tools such as EvoSuite generate behavioral test suites by optimizing for coverage, but tend to produce tests that are hard to understand. Language models trained on code can generate code that is highly similar to that written by humans, but… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  22. arXiv:2309.11512  [pdf, other

    stat.AP cs.LG

    Multidimensional well-being of US households at a fine spatial scale using fused household surveys: fusionACS

    Authors: Kevin Ummel, Miguel Poblete-Cazenave, Karthik Akkiraju, Nick Graetz, Hero Ashman, Cora Kingdon, Steven Herrera Tenorio, Aaryaman "Sunny" Singhal, Daniel Aldana Cohen, Narasimha D. Rao

    Abstract: Social science often relies on surveys of households and individuals. Dozens of such surveys are regularly administered by the U.S. government. However, they field independent, unconnected samples with specialized questions, limiting research questions to those that can be answered by a single survey. The fusionACS project seeks to integrate data from multiple U.S. household surveys by statistical… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: 35 pages, 6 figures

  23. arXiv:2308.11809  [pdf, other

    q-bio.NC cs.AI cs.NE

    Expressive probabilistic sampling in recurrent neural networks

    Authors: Shirui Chen, Linxing Preston Jiang, Rajesh P. N. Rao, Eric Shea-Brown

    Abstract: In sampling-based Bayesian models of brain function, neural activities are assumed to be samples from probability distributions that the brain uses for probabilistic computation. However, a comprehensive understanding of how mechanistic models of neural dynamics can sample from arbitrary distributions is still lacking. We use tools from functional analysis and stochastic differential equations to… ▽ More

    Submitted 14 November, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

  24. arXiv:2308.07870  [pdf, other

    cs.AI cs.LG cs.NE

    Brain-Inspired Computational Intelligence via Predictive Coding

    Authors: Tommaso Salvatori, Ankur Mali, Christopher L. Buckley, Thomas Lukasiewicz, Rajesh P. N. Rao, Karl Friston, Alexander Ororbia

    Abstract: Artificial intelligence (AI) is rapidly becoming one of the key technologies of this century. The majority of results in AI thus far have been achieved using deep neural networks trained with the error backpropagation learning algorithm. However, the ubiquitous adoption of this approach has highlighted some important limitations such as substantial computational cost, difficulty in quantifying unc… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: 37 Pages, 9 Figures

  25. arXiv:2307.06883  [pdf, other

    cs.OH physics.ins-det

    Cyber Framework for Steering and Measurements Collection Over Instrument-Computing Ecosystems

    Authors: Anees Al-Najjar, Nageswara S. V. Rao, Ramanan Sankaran, Helia Zandi, Debangshu Mukherjee, Maxim Ziatdinov, Craig Bridges

    Abstract: We propose a framework to develop cyber solutions to support the remote steering of science instruments and measurements collection over instrument-computing ecosystems. It is based on provisioning separate data and control connections at the network level, and developing software modules consisting of Python wrappers for instrument commands and Pyro server-client codes that make them available ac… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: Paper accepted for presentation at IEEE SMARTCOMP 2023

  26. Discrimination through Image Selection by Job Advertisers on Facebook

    Authors: Varun Nagaraj Rao, Aleksandra Korolova

    Abstract: Targeted advertising platforms are widely used by job advertisers to reach potential employees; thus issues of discrimination due to targeting that have surfaced have received widespread attention. Advertisers could misuse targeting tools to exclude people based on gender, race, location and other protected attributes from seeing their job ads. In response to legal actions, Facebook disabled the a… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: Published in FAccT 2023

  27. arXiv:2306.05912  [pdf, other

    eess.IV cs.CV

    Single-Image-Based Deep Learning for Segmentation of Early Esophageal Cancer Lesions

    Authors: Haipeng Li, Dingrui Liu, Yu Zeng, Shuaicheng Liu, Tao Gan, Nini Rao, Jinlin Yang, Bing Zeng

    Abstract: Accurate segmentation of lesions is crucial for diagnosis and treatment of early esophageal cancer (EEC). However, neither traditional nor deep learning-based methods up to today can meet the clinical requirements, with the mean Dice score - the most important metric in medical image analysis - hardly exceeding 0.75. In this paper, we present a novel deep learning approach for segmenting EEC lesio… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

  28. arXiv:2305.20015  [pdf, other

    cs.SE cs.AI

    AI for Low-Code for AI

    Authors: Nikitha Rao, Jason Tsay, Kiran Kate, Vincent J. Hellendoorn, Martin Hirzel

    Abstract: Low-code programming allows citizen developers to create programs with minimal coding effort, typically via visual (e.g. drag-and-drop) interfaces. In parallel, recent AI-powered tools such as Copilot and ChatGPT generate programs from natural language instructions. We argue that these modalities are complementary: tools like ChatGPT greatly reduce the need to memorize large APIs but still require… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  29. arXiv:2305.09887  [pdf, other

    cs.LG cs.DC

    Simplifying Distributed Neural Network Training on Massive Graphs: Randomized Partitions Improve Model Aggregation

    Authors: Jiong Zhu, Aishwarya Reganti, Edward Huang, Charles Dickens, Nikhil Rao, Karthik Subbian, Danai Koutra

    Abstract: Distributed training of GNNs enables learning on massive graphs (e.g., social and e-commerce networks) that exceed the storage and computational capacity of a single machine. To reach performance comparable to centralized training, distributed frameworks focus on maximally recovering cross-instance node dependencies with either communication across instances or periodic fallback to centralized tra… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 14 pages, 3 figures

  30. arXiv:2304.02048  [pdf

    cond-mat.mtrl-sci cs.LG

    Deep Learning for Automated Experimentation in Scanning Transmission Electron Microscopy

    Authors: Sergei V. Kalinin, Debangshu Mukherjee, Kevin M. Roccapriore, Ben Blaiszik, Ayana Ghosh, Maxim A. Ziatdinov, A. Al-Najjar, Christina Doty, Sarah Akers, Nageswara S. Rao, Joshua C. Agar, Steven R. Spurgeon

    Abstract: Machine learning (ML) has become critical for post-acquisition data analysis in (scanning) transmission electron microscopy, (S)TEM, imaging and spectroscopy. An emerging trend is the transition to real-time analysis and closed-loop microscope operation. The effective use of ML in electron microscopy now requires the development of strategies for microscopy-centered experiment workflow design and… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: Review Article

  31. arXiv:2302.14189  [pdf, other

    cs.LG cs.AI cs.SI

    You Only Transfer What You Share: Intersection-Induced Graph Transfer Learning for Link Prediction

    Authors: Wenqing Zheng, Edward W Huang, Nikhil Rao, Zhangyang Wang, Karthik Subbian

    Abstract: Link prediction is central to many real-world applications, but its performance may be hampered when the graph of interest is sparse. To alleviate issues caused by sparsity, we investigate a previously overlooked phenomenon: in many cases, a densely connected, complementary graph can be found for the original graph. The denser graph may share nodes with the original graph, which offers a natural b… ▽ More

    Submitted 18 June, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted in TMLR (https://openreview.net/forum?id=Nn71AdKyYH)

  32. arXiv:2211.14261  [pdf, ps, other

    cs.RO

    Temporal Waypoint Navigation of Multi-UAV Payload System using Barrier Functions

    Authors: Nishanth Rao, Suresh Sundaram, Pushpak Jagtap

    Abstract: Aerial package transportation often requires complex spatial and temporal specifications to be satisfied in order to ensure safe and timely delivery from one point to another. It is usually efficient to transport versatile payloads using multiple UAVs that can work collaboratively to achieve the desired task. The complex temporal specifications can be handled coherently by applying Signal Temporal… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: Submitted to ECC 2023

  33. arXiv:2211.13328  [pdf, other

    cs.IR

    Search Behavior Prediction: A Hypergraph Perspective

    Authors: Yan Han, Edward W Huang, Wenqing Zheng, Nikhil Rao, Zhangyang Wang, Karthik Subbian

    Abstract: Although the bipartite shopping graphs are straightforward to model search behavior, they suffer from two challenges: 1) The majority of items are sporadically searched and hence have noisy/sparse query associations, leading to a \textit{long-tail} distribution. 2) Infrequent queries are more likely to link to popular items, leading to another hurdle known as \textit{disassortative mixing}. To add… ▽ More

    Submitted 28 November, 2022; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: WSDM 2023

  34. arXiv:2211.06548  [pdf, ps, other

    cs.RO

    Computationally Light Spectrally Normalized Memory Neuron Network based Estimator for GPS-Denied operation of Micro UAV

    Authors: Nishanth Rao, Suresh Sundaram, Varun Raghavendra

    Abstract: This paper addresses the problem of position estimation in UAVs operating in a cluttered environment where GPS information is unavailable. A model learning-based approach is proposed that takes in the rotor RPMs and past state as input and predicts the one-step-ahead position of the UAV using a novel spectral-normalized memory neural network (SN-MNN). The spectral normalization guarantees stable a… ▽ More

    Submitted 3 December, 2022; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: Submitted to L4DC 2023

  35. arXiv:2210.13461  [pdf, other

    cs.LG cs.AI cs.CV cs.NE q-bio.NC

    Active Predictive Coding: A Unified Neural Framework for Learning Hierarchical World Models for Perception and Planning

    Authors: Rajesh P. N. Rao, Dimitrios C. Gklezakos, Vishwas Sathish

    Abstract: Predictive coding has emerged as a prominent model of how the brain learns through predictions, anticipating the importance accorded to predictive learning in recent AI architectures such as transformers. Here we propose a new framework for predictive coding called active predictive coding which can learn hierarchical world models and solve two radically different open problems in AI: (1) how do w… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: 15 pages, 10 figures, 2 supplementary figures

  36. arXiv:2210.11753  [pdf, other

    cs.CL

    TransLIST: A Transformer-Based Linguistically Informed Sanskrit Tokenizer

    Authors: Jivnesh Sandhan, Rathin Singha, Narein Rao, Suvendu Samanta, Laxmidhar Behera, Pawan Goyal

    Abstract: Sanskrit Word Segmentation (SWS) is essential in making digitized texts available and in deploying downstream tasks. It is, however, non-trivial because of the sandhi phenomenon that modifies the characters at the word boundaries, and needs special treatment. Existing lexicon driven approaches for SWS make use of Sanskrit Heritage Reader, a lexicon-driven shallow parser, to generate the complete c… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Accepted at EMNLP22 (Findings)

  37. arXiv:2210.11478  [pdf, other

    q-bio.NC cs.AI

    Neural Co-Processors for Restoring Brain Function: Results from a Cortical Model of Grasping

    Authors: Matthew J. Bryan, Linxing Preston Jiang, Rajesh P N Rao

    Abstract: Objective: A major challenge in designing closed-loop brain-computer interfaces is finding optimal stimulation patterns as a function of ongoing neural activity for different subjects and objectives. Approach: To achieve goal-directed closed-loop neurostimulation, we propose "neural co-processors" which use artificial neural networks and deep learning to learn optimal closed-loop stimulation polic… ▽ More

    Submitted 20 March, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: 45 pages, 19 figures. Submitted the IOP Journal of Neural Engineering

  38. arXiv:2210.09791  [pdf, other

    cs.DC

    Enabling Autonomous Electron Microscopy for Networked Computation and Steering

    Authors: Anees Al-Najjar, Nageswara S. V. Rao, Ramanan Sankaran, Maxim Ziatdinov, Debangshu Mukherjee, Olga Ovchinnikova, Kevin Roccapriore, Andrew R. Lupini, Sergei V. Kalinin

    Abstract: Advanced electron microscopy workflows require an ecosystem of microscope instruments and computing systems possibly located at different sites to conduct remotely steered and automated experiments. Current workflow executions involve manual operations for steering and measurement tasks, which are typically performed from control workstations co-located with microscopes; consequently, their operat… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: 11 pages, 16 figures, accepted at IEEE eScience 2022 conference

  39. arXiv:2208.13301  [pdf, other

    cs.DC

    ECP SOLLVE: Validation and Verification Testsuite Status Update and Compiler Insight for OpenMP

    Authors: Thomas Huber, Swaroop Pophale, Nolan Baker, Michael Carr, Nikhil Rao, Jaydon Reap, Kristina Holsapple, Joshua Hoke Davis, Tobias Burnus, Seyong Lee, David E. Bernholdt, Sunita Chandrasekaran

    Abstract: The OpenMP language continues to evolve with every new specification release, as does the need to validate and verify the new features that have been introduced. With the release of OpenMP 5.0 and OpenMP 5.1, plenty of new target offload and host-based features have been introduced to the programming model. While OpenMP continues to grow in maturity, there is an observable growth in the number of… ▽ More

    Submitted 14 November, 2022; v1 submitted 28 August, 2022; originally announced August 2022.

  40. arXiv:2207.04375  [pdf, ps, other

    cs.RO

    An Input-Output Feedback Linearization based Exponentially Stable Controller for Multi-UAV Payload Transport

    Authors: Nishanth Rao, Suresh Sundaram

    Abstract: In this paper, an exponentially stable trajectory tracking controller is proposed for multi-UAV payload transport. The multi-UAV payload system has a 2-DOF magnetic spherical joint between the UAVs and the vertical rigid links of the payload frame, so the UAVs can roll or pitch freely. These vertical links are rigidly attached to the payload and cannot move. An input-output feedback linearized mod… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

    Comments: Submitted to IEEE - Transactions on Robotics (IEEE - TRO)

  41. arXiv:2207.03593  [pdf, other

    cs.LG

    Hyper-Universal Policy Approximation: Learning to Generate Actions from a Single Image using Hypernets

    Authors: Dimitrios C. Gklezakos, Rishi Jha, Rajesh P. N. Rao

    Abstract: Inspired by Gibson's notion of object affordances in human vision, we ask the question: how can an agent learn to predict an entire action policy for a novel object or environment given only a single glimpse? To tackle this problem, we introduce the concept of Universal Policy Functions (UPFs) which are state-to-action mappings that generalize not only to new goals but most importantly to novel, u… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  42. arXiv:2207.02368  [pdf, other

    cs.IR cs.LG cs.SI

    Text Enriched Sparse Hyperbolic Graph Convolutional Networks

    Authors: Nurendra Choudhary, Nikhil Rao, Karthik Subbian, Chandan K. Reddy

    Abstract: Heterogeneous networks, which connect informative nodes containing text with different edge types, are routinely used to store and process information in various real-world applications. Graph Neural Networks (GNNs) and their hyperbolic variants provide a promising approach to encode such networks in a low-dimensional latent space through neighborhood aggregation and hierarchical feature extractio… ▽ More

    Submitted 7 July, 2022; v1 submitted 5 July, 2022; originally announced July 2022.

    Comments: Preprint under review. 13 pages, 10 figures, 6 tables

    ACM Class: I.2.4; I.2.6; G.2.2; F.2.2

  43. arXiv:2206.08462  [pdf, other

    cs.CV cs.LG

    Recursive Neural Programs: Variational Learning of Image Grammars and Part-Whole Hierarchies

    Authors: Ares Fisher, Rajesh P. N. Rao

    Abstract: Human vision involves parsing and representing objects and scenes using structured representations based on part-whole hierarchies. Computer vision and machine learning researchers have recently sought to emulate this capability using capsule networks, reference frames and active predictive coding, but a generative model formulation has been lacking. We introduce Recursive Neural Programs (RNPs),… ▽ More

    Submitted 25 June, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: 9 pages, 6 figures. fixed LaTeX typo for algorithm reference

  44. arXiv:2206.06588  [pdf, other

    cs.IR cs.LG

    Shopping Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search

    Authors: Chandan K. Reddy, Lluís Màrquez, Fran Valero, Nikhil Rao, Hugo Zaragoza, Sambaran Bandyopadhyay, Arnab Biswas, Anlu Xing, Karthik Subbian

    Abstract: Improving the quality of search results can significantly enhance users experience and engagement with search engines. In spite of several recent advancements in the fields of machine learning and data mining, correctly classifying items for a particular user search query has been a long-standing challenge, which still has a large room for improvement. This paper introduces the "Shopping Queries D… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  45. arXiv:2206.04416  [pdf

    cs.CY

    Analysis of Learner Independent Variables for Estimating Assessment Items Difficulty Level

    Authors: Shilpi Banerjee, N. J. Rao

    Abstract: The quality of assessment determines the quality of learning, and is characterized by validity, reliability and difficulty. Mastery of learning is generally represented by the difficulty levels of assessment items. A very large number of variables are identified in the literature to measure the difficulty level. These variables, which are not completely independent of one another, are categorized… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: 16 pages

  46. arXiv:2206.03040  [pdf, other

    stat.ML cs.IR cs.LG

    Learning Backward Compatible Embeddings

    Authors: Weihua Hu, Rajas Bansal, Kaidi Cao, Nikhil Rao, Karthik Subbian, Jure Leskovec

    Abstract: Embeddings, low-dimensional vector representation of objects, are fundamental in building modern machine learning systems. In industrial settings, there is usually an embedding team that trains an embedding model to solve intended tasks (e.g., product recommendation). The produced embeddings are then widely consumed by consumer teams to solve their unintended tasks (e.g., fraud detection). However… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

    Comments: KDD 2022, Applied Data Science Track

  47. arXiv:2205.10092  [pdf, other

    cs.RO

    An efficient Deep Spatio-Temporal Context Aware decision Network (DST-CAN) for Predictive Manoeuvre Planning

    Authors: Jayabrata Chowdhury, Suresh Sundaram, Nishanth Rao, Narasimhan Sundararajan

    Abstract: To ensure the safety and efficiency of its maneuvers, an Autonomous Vehicle (AV) should anticipate the future intentions of surrounding vehicles using its sensor information. If an AV can predict its surrounding vehicles' future trajectories, it can make safe and efficient manoeuvre decisions. In this paper, we present such a Deep Spatio-Temporal Context-Aware decision Network (DST-CAN) model for… ▽ More

    Submitted 8 July, 2024; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: 12 pages, 8 figures

  48. Comments on Comments: Where Code Review and Documentation Meet

    Authors: Nikitha Rao, Jason Tsay, Martin Hirzel, Vincent J. Hellendoorn

    Abstract: A central function of code review is to increase understanding; helping reviewers understand a code change aids in knowledge transfer and finding bugs. Comments in code largely serve a similar purpose, helping future readers understand the program. It is thus natural to study what happens when these two forms of understanding collide. We ask: what documentation-related comments do reviewers make a… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

  49. arXiv:2202.08335  [pdf, other

    cs.LG

    Task-Agnostic Graph Explanations

    Authors: Yaochen Xie, Sumeet Katariya, Xianfeng Tang, Edward Huang, Nikhil Rao, Karthik Subbian, Shuiwang Ji

    Abstract: Graph Neural Networks (GNNs) have emerged as powerful tools to encode graph-structured data. Due to their broad applications, there is an increasing need to develop tools to explain how GNNs make decisions given graph-structured data. Existing learning-based GNN explanation approaches are task-specific in training and hence suffer from crucial drawbacks. Specifically, they are incapable of produci… ▽ More

    Submitted 23 September, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: Accepted by NeurIPS 2022

  50. arXiv:2201.13033  [pdf, ps, other

    cs.RO

    Integrated Decision Control Approach for Cooperative Safety-Critical Payload Transport in a Cluttered Environment

    Authors: Nishanth Rao, Suresh Sundaram

    Abstract: In this paper, the problem of coordinated transportation of heavy payload by a team of UAVs in a cluttered environment is addressed. The payload is modeled as a rigid body and is assumed to track a pre-computed global flight trajectory from a start point to a goal point. Due to the presence of local dynamic obstacles in the environment, the UAVs must ensure that there is no collision between the p… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

    Comments: Submitted to IEEE Transactions on Intelligent Transporation Systems (IEEE T - ITS)