subscribe to arXiv mailings

FutureFill: Fast Generation from Convolutional Sequence Models

Authors: Naman Agarwal, Xinyi Chen, Evan Dogariu, Vlad Feinberg, Daniel Suo, Peter Bartlett, Elad Hazan

Abstract: We address the challenge of efficient auto-regressive generation in sequence prediction models by introducing FutureFill: a method for fast generation that applies to any sequence prediction algorithm based on convolutional operators. Our approach reduces the generation time requirement from linear to square root relative to the context length. Additionally, FutureFill requires a prefill cache siz… ▽ More We address the challenge of efficient auto-regressive generation in sequence prediction models by introducing FutureFill: a method for fast generation that applies to any sequence prediction algorithm based on convolutional operators. Our approach reduces the generation time requirement from linear to square root relative to the context length. Additionally, FutureFill requires a prefill cache sized only by the number of tokens generated, which is smaller than the cache requirements for standard convolutional and attention-based models. We validate our theoretical findings with experimental evidence demonstrating correctness and efficiency gains in a synthetic generation task. △ Less

Submitted 2 October, 2024; originally announced October 2024.

arXiv:2410.03339 [pdf, other]

Tarzan: Passively-Learned Real-Time Rate Control for Video Conferencing

Authors: Neil Agarwal, Rui Pan, Francis Y. Yan, Ravi Netravali

Abstract: Rate control algorithms are at the heart of video conferencing platforms, determining target bitrates that match dynamic network characteristics for high quality. Recent data-driven strategies have shown promise for this challenging task, but the performance degradation they introduce during training has been a nonstarter for many production services, precluding adoption. This paper aims to bolste… ▽ More Rate control algorithms are at the heart of video conferencing platforms, determining target bitrates that match dynamic network characteristics for high quality. Recent data-driven strategies have shown promise for this challenging task, but the performance degradation they introduce during training has been a nonstarter for many production services, precluding adoption. This paper aims to bolster the practicality of data-driven rate control by presenting an alternative avenue for experiential learning: leveraging purely existing telemetry logs produced by the incumbent algorithm in production. We observe that these logs contain effective decisions, although often at the wrong times or in the wrong order. To realize this approach despite the inherent uncertainty that log-based learning brings (i.e., lack of feedback for new decisions), our system, Tarzan, combines a variety of robust learning techniques (i.e., conservatively reasoning about alternate behavior to minimize risk and using a richer model formulation to account for environmental noise). Across diverse networks (emulated and real-world), Tarzan outperforms the widely deployed GCC algorithm, increasing average video bitrates by 15-39% while reducing freeze rates by 60-100%. △ Less

Submitted 4 October, 2024; originally announced October 2024.

arXiv:2409.12251 [pdf]

Empowering Abilities: Increasing Representation of Students with Disabilities in the STEM Field

Authors: Esperanza Moreno, Piyush Kumar, Richard O Adansi, Dorothy Moreno, Demy Rodriguez, Raul Baez Ramirez, Audrey R Kapsa, Arturo Rodriguez, Neelam Agarwal, Vinod Kumar, Beverley A Calvo, Vivek Tandon

Abstract: The ExploreSTEM Summer Camps 2023 were designed to deliver inclusive STEM education to students aged 14 to 22 years with disabilities. This paper presents a thorough examination of the 2023 camp program, emphasizing the pivotal role of inclusive STEM education in potentially shaping students' personal and academic trajectories. The curriculum encompassed four weeklong fundamental STEM domains: Int… ▽ More The ExploreSTEM Summer Camps 2023 were designed to deliver inclusive STEM education to students aged 14 to 22 years with disabilities. This paper presents a thorough examination of the 2023 camp program, emphasizing the pivotal role of inclusive STEM education in potentially shaping students' personal and academic trajectories. The curriculum encompassed four weeklong fundamental STEM domains: Internet of Things (IoT), Computational Engineering, Artificial Intelligence (AI), and Augmented and Virtual Reality (AR/VR). Within Camp 1, students actively engaged with Dash robots, employing dedicated programming environments to command actions and gather sensor data, fostering interactions with the IoT platform and facilitating seamless data transmission. Camp 2 was dedicated to acquainting students with foundational computational engineering principles, establishing a robust framework for comprehending intricate engineering concepts. Camp 3 commenced with insightful presentations elucidating AI applications across multifaceted industries, including engineering, healthcare, and education, illuminating AI's pervasive influence on contemporary society. The primary aim of Camp 4 was to introduce students to the immersive domains of AR and VR, showcasing their applications beyond conventional STEM disciplines into everyday life experiences. The amalgamation of informative presentations, interactive activities, and a nurturing learning environment cultivated an engaging and enriching experience for all participants. By embracing inclusivity and harnessing innovative pedagogical approaches, the ExploreSTEM Summer Camps empowered students to explore, innovate, and excel within the dynamic realm of STEM education. △ Less

Submitted 18 September, 2024; originally announced September 2024.

arXiv:2408.06113 [pdf, other]

IIT Bombay Racing Driverless: Autonomous Driving Stack for Formula Student AI

Authors: Yash Rampuria, Deep Boliya, Shreyash Gupta, Gopalan Iyengar, Ayush Rohilla, Mohak Vyas, Chaitanya Langde, Mehul Vijay Chanda, Ronak Gautam Matai, Kothapalli Namitha, Ajinkya Pawar, Bhaskar Biswas, Nakul Agarwal, Rajit Khandelwal, Rohan Kumar, Shubham Agarwal, Vishwam Patel, Abhimanyu Singh Rathore, Amna Rahman, Ayush Mishra, Yash Tangri

Abstract: This work presents the design and development of IIT Bombay Racing's Formula Student style autonomous racecar algorithm capable of running at the racing events of Formula Student-AI, held in the UK. The car employs a cutting-edge sensor suite of the compute unit NVIDIA Jetson Orin AGX, 2 ZED2i stereo cameras, 1 Velodyne Puck VLP16 LiDAR and SBG Systems Ellipse N GNSS/INS IMU. It features deep lear… ▽ More This work presents the design and development of IIT Bombay Racing's Formula Student style autonomous racecar algorithm capable of running at the racing events of Formula Student-AI, held in the UK. The car employs a cutting-edge sensor suite of the compute unit NVIDIA Jetson Orin AGX, 2 ZED2i stereo cameras, 1 Velodyne Puck VLP16 LiDAR and SBG Systems Ellipse N GNSS/INS IMU. It features deep learning algorithms and control systems to navigate complex tracks and execute maneuvers without any human intervention. The design process involved extensive simulations and testing to optimize the vehicle's performance and ensure its safety. The algorithms have been tested on a small scale, in-house manufactured 4-wheeled robot and on simulation software. The results obtained for testing various algorithms in perception, simultaneous localization and mapping, path planning and controls have been detailed. △ Less

Submitted 12 August, 2024; originally announced August 2024.

Comments: 8 pages, 19 figures

arXiv:2408.01549 [pdf]

Reducing COVID-19 Misinformation Spread by Introducing Information Diffusion Delay Using Agent-based Modeling

Authors: Mustafa Alassad, Nitin Agarwal

Abstract: With the explosive growth of the Coronavirus Pandemic (COVID-19), misinformation on social media has developed into a global phenomenon with widespread and detrimental societal effects. Despite recent progress and efforts in detecting COVID-19 misinformation on social media networks, this task remains challenging due to the complexity, diversity, multi-modality, and high costs of fact-checking or… ▽ More With the explosive growth of the Coronavirus Pandemic (COVID-19), misinformation on social media has developed into a global phenomenon with widespread and detrimental societal effects. Despite recent progress and efforts in detecting COVID-19 misinformation on social media networks, this task remains challenging due to the complexity, diversity, multi-modality, and high costs of fact-checking or annotation. In this research, we introduce a systematic and multidisciplinary agent-based modeling approach to limit the spread of COVID-19 misinformation and interpret the dynamic actions of users and communities in evolutionary online (or offline) social media networks. Our model was applied to a Twitter network associated with an armed protest demonstration against the COVID-19 lockdown in Michigan state in May, 2020. We implemented a one-median problem to categorize the Twitter network into six key communities (nodes) and identified information exchange (links) within the network. We measured the response time to COVID-19 misinformation spread in the network and employed a cybernetic organizational method to monitor the Twitter network. The overall misinformation mitigation strategy was evaluated, and agents were allocated to interact with the network based on the measured response time and feedback. The proposed model prioritized the communities based on the agents response times at the operational level. It then optimized agent allocation to limit the spread of COVID19 related misinformation from different communities, improved the information diffusion delay threshold to up to 3 minutes, and ultimately enhanced the mitigation process to reduce misinformation spread across the entire network. △ Less

Submitted 2 August, 2024; originally announced August 2024.

arXiv:2407.14502 [pdf, other]

M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models

Authors: Seunggeun Chi, Hyung-gun Chi, Hengbo Ma, Nakul Agarwal, Faizan Siddiqui, Karthik Ramani, Kwonjoon Lee

Abstract: We introduce the Multi-Motion Discrete Diffusion Models (M2D2M), a novel approach for human motion generation from textual descriptions of multiple actions, utilizing the strengths of discrete diffusion models. This approach adeptly addresses the challenge of generating multi-motion sequences, ensuring seamless transitions of motions and coherence across a series of actions. The strength of M2D2M… ▽ More We introduce the Multi-Motion Discrete Diffusion Models (M2D2M), a novel approach for human motion generation from textual descriptions of multiple actions, utilizing the strengths of discrete diffusion models. This approach adeptly addresses the challenge of generating multi-motion sequences, ensuring seamless transitions of motions and coherence across a series of actions. The strength of M2D2M lies in its dynamic transition probability within the discrete diffusion model, which adapts transition probabilities based on the proximity between motion tokens, encouraging mixing between different modes. Complemented by a two-phase sampling strategy that includes independent and joint denoising steps, M2D2M effectively generates long-term, smooth, and contextually coherent human motion sequences, utilizing a model trained for single-motion generation. Extensive experiments demonstrate that M2D2M surpasses current state-of-the-art benchmarks for motion generation from text descriptions, showcasing its efficacy in interpreting language semantics and generating dynamic, realistic motions. △ Less

Submitted 19 July, 2024; originally announced July 2024.

arXiv:2407.03013 [pdf]

Disentangling heterogeneity and disorder during ultrafast surface melting of orbital order

Authors: Maurizio Monti, Khalid M. Siddiqui, Daniel Perez-Salinas, Naman Agarwal, Martin Bremholm, Xiang Li, Dharmalingam Prabhakaran, Xin Liu, Danylo Babich, Mathias Sander, Yunpei Deng, Henrik T. Lemke, Roman Mankowsky, Xuerong Liu, Simon E. Wall

Abstract: Understanding how light modifies long-range order is key to improve our ability to control material functionality on an ultrafast timescale. Transient spatial heterogeneity has been proposed in many materials, but isolating the dynamics of different regions experimentally has been challenging. Here we address this issue and measure the dynamics of orbital order melting in the layered manganite, La… ▽ More Understanding how light modifies long-range order is key to improve our ability to control material functionality on an ultrafast timescale. Transient spatial heterogeneity has been proposed in many materials, but isolating the dynamics of different regions experimentally has been challenging. Here we address this issue and measure the dynamics of orbital order melting in the layered manganite, La0.5Sr1.5MnO4, and isolate the surface dynamics from the bulk for the first time. Bulk measurements show orbital order is rapidly suppressed, but the correlation length surprisingly increases. However, the surface dynamics, show a stronger suppression and a significant decrease in correlation length. By isolating the surface changes, we find that light preferentially melts a less ordered surface and the loss of long-range order is likely driven by the formation of local and disordered polarons. Melting the disordered surface effectively increases the average correlation of the bulk probed volume, resolving the contradictory response. These results show that surface scattering methods are necessary to understand both surface and bulk dynamics in heterogeneous materials. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: 22 pages, 8 figures

arXiv:2406.12250 [pdf]

doi 10.1038/s41467-024-49942-2

Observation of stacking engineered magnetic phase transitions within moiré supercells of twisted van der Waals magnets

Authors: Senlei Li, Zeliang Sun, Nathan J. McLaughlin, Afsana Sharmin, Nishkarsh Agarwal, Mengqi Huang, Suk Hyun Sung, Hanyi Lu, Shaohua Yan, Hechang Lei, Robert Hovden, Hailong Wang, Hua Chen, Liuyan Zhao, Chunhui Rita Du

Abstract: Twist engineering of magnetic van der Waals (vdW) moiré superlattices provides an attractive way to achieve precise nanoscale control over the spin degree of freedom on two-dimensional flatland. Despite the very recent demonstrations of moiré magnetism featuring exotic phases with noncollinear spin order in twisted vdW magnet chromium triiodide CrI3, the local magnetic interactions, spin dynamics,… ▽ More Twist engineering of magnetic van der Waals (vdW) moiré superlattices provides an attractive way to achieve precise nanoscale control over the spin degree of freedom on two-dimensional flatland. Despite the very recent demonstrations of moiré magnetism featuring exotic phases with noncollinear spin order in twisted vdW magnet chromium triiodide CrI3, the local magnetic interactions, spin dynamics, and magnetic phase transitions within and across individual moiré supercells remain elusive. Taking advantage of a scanning single-spin magnetometry platform, here we report observation of two distinct magnetic phase transitions with separate critical temperatures within a moiré supercell of small-angle twisted double trilayer CrI3. By measuring temperature dependent spin fluctuations at the coexisting ferromagnetic and antiferromagnetic regions in twisted CrI3, we explicitly show that the Curie temperature of the ferromagnetic state is higher than the Néel temperature of the antiferromagnetic one by ~10 K. Our mean-field calculations attribute such a spatial and thermodynamic phase separation to the stacking order modulated interlayer exchange coupling at the twisted interface of the moiré superlattices. The presented results highlight twist engineering as a promising tuning knob to realize on-demand control of not only the nanoscale spin order of moiré quantum matter but also its dynamic magnetic responses, which may find relevant applications in developing transformative vdW electronic and magnetic devices. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Journal ref: Nat. Commun. 15, 5712 (2024)

arXiv:2406.11704 [pdf, other]

Nemotron-4 340B Technical Report

Authors: Nvidia, :, Bo Adler, Niket Agarwal, Ashwath Aithal, Dong H. Anh, Pallab Bhattacharya, Annika Brundyn, Jared Casper, Bryan Catanzaro, Sharon Clay, Jonathan Cohen, Sirshak Das, Ayush Dattagupta, Olivier Delalleau, Leon Derczynski, Yi Dong, Daniel Egert, Ellie Evans, Aleksander Ficek, Denys Fridman, Shaona Ghosh, Boris Ginsburg, Igor Gitman, Tomasz Grzegorzek , et al. (58 additional authors not shown)

Abstract: We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation be… ▽ More We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation benchmarks, and were sized to fit on a single DGX H100 with 8 GPUs when deployed in FP8 precision. We believe that the community can benefit from these models in various research studies and commercial applications, especially for generating synthetic data to train smaller language models. Notably, over 98% of data used in our model alignment process is synthetically generated, showcasing the effectiveness of these models in generating synthetic data. To further support open research and facilitate model development, we are also open-sourcing the synthetic data generation pipeline used in our model alignment process. △ Less

Submitted 6 August, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

arXiv:2405.20305 [pdf, other]

Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models

Authors: Himangi Mittal, Nakul Agarwal, Shao-Yuan Lo, Kwonjoon Lee

Abstract: We introduce PlausiVL, a large video-language model for anticipating action sequences that are plausible in the real-world. While significant efforts have been made towards anticipating future actions, prior approaches do not take into account the aspect of plausibility in an action sequence. To address this limitation, we explore the generative capability of a large video-language model in our wo… ▽ More We introduce PlausiVL, a large video-language model for anticipating action sequences that are plausible in the real-world. While significant efforts have been made towards anticipating future actions, prior approaches do not take into account the aspect of plausibility in an action sequence. To address this limitation, we explore the generative capability of a large video-language model in our work and further, develop the understanding of plausibility in an action sequence by introducing two objective functions, a counterfactual-based plausible action sequence learning loss and a long-horizon action repetition loss. We utilize temporal logical constraints as well as verb-noun action pair logical constraints to create implausible/counterfactual action sequences and use them to train the model with plausible action sequence learning loss. This loss helps the model to differentiate between plausible and not plausible action sequences and also helps the model to learn implicit temporal cues crucial for the task of action anticipation. The long-horizon action repetition loss puts a higher penalty on the actions that are more prone to repetition over a longer temporal window. With this penalization, the model is able to generate diverse, plausible action sequences. We evaluate our approach on two large-scale datasets, Ego4D and EPIC-Kitchens-100, and show improvements on the task of action anticipation. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Comments: CVPR 2024

arXiv:2405.02141 [pdf, other]

Multi-Objective Recommendation via Multivariate Policy Learning

Authors: Olivier Jeunen, Jatin Mandav, Ivan Potapov, Nakul Agarwal, Sourabh Vaid, Wenzhe Shi, Aleksei Ustimenko

Abstract: Real-world recommender systems often need to balance multiple objectives when deciding which recommendations to present to users. These include behavioural signals (e.g. clicks, shares, dwell time), as well as broader objectives (e.g. diversity, fairness). Scalarisation methods are commonly used to handle this balancing task, where a weighted average of per-objective reward signals determines the… ▽ More Real-world recommender systems often need to balance multiple objectives when deciding which recommendations to present to users. These include behavioural signals (e.g. clicks, shares, dwell time), as well as broader objectives (e.g. diversity, fairness). Scalarisation methods are commonly used to handle this balancing task, where a weighted average of per-objective reward signals determines the final score used for ranking. Naturally, how these weights are computed exactly, is key to success for any online platform. We frame this as a decision-making task, where the scalarisation weights are actions taken to maximise an overall North Star reward (e.g. long-term user retention or growth). We extend existing policy learning methods to the continuous multivariate action domain, proposing to maximise a pessimistic lower bound on the North Star reward that the learnt policy will yield. Typical lower bounds based on normal approximations suffer from insufficient coverage, and we propose an efficient and effective policy-dependent correction for this. We provide guidance to design stochastic data collection policies, as well as highly sensitive reward signals. Empirical observations from simulations, offline and online experiments highlight the efficacy of our deployed approach. △ Less

Submitted 16 September, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

Comments: Accepted as a full paper in the 2024 ACM Conference on Recommender Systems (RecSys '24)

arXiv:2405.01722 [pdf, other]

Quantifying spectral signatures of non-Markovianity beyond the Born-Redfield master equation

Authors: A. Keefe, N. Agarwal, A. Kamal

Abstract: Memory or time-non-local effects in open quantum dynamics pose theoretical as well as practical challenges in the understanding and control of noisy quantum systems. While there has been a comprehensive and concerted effort towards developing diagnostics for non-Markovian dynamics, all existing measures rely on time-domain measurements which are typically slow and expensive as they require averagi… ▽ More Memory or time-non-local effects in open quantum dynamics pose theoretical as well as practical challenges in the understanding and control of noisy quantum systems. While there has been a comprehensive and concerted effort towards developing diagnostics for non-Markovian dynamics, all existing measures rely on time-domain measurements which are typically slow and expensive as they require averaging several runs to resolve small transient features on a broad background, and scale unfavorably with system size and complexity. In this work, we propose a spectroscopic measure of non-Markovianity which can detect persistent non-Markovianity in the system steady state. In addition to being experimentally viable, the proposed measure has a direct information theoretic interpretation: a large value indicates the information loss per unit bandwidth of making the Markov approximation. In the same vein, we derive a frequency-domain quantum master equation (FD-QME) that goes beyond the standard Born-Redfield description and retains the full memory of the state of the reduced system. Using the FD-QME and the proposed measure, we are able to reliably diagnose and quantify non-Markovianity in several system-environment settings including those with environmental correlations and retardation effects. △ Less

Submitted 11 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

Comments: 15+ pages, 8 figures including 3 appendices

arXiv:2403.18907 [pdf, other]

Open system dynamics in interacting quantum field theories

Authors: Brenden Bowen, Nishant Agarwal, Archana Kamal

Abstract: A quantum system that interacts with an environment generally undergoes non-unitary evolution described by a non-Markovian or Markovian master equation. In this paper, we construct the non-Markovian Redfield master equation for a quantum scalar field that interacts with a second field through a bilinear or nonlinear interaction on a Minkowski background. We use the resulting master equation to set… ▽ More A quantum system that interacts with an environment generally undergoes non-unitary evolution described by a non-Markovian or Markovian master equation. In this paper, we construct the non-Markovian Redfield master equation for a quantum scalar field that interacts with a second field through a bilinear or nonlinear interaction on a Minkowski background. We use the resulting master equation to set up coupled differential equations that can be solved to obtain the equal-time two-point function of the system field. We show how the equations simplify under various approximations including the Markovian limit, and argue that the Redfield equation-based solution provides a perturbative resummation to the standard second order Dyson series result. For the bilinear interaction, we explicitly show that the Redfield solution is closer to the exact solution compared to the perturbation theory-based one. Further, the environment correlation function is oscillatory and non-decaying in this case, making the Markovian master equation a poor approximation. For the nonlinear interaction, on the other hand, the environment correlation function is sharply peaked and the Redfield solution matches that obtained using a Markovian master equation in the late-time limit. △ Less

Submitted 27 March, 2024; originally announced March 2024.

Comments: 19 pages, 6 figures

arXiv:2403.04978 [pdf, other]

Stacking as Accelerated Gradient Descent

Authors: Naman Agarwal, Pranjal Awasthi, Satyen Kale, Eric Zhao

Abstract: Stacking, a heuristic technique for training deep residual networks by progressively increasing the number of layers and initializing new layers by copying parameters from older layers, has proven quite successful in improving the efficiency of training deep neural networks. In this paper, we propose a theoretical explanation for the efficacy of stacking: viz., stacking implements a form of Nester… ▽ More Stacking, a heuristic technique for training deep residual networks by progressively increasing the number of layers and initializing new layers by copying parameters from older layers, has proven quite successful in improving the efficiency of training deep neural networks. In this paper, we propose a theoretical explanation for the efficacy of stacking: viz., stacking implements a form of Nesterov's accelerated gradient descent. The theory also covers simpler models such as the additive ensembles constructed in boosting methods, and provides an explanation for a similar widely-used practical heuristic for initializing the new classifier in each round of boosting. We also prove that for certain deep linear residual networks, stacking does provide accelerated training, via a new potential function analysis of the Nesterov's accelerated gradient method which allows errors in updates. We conduct proof-of-concept experiments to validate our theory as well. △ Less

Submitted 7 March, 2024; originally announced March 2024.

arXiv:2402.18702 [pdf]

Characterizing Multimedia Information Environment through Multi-modal Clustering of YouTube Videos

Authors: Niloofar Yousefi, Mainuddin Shaik, Nitin Agarwal

Abstract: This study aims to investigate the comprehensive characterization of information content in multimedia (videos), particularly on YouTube. The research presents a multi-method framework for characterizing multimedia content by clustering signals from various modalities, such as audio, video, and text. With a focus on South China Sea videos as a case study, this approach aims to enhance our understa… ▽ More This study aims to investigate the comprehensive characterization of information content in multimedia (videos), particularly on YouTube. The research presents a multi-method framework for characterizing multimedia content by clustering signals from various modalities, such as audio, video, and text. With a focus on South China Sea videos as a case study, this approach aims to enhance our understanding of online content, especially on YouTube. The dataset includes 160 videos, and our findings offer insights into content themes and patterns within different modalities of a video based on clusters. Text modality analysis revealed topical themes related to geopolitical countries, strategies, and global security, while video and audio modality analysis identified distinct patterns of signals related to diverse sets of videos, including news analysis/reporting, educational content, and interviews. Furthermore, our findings uncover instances of content repurposing within video clusters, which were identified using the barcode technique and audio similarity assessments. These findings indicate potential content amplification techniques. In conclusion, this study uniquely enhances our current understanding of multimedia content information based on modality clustering techniques. △ Less

Submitted 28 February, 2024; originally announced February 2024.

Comments: 14 pages, In the 4th International Conference on SMART MULTIMEDIA, 2024

arXiv:2402.07114 [pdf, other]

Towards Quantifying the Preconditioning Effect of Adam

Authors: Rudrajit Das, Naman Agarwal, Sujay Sanghavi, Inderjit S. Dhillon

Abstract: There is a notable dearth of results characterizing the preconditioning effect of Adam and showing how it may alleviate the curse of ill-conditioning -- an issue plaguing gradient descent (GD). In this work, we perform a detailed analysis of Adam's preconditioning effect for quadratic functions and quantify to what extent Adam can mitigate the dependence on the condition number of the Hessian. Our… ▽ More There is a notable dearth of results characterizing the preconditioning effect of Adam and showing how it may alleviate the curse of ill-conditioning -- an issue plaguing gradient descent (GD). In this work, we perform a detailed analysis of Adam's preconditioning effect for quadratic functions and quantify to what extent Adam can mitigate the dependence on the condition number of the Hessian. Our key finding is that Adam can suffer less from the condition number but at the expense of suffering a dimension-dependent quantity. Specifically, for a $d$-dimensional quadratic with a diagonal Hessian having condition number $κ$, we show that the effective condition number-like quantity controlling the iteration complexity of Adam without momentum is $\mathcal{O}(\min(d, κ))$. For a diagonally dominant Hessian, we obtain a bound of $\mathcal{O}(\min(d \sqrt{d κ}, κ))$ for the corresponding quantity. Thus, when $d < \mathcal{O}(κ^p)$ where $p = 1$ for a diagonal Hessian and $p = 1/3$ for a diagonally dominant Hessian, Adam can outperform GD (which has an $\mathcal{O}(κ)$ dependence). On the negative side, our results suggest that Adam can be worse than GD for a sufficiently non-diagonal Hessian even if $d \ll \mathcal{O}(κ^{1/3})$; we corroborate this with empirical evidence. Finally, we extend our analysis to functions satisfying per-coordinate Lipschitz smoothness and a modified version of the Polyak-Łojasiewicz condition. △ Less

Submitted 11 February, 2024; originally announced February 2024.

arXiv:2402.00636 [pdf, other]

Ultra-Cold Cryogenic TEM with Liquid Helium and High Stability

Authors: Emily Rennich, Suk Hyun Sung, Nishkarsh Agarwal, Maya Gates, Robert Kerns, Robert Hovden, Ismail El Baggari

Abstract: Cryogenic transmission electron microscopy has revolutionized structural biology and materials science, but achieving temperatures below the boiling point of liquid nitrogen remains a long-standing aspiration. We introduce an ultra-cold liquid helium transmission electron microscope specimen holder, featuring continuous cryogen flow and vibration decoupling. This instrument is compatible with mode… ▽ More Cryogenic transmission electron microscopy has revolutionized structural biology and materials science, but achieving temperatures below the boiling point of liquid nitrogen remains a long-standing aspiration. We introduce an ultra-cold liquid helium transmission electron microscope specimen holder, featuring continuous cryogen flow and vibration decoupling. This instrument is compatible with modern aberration-corrected microscopes and achieves sub-25 K base temperature, ${\pm}$2 mK thermal stability over many hours, and atomic resolution--setting the stage for a new era of cryogenic electron microscopy. △ Less

Submitted 1 February, 2024; originally announced February 2024.

Comments: 5 pages, 2 figures

arXiv:2401.05118 [pdf, ps, other]

On Escape rate for subshift with Markov measure

Authors: Nikita Agarwal, Haritha Cheriyath, Sharvari Neetin Tikekar

Abstract: In this paper, we consider a subshift of finite type with Markov measure. By considering a union of cylinders as holes, we investigate the exponential growth rate of measure of points whose orbits do not escape into the hole over a fixed number of iterations. We present two formulations for this escape rate: one based on the spectral radius of the Hadamard product of a related adjacency matrix and… ▽ More In this paper, we consider a subshift of finite type with Markov measure. By considering a union of cylinders as holes, we investigate the exponential growth rate of measure of points whose orbits do not escape into the hole over a fixed number of iterations. We present two formulations for this escape rate: one based on the spectral radius of the Hadamard product of a related adjacency matrix and the stochastic matrix with respect to which the Markov measure is defined, and the other utilizing a recurrence relation. These formulations enable a comparative analysis of escape rates into distinct holes. △ Less

Submitted 6 June, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

arXiv:2401.03599 [pdf, other]

doi 10.1109/LRA.2023.3342554

Disentangled Neural Relational Inference for Interpretable Motion Prediction

Authors: Victoria M. Dax, Jiachen Li, Enna Sachdeva, Nakul Agarwal, Mykel J. Kochenderfer

Abstract: Effective interaction modeling and behavior prediction of dynamic agents play a significant role in interactive motion planning for autonomous robots. Although existing methods have improved prediction accuracy, few research efforts have been devoted to enhancing prediction model interpretability and out-of-distribution (OOD) generalizability. This work addresses these two challenging aspects by d… ▽ More Effective interaction modeling and behavior prediction of dynamic agents play a significant role in interactive motion planning for autonomous robots. Although existing methods have improved prediction accuracy, few research efforts have been devoted to enhancing prediction model interpretability and out-of-distribution (OOD) generalizability. This work addresses these two challenging aspects by designing a variational auto-encoder framework that integrates graph-based representations and time-sequence models to efficiently capture spatio-temporal relations between interactive agents and predict their dynamics. Our model infers dynamic interaction graphs in a latent space augmented with interpretable edge features that characterize the interactions. Moreover, we aim to enhance model interpretability and performance in OOD scenarios by disentangling the latent space of edge features, thereby strengthening model versatility and robustness. We validate our approach through extensive experiments on both simulated and real-world datasets. The results show superior performance compared to existing methods in modeling spatio-temporal relations, motion prediction, and identifying time-invariant latent features. △ Less

Submitted 7 January, 2024; originally announced January 2024.

Journal ref: IEEE Robotics and Automation Letters, Date: FEBRUARY 2024 , Volume: 9, Issue: 2, ISSN: 2377-3766, pp1452-1459

arXiv:2312.11534 [pdf, ps, other]

Improved Differentially Private and Lazy Online Convex Optimization

Authors: Naman Agarwal, Satyen Kale, Karan Singh, Abhradeep Guha Thakurta

Abstract: We study the task of $(ε, δ)$-differentially private online convex optimization (OCO). In the online setting, the release of each distinct decision or iterate carries with it the potential for privacy loss. This problem has a long history of research starting with Jain et al. [2012] and the best known results for the regime of ε not being very small are presented in Agarwal et al. [2023]. In this… ▽ More We study the task of $(ε, δ)$-differentially private online convex optimization (OCO). In the online setting, the release of each distinct decision or iterate carries with it the potential for privacy loss. This problem has a long history of research starting with Jain et al. [2012] and the best known results for the regime of ε not being very small are presented in Agarwal et al. [2023]. In this paper we improve upon the results of Agarwal et al. [2023] in terms of the dimension factors as well as removing the requirement of smoothness. Our results are now the best known rates for DP-OCO in this regime. Our algorithms builds upon the work of [Asi et al., 2023] which introduced the idea of explicitly limiting the number of switches via rejection sampling. The main innovation in our algorithm is the use of sampling from a strongly log-concave density which allows us to trade-off the dimension factors better leading to improved results. △ Less

Submitted 20 December, 2023; v1 submitted 15 December, 2023; originally announced December 2023.

arXiv:2312.08021 [pdf, other]

Improving search relevance of Azure Cognitive Search by Bayesian optimization

Authors: Nitin Agarwal, Ashish Kumar, Kiran R, Manish Gupta, Laurent Boué

Abstract: Azure Cognitive Search (ACS) has emerged as a major contender in "Search as a Service" cloud products in recent years. However, one of the major challenges for ACS users is to improve the relevance of the search results for their specific usecases. In this paper, we propose a novel method to find the optimal ACS configuration that maximizes search relevance for a specific usecase (product search,… ▽ More Azure Cognitive Search (ACS) has emerged as a major contender in "Search as a Service" cloud products in recent years. However, one of the major challenges for ACS users is to improve the relevance of the search results for their specific usecases. In this paper, we propose a novel method to find the optimal ACS configuration that maximizes search relevance for a specific usecase (product search, document search...) The proposed solution improves key online marketplace metrics such as click through rates (CTR) by formulating the search relevance problem as hyperparameter tuning. We have observed significant improvements in real-world search call to action (CTA) rate in multiple marketplaces by introducing optimized weights generated from the proposed approach. △ Less

Submitted 13 December, 2023; originally announced December 2023.

Journal ref: Microsoft Journal of Applied Research, Volume 20, 2024

arXiv:2312.06837 [pdf, other]

Spectral State Space Models

Authors: Naman Agarwal, Daniel Suo, Xinyi Chen, Elad Hazan

Abstract: This paper studies sequence modeling for prediction tasks with long range dependencies. We propose a new formulation for state space models (SSMs) based on learning linear dynamical systems with the spectral filtering algorithm (Hazan et al. (2017)). This gives rise to a novel sequence prediction architecture we call a spectral state space model. Spectral state space models have two primary adva… ▽ More This paper studies sequence modeling for prediction tasks with long range dependencies. We propose a new formulation for state space models (SSMs) based on learning linear dynamical systems with the spectral filtering algorithm (Hazan et al. (2017)). This gives rise to a novel sequence prediction architecture we call a spectral state space model. Spectral state space models have two primary advantages. First, they have provable robustness properties as their performance depends on neither the spectrum of the underlying dynamics nor the dimensionality of the problem. Second, these models are constructed with fixed convolutional filters that do not require learning while still outperforming SSMs in both theory and practice. The resulting models are evaluated on synthetic dynamical systems and long-range prediction tasks of various modalities. These evaluations support the theoretical benefits of spectral filtering for tasks requiring very long range memory. △ Less

Submitted 11 July, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

arXiv:2311.13627 [pdf, other]

Vamos: Versatile Action Models for Video Understanding

Authors: Shijie Wang, Qi Zhao, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, Chen Sun

Abstract: What makes good representations for video understanding, such as anticipating future activities, or answering video-conditioned questions? While earlier approaches focus on end-to-end learning directly from video pixels, we propose to revisit text-based representations, such as general-purpose video captions, which are interpretable and can be directly consumed by large language models (LLMs). Int… ▽ More What makes good representations for video understanding, such as anticipating future activities, or answering video-conditioned questions? While earlier approaches focus on end-to-end learning directly from video pixels, we propose to revisit text-based representations, such as general-purpose video captions, which are interpretable and can be directly consumed by large language models (LLMs). Intuitively, different video understanding tasks may require representations that are complementary and at different granularity. To this end, we propose versatile action models (Vamos), a learning framework powered by a large language model as the ``reasoner'', and can flexibly leverage visual embedding and free-form text descriptions as its input. To interpret the important text evidence for question answering, we generalize the concept bottleneck model to work with tokens and nonlinear models, which uses hard attention to select a small subset of tokens from the free-form text as inputs to the LLM reasoner. We evaluate Vamos on five complementary benchmarks, Ego4D, NeXT-QA, IntentQA, Spacewalk-18, and EgoSchema, on its capability to model temporal dynamics, encode visual history, and perform reasoning. Surprisingly, we observe that text-based representations consistently achieve competitive performance on all benchmarks, and that visual embeddings provide marginal or no performance improvement, demonstrating the effectiveness of text-based video representation in the LLM era. We also demonstrate that our token bottleneck model is able to select relevant evidence from free-form text, support test-time intervention, and achieves nearly 5 times inference speedup while keeping a competitive question answering performance. Code and models are publicly released at https://brown-palm.github.io/Vamos/ △ Less

Submitted 13 July, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

Comments: Accepted to ECCV 2024 (European Conference on Computer Vision). Code and models are released at https://brown-palm.github.io/Vamos/

arXiv:2311.11892 [pdf]

Multimodal Characterization of Emotion within Multimedia Space

Authors: Dayo Samuel Banjo, Connice Trimmingham, Niloofar Yousefi, Nitin Agarwal

Abstract: Technological advancement and its omnipresent connection have pushed humans past the boundaries and limitations of a computer screen, physical state, or geographical location. It has provided a depth of avenues that facilitate human-computer interaction that was once inconceivable such as audio and body language detection. Given the complex modularities of emotions, it becomes vital to study human… ▽ More Technological advancement and its omnipresent connection have pushed humans past the boundaries and limitations of a computer screen, physical state, or geographical location. It has provided a depth of avenues that facilitate human-computer interaction that was once inconceivable such as audio and body language detection. Given the complex modularities of emotions, it becomes vital to study human-computer interaction, as it is the commencement of a thorough understanding of the emotional state of users and, in the context of social networks, the producers of multimodal information. This study first acknowledges the accuracy of classification found within multimodal emotion detection systems compared to unimodal solutions. Second, it explores the characterization of multimedia content produced based on their emotions and the coherence of emotion in different modalities by utilizing deep learning models to classify emotion across different modalities. △ Less

Submitted 20 November, 2023; originally announced November 2023.

Comments: 8 pages, Published in International Conference on Computers and Computation (COMPUTE 2022), November 03-04, 2022, San Francisco, United States

arXiv:2311.05791 [pdf]

Detecting Suspicious Commenter Mob Behaviors on YouTube Using Graph2Vec

Authors: Shadi Shajari, Mustafa Alassad, Nitin Agarwal

Abstract: YouTube, a widely popular online platform, has transformed the dynamics of con-tent consumption and interaction for users worldwide. With its extensive range of content crea-tors and viewers, YouTube serves as a hub for video sharing, entertainment, and information dissemination. However, the exponential growth of users and their active engagement on the platform has raised concerns regarding susp… ▽ More YouTube, a widely popular online platform, has transformed the dynamics of con-tent consumption and interaction for users worldwide. With its extensive range of content crea-tors and viewers, YouTube serves as a hub for video sharing, entertainment, and information dissemination. However, the exponential growth of users and their active engagement on the platform has raised concerns regarding suspicious commenter behaviors, particularly in the com-ment section. This paper presents a social network analysis-based methodology for detecting suspicious commenter mob-like behaviors among YouTube channels and the similarities therein. The method aims to characterize channels based on the level of such behavior and identify com-mon patterns across them. To evaluate the effectiveness of the proposed model, we conducted an analysis of 20 YouTube channels, consisting of 7,782 videos, 294,199 commenters, and 596,982 comments. These channels were specifically selected for propagating false views about the U.S. Military. The analysis revealed significant similarities among the channels, shedding light on the prevalence of suspicious commenter behavior. By understanding these similarities, we contribute to a better understanding of the dynamics of suspicious behavior on YouTube channels, which can inform strategies for addressing and mitigating such behavior. △ Less

Submitted 9 November, 2023; originally announced November 2023.

arXiv:2311.00180 [pdf, other]

Object-centric Video Representation for Long-term Action Anticipation

Authors: Ce Zhang, Changcheng Fu, Shijie Wang, Nakul Agarwal, Kwonjoon Lee, Chiho Choi, Chen Sun

Abstract: This paper focuses on building object-centric representations for long-term action anticipation in videos. Our key motivation is that objects provide important cues to recognize and predict human-object interactions, especially when the predictions are longer term, as an observed "background" object could be used by the human actor in the future. We observe that existing object-based video recogni… ▽ More This paper focuses on building object-centric representations for long-term action anticipation in videos. Our key motivation is that objects provide important cues to recognize and predict human-object interactions, especially when the predictions are longer term, as an observed "background" object could be used by the human actor in the future. We observe that existing object-based video recognition frameworks either assume the existence of in-domain supervised object detectors or follow a fully weakly-supervised pipeline to infer object locations from action labels. We propose to build object-centric video representations by leveraging visual-language pretrained models. This is achieved by "object prompts", an approach to extract task-specific object-centric representations from general-purpose pretrained models without finetuning. To recognize and predict human-object interactions, we use a Transformer-based neural architecture which allows the "retrieval" of relevant objects for action anticipation at various time scales. We conduct extensive evaluations on the Ego4D, 50Salads, and EGTEA Gaze+ benchmarks. Both quantitative and qualitative results confirm the effectiveness of our proposed method. △ Less

Submitted 31 October, 2023; originally announced November 2023.

Comments: This is an accepted WACV 2024 paper. Our code is available at https://github.com/brown-palm/ObjectPrompt

arXiv:2310.14079 [pdf, other]

To Copy, or not to Copy; That is a Critical Issue of the Output Softmax Layer in Neural Sequential Recommenders

Authors: Haw-Shiuan Chang, Nikhil Agarwal, Andrew McCallum

Abstract: Recent studies suggest that the existing neural models have difficulty handling repeated items in sequential recommendation tasks. However, our understanding of this difficulty is still limited. In this study, we substantially advance this field by identifying a major source of the problem: the single hidden state embedding and static item embeddings in the output softmax layer. Specifically, the… ▽ More Recent studies suggest that the existing neural models have difficulty handling repeated items in sequential recommendation tasks. However, our understanding of this difficulty is still limited. In this study, we substantially advance this field by identifying a major source of the problem: the single hidden state embedding and static item embeddings in the output softmax layer. Specifically, the similarity structure of the global item embeddings in the softmax layer sometimes forces the single hidden state embedding to be close to new items when copying is a better choice, while sometimes forcing the hidden state to be close to the items from the input inappropriately. To alleviate the problem, we adapt the recently-proposed softmax alternatives such as softmax-CPR to sequential recommendation tasks and demonstrate that the new softmax architectures unleash the capability of the neural encoder on learning when to copy and when to exclude the items from the input sequence. By only making some simple modifications on the output softmax layer for SASRec and GRU4Rec, softmax-CPR achieves consistent improvement in 12 datasets. With almost the same model size, our best method not only improves the average NDCG@10 of GRU4Rec in 5 datasets with duplicated items by 10% (4%-17% individually) but also improves 7 datasets without duplicated items by 24% (8%-39%)! △ Less

Submitted 21 October, 2023; originally announced October 2023.

Comments: WSDM 2024

arXiv:2309.13470 [pdf, other]

HAVE-Net: Hallucinated Audio-Visual Embeddings for Few-Shot Classification with Unimodal Cues

Authors: Ankit Jha, Debabrata Pal, Mainak Singha, Naman Agarwal, Biplab Banerjee

Abstract: Recognition of remote sensing (RS) or aerial images is currently of great interest, and advancements in deep learning algorithms added flavor to it in recent years. Occlusion, intra-class variance, lighting, etc., might arise while training neural networks using unimodal RS visual input. Even though joint training of audio-visual modalities improves classification performance in a low-data regime,… ▽ More Recognition of remote sensing (RS) or aerial images is currently of great interest, and advancements in deep learning algorithms added flavor to it in recent years. Occlusion, intra-class variance, lighting, etc., might arise while training neural networks using unimodal RS visual input. Even though joint training of audio-visual modalities improves classification performance in a low-data regime, it has yet to be thoroughly investigated in the RS domain. Here, we aim to solve a novel problem where both the audio and visual modalities are present during the meta-training of a few-shot learning (FSL) classifier; however, one of the modalities might be missing during the meta-testing stage. This problem formulation is pertinent in the RS domain, given the difficulties in data acquisition or sensor malfunctioning. To mitigate, we propose a novel few-shot generative framework, Hallucinated Audio-Visual Embeddings-Network (HAVE-Net), to meta-train cross-modal features from limited unimodal data. Precisely, these hallucinated features are meta-learned from base classes and used for few-shot classification on novel classes during the inference phase. The experimental results on the benchmark ADVANCE and AudioSetZSL datasets show that our hallucinated modality augmentation strategy for few-shot classification outperforms the classifier performance trained with the real multimodal information at least by 0.8-2%. △ Less

Submitted 23 September, 2023; originally announced September 2023.

Comments: 8 Page, 2 Figures, 2 Tables, Accepted in Adapting to Change: Reliable Multimodal Learning Across Domains Workshop, ECML PKDD 2023

arXiv:2309.06597 [pdf, other]

Rank2Tell: A Multimodal Driving Dataset for Joint Importance Ranking and Reasoning

Authors: Enna Sachdeva, Nakul Agarwal, Suhas Chundi, Sean Roelofs, Jiachen Li, Mykel Kochenderfer, Chiho Choi, Behzad Dariush

Abstract: The widespread adoption of commercial autonomous vehicles (AVs) and advanced driver assistance systems (ADAS) may largely depend on their acceptance by society, for which their perceived trustworthiness and interpretability to riders are crucial. In general, this task is challenging because modern autonomous systems software relies heavily on black-box artificial intelligence models. Towards this… ▽ More The widespread adoption of commercial autonomous vehicles (AVs) and advanced driver assistance systems (ADAS) may largely depend on their acceptance by society, for which their perceived trustworthiness and interpretability to riders are crucial. In general, this task is challenging because modern autonomous systems software relies heavily on black-box artificial intelligence models. Towards this goal, this paper introduces a novel dataset, Rank2Tell, a multi-modal ego-centric dataset for Ranking the importance level and Telling the reason for the importance. Using various close and open-ended visual question answering, the dataset provides dense annotations of various semantic, spatial, temporal, and relational attributes of various important objects in complex traffic scenarios. The dense annotations and unique attributes of the dataset make it a valuable resource for researchers working on visual scene understanding and related fields. Furthermore, we introduce a joint model for joint importance level ranking and natural language captions generation to benchmark our dataset and demonstrate performance with quantitative evaluations. △ Less

Submitted 8 November, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

arXiv:2307.16368 [pdf, other]

AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?

Authors: Qi Zhao, Shijie Wang, Ce Zhang, Changcheng Fu, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, Chen Sun

Abstract: Can we better anticipate an actor's future actions (e.g. mix eggs) by knowing what commonly happens after his/her current action (e.g. crack eggs)? What if we also know the longer-term goal of the actor (e.g. making egg fried rice)? The long-term action anticipation (LTA) task aims to predict an actor's future behavior from video observations in the form of verb and noun sequences, and it is cruci… ▽ More Can we better anticipate an actor's future actions (e.g. mix eggs) by knowing what commonly happens after his/her current action (e.g. crack eggs)? What if we also know the longer-term goal of the actor (e.g. making egg fried rice)? The long-term action anticipation (LTA) task aims to predict an actor's future behavior from video observations in the form of verb and noun sequences, and it is crucial for human-machine interaction. We propose to formulate the LTA task from two perspectives: a bottom-up approach that predicts the next actions autoregressively by modeling temporal dynamics; and a top-down approach that infers the goal of the actor and plans the needed procedure to accomplish the goal. We hypothesize that large language models (LLMs), which have been pretrained on procedure text data (e.g. recipes, how-tos), have the potential to help LTA from both perspectives. It can help provide the prior knowledge on the possible next actions, and infer the goal given the observed part of a procedure, respectively. To leverage the LLMs, we propose a two-stage framework, AntGPT. It first recognizes the actions already performed in the observed videos and then asks an LLM to predict the future actions via conditioned generation, or to infer the goal and plan the whole procedure by chain-of-thought prompting. Empirical results on the Ego4D LTA v1 and v2 benchmarks, EPIC-Kitchens-55, as well as EGTEA GAZE+ demonstrate the effectiveness of our proposed approach. AntGPT achieves state-of-the-art performance on all above benchmarks, and can successfully infer the goal and thus perform goal-conditioned "counterfactual" prediction via qualitative analysis. Code and model will be released at https://brown-palm.github.io/AntGPT △ Less

Submitted 31 March, 2024; v1 submitted 30 July, 2023; originally announced July 2023.

Comments: ICLR 2024 Camera Ready

arXiv:2307.15924 [pdf, other]

Correlator webs of massive multiparton amplitudes at four loops: A study of boomerang webs

Authors: Neelima Agarwal, Sourav Pal, Aditya Srivastav, Anurag Tripathi

Abstract: Logarithm of the soft function can be organized into sets of Feynman diagrams known as Cwebs. We introduced a new formalism in~\cite{Agarwal:2022wyk}, that allows to determine several of the building blocks of Cweb mixing matrices without explicit computations. In~\cite{Agarwal:2022xec} we used this formalism to obtain the diagonal blocks of four general classes of Cwebs to all orders in perturbat… ▽ More Logarithm of the soft function can be organized into sets of Feynman diagrams known as Cwebs. We introduced a new formalism in~\cite{Agarwal:2022wyk}, that allows to determine several of the building blocks of Cweb mixing matrices without explicit computations. In~\cite{Agarwal:2022xec} we used this formalism to obtain the diagonal blocks of four general classes of Cwebs to all orders in perturbation theory which also covered all the four loop Boomerang Cwebs connecting four Wilson lines. In this work we present complete mixing matrices and exponentiated colour factors for Boomerang Cwebs at four loops that connect three and four Wilson lines. Also, we present a more efficient version of the algorithm of generating Cwebs that was presented in~\cite{Agarwal:2020nyc}. This new algorithm has been used to generate the Cwebs in the present work. △ Less

Submitted 29 May, 2024; v1 submitted 29 July, 2023; originally announced July 2023.

Comments: 59 pages, 5 figures, 1 table, published version, minor changes, conclusion remain unchanged

arXiv:2307.04587 [pdf, other]

doi 10.1038/s41467-024-45711-3

Endotaxial Stabilization of 2D Charge Density Waves with Long-range Order

Authors: Suk Hyun Sung, Nishkarsh Agarwal, Ismail El Baggari, Yin Min Goh, Patrick Kezer, Noah Schnitzer, Yu Liu, Wenjian Lu, Yuping Sun, Lena F. Kourkoutis, John T. Heron, Kai Sun, Robert Hovden

Abstract: Charge density waves are emergent quantum states that spontaneously reduce crystal symmetry, drive metal-insulator transitions, and precede superconductivity. In low-dimensions, distinct quantum states arise, however, thermal fluctuations and external disorder destroy long-range order. Here we stabilize ordered two-dimensional (2D) charge density waves through endotaxial synthesis of confined mono… ▽ More Charge density waves are emergent quantum states that spontaneously reduce crystal symmetry, drive metal-insulator transitions, and precede superconductivity. In low-dimensions, distinct quantum states arise, however, thermal fluctuations and external disorder destroy long-range order. Here we stabilize ordered two-dimensional (2D) charge density waves through endotaxial synthesis of confined monolayers of 1T-TaS$_2$. Specifically, an ordered incommensurate charge density wave (oIC-CDW) is realized in 2D with dramatically enhanced amplitude and resistivity. By enhancing CDW order, the hexatic nature of charge density waves becomes observable. Upon heating via in-situ TEM, the CDW continuously melts in a reversible hexatic process wherein topological defects form in the charge density wave. From these results, new regimes of the CDW phase diagram for 1T-TaS$_2$ are derived and consistent with the predicted emergence of vestigial quantum order. △ Less

Submitted 10 July, 2023; originally announced July 2023.

arXiv:2307.03876 [pdf]

Revealing intrinsic domains and fluctuations of moiré magnetism by a wide-field quantum microscope

Authors: Mengqi Huang, Zeliang Sun, Gerald Yan, Hongchao Xie, Nishkarsh Agarwal, Gaihua Ye, Suk Hyun Sung, Hanyi Lu, Jingcheng Zhou, Shaohua Yan, Shangjie Tian, Hechang Lei, Robert Hovden, Rui He, Hailong Wang, Liuyan Zhao, Chunhui Rita Du

Abstract: Moiré magnetism featured by stacking engineered atomic registry and lattice interactions has recently emerged as an appealing quantum state of matter at the forefront condensed matter physics research. Nanoscale imaging of moiré magnets is highly desirable and serves as a prerequisite to investigate a broad range of intriguing physics underlying the interplay between topology, electronic correlati… ▽ More Moiré magnetism featured by stacking engineered atomic registry and lattice interactions has recently emerged as an appealing quantum state of matter at the forefront condensed matter physics research. Nanoscale imaging of moiré magnets is highly desirable and serves as a prerequisite to investigate a broad range of intriguing physics underlying the interplay between topology, electronic correlations, and unconventional nanomagnetism. Here we report spin defect-based wide-field imaging of magnetic domains and spin fluctuations in twisted double trilayer (tDT) chromium triiodide CrI3. We explicitly show that intrinsic moiré domains of opposite magnetizations appear over arrays of moiré supercells in low-twist-angle tDT CrI3. In contrast, spin fluctuations measured in tDT CrI3 manifest little spatial variations on the same mesoscopic length scale due to the dominant driving force of intralayer exchange interaction. Our results enrich the current understanding of exotic magnetic phases sustained by moiré magnetism and highlight the opportunities provided by quantum spin sensors in probing microscopic spin related phenomena on two-dimensional flatland. △ Less

Submitted 7 July, 2023; originally announced July 2023.

arXiv:2306.17601 [pdf, other]

Next-to-leading power corrections to the event shape variables

Authors: Neelima Agarwal, Melissa van Beekveld, Eric Laenen, Shubham Mishra, Ayan Mukhopadhyay, Anurag Tripathi

Abstract: We investigate the origin of next-to-leading power corrections to the event shapes thrust and $c$-parameter, at next-to-leading order. For both event shapes we trace the origin of such terms in the exact calculation, and compare with a recent approach involving the eikonal approximation and momentum shifts that follow from the Low-Burnett-Kroll-Del Duca theorem. We assess the differences both anal… ▽ More We investigate the origin of next-to-leading power corrections to the event shapes thrust and $c$-parameter, at next-to-leading order. For both event shapes we trace the origin of such terms in the exact calculation, and compare with a recent approach involving the eikonal approximation and momentum shifts that follow from the Low-Burnett-Kroll-Del Duca theorem. We assess the differences both analytically and numerically. For the $c$-parameter both exact and approximate results are expressed in terms of elliptic integrals, but near the elastic limit it exhibits patterns similar to the thrust results. △ Less

Submitted 30 June, 2023; originally announced June 2023.

Comments: 23 pages, 8 figures and 1 table

arXiv:2306.10871 [pdf, other]

Stability of Reset and Impulsive Continuous-time Linear Switched Systems

Authors: Swapnil Tripathi, Nikita Agarwal

Abstract: We study stability issue of reset and impulsive switched systems. We find time constraints (dwell time and flee time) on switching signals which stabilize a given reset switched system. For a given collection of matrices, we find an assignment of resets and time constraints on switching signals which guarantee stability of the reset switched system. Similar results are obtained for impulsive switc… ▽ More We study stability issue of reset and impulsive switched systems. We find time constraints (dwell time and flee time) on switching signals which stabilize a given reset switched system. For a given collection of matrices, we find an assignment of resets and time constraints on switching signals which guarantee stability of the reset switched system. Similar results are obtained for impulsive switched systems as well. Two techniques, namely, analysis of flow of the system and the multiple Lyapunov function approach is used to obtain the results. The results are later generalized to obtain mode-dependent time constraints for stability of these systems. △ Less

Submitted 19 June, 2023; originally announced June 2023.

MSC Class: 37N35 (Primary); 93C05; 93D05 (Secondary)

arXiv:2306.07179 [pdf, other]

Benchmarking Neural Network Training Algorithms

Authors: George E. Dahl, Frank Schneider, Zachary Nado, Naman Agarwal, Chandramouli Shama Sastry, Philipp Hennig, Sourabh Medapati, Runa Eschenhagen, Priya Kasimbeg, Daniel Suo, Juhan Bae, Justin Gilmer, Abel L. Peirson, Bilal Khan, Rohan Anil, Mike Rabbat, Shankar Krishnan, Daniel Snider, Ehsan Amid, Kongtao Chen, Chris J. Maddison, Rakshith Vasudev, Michal Badura, Ankush Garg, Peter Mattson

Abstract: Training algorithms, broadly construed, are an essential part of every deep learning pipeline. Training algorithm improvements that speed up training across a wide variety of workloads (e.g., better update rules, tuning protocols, learning rate schedules, or data selection schemes) could save time, save computational resources, and lead to better, more accurate, models. Unfortunately, as a communi… ▽ More Training algorithms, broadly construed, are an essential part of every deep learning pipeline. Training algorithm improvements that speed up training across a wide variety of workloads (e.g., better update rules, tuning protocols, learning rate schedules, or data selection schemes) could save time, save computational resources, and lead to better, more accurate, models. Unfortunately, as a community, we are currently unable to reliably identify training algorithm improvements, or even determine the state-of-the-art training algorithm. In this work, using concrete experiments, we argue that real progress in speeding up training requires new benchmarks that resolve three basic challenges faced by empirical comparisons of training algorithms: (1) how to decide when training is complete and precisely measure training time, (2) how to handle the sensitivity of measurements to exact workload details, and (3) how to fairly compare algorithms that require hyperparameter tuning. In order to address these challenges, we introduce a new, competitive, time-to-result benchmark using multiple workloads running on fixed hardware, the AlgoPerf: Training Algorithms benchmark. Our benchmark includes a set of workload variants that make it possible to detect benchmark submissions that are more robust to workload changes than current widely-used methods. Finally, we evaluate baseline submissions constructed using various optimizers that represent current practice, as well as other optimizers that have recently received attention in the literature. These baseline results collectively demonstrate the feasibility of our benchmark, show that non-trivial gaps between methods exist, and set a provisional state-of-the-art for future benchmark submissions to try and surpass. △ Less

Submitted 12 June, 2023; originally announced June 2023.

Comments: 102 pages, 8 figures, 41 tables

arXiv:2304.08890 [pdf]

doi 10.1002/advs.202302550

Transient non-collinear magnetic state for all-optical magnetization switching

Authors: Sergii Parchenko, Antoni Frej, Hiroki Ueda, Robert Carley, Laurent Mercadier, Natalia Gerasimova, Giuseppe Mercurio, Justine Schlappa, Alexander Yaroslavtsev, Naman Agarwal, Rafael Gort, Andreas Scherz, Anatoly Zvezdin, Andrzej Stupakiewicz, Urs Staub

Abstract: Resonant absorption of a photon by bound electrons in a solid can promote an electron to another orbital state or transfer it to a neighboring atomic site. Such a transition in a magnetically ordered material could affect the magnetic order. While this process is an obvious road map for optical control of magnetization, experimental demonstration of such a process remains challenging. Exciting a s… ▽ More Resonant absorption of a photon by bound electrons in a solid can promote an electron to another orbital state or transfer it to a neighboring atomic site. Such a transition in a magnetically ordered material could affect the magnetic order. While this process is an obvious road map for optical control of magnetization, experimental demonstration of such a process remains challenging. Exciting a significant fraction of magnetic ions requires a very intense incoming light beam, as orbital resonances are often weak compared to above-band-gap excitations. In the latter case, a sizeable reduction of the magnetization occurs as the absorbed energy increases the spin temperature, masking the non-thermal optical effects. Here, using ultrafast x-ray spectroscopy, we were able to resolve changes in the magnetization state induced by resonant absorption of infrared photons in Co-doped yttrium iron garnet, with negligible thermal effects. We found that the optical excitation of the Co ions affects the two distinct magnetic Fe sublattices differently, resulting in a transient non-collinear magnetic state. The present results indicate that the all-optical magnetization switching most likely occurs due to the creation of a transient, non-collinear magnetic state followed by coherent spin rotations of the Fe moments. △ Less

Submitted 18 April, 2023; originally announced April 2023.

Journal ref: Adv. Sci. 10, 2302550 (2023)

arXiv:2304.07681 [pdf]

Commenter Behavior Characterization on YouTube Channels

Authors: Shadi Shajari, Nitin Agarwal, Mustafa Alassad

Abstract: YouTube is the second most visited website in the world and receives comments from millions of commenters daily. The comments section acts as a space for discussions among commenters, but it could also be a breeding ground for problematic behavior. In particular, the presence of suspicious commenters who engage in activities that deviate from the norms of constructive and respectful discourse can… ▽ More YouTube is the second most visited website in the world and receives comments from millions of commenters daily. The comments section acts as a space for discussions among commenters, but it could also be a breeding ground for problematic behavior. In particular, the presence of suspicious commenters who engage in activities that deviate from the norms of constructive and respectful discourse can negatively impact the community and the quality of the online experience. This paper presents a social network analysis-based methodology for detecting commenter mobs on YouTube. These mobs of commenters collaborate to boost engagement on certain videos. The method provides a way to characterize channels based on the level of suspicious commenter behavior and detect coordination among channels. To evaluate our model, we analyzed 20 YouTube channels, 7,782 videos, 294,199 commenters, and 596,982 comments that propagated false views about the U.S. Military. The analysis concluded with evidence of commenter mob activities, possible coordinated suspicious behavior on the channels, and an explanation of the behavior of co-commenter communities. △ Less

Submitted 15 April, 2023; originally announced April 2023.

arXiv:2304.04126 [pdf]

doi 10.31219/osf.io/8nj7y

Deliberative Democracy, Perspective from Indo-Pacific Blogosphere: A Survey

Authors: Abiola Akinnubi, Nitin Agarwal

Abstract: Deliberation and communication within the national space have had numerous implications on how citizens online and offline perceive government. It has also impacted the relationship between opposition and incumbent governments in the Indo-Pacific region. Authoritarian regimes have historically had control over the dissemination of information, thereby controlling power and limiting challenges from… ▽ More Deliberation and communication within the national space have had numerous implications on how citizens online and offline perceive government. It has also impacted the relationship between opposition and incumbent governments in the Indo-Pacific region. Authoritarian regimes have historically had control over the dissemination of information, thereby controlling power and limiting challenges from citizens who are not comfortable with the status quo. Social media and blogs have allowed citizens of these countries to find a way to communicate, and the exchange of information continues to rise. The quest by both authoritarian and democratic regimes to control or influence the discussion in the public sphere has given rise to concepts like cybertroopers, congressional bloggers, and commentator bloggers, among others. Cybertroopers have become the de facto online soldiers of authoritarian regimes who must embrace democracy. While commentator and congressional bloggers have acted with different strategies, commentator bloggers educate online citizens with knowledgeable information to influence the citizens. Congressional bloggers are political officeholders who use blogging to communicate their positions on ongoing national issues. Therefore, this work has explored various concepts synonymous with the Indo-Pacific public sphere and how it shapes elections and democracy. △ Less

Submitted 10 April, 2023; v1 submitted 8 April, 2023; originally announced April 2023.

arXiv:2303.04829 [pdf, other]

doi 10.1103/PhysRevResearch.6.023113

Initial value formulation of a quantum damped harmonic oscillator

Authors: Nishant Agarwal, Yi-Zen Chu

Abstract: The in-in formalism and its influence functional generalization are widely used to describe the out-of-equilibrium dynamics of unitary and open quantum systems, respectively. In this paper, we build on these techniques to develop an effective theory of a quantum damped harmonic oscillator and use it to study initial state-dependence, decoherence, and thermalization. We first consider a Gaussian in… ▽ More The in-in formalism and its influence functional generalization are widely used to describe the out-of-equilibrium dynamics of unitary and open quantum systems, respectively. In this paper, we build on these techniques to develop an effective theory of a quantum damped harmonic oscillator and use it to study initial state-dependence, decoherence, and thermalization. We first consider a Gaussian initial state and quadratic influence functional and obtain general equations for the Green's functions of the oscillator. We solve the equations in the specific case of time-local dissipation and use the resulting Green's functions to obtain the purity and unequal-time two-point correlations of the oscillator. We find that the dynamics must include a non-vanishing noise term to yield physical results for the purity and that the oscillator decoheres in time such that the late-time density operator is thermal. We show that the frequency spectrum or unequal-time correlations can, however, distinguish between the damped oscillator and an isolated oscillator in thermal equilibrium, and obtain a generalized fluctuation-dissipation relation for the damped oscillator. We briefly consider time-nonlocal dissipation as well, to show that the fluctuation-dissipation relation is satisfied for a specific choice of dissipation kernels. Lastly, we develop a double in-out path integral approach to go beyond Gaussian initial states and show that our equal-time results for time-local dissipation are in fact non-perturbative in the initial state. △ Less

Submitted 6 May, 2024; v1 submitted 8 March, 2023; originally announced March 2023.

Comments: 24 pages, including 2 appendices, 2 figures. Expanded discussion of Wick's theorem in section IIIA, updated discussion of fluctuation-dissipation relation in section IVC, added discussion of time-nonlocal dissipation in section V. Matches published version

Journal ref: Phys. Rev. Res. 6, 023113 (2024)

arXiv:2303.02181 [pdf, other]

Thermalization and localization in a discretized quantum field theory

Authors: Spasen Chaykov, Brenden Bowen, Nishant Agarwal

Abstract: Localization marks the breakdown of thermalization in subregions of quantum many-body systems in the presence of sufficiently large disorder. In this paper, we use numerical techniques to study thermalization and localization in a many-body system of coupled quantum harmonic oscillators obtained by discretizing a scalar quantum field theory in Minkowski spacetime. We consider a Gaussian initial st… ▽ More Localization marks the breakdown of thermalization in subregions of quantum many-body systems in the presence of sufficiently large disorder. In this paper, we use numerical techniques to study thermalization and localization in a many-body system of coupled quantum harmonic oscillators obtained by discretizing a scalar quantum field theory in Minkowski spacetime. We consider a Gaussian initial state, constructed through a global mass quench, with a quadratic Hamiltonian, and solve for the system's exact dynamics without and with disorder in one and two spatial dimensions. We find that finite-size systems localize for sufficiently large disorder in both cases, such that the entanglement entropy of subregions retains its initial area-law behavior, and the system no longer develops long-range correlations. To probe the thermalization-to-localization transition further, we define a frequency gap ratio that measures adjacent gaps in the phase space eigenvalues of the Hamiltonian and study how it varies with disorder strength and system size. We find signatures of a chaotic regime at intermediate disorder in two spatial dimensions and argue that it is a finite-size effect, such that the system would localize for arbitrarily small disorder in the continuum in both one and two spatial dimensions, consistent with Anderson localization. Lastly, we use the frequency gap ratio to argue that in three spatial dimensions, on the other hand, the system would only localize for disorder strengths above a critical value in the continuum, again consistent with Anderson localization. △ Less

Submitted 3 March, 2023; originally announced March 2023.

Comments: 10 pages, 5 figures

arXiv:2302.14270 [pdf]

Comparing Toxicity Across Social Media Platforms for COVID-19 Discourse

Authors: Nahiyan Bin Noor, Niloofar Yousefi, Billy Spann, Nitin Agarwal

Abstract: The emergence of toxic information on social networking sites, such as Twitter, Parler, and Reddit, has become a growing concern. Consequently, this study aims to assess the level of toxicity in COVID-19 discussions on Twitter, Parler, and Reddit. Using data analysis from January 1 through December 31, 2020, we examine the development of toxicity over time and compare the findings across the three… ▽ More The emergence of toxic information on social networking sites, such as Twitter, Parler, and Reddit, has become a growing concern. Consequently, this study aims to assess the level of toxicity in COVID-19 discussions on Twitter, Parler, and Reddit. Using data analysis from January 1 through December 31, 2020, we examine the development of toxicity over time and compare the findings across the three platforms. The results indicate that Parler had lower toxicity levels than both Twitter and Reddit in discussions related to COVID-19. In contrast, Reddit showed the highest levels of toxicity, largely due to various anti-vaccine forums that spread misinformation about COVID-19 vaccines. Notably, our analysis of COVID-19 vaccination conversations on Twitter also revealed a significant presence of conspiracy theories among individuals with highly toxic attitudes. Our computational approach provides decision-makers with useful information about reducing the spread of toxicity within online communities. The study's findings highlight the importance of taking action to encourage more uplifting and productive online discourse across all platforms. △ Less

Submitted 26 April, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

Journal ref: IARIA. (2023) 21-26

arXiv:2212.06610 [pdf, other]

doi 10.1007/JHEP02(2023)258

Deciphering Colour Building Blocks of Massive Multiparton Amplitudes at 4-loops and beyond

Authors: Neelima Agarwal, Sourav Pal, Aditya Srivastav, Anurag Tripathi

Abstract: The soft function in non-abelian gauge theories exponentiate, and their logarithms can be organised in terms of the collections of Feynman diagrams called Cwebs. The colour factors that appear in the logarithm are controlled by the web mixing matrices. Direct construction of the diagonal blocks of Cwebs using the new concepts of Normal ordering, basis Cweb and Fused-Web was recently carried out in… ▽ More The soft function in non-abelian gauge theories exponentiate, and their logarithms can be organised in terms of the collections of Feynman diagrams called Cwebs. The colour factors that appear in the logarithm are controlled by the web mixing matrices. Direct construction of the diagonal blocks of Cwebs using the new concepts of Normal ordering, basis Cweb and Fused-Web was recently carried out in~\cite{Agarwal:2022wyk}. In this article we establish correspondence between the boomerang webs introduced in ~\cite{Gardi:2021gzz} and non-boomerang Cwebs. We use this correspondence together with Uniqueness theorem and Fused web formalism introduced in ~\cite{Agarwal:2022wyk} to obtain the diagonal blocks of four general classes of Cwebs to all orders in perturbation theory which also cover all the four loop Boomerang Cwebs connecting four Wilson lines. We also fully construct the mixing matrix of a special Cweb to all orders in perturbation theory. △ Less

Submitted 2 March, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

Comments: 57 pages, 33 figures, Published Version. arXiv admin note: text overlap with arXiv:2204.05936

Journal ref: Journal of High Energy Physics 2023

arXiv:2212.06283 [pdf, ps, other]

Variance-Reduced Conservative Policy Iteration

Authors: Naman Agarwal, Brian Bullins, Karan Singh

Abstract: We study the sample complexity of reducing reinforcement learning to a sequence of empirical risk minimization problems over the policy space. Such reductions-based algorithms exhibit local convergence in the function space, as opposed to the parameter space for policy gradient algorithms, and thus are unaffected by the possibly non-linear or discontinuous parameterization of the policy class. We… ▽ More We study the sample complexity of reducing reinforcement learning to a sequence of empirical risk minimization problems over the policy space. Such reductions-based algorithms exhibit local convergence in the function space, as opposed to the parameter space for policy gradient algorithms, and thus are unaffected by the possibly non-linear or discontinuous parameterization of the policy class. We propose a variance-reduced variant of Conservative Policy Iteration that improves the sample complexity of producing a $\varepsilon$-functional local optimum from $O(\varepsilon^{-4})$ to $O(\varepsilon^{-3})$. Under state-coverage and policy-completeness assumptions, the algorithm enjoys $\varepsilon$-global optimality after sampling $O(\varepsilon^{-2})$ times, improving upon the previously established $O(\varepsilon^{-3})$ sample requirement. △ Less

Submitted 25 January, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

Comments: To appear in proceedings of ALT 2023; updated references

arXiv:2211.11219 [pdf, other]

Best of Both Worlds in Online Control: Competitive Ratio and Policy Regret

Authors: Gautam Goel, Naman Agarwal, Karan Singh, Elad Hazan

Abstract: We consider the fundamental problem of online control of a linear dynamical system from two different viewpoints: regret minimization and competitive analysis. We prove that the optimal competitive policy is well-approximated by a convex parameterized policy class, known as a disturbance-action control (DAC) policies. Using this structural result, we show that several recently proposed online cont… ▽ More We consider the fundamental problem of online control of a linear dynamical system from two different viewpoints: regret minimization and competitive analysis. We prove that the optimal competitive policy is well-approximated by a convex parameterized policy class, known as a disturbance-action control (DAC) policies. Using this structural result, we show that several recently proposed online control algorithms achieve the best of both worlds: sublinear regret vs. the best DAC policy selected in hindsight, and optimal competitive ratio, up to an additive correction which grows sublinearly in the time horizon. We further conclude that sublinear regret vs. the optimal competitive policy is attainable when the linear dynamical system is unknown, and even when a stabilizing controller for the dynamics is not available a priori. △ Less

Submitted 21 November, 2022; originally announced November 2022.

arXiv:2211.06068 [pdf, ps, other]

A combinatorial approach to study subshifts associated with multigraphs

Authors: Nikita Agarwal, Haritha Cheriyath, Sharvari Neetin Tikekar

Abstract: A subshift of finite type over finitely many symbols can be described as a collection of all infinite walks on a digraph with at most a single edge from a vertex to another. The associated finite set $\F$ of forbidden words is a constraint which determines the language of the shift entirely. In this paper, in order to describe infinite walks on a multigraph, we introduce the notion of multiplicity… ▽ More A subshift of finite type over finitely many symbols can be described as a collection of all infinite walks on a digraph with at most a single edge from a vertex to another. The associated finite set $\F$ of forbidden words is a constraint which determines the language of the shift entirely. In this paper, in order to describe infinite walks on a multigraph, we introduce the notion of multiplicity of a word (finite walk) and define repeated words as those having multiplicity at least $2$. In general, for given collections $\F$ of forbidden words and $\R$ of repeated words with pre-assigned multiplicities, we define notion of a generalized language which is a multiset. We obtain a subshift associated with $\F$ and $\R$ such that its entropy is calculated using the generalized language. We also study the relationship between the language of this subshift and the generalized language. We then obtain a combinatorial expression for the generating function that enumerates the number of words of fixed length in this generalized language. This gives the Perron root and eigenvectors of the adjacency matrix with integer entries associated to the underlying multigraph. Using this, the topological entropy and an alternate definition of Parry measure for the associated edge shift are obtained. We also discuss some properties of Markov measures on this subshift. △ Less

Submitted 15 March, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

arXiv:2211.05239 [pdf, other]

RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure

Authors: Mark Zhao, Dhruv Choudhary, Devashish Tyagi, Ajay Somani, Max Kaplan, Sung-Han Lin, Sarunya Pumma, Jongsoo Park, Aarti Basant, Niket Agarwal, Carole-Jean Wu, Christos Kozyrakis

Abstract: We present RecD (Recommendation Deduplication), a suite of end-to-end infrastructure optimizations across the Deep Learning Recommendation Model (DLRM) training pipeline. RecD addresses immense storage, preprocessing, and training overheads caused by feature duplication inherent in industry-scale DLRM training datasets. Feature duplication arises because DLRM datasets are generated from interactio… ▽ More We present RecD (Recommendation Deduplication), a suite of end-to-end infrastructure optimizations across the Deep Learning Recommendation Model (DLRM) training pipeline. RecD addresses immense storage, preprocessing, and training overheads caused by feature duplication inherent in industry-scale DLRM training datasets. Feature duplication arises because DLRM datasets are generated from interactions. While each user session can generate multiple training samples, many features' values do not change across these samples. We demonstrate how RecD exploits this property, end-to-end, across a deployed training pipeline. RecD optimizes data generation pipelines to decrease dataset storage and preprocessing resource demands and to maximize duplication within a training batch. RecD introduces a new tensor format, InverseKeyedJaggedTensors (IKJTs), to deduplicate feature values in each batch. We show how DLRM model architectures can leverage IKJTs to drastically increase training throughput. RecD improves the training and preprocessing throughput and storage efficiency by up to 2.48x, 1.79x, and 3.71x, respectively, in an industry-scale DLRM training system. △ Less

Submitted 1 May, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

Comments: Published in the Proceedings of the Sixth Conference on Machine Learning and Systems (MLSys 2023)

arXiv:2211.04265 [pdf, other]

doi 10.1107/S1600577523000619

Photon shot-noise limited transient absorption soft X-ray spectroscopy at the European XFEL

Authors: Loïc Le Guyader, Andrea Eschenlohr, Martin Beye, William Schlotter, Florian Döring, Cammille Carinan, David Hickin, Naman Agarwal, Christine Boeglin, Uwe Bovensiepen, Jens Buck, Robert Carley, Andrea Castoldi, Alessandro D'Elia, Jan-Torben Delitz, Wajid Ehsan, Robin Engel, Florian Erdinger, Hans Fangohr, Peter Fischer, Carlo Fiorini, Alexander Föhlisch, Luca Gelisio, Michael Gensch, Natalia Gerasimova , et al. (39 additional authors not shown)

Abstract: Femtosecond transient soft X-ray Absorption Spectroscopy (XAS) is a very promising technique that can be employed at X-ray Free Electron Lasers (FELs) to investigate out-of-equilibrium dynamics for material and energy research. Here we present a dedicated setup for soft X-rays available at the Spectroscopy & Coherent Scattering (SCS) instrument at the European X-ray Free Electron Laser (EuXFEL). I… ▽ More Femtosecond transient soft X-ray Absorption Spectroscopy (XAS) is a very promising technique that can be employed at X-ray Free Electron Lasers (FELs) to investigate out-of-equilibrium dynamics for material and energy research. Here we present a dedicated setup for soft X-rays available at the Spectroscopy & Coherent Scattering (SCS) instrument at the European X-ray Free Electron Laser (EuXFEL). It consists of a beam-splitting off-axis zone plate (BOZ) used in transmission to create three copies of the incoming beam, which are used to measure the transmitted intensity through the excited and unexcited sample, as well as to monitor the incoming intensity. Since these three intensity signals are detected shot-by-shot and simultaneously, this setup allows normalized shot-by-shot analysis of the transmission. For photon detection, the DSSC imaging detector, which is capable of recording up to 800 images at 4.5 MHz frame rate during the FEL burst, is employed and allows approaching the photon shot-noise limit. We review the setup and its capabilities, as well as the online and offline analysis tools provided to users. △ Less

Submitted 4 January, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

Comments: 11 figures

Journal ref: J. Synchrotron Rad. (2023). 30, 284-300

arXiv:2210.13162 [pdf, other]

The interplay of local electron correlations and ultrafast spin dynamics in fcc Ni

Authors: Tobias Lojewski, Mohamed F. Elhanoty, Loïc Le Guyader, Oscar Grånäs, Naman Agarwal, Christine Boeglin, Robert Carley, Andrea Castoldi, Christian David, Carsten Deiter, Florian Döring, Robin Y. Engel, Florian Erdinger, Hans Fangohr, Carlo Fiorini, Peter Fischer, Natalia Gerasimova, Rafael Gort, Frank de Groot, Karsten Hansen, Steffen Hauf, David Hickin, Manuel Izquierdo, Benjamin E. Van Kuiken, Yaroslav Kvashnin , et al. (26 additional authors not shown)

Abstract: The complex electronic structure of metallic ferromagnets is determined by a balance between exchange interaction, electron hopping leading to band formation, and local Coulomb repulsion. The interplay between the respective terms of the Hamiltonian is of fundamental interest, since it produces most, if not all, of the exotic phenomena observed in the solid state. By combining high energy and temp… ▽ More The complex electronic structure of metallic ferromagnets is determined by a balance between exchange interaction, electron hopping leading to band formation, and local Coulomb repulsion. The interplay between the respective terms of the Hamiltonian is of fundamental interest, since it produces most, if not all, of the exotic phenomena observed in the solid state. By combining high energy and temporal resolution in femtosecond time-resolved X-ray absorption spectroscopy with ab initio time-dependent density functional theory we analyze the electronic structure in fcc Ni on the time scale of these interactions in a pump-probe experiment. We distinguish transient broadening and energy shifts in the absorption spectra, which we demonstrate to be caused by electron repopulation and correlation-induced modifications of the electronic structure, respectively. Importantly, the theoretical description of this experimental result hence requires to take the local Coulomb interaction into account, revealing a temporal interplay between band formation, exchange interaction, and Coulomb repulsion. △ Less

Submitted 24 October, 2022; originally announced October 2022.

arXiv:2210.05355 [pdf, ps, other]

Multi-User Reinforcement Learning with Low Rank Rewards

Authors: Naman Agarwal, Prateek Jain, Suhas Kowshik, Dheeraj Nagaraj, Praneeth Netrapalli

Abstract: In this work, we consider the problem of collaborative multi-user reinforcement learning. In this setting there are multiple users with the same state-action space and transition probabilities but with different rewards. Under the assumption that the reward matrix of the $N$ users has a low-rank structure -- a standard and practically successful assumption in the offline collaborative filtering se… ▽ More In this work, we consider the problem of collaborative multi-user reinforcement learning. In this setting there are multiple users with the same state-action space and transition probabilities but with different rewards. Under the assumption that the reward matrix of the $N$ users has a low-rank structure -- a standard and practically successful assumption in the offline collaborative filtering setting -- the question is can we design algorithms with significantly lower sample complexity compared to the ones that learn the MDP individually for each user. Our main contribution is an algorithm which explores rewards collaboratively with $N$ user-specific MDPs and can learn rewards efficiently in two key settings: tabular MDPs and linear MDPs. When $N$ is large and the rank is constant, the sample complexity per MDP depends logarithmically over the size of the state-space, which represents an exponential reduction (in the state-space size) when compared to the standard ``non-collaborative'' algorithms. △ Less

Submitted 22 May, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

Showing 1–50 of 176 results for author: Agarwal, N