-
Making public reputation out of private assessments
Authors:
Youngsuk Mun,
Quang Anh Le,
Seung Ki Baek
Abstract:
Reputation is not just a simple opinion that an individual has about another but a social construct that emerges through communication. Despite the huge importance in coordinating human behavior, such a communicative aspect has remained relatively unexplored in the field of indirect reciprocity. In this work, we bridge the gap between private assessment and public reputation: We begin by clarifyin…
▽ More
Reputation is not just a simple opinion that an individual has about another but a social construct that emerges through communication. Despite the huge importance in coordinating human behavior, such a communicative aspect has remained relatively unexplored in the field of indirect reciprocity. In this work, we bridge the gap between private assessment and public reputation: We begin by clarifying what we mean by reputation and argue that the formation of reputation can be modeled by a bi-stochastic matrix, provided that both assessment and behavior are regarded as continuous variables. By choosing bi-stochastic matrices that represent averaging processes, we show that only four norms among the leading eight, which judge a good person's cooperation toward a bad one as good, will keep cooperation asymptotically or neutrally stable against assessment error in a homogeneous society where every member has adopted the same norm. However, when one of those four norms is used by the resident population, the opinion averaging process allows neutral invasion of mutant norms with small differences in the assessment rule. Our approach provides a theoretical framework for describing the formation of reputation in mathematical terms.
△ Less
Submitted 9 October, 2024;
originally announced October 2024.
-
Digital Twin for O-RAN Towards 6G
Authors:
Huan X. Nguyen,
Kexuan Sun,
Duc To,
Quoc-Tuan Vien,
Tuan Anh Le
Abstract:
In future wireless systems of beyond 5G and 6G, addressing diverse applications with varying quality requirements is essential. Open Radio Access Network (O-RAN) architectures offer the potential for dynamic resource adaptation based on traffic demands. However, achieving real-time resource orchestration remains a challenge. Simultaneously, Digital Twin (DT) technology holds promise for testing an…
▽ More
In future wireless systems of beyond 5G and 6G, addressing diverse applications with varying quality requirements is essential. Open Radio Access Network (O-RAN) architectures offer the potential for dynamic resource adaptation based on traffic demands. However, achieving real-time resource orchestration remains a challenge. Simultaneously, Digital Twin (DT) technology holds promise for testing and analysing complex systems, offering a unique platform for addressing dynamic operation and automation in O-RAN architectures. Yet, developing DTs for complex 5G/6G networks poses challenges, including data exchanges, ML model training data availability, network dynamics, processing power limitations, interdisciplinary collaboration needs, and a lack of standardized methodologies. This paper provides an overview of Open RAN architecture, trend and challenges, proposing the DT concepts for O-RAN with solution examples showcasing its integration into the framework.
△ Less
Submitted 3 October, 2024;
originally announced October 2024.
-
High-energy nuclear scattering of neutrinos
Authors:
Anh Dung Le,
Heikki Mäntyssari
Abstract:
We study the energy dependence of the total and diffractive neutrino-nucleon and neutrino-nucleus cross sections at very high energies. The calculation employs the QCD dipole model and the small-$x$ nonlinear Balitsky-Kovchegov evolution. We show the sensitivity of the nuclear effect quantification on the nuclear setup, and predict up to $\sim 10\%$ nuclear suppression in the inclusive neutrino-ox…
▽ More
We study the energy dependence of the total and diffractive neutrino-nucleon and neutrino-nucleus cross sections at very high energies. The calculation employs the QCD dipole model and the small-$x$ nonlinear Balitsky-Kovchegov evolution. We show the sensitivity of the nuclear effect quantification on the nuclear setup, and predict up to $\sim 10\%$ nuclear suppression in the inclusive neutrino-oxygen scattering stemming from the nonlinear evolution. Diffractive contribution to the total scattering is small, which is only few percentage. The $\left|q\bar{q}g\right>$ componnent of the $W^{\pm}$ boson is found to contribute significantly to the diffractive process, which reaches up to $\sim 40\%$ of the diffractive cross section.
△ Less
Submitted 1 October, 2024;
originally announced October 2024.
-
Inclusive and diffractive neutrino-nucleus scattering at high energy
Authors:
Anh Dung Le,
Heikki Mäntysaari
Abstract:
We calculate the energy dependence of inclusive and diffractive neutrino-nucleus deep-inelastic scattering cross sections within the dipole picture, focusing on the ultra-high-energy regime. We predict an up to $\sim 10\%$ nuclear suppression in the inclusive neutrino-Oxygen scattering originating from the non-linear QCD dynamics in the small-$x$ Balitsky-Kovchegov evolution. Diffraction is found…
▽ More
We calculate the energy dependence of inclusive and diffractive neutrino-nucleus deep-inelastic scattering cross sections within the dipole picture, focusing on the ultra-high-energy regime. We predict an up to $\sim 10\%$ nuclear suppression in the inclusive neutrino-Oxygen scattering originating from the non-linear QCD dynamics in the small-$x$ Balitsky-Kovchegov evolution. Diffraction is found to be a small $1\dots 4\%$ contribution to the total cross section across a wide range of neutrino energies relevant for current and near-future experiments. The diffractive cross section is calculated separately for the coherent and incoherent channels that are found to be of equal importance. Additionally, we include the dominant contribution from the $|q\bar q g\rangle$ Fock state of the $W^\pm$ and $Z$ bosons in the high-$Q^2$ limit, along with the lowest-order $|q\bar q\rangle$ contribution. The $|q\bar{q}g\rangle$ contribution is found to be numerically significant, reaching up to 40\% of the diffractive cross section.
△ Less
Submitted 25 September, 2024;
originally announced September 2024.
-
Modeling water radiolysis with Geant4-DNA: Impact of the temporal structure of the irradiation pulse under oxygen conditions
Authors:
Tuan Anh Le,
Hoang Ngoc Tran,
Serena Fattori,
Viet Cuong Phan,
Sebastien Incerti
Abstract:
The differences in H2O2 production between conventional (CONV) and ultra-high dose rate (UHDR) irradiations in water radiolysis are still not fully understood. The lower levels of this radiolytic species, as a critical end product of water radiolysis, are particularly relevant for investigating the connection between the high-density energy deposition during short-duration physical events (ionizat…
▽ More
The differences in H2O2 production between conventional (CONV) and ultra-high dose rate (UHDR) irradiations in water radiolysis are still not fully understood. The lower levels of this radiolytic species, as a critical end product of water radiolysis, are particularly relevant for investigating the connection between the high-density energy deposition during short-duration physical events (ionizations or excitations) and biological responses of the FLASH effect. In this study, we developed a new Geant4-DNA chemistry model to simulate radiolysis considering the time structure of the irradiation pulse at different absorbed doses to liquid water of 0.01, 0.1, 1, and 2 Gy under 1 MeV electron irradiation. The model allows the description of the beam's temporal structure, including the pulse duration, the pulse repetition frequency, and the pulse amplitude for the different beam irradiation conditions through a wide dose rate range, from 0.01 Gy/s up to about 105 Gy/s, at various oxygen concentrations. The preliminary results indicate a correlation between the temporal structure of the pulses and a significant reduction in the production of reactive oxygen species (ROS) at different dose rates.
△ Less
Submitted 18 September, 2024;
originally announced September 2024.
-
Firefly Algorithm for Movable Antenna Arrays
Authors:
Manh Kha Hoang,
Tuan Anh Le,
Kieu-Xuan Thuc,
Tong Van Luyen,
Xin-She Yang,
Derrick Wing Kwan Ng
Abstract:
This letter addresses a multivariate optimization problem for linear movable antenna arrays (MAAs). Particularly, the position and beamforming vectors of the under-investigated MAA are optimized simultaneously to maximize the minimum beamforming gain across several intended directions, while ensuring interference levels at various unintended directions remain below specified thresholds. To this en…
▽ More
This letter addresses a multivariate optimization problem for linear movable antenna arrays (MAAs). Particularly, the position and beamforming vectors of the under-investigated MAA are optimized simultaneously to maximize the minimum beamforming gain across several intended directions, while ensuring interference levels at various unintended directions remain below specified thresholds. To this end, a swarm-intelligence-based firefly algorithm (FA) is introduced to acquire an effective solution to the optimization problem. Simulation results reveal the superior performance of the proposed FA approach compared to the state-of-the-art approach employing alternating optimization and successive convex approximation. This is attributed to the FA's effectiveness in handling non-convex multivariate and multimodal optimization problems without resorting approximations.
△ Less
Submitted 6 September, 2024;
originally announced September 2024.
-
Dynamically syndetic sets and the combinatorics of syndetic, idempotent filters
Authors:
Daniel Glasscock,
Anh N. Le
Abstract:
A subset of the positive integers is \emph{dynamically central syndetic} if it contains the set of times that a point returns to a neighborhood of itself under a minimal transformation of a compact metric space. These sets are part of the highly-influential link between dynamics and combinatorics forged by Furstenberg and Weiss in the 1970's. Our main result is a characterization of dynamically ce…
▽ More
A subset of the positive integers is \emph{dynamically central syndetic} if it contains the set of times that a point returns to a neighborhood of itself under a minimal transformation of a compact metric space. These sets are part of the highly-influential link between dynamics and combinatorics forged by Furstenberg and Weiss in the 1970's. Our main result is a characterization of dynamically central syndetic sets as precisely those sets that belong to syndetic, idempotent filters. Idempotent filters are combinatorial objects that abound in ergodic Ramsey theory but have been largely unnoticed and unexplored. We develop the algebra of these objects for the proof of the main theorem and with an eye toward future applications.
The main result is best contextualized as a ``global'' analogue to Bergelson and Hindman's ``local'' characterization of Furstenberg's central sets as members of minimal, idempotent ultrafilters. It leads to a dual characterization of sets of topological pointwise recurrence, allowing us to answer a question of Glasner, Tsankov, Weiss, and Zucker. We draw numerous striking contrasts between pointwise recurrence and set recurrence, a topic with a long history in the subject and its applications, and answer four questions posed by Host, Kra, and Maass. We also show that the intersection of a dynamically central syndetic set with a set of pointwise recurrence must be piecewise syndetic, generalizing results of Dong, Glasner, Huang, Shao, Weiss, and Ye.
△ Less
Submitted 22 August, 2024;
originally announced August 2024.
-
Quantum pathways interference in laser-induced electron diffraction revealed by a semiclassical method
Authors:
Phi-Hung Tran,
Van-Hung Hoang,
Anh-Thu Le
Abstract:
We develop a novel method for strong-laser-field physics based on the combination of the semiclassical Herman-Kluk propagator and the strong-field approximation and demonstrate its high accuracy on the calculations of photoelectron momentum distribution (PMD) for atoms and molecules in intense lasers. For rescattered electrons, we show that for a given time that electron tunnels to the continuum,…
▽ More
We develop a novel method for strong-laser-field physics based on the combination of the semiclassical Herman-Kluk propagator and the strong-field approximation and demonstrate its high accuracy on the calculations of photoelectron momentum distribution (PMD) for atoms and molecules in intense lasers. For rescattered electrons, we show that for a given time that electron tunnels to the continuum, there are typically multiple trajectories that lead to the same final momentum in the high-energy region. These trajectories start with slightly different initial transverse momenta and carry different phases giving rise to the interference structures in the PMD, which can also be associated with the laser-free electron-ion differential cross section. This is in contrast to the well-known long and short trajectories, which result in different interference patterns. Our results can be used to extend current capabilities of the laser-induced electron diffraction and other ultrafast imaging and strong-field spectroscopic techniques.
△ Less
Submitted 22 August, 2024;
originally announced August 2024.
-
Machine Learning with Physics Knowledge for Prediction: A Survey
Authors:
Joe Watson,
Chen Song,
Oliver Weeger,
Theo Gruner,
An T. Le,
Kay Hansel,
Ahmed Hendawy,
Oleg Arenz,
Will Trojak,
Miles Cranmer,
Carlo D'Eramo,
Fabian Bülow,
Tanmay Goyal,
Jan Peters,
Martin W. Hoffman
Abstract:
This survey examines the broad suite of methods and models for combining machine learning with physics knowledge for prediction and forecast, with a focus on partial differential equations. These methods have attracted significant interest due to their potential impact on advancing scientific research and industrial practices by improving predictive models with small- or large-scale datasets and e…
▽ More
This survey examines the broad suite of methods and models for combining machine learning with physics knowledge for prediction and forecast, with a focus on partial differential equations. These methods have attracted significant interest due to their potential impact on advancing scientific research and industrial practices by improving predictive models with small- or large-scale datasets and expressive predictive models with useful inductive biases. The survey has two parts. The first considers incorporating physics knowledge on an architectural level through objective functions, structured predictive models, and data augmentation. The second considers data as physics knowledge, which motivates looking at multi-task, meta, and contextual learning as an alternative approach to incorporating physics knowledge in a data-driven fashion. Finally, we also provide an industrial perspective on the application of these methods and a survey of the open-source ecosystem for physics-informed machine learning.
△ Less
Submitted 19 August, 2024;
originally announced August 2024.
-
A Finite Difference Scheme for (2+1)D Cubic-Quintic Nonlinear Schrödinger Equations with Nonlinear Damping
Authors:
Anh Ha Le,
Toan T. Huynh,
Quan M. Nguyen
Abstract:
Solitons of the purely cubic nonlinear Schrödinger equation in a space dimension of $n \geq 2$ suffer critical and supercritical collapses. These solitons can be stabilized in a cubic-quintic nonlinear medium. In this paper, we analyze the Crank-Nicolson finite difference scheme for the (2+1)D cubic-quintic nonlinear Schrödinger equation with cubic damping. We show that both the discrete solution,…
▽ More
Solitons of the purely cubic nonlinear Schrödinger equation in a space dimension of $n \geq 2$ suffer critical and supercritical collapses. These solitons can be stabilized in a cubic-quintic nonlinear medium. In this paper, we analyze the Crank-Nicolson finite difference scheme for the (2+1)D cubic-quintic nonlinear Schrödinger equation with cubic damping. We show that both the discrete solution, in the discrete $L^2$-norm, and discrete energy are bounded. By using appropriate settings and estimations, the existence and the uniqueness of the numerical solution are proved. In addition, the error estimations are established in terms of second order for both space and time in discrete $L^2$-norm and $H^1$-norm. Numerical simulations for the (2+1)D cubic-quintic nonlinear Schrödinger equation with cubic damping are conducted to validate the convergence.
△ Less
Submitted 6 August, 2024; v1 submitted 17 July, 2024;
originally announced July 2024.
-
Dude: Dual Distribution-Aware Context Prompt Learning For Large Vision-Language Model
Authors:
Duy M. H. Nguyen,
An T. Le,
Trung Q. Nguyen,
Nghiem T. Diep,
Tai Nguyen,
Duy Duong-Tran,
Jan Peters,
Li Shen,
Mathias Niepert,
Daniel Sonntag
Abstract:
Prompt learning methods are gaining increasing attention due to their ability to customize large vision-language models to new domains using pre-trained contextual knowledge and minimal training data. However, existing works typically rely on optimizing unified prompt inputs, often struggling with fine-grained classification tasks due to insufficient discriminative attributes. To tackle this, we c…
▽ More
Prompt learning methods are gaining increasing attention due to their ability to customize large vision-language models to new domains using pre-trained contextual knowledge and minimal training data. However, existing works typically rely on optimizing unified prompt inputs, often struggling with fine-grained classification tasks due to insufficient discriminative attributes. To tackle this, we consider a new framework based on a dual context of both domain-shared and class-specific contexts, where the latter is generated by Large Language Models (LLMs) such as GPTs. Such dual prompt methods enhance the model's feature representation by joining implicit and explicit factors encoded in LLM knowledge. Moreover, we formulate the Unbalanced Optimal Transport (UOT) theory to quantify the relationships between constructed prompts and visual tokens. Through partial matching, UOT can properly align discrete sets of visual tokens and prompt embeddings under different mass distributions, which is particularly valuable for handling irrelevant or noisy elements, ensuring that the preservation of mass does not restrict transport solutions. Furthermore, UOT's characteristics integrate seamlessly with image augmentation, expanding the training sample pool while maintaining a reasonable distance between perturbed images and prompt inputs. Extensive experiments across few-shot classification and adapter settings substantiate the superiority of our model over current state-of-the-art baselines.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
Good things come in three: Generating SO Post Titles with Pre-Trained Models, Self Improvement and Post Ranking
Authors:
Duc Anh Le,
Anh M. T. Bui,
Phuong T. Nguyen,
Davide Di Ruscio
Abstract:
Stack Overflow is a prominent Q and A forum, supporting developers in seeking suitable resources on programming-related matters. Having high-quality question titles is an effective means to attract developers' attention. Unfortunately, this is often underestimated, leaving room for improvement. Research has been conducted, predominantly leveraging pre-trained models to generate titles from code sn…
▽ More
Stack Overflow is a prominent Q and A forum, supporting developers in seeking suitable resources on programming-related matters. Having high-quality question titles is an effective means to attract developers' attention. Unfortunately, this is often underestimated, leaving room for improvement. Research has been conducted, predominantly leveraging pre-trained models to generate titles from code snippets and problem descriptions. Yet, getting high-quality titles is still a challenging task, attributed to both the quality of the input data (e.g., containing noise and ambiguity) and inherent constraints in sequence generation models. In this paper, we present FILLER as a solution to generating Stack Overflow post titles using a fine-tuned language model with self-improvement and post ranking. Our study focuses on enhancing pre-trained language models for generating titles for Stack Overflow posts, employing a training and subsequent fine-tuning paradigm for these models. To this end, we integrate the model's predictions into the training process, enabling it to learn from its errors, thereby lessening the effects of exposure bias. Moreover, we apply a post-ranking method to produce a variety of sample candidates, subsequently selecting the most suitable one. To evaluate FILLER, we perform experiments using benchmark datasets, and the empirical findings indicate that our model provides high-quality recommendations. Moreover, it significantly outperforms all the baselines, including Code2Que, SOTitle, CCBERT, M3NSCT5, and GPT3.5-turbo. A user study also shows that FILLER provides more relevant titles, with respect to SOTitle and GPT3.5-turbo.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Symmetries in 3D photoelectron momentum spectroscopy as precursory methods for dichroic and enantiosensitive measurements
Authors:
Michael Davino,
Edward McManus,
Tobias Saule,
Phi-Hung Tran,
Andrés F. Ordóñez,
George Gibson,
Anh-Thu Le,
Carlos A. Trallero-Herrero
Abstract:
3D photoelectron angular distributions (PADs) are measured from an atomic target ionized by ultrafast, elliptical fields of opposite handedness. Comparing these PADs to one another and to numeric simulations, a difficult to avoid systematic error in their orientation is identified and subsequently corrected by imposing the dichroic symmetry by which they are necessarily related. We show that this…
▽ More
3D photoelectron angular distributions (PADs) are measured from an atomic target ionized by ultrafast, elliptical fields of opposite handedness. Comparing these PADs to one another and to numeric simulations, a difficult to avoid systematic error in their orientation is identified and subsequently corrected by imposing the dichroic symmetry by which they are necessarily related. We show that this correction can be directly applied to molecular targets in the same fields. This paves the way for measurement of enantiosensitive information which has yet to be accessed experimentally.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Arena 3.0: Advancing Social Navigation in Collaborative and Highly Dynamic Environments
Authors:
Linh Kästner,
Volodymyir Shcherbyna,
Huajian Zeng,
Tuan Anh Le,
Maximilian Ho-Kyoung Schreff,
Halid Osmaev,
Nam Truong Tran,
Diego Diaz,
Jan Golebiowski,
Harold Soh,
Jens Lambrecht
Abstract:
Building upon our previous contributions, this paper introduces Arena 3.0, an extension of Arena-Bench, Arena 1.0, and Arena 2.0. Arena 3.0 is a comprehensive software stack containing multiple modules and simulation environments focusing on the development, simulation, and benchmarking of social navigation approaches in collaborative environments. We significantly enhance the realism of human beh…
▽ More
Building upon our previous contributions, this paper introduces Arena 3.0, an extension of Arena-Bench, Arena 1.0, and Arena 2.0. Arena 3.0 is a comprehensive software stack containing multiple modules and simulation environments focusing on the development, simulation, and benchmarking of social navigation approaches in collaborative environments. We significantly enhance the realism of human behavior simulation by incorporating a diverse array of new social force models and interaction patterns, encompassing both human-human and human-robot dynamics. The platform provides a comprehensive set of new task modes, designed for extensive benchmarking and testing and is capable of generating realistic and human-centric environments dynamically, catering to a broad spectrum of social navigation scenarios. In addition, the platform's functionalities have been abstracted across three widely used simulators, each tailored for specific training and testing purposes. The platform's efficacy has been validated through an extensive benchmark and user evaluations of the platform by a global community of researchers and students, which noted the substantial improvement compared to previous versions and expressed interests to utilize the platform for future research and development. Arena 3.0 is openly available at https://github.com/Arena-Rosnav.
△ Less
Submitted 2 June, 2024;
originally announced June 2024.
-
Active Galactic Nuclei and STaR fOrmation in Nearby Galaxies (AGNSTRONG). I. Sample and Strategy
Authors:
Huynh Anh N. Le,
Chen Qin,
Yongquan Xue,
Shifu Zhu,
Kim Ngan N. Nguyen,
Ruisong Xia,
Xiaozhi Lin
Abstract:
We introduce our project, AGNSTRONG (Active Galactic Nuclei and STaR fOrmation in Nearby Galaxies). Our research goals encompass investigating the kinematic properties of ionized and molecular gas outflows, understanding the impact of AGN feedback, and exploring the coevolution dynamics between AGN strength activity and star formation activity. We aim to conduct a thorough analysis to determine wh…
▽ More
We introduce our project, AGNSTRONG (Active Galactic Nuclei and STaR fOrmation in Nearby Galaxies). Our research goals encompass investigating the kinematic properties of ionized and molecular gas outflows, understanding the impact of AGN feedback, and exploring the coevolution dynamics between AGN strength activity and star formation activity. We aim to conduct a thorough analysis to determine whether there is an increase or suppression in SFRs among targets with and without powerful relativistic jets. Our sample consists of 35 nearby AGNs with and without powerful relativistic jet detections. Utilizing sub-millimeter (sub-mm) continuum observations at 450 μm and 850 μm from SCUBA-2 at the James Clerk Maxwell Telescope, we determine star-formation rates (SFRs) for our sources using spectral energy distribution (SED) fitting models. Additionally, we employ high-quality, spatially resolved spectra from UV-optical to near-infrared bands obtained with the Double Spectrograph and Triple Spectrograph mounted on the 200-inch Hale telescope at Palomar Observatory to study their multiphase gas outflow properties. This paper presents an overview of our sample selection methodology, research strategy, and initial results of our project. We find that the SFRs determined without including the sub-mm data in the SED fitting are overestimated by approximately 0.08 dex compared to those estimated with the inclusion of sub-mm data. Additionally, we compare the estimated SFRs in our work with those traced by the 4000Å break, as provided by the MPA-JHU catalog. We find that our determined SFRs are systematically higher than those traced by the 4000Å break. Finally, we outline our future research plans.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Diameter of Commuting Graphs of Lie Algebras
Authors:
Hieu V. Ha,
Hoa D. Quang,
Vu A. Le,
Tuyen T. M Nguyen
Abstract:
In this paper, we study the connectedness of the commuting graph of a general Lie algebra and provide a process to determine whether the commuting graph is connected or not, as well as to compute an upper bound for its diameter. In addition, we will examine the connectedness and diameter of the commuting graphs of some remarkable classes of Lie algebras, including: (1) a class of Lie algebras with…
▽ More
In this paper, we study the connectedness of the commuting graph of a general Lie algebra and provide a process to determine whether the commuting graph is connected or not, as well as to compute an upper bound for its diameter. In addition, we will examine the connectedness and diameter of the commuting graphs of some remarkable classes of Lie algebras, including: (1) a class of Lie algebras with one- or two-dimensional derived algebras; and (2) a class of solvable Lie algebras over the real field of dimension up to $4$.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Greedy Heuristics for Sampling-based Motion Planning in High-Dimensional State Spaces
Authors:
Phone Thiha Kyaw,
Anh Vu Le,
Lim Yi,
Prabakaran Veerajagadheswar,
Mohan Rajesh Elara,
Dinh Tung Vo,
Minh Bui Vu
Abstract:
Sampling-based motion planning algorithms are very effective at finding solutions in high-dimensional continuous state spaces as they do not require prior approximations of the problem domain compared to traditional discrete graph-based searches. The anytime version of the Rapidly-exploring Random Trees (RRT) algorithm, denoted as RRT*, often finds high-quality solutions by incrementally approxima…
▽ More
Sampling-based motion planning algorithms are very effective at finding solutions in high-dimensional continuous state spaces as they do not require prior approximations of the problem domain compared to traditional discrete graph-based searches. The anytime version of the Rapidly-exploring Random Trees (RRT) algorithm, denoted as RRT*, often finds high-quality solutions by incrementally approximating and searching the problem domain through random sampling. However, due to its low sampling efficiency and slow convergence rate, research has proposed many variants of RRT*, incorporating different heuristics and sampling strategies to overcome the constraints in complex planning problems. Yet, these approaches address specific convergence aspects of RRT* limitations, leaving a need for a sampling-based algorithm that can quickly find better solutions in complex high-dimensional state spaces with a faster convergence rate for practical motion planning applications. This article unifies and leverages the greedy search and heuristic techniques used in various RRT* variants to develop a greedy version of the anytime Rapidly-exploring Random Trees algorithm, denoted as Greedy RRT* (G-RRT*). It improves the initial solution-finding time of RRT* by maintaining two trees rooted at both the start and goal ends, advancing toward each other using greedy connection heuristics. It also accelerates the convergence rate of RRT* by introducing a greedy version of direct informed sampling procedure, which guides the sampling towards the promising region of the problem domain based on heuristics. We validate our approach on simulated planning problems, manipulation problems on Barrett WAM Arms, and on a self-reconfigurable robot, Panthera. Results show that G-RRT* produces asymptotically optimal solution paths and outperforms state-of-the-art RRT* variants, especially in high-dimensional planning problems.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Graph Neural Network based Active and Passive Beamforming for Distributed STAR-RIS-Assisted Multi-User MISO Systems
Authors:
Ha An Le,
Trinh Van Chien,
Wan Choi
Abstract:
This paper investigates a joint active and passive beamforming design for distributed simultaneous transmitting and reflecting (STAR) reconfigurable intelligent surface (RIS) assisted multi-user (MU)- mutiple input single output (MISO) systems, where the energy splitting (ES) mode is considered for the STAR-RIS. We aim to design the active beamforming vectors at the base station (BS) and the passi…
▽ More
This paper investigates a joint active and passive beamforming design for distributed simultaneous transmitting and reflecting (STAR) reconfigurable intelligent surface (RIS) assisted multi-user (MU)- mutiple input single output (MISO) systems, where the energy splitting (ES) mode is considered for the STAR-RIS. We aim to design the active beamforming vectors at the base station (BS) and the passive beamforming at the STAR-RIS to maximize the user sum rate under transmitting power constraints. The formulated problem is non-convex and nontrivial to obtain the global optimum due to the coupling between active beamforming vectors and STAR-RIS phase shifts. To efficiently solve the problem, we propose a novel graph neural network (GNN)-based framework. Specifically, we first model the interactions among users and network entities are using a heterogeneous graph representation. A heterogeneous graph neural network (HGNN) implementation is then introduced to directly optimizes beamforming vectors and STAR-RIS coefficients with the system objective. Numerical results show that the proposed approach yields efficient performance compared to the previous benchmarks. Furthermore, the proposed GNN is scalable with various system configurations.
△ Less
Submitted 15 October, 2024; v1 submitted 3 May, 2024;
originally announced May 2024.
-
Well-posedness of McKean-Vlasov SDEs with density-dependent drift
Authors:
Anh-Dung Le
Abstract:
In this paper, we study the well-posedness of McKean-Vlasov stochastic differential equations (SDE) whose drift depends pointwisely on marginal density and satisfies a condition about local integrability in time-space variables. The drift is assumed to be Lipschitz continuous in distribution variable with respect to Wasserstein metric $W_p$. Our approach is by approximation with mollified SDEs. We…
▽ More
In this paper, we study the well-posedness of McKean-Vlasov stochastic differential equations (SDE) whose drift depends pointwisely on marginal density and satisfies a condition about local integrability in time-space variables. The drift is assumed to be Lipschitz continuous in distribution variable with respect to Wasserstein metric $W_p$. Our approach is by approximation with mollified SDEs. We establish a new estimate about H{ö}lder continuity in time of marginal density. Then we deduce that the marginal distributions (resp. marginal densities) of the mollified SDEs converge in $W_p$ (resp. topology of compact convergence) to the solution of the Fokker-Planck equation associated with the density-dependent SDE. We prove strong existence of a solution. Weak and strong uniqueness are obtained when $p=1$, the drift coefficient is bounded, and the diffusion coefficient is distribution free.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
ViLLM-Eval: A Comprehensive Evaluation Suite for Vietnamese Large Language Models
Authors:
Trong-Hieu Nguyen,
Anh-Cuong Le,
Viet-Cuong Nguyen
Abstract:
The rapid advancement of large language models (LLMs) necessitates the development of new benchmarks to accurately assess their capabilities. To address this need for Vietnamese, this work aims to introduce ViLLM-Eval, the comprehensive evaluation suite designed to measure the advanced knowledge and reasoning abilities of foundation models within a Vietnamese context. ViLLM-Eval consists of multip…
▽ More
The rapid advancement of large language models (LLMs) necessitates the development of new benchmarks to accurately assess their capabilities. To address this need for Vietnamese, this work aims to introduce ViLLM-Eval, the comprehensive evaluation suite designed to measure the advanced knowledge and reasoning abilities of foundation models within a Vietnamese context. ViLLM-Eval consists of multiple-choice questions and predict next word tasks spanning various difficulty levels and diverse disciplines, ranging from humanities to science and engineering. A thorough evaluation of the most advanced LLMs on ViLLM-Eval revealed that even the best performing models have significant room for improvement in understanding and responding to Vietnamese language tasks. ViLLM-Eval is believed to be instrumental in identifying key strengths and weaknesses of foundation models, ultimately promoting their development and enhancing their performance for Vietnamese users. This paper provides a thorough overview of ViLLM-Eval as part of the Vietnamese Large Language Model shared task, held within the 10th International Workshop on Vietnamese Language and Speech Processing (VLSP 2023).
△ Less
Submitted 18 April, 2024; v1 submitted 17 April, 2024;
originally announced April 2024.
-
A Critique of Chen's "The 2-MAXSAT Problem Can Be Solved in Polynomial Time"
Authors:
Tran Duy Anh Le,
Michael P. Reidy,
Eliot J. Smith
Abstract:
In this paper, we examine Yangjun Chen's technical report titled ``The 2-MAXSAT Problem Can Be Solved in Polynomial Time'' [Che23], which revises and expands upon their conference paper of the same name [Che22]. Chen's paper purports to build a polynomial-time algorithm for the ${\rm NP}$-complete problem 2-MAXSAT by converting a 2-CNF formula into a graph that is then searched. We show through mu…
▽ More
In this paper, we examine Yangjun Chen's technical report titled ``The 2-MAXSAT Problem Can Be Solved in Polynomial Time'' [Che23], which revises and expands upon their conference paper of the same name [Che22]. Chen's paper purports to build a polynomial-time algorithm for the ${\rm NP}$-complete problem 2-MAXSAT by converting a 2-CNF formula into a graph that is then searched. We show through multiple counterexamples that Chen's proposed algorithms contain flaws, and we find that the structures they create lack properly formalized definitions. Furthermore, we elaborate on how the author fails to prove the correctness of their algorithms and how they make overgeneralizations in their time analysis of their proposed solution. Due to these issues, we conclude that Chen's technical report [Che23] and conference paper [Che22] both fail to provide a proof that ${\rm P}={\rm NP}$.
△ Less
Submitted 21 February, 2024;
originally announced April 2024.
-
Bessel-beam direct-write of the etch-mask in a nano-film of alumina for high-efficiency Si solar cells
Authors:
Tomas Katkus,
Soon Hock Ng,
Haoran Mu,
Nguyen Hoai An Le,
Dominyka Stonyte,
Zahra Khajehsaeidimahabadi,
Gediminas Seniutinas,
Justas Baltrukonis,
Orestas Ulcinas,
Mindaugas Mikutis,
Vytautas Sabonis,
Yoshiaki Nishijima,
Michael Rienacker,
Jan Krugener,
Robby Peibst,
Sajeev John,
Saulius Juodkazis
Abstract:
Large surface area applications such as high-efficiency > 26% solar cells require surface patterning with 1-10 micrometers periodic patterns at high fidelity over 1-10 cm^2 areas (before up scaling to 1 m^2) to perform at, or exceed, the Lambertian (ray optics) limit of light trapping. Here we show a pathway to high-resolution sub-1 micrometer etch mask patterning by ablation using direct femtosec…
▽ More
Large surface area applications such as high-efficiency > 26% solar cells require surface patterning with 1-10 micrometers periodic patterns at high fidelity over 1-10 cm^2 areas (before up scaling to 1 m^2) to perform at, or exceed, the Lambertian (ray optics) limit of light trapping. Here we show a pathway to high-resolution sub-1 micrometer etch mask patterning by ablation using direct femtosecond laser writing performed at room conditions (without the need for a vacuum-based lithography approach). A Bessel beam was used to alleviate the required high surface tracking tolerance for ablation of 0.3-0.8 micrometer diameter holes in ~40 nm alumina Al2O3-mask at high writing speed, 7.5 cm/s; a patterning rate 1 cm^2 per 20 min. The plasma etching protocol was optimised for a zero-mesa formation of photonic crystal (PhC) trapping structures and smooth surfaces at the nanoscale level. Scaling up in area and throughput of the demonstrated approach is outlined.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
On the Certification of the Kinematics of 3-DOF Spherical Parallel Manipulators
Authors:
Alexandre Lê,
Guillaume Rance,
Fabrice Rouillier,
Damien Chablat
Abstract:
This paper aims to study a specific kind of parallel robot: Spherical Parallel Manipulators (SPM) that are capable of unlimited rolling. A focus is made on the kinematics of such mechanisms, especially taking into account uncertainties (e.g. on conception & fabrication parameters, measures) and their propagations. Such considerations are crucial if we want to control our robot correctly without an…
▽ More
This paper aims to study a specific kind of parallel robot: Spherical Parallel Manipulators (SPM) that are capable of unlimited rolling. A focus is made on the kinematics of such mechanisms, especially taking into account uncertainties (e.g. on conception & fabrication parameters, measures) and their propagations. Such considerations are crucial if we want to control our robot correctly without any undesirable behavior in its workspace (e.g. effects of singularities). In this paper, we will consider two different approaches to study the kinematics and the singularities of the robot of interest: symbolic and semi-numerical. By doing so, we can compute a singularity-free zone in the work- and joint spaces, considering given uncertainties on the parameters. In this zone, we can use any control law to inertially stabilize the upper platform of the robot.
△ Less
Submitted 6 May, 2024; v1 submitted 5 March, 2024;
originally announced March 2024.
-
Lemur: Log Parsing with Entropy Sampling and Chain-of-Thought Merging
Authors:
Wei Zhang,
Hongcheng Guo,
Anjie Le,
Jian Yang,
Jiaheng Liu,
Zhoujun Li,
Tieqiao Zheng,
Shi Xu,
Runqiang Zang,
Liangfan Zheng,
Bo Zhang
Abstract:
Logs produced by extensive software systems are integral to monitoring system behaviors. Advanced log analysis facilitates the detection, alerting, and diagnosis of system faults. Log parsing, which entails transforming raw log messages into structured templates, constitutes a critical phase in the automation of log analytics. Existing log parsers fail to identify the correct templates due to reli…
▽ More
Logs produced by extensive software systems are integral to monitoring system behaviors. Advanced log analysis facilitates the detection, alerting, and diagnosis of system faults. Log parsing, which entails transforming raw log messages into structured templates, constitutes a critical phase in the automation of log analytics. Existing log parsers fail to identify the correct templates due to reliance on human-made rules. Besides, These methods focus on statistical features while ignoring semantic information in log messages. To address these challenges, we introduce a cutting-edge \textbf{L}og parsing framework with \textbf{E}ntropy sampling and Chain-of-Thought \textbf{M}erging (Lemur). Specifically, to discard the tedious manual rules. We propose a novel sampling method inspired by information entropy, which efficiently clusters typical logs. Furthermore, to enhance the merging of log templates, we design a chain-of-thought method for large language models (LLMs). LLMs exhibit exceptional semantic comprehension, deftly distinguishing between parameters and invariant tokens. We have conducted experiments on large-scale public datasets. Extensive evaluation demonstrates that Lemur achieves the state-of-the-art performance and impressive efficiency.
△ Less
Submitted 1 March, 2024; v1 submitted 28 February, 2024;
originally announced February 2024.
-
Earth's Alfvén wings driven by the April 2023 Coronal Mass Ejection
Authors:
Li-Jen Chen,
Daniel Gershman,
Brandon Burkholder,
Yuxi Chen,
Menelaos Sarantos,
Lan Jian,
James Drake,
Chuanfei Dong,
Harsha Gurram,
Jason Shuster,
Daniel Graham,
Olivier Le Contel,
Steven Schwartz,
Stephen Fuselier,
Hadi Madanian,
Craig Pollock,
Haoming Liang,
Matthew Argall,
Richard Denton,
Rachel Rice,
Jason Beedle,
Kevin Genestreti,
Akhtar Ardakani,
Adam Stanier,
Ari Le
, et al. (11 additional authors not shown)
Abstract:
We report a rare regime of Earth's magnetosphere interaction with sub-Alfvénic solar wind in which the windsock-like magnetosphere transforms into one with Alfvén wings. In the magnetic cloud of a Coronal Mass Ejection (CME) on April 24, 2023, NASA's Magnetospheric Multiscale mission distinguishes the following features: (1) unshocked and accelerated cold CME plasma coming directly against Earth's…
▽ More
We report a rare regime of Earth's magnetosphere interaction with sub-Alfvénic solar wind in which the windsock-like magnetosphere transforms into one with Alfvén wings. In the magnetic cloud of a Coronal Mass Ejection (CME) on April 24, 2023, NASA's Magnetospheric Multiscale mission distinguishes the following features: (1) unshocked and accelerated cold CME plasma coming directly against Earth's dayside magnetosphere; (2) dynamical wing filaments representing new channels of magnetic connection between the magnetosphere and foot points of the Sun's erupted flux rope; (3) cold CME ions observed with energized counter-streaming electrons, evidence of CME plasma captured due to reconnection between magnetic-cloud and Alfvén-wing field lines. The reported measurements advance our knowledge of CME interaction with planetary magnetospheres, and open new opportunities to understand how sub-Alfvénic plasma flows impact astrophysical bodies such as Mercury, moons of Jupiter, and exoplanets close to their host stars.
△ Less
Submitted 3 May, 2024; v1 submitted 12 February, 2024;
originally announced February 2024.
-
Doing Experiments and Revising Rules with Natural Language and Probabilistic Reasoning
Authors:
Wasu Top Piriyakulkij,
Cassidy Langenfeld,
Tuan Anh Le,
Kevin Ellis
Abstract:
We give a model of how to infer natural language rules by doing experiments. The model integrates Large Language Models (LLMs) with Monte Carlo algorithms for probabilistic inference, interleaving online belief updates with experiment design under information-theoretic criteria. We conduct a human-model comparison on a Zendo-style task, finding that a critical ingredient for modeling the human dat…
▽ More
We give a model of how to infer natural language rules by doing experiments. The model integrates Large Language Models (LLMs) with Monte Carlo algorithms for probabilistic inference, interleaving online belief updates with experiment design under information-theoretic criteria. We conduct a human-model comparison on a Zendo-style task, finding that a critical ingredient for modeling the human data is to assume that humans also consider fuzzy, probabilistic rules, in addition to assuming that humans perform approximately-Bayesian belief updates. We also compare with recent algorithms for using LLMs to generate and revise hypotheses, finding that our online inference method yields higher accuracy at recovering the true underlying rule, and provides better support for designing optimal experiments.
△ Less
Submitted 22 May, 2024; v1 submitted 8 February, 2024;
originally announced February 2024.
-
Structure-Aware E(3)-Invariant Molecular Conformer Aggregation Networks
Authors:
Duy M. H. Nguyen,
Nina Lukashina,
Tai Nguyen,
An T. Le,
TrungTin Nguyen,
Nhat Ho,
Jan Peters,
Daniel Sonntag,
Viktor Zaverkin,
Mathias Niepert
Abstract:
A molecule's 2D representation consists of its atoms, their attributes, and the molecule's covalent bonds. A 3D (geometric) representation of a molecule is called a conformer and consists of its atom types and Cartesian coordinates. Every conformer has a potential energy, and the lower this energy, the more likely it occurs in nature. Most existing machine learning methods for molecular property p…
▽ More
A molecule's 2D representation consists of its atoms, their attributes, and the molecule's covalent bonds. A 3D (geometric) representation of a molecule is called a conformer and consists of its atom types and Cartesian coordinates. Every conformer has a potential energy, and the lower this energy, the more likely it occurs in nature. Most existing machine learning methods for molecular property prediction consider either 2D molecular graphs or 3D conformer structure representations in isolation. Inspired by recent work on using ensembles of conformers in conjunction with 2D graph representations, we propose $\mathrm{E}$(3)-invariant molecular conformer aggregation networks. The method integrates a molecule's 2D representation with that of multiple of its conformers. Contrary to prior work, we propose a novel 2D-3D aggregation mechanism based on a differentiable solver for the Fused Gromov-Wasserstein Barycenter problem and the use of an efficient conformer generation method based on distance geometry. We show that the proposed aggregation mechanism is $\mathrm{E}$(3) invariant and propose an efficient GPU implementation. Moreover, we demonstrate that the aggregation mechanism helps to significantly outperform state-of-the-art molecule property prediction methods on established datasets.
△ Less
Submitted 19 August, 2024; v1 submitted 2 February, 2024;
originally announced February 2024.
-
Robust Inverse Graphics via Probabilistic Inference
Authors:
Tuan Anh Le,
Pavel Sountsov,
Matthew D. Hoffman,
Ben Lee,
Brian Patton,
Rif A. Saurous
Abstract:
How do we infer a 3D scene from a single image in the presence of corruptions like rain, snow or fog? Straightforward domain randomization relies on knowing the family of corruptions ahead of time. Here, we propose a Bayesian approach-dubbed robust inverse graphics (RIG)-that relies on a strong scene prior and an uninformative uniform corruption prior, making it applicable to a wide range of corru…
▽ More
How do we infer a 3D scene from a single image in the presence of corruptions like rain, snow or fog? Straightforward domain randomization relies on knowing the family of corruptions ahead of time. Here, we propose a Bayesian approach-dubbed robust inverse graphics (RIG)-that relies on a strong scene prior and an uninformative uniform corruption prior, making it applicable to a wide range of corruptions. Given a single image, RIG performs posterior inference jointly over the scene and the corruption. We demonstrate this idea by training a neural radiance field (NeRF) scene prior and using a secondary NeRF to represent the corruptions over which we place an uninformative prior. RIG, trained only on clean data, outperforms depth estimators and alternative NeRF approaches that perform point estimation instead of full inference. The results hold for a number of scene prior architectures based on normalizing flows and diffusion models. For the latter, we develop reconstruction-guidance with auxiliary latents (ReGAL)-a diffusion conditioning algorithm that is applicable in the presence of auxiliary latent variables such as the corruption. RIG demonstrates how scene priors can be used beyond generation tasks.
△ Less
Submitted 11 June, 2024; v1 submitted 2 February, 2024;
originally announced February 2024.
-
Intermittent Electron-Only Reconnection at Lunar Mini-Magnetospheres
Authors:
Adam Stanier,
Li-Jen Chen,
Ari Le,
Jasper Halekas,
Rhyan Sawyer
Abstract:
Lunar crustal magnetic anomalies (LCMA) are sub-ion-gyroradius structures that have been shown to stand off the solar wind (SW) plasma from the Moon's surface, forming shock-like discontinuities and reflecting incident SW protons. In this letter, the results of high-resolution, two-dimensional fully kinetic simulations show a bursty electron-only magnetic reconnection in the SW-LCMA interaction re…
▽ More
Lunar crustal magnetic anomalies (LCMA) are sub-ion-gyroradius structures that have been shown to stand off the solar wind (SW) plasma from the Moon's surface, forming shock-like discontinuities and reflecting incident SW protons. In this letter, the results of high-resolution, two-dimensional fully kinetic simulations show a bursty electron-only magnetic reconnection in the SW-LCMA interaction region, characterized by the quasi-periodic formation and ejection of magnetic islands and strong parallel electron flows along the X-point separator lines. The islands are observed to modify the magnetic pressure pile-up and Hall electric field above the LCMA, leading to sharp increases in reflected protons that drive electromagnetic fluctuations downstream and short distances upstream in the SW.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Interpolation sets for dynamical systems
Authors:
Andreas Koutsogiannis,
Anh N. Le,
Joel Moreira,
Ronnie Pavlov,
Florian K. Richter
Abstract:
Originating in harmonic analysis, interpolation sets were first studied in dynamics by Glasner and Weiss in the 1980s. A set $S \subset \mathbb{N}$ is an interpolation set for a class of topological dynamical systems $\mathcal{C}$ if any bounded sequence on $S$ can be extended to a sequence that arises from a system in $\mathcal{C}$. In this paper, we provide combinatorial characterizations of int…
▽ More
Originating in harmonic analysis, interpolation sets were first studied in dynamics by Glasner and Weiss in the 1980s. A set $S \subset \mathbb{N}$ is an interpolation set for a class of topological dynamical systems $\mathcal{C}$ if any bounded sequence on $S$ can be extended to a sequence that arises from a system in $\mathcal{C}$. In this paper, we provide combinatorial characterizations of interpolation sets for:
$\bullet$ (totally) minimal systems;
$\bullet$ topologically (weak) mixing systems;
$\bullet$ strictly ergodic systems; and
$\bullet$ zero entropy systems.
Additionally, we prove some results on a slightly different notion, called weak interpolation sets, for several classes of systems. We also answer a question of Host, Kra, and Maass concerning the connection between sets of pointwise recurrence for distal systems and $IP$-sets.
△ Less
Submitted 13 August, 2024; v1 submitted 27 January, 2024;
originally announced January 2024.
-
Intersective sets for sparse sets of integers
Authors:
Pierre-Yves Bienvenu,
John T. Griesmer,
Anh N. Le,
Thái Hoàng Lê
Abstract:
For $E \subset \mathbb{N}$, a subset $R \subset \mathbb{N}$ is $E$-intersective if for every $A \subset E$ having positive upper relative density, we have $R \cap (A - A) \neq \varnothing$. On the other hand, $R$ is chromatically $E$-intersective if for every finite partition $E=\bigcup_{i=1}^k E_i$, there exists $i$ such that $R\cap (E_i-E_i)\neq\varnothing$. When $E=\mathbb{N}$, we recover the u…
▽ More
For $E \subset \mathbb{N}$, a subset $R \subset \mathbb{N}$ is $E$-intersective if for every $A \subset E$ having positive upper relative density, we have $R \cap (A - A) \neq \varnothing$. On the other hand, $R$ is chromatically $E$-intersective if for every finite partition $E=\bigcup_{i=1}^k E_i$, there exists $i$ such that $R\cap (E_i-E_i)\neq\varnothing$. When $E=\mathbb{N}$, we recover the usual notions of intersectivity and chromatic intersectivity.
In this article, we investigate to which extent known intersectivity results hold in the relative setting when $E = \mathbb{P}$, the set of primes, or other sparse subsets of $\mathbb{N}$. Among other things, we prove:
-There exists an intersective set that is not $\mathbb{P}$-intersective.
-However, every $\mathbb{P}$-intersective set is intersective.
-There exists a chromatically $\mathbb{P}$-intersective set which is not intersective (and therefore not $\mathbb{P}$-intersective).
-The set of shifted Chen primes $\mathbb{P}_{\mathrm{Chen}} + 1$ is $\mathbb{P}$-intersective (and therefore intersective).
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
Design and Evaluation of a Socially Assistive Robot Schoolwork Companion for College Students with ADHD
Authors:
Amy O'Connell,
Ashveen Banga,
Jennifer Ayissi,
Nikki Yaminrafie,
Ellen Ko,
Andrew Le,
Bailey Cislowski,
Maja Matarić
Abstract:
College students with ADHD respond positively to simple socially assistive robots (SARs) that monitor attention and provide non-verbal feedback, but studies have been done only in brief in-lab sessions. We present an initial design and evaluation of an in-dorm SAR study companion for college students with ADHD. This work represents the introductory stages of an ongoing user-centered, participatory…
▽ More
College students with ADHD respond positively to simple socially assistive robots (SARs) that monitor attention and provide non-verbal feedback, but studies have been done only in brief in-lab sessions. We present an initial design and evaluation of an in-dorm SAR study companion for college students with ADHD. This work represents the introductory stages of an ongoing user-centered, participatory design process. In a three-week within-subjects user study, university students (N=11) with self-reported symptoms of adult ADHD had a SAR study companion in their dorm room for two weeks and a computer-based system for one week. Toward developing SARs for long-term, in-dorm use, we focus on 1) evaluating the usability and desire for SAR study companions by college students with ADHD and 2) collecting participant feedback about the SAR design and functionality. Participants responded positively to the robot; after one week of regular use, 91% (10 of 11) chose to continue using the robot voluntarily in the second week.
△ Less
Submitted 11 January, 2024;
originally announced January 2024.
-
Joint Power Allocation and User Scheduling in Integrated Satellite-Terrestrial Cell-Free Massive MIMO IoT Systems
Authors:
Trinh Van Chien,
Ha An Le,
Ta Hai Tung,
Hien Quoc Ngo,
Symeon Chatzinotas
Abstract:
Both space and ground communications have been proven effective solutions under different perspectives in Internet of Things (IoT) networks. This paper investigates multiple-access scenarios, where plenty of IoT users are cooperatively served by a satellite in space and access points (APs) on the ground. Available users in each coherence interval are split into scheduled and unscheduled subsets to…
▽ More
Both space and ground communications have been proven effective solutions under different perspectives in Internet of Things (IoT) networks. This paper investigates multiple-access scenarios, where plenty of IoT users are cooperatively served by a satellite in space and access points (APs) on the ground. Available users in each coherence interval are split into scheduled and unscheduled subsets to optimize limited radio resources. We compute the uplink ergodic throughput of each scheduled user under imperfect channel state information (CSI) and non-orthogonal pilot signals. As maximum-radio combining is deployed locally at the ground gateway and the APs, the uplink ergodic throughput is obtained in a closed-form expression. The analytical results explicitly unveil the effects of channel conditions and pilot contamination on each scheduled user. By maximizing the sum throughput, the system can simultaneously determine scheduled users and perform power allocation based on either a model-based approach with alternating optimization or a learning-based approach with the graph neural network. Numerical results manifest that integrated satellite-terrestrial cell-free massive multiple-input multiple-output systems can significantly improve the sum ergodic throughput over coherence intervals. The integrated systems can schedule the vast majority of users; some might be out of service due to the limited power budget.
△ Less
Submitted 8 January, 2024;
originally announced January 2024.
-
On Czerwinski's "${\rm P} \neq {\rm NP}$ relative to a ${\rm P}$-complete oracle"
Authors:
Michael C. Chavrimootoo,
Tran Duy Anh Le,
Michael P. Reidy,
Eliot J. Smith
Abstract:
In this paper, we take a closer look at Czerwinski's "${\rm P}\neq{\rm NP}$ relative to a ${\rm P}$-complete oracle" [Cze23]. There are (uncountably) infinitely-many relativized worlds where ${\rm P}$ and ${\rm NP}$ differ, and it is well-known that for any ${\rm P}$-complete problem $A$, ${\rm P}^A \neq {\rm NP}^A \iff {\rm P}\neq {\rm NP}$. The paper defines two sets ${\rm D}_{\rm P}$ and…
▽ More
In this paper, we take a closer look at Czerwinski's "${\rm P}\neq{\rm NP}$ relative to a ${\rm P}$-complete oracle" [Cze23]. There are (uncountably) infinitely-many relativized worlds where ${\rm P}$ and ${\rm NP}$ differ, and it is well-known that for any ${\rm P}$-complete problem $A$, ${\rm P}^A \neq {\rm NP}^A \iff {\rm P}\neq {\rm NP}$. The paper defines two sets ${\rm D}_{\rm P}$ and ${\rm D}_{\rm NP}$ and builds the purported proof of their main theorem on the claim that an oracle Turing machine with ${\rm D}_{\rm NP}$ as its oracle and that accepts ${\rm D}_{\rm P}$ must make $Θ(2^n)$ queries to the oracle. We invalidate the latter by proving that there is an oracle Turing machine with ${\rm D}_{\rm NP}$ as its oracle that accepts ${\rm D}_{\rm P}$ and yet only makes one query to the oracle. We thus conclude that Czerwinski's paper [Cze23] fails to establish that ${\rm P} \neq {\rm NP}$.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
Inertial Line-Of-Sight Stabilization Using a 3-DOF Spherical Parallel Manipulator with Coaxial Input Shafts
Authors:
Alexandre Le,
Guillaume Rance,
Fabrice Rouillier,
Damien Chablat
Abstract:
This article dives into the use of a 3-RRR Spherical Parallel Manipulator (SPM) for the purpose of inertial Line Of Sight (LOS) stabilization. Such a parallel robot provides three Degrees of Freedom (DOF) in orientation and is studied from the kinematic point of view. In particular, one guarantees that the singular loci (with the resulting numerical instabilities and inappropriate behavior of the…
▽ More
This article dives into the use of a 3-RRR Spherical Parallel Manipulator (SPM) for the purpose of inertial Line Of Sight (LOS) stabilization. Such a parallel robot provides three Degrees of Freedom (DOF) in orientation and is studied from the kinematic point of view. In particular, one guarantees that the singular loci (with the resulting numerical instabilities and inappropriate behavior of the mechanism) are far away from the prescribed workspace. Once the kinematics of the device is certified, a control strategy needs to be implemented in order to stabilize the LOS through the upper platform of the mechanism. Such a work is done with MATLAB Simulink using a SimMechanics model of our robot.
△ Less
Submitted 2 January, 2024; v1 submitted 5 December, 2023;
originally announced December 2023.
-
Training Chain-of-Thought via Latent-Variable Inference
Authors:
Du Phan,
Matthew D. Hoffman,
David Dohan,
Sholto Douglas,
Tuan Anh Le,
Aaron Parisi,
Pavel Sountsov,
Charles Sutton,
Sharad Vikram,
Rif A. Saurous
Abstract:
Large language models (LLMs) solve problems more accurately and interpretably when instructed to work out the answer step by step using a ``chain-of-thought'' (CoT) prompt. One can also improve LLMs' performance on a specific task by supervised fine-tuning, i.e., by using gradient ascent on some tunable parameters to maximize the average log-likelihood of correct answers from a labeled training se…
▽ More
Large language models (LLMs) solve problems more accurately and interpretably when instructed to work out the answer step by step using a ``chain-of-thought'' (CoT) prompt. One can also improve LLMs' performance on a specific task by supervised fine-tuning, i.e., by using gradient ascent on some tunable parameters to maximize the average log-likelihood of correct answers from a labeled training set. Naively combining CoT with supervised tuning requires supervision not just of the correct answers, but also of detailed rationales that lead to those answers; these rationales are expensive to produce by hand. Instead, we propose a fine-tuning strategy that tries to maximize the \emph{marginal} log-likelihood of generating a correct answer using CoT prompting, approximately averaging over all possible rationales. The core challenge is sampling from the posterior over rationales conditioned on the correct answer; we address it using a simple Markov-chain Monte Carlo (MCMC) expectation-maximization (EM) algorithm inspired by the self-taught reasoner (STaR), memoized wake-sleep, Markovian score climbing, and persistent contrastive divergence. This algorithm also admits a novel control-variate technique that drives the variance of our gradient estimates to zero as the model improves. Applying our technique to GSM8K and the tasks in BIG-Bench Hard, we find that this MCMC-EM fine-tuning technique typically improves the model's accuracy on held-out examples more than STaR or prompt-tuning with or without CoT.
△ Less
Submitted 28 November, 2023;
originally announced December 2023.
-
The Seoul National University AGN Monitoring Project III: H$β$ lag measurements of 32 luminous AGNs and the high-luminosity end of the size--luminosity relation
Authors:
Jong-Hak Woo,
Shu Wang,
Suvendu Rakshit,
Hojin Cho,
Donghoon Son,
Vardha N. Bennert,
Elena Gallo,
Edmund Hodges-Kluck,
Tommaso Treu,
Aaron J. Barth,
Wanjin Cho,
Adi Foord,
Jaehyuk Geum,
Hengxiao Guo,
Yashashree Jadhav,
Yiseul Jeon,
Kyle M. Kabasares,
Won-Suk Kang,
Changseok Kim,
Minjin Kim,
Tae-Woo Kim,
Huynh Anh N. Le,
Matthew A. Malkan,
Amit Kumar Mandal,
Daeseong Park
, et al. (6 additional authors not shown)
Abstract:
We present the main results from a long-term reverberation mapping campaign carried out for the Seoul National University Active Galactic Nuclei (AGN) Monitoring Project. High-quality data were obtained during 2015-2021 for 32 luminous AGNs (i.e., continuum luminosity in the range of $10^{44-46}$ erg s$^{-1}$) at a regular cadence, of 20-30 days for spectroscopy and 3-5 days for photometry. We obt…
▽ More
We present the main results from a long-term reverberation mapping campaign carried out for the Seoul National University Active Galactic Nuclei (AGN) Monitoring Project. High-quality data were obtained during 2015-2021 for 32 luminous AGNs (i.e., continuum luminosity in the range of $10^{44-46}$ erg s$^{-1}$) at a regular cadence, of 20-30 days for spectroscopy and 3-5 days for photometry. We obtain time lag measurements between the variability in the H$β$ emission and the continuum for 32 AGNs; twenty-five of those have the best lag measurements based on our quality assessment, examining correlation strength, and the posterior lag distribution. Our study significantly increases the current sample of reverberation-mapped AGNs, particularly at the moderate to high luminosity end. Combining our results with literature measurements, we derive a H$β$ broad line region size--luminosity relation with a shallower slope than reported in the literature. For a given luminosity, most of our measured lags are shorter than the expectation, implying that single-epoch black hole mass estimators based on previous calibrations could suffer large systematic uncertainties.
△ Less
Submitted 26 November, 2023;
originally announced November 2023.
-
Scattering from an external field in quantum chromodynamics at high energies: from foundations to interdisciplinary connections
Authors:
Athanasia-Konstantina Angelopoulou,
Anh Dung Le,
Stéphane Munier
Abstract:
We review the factorization of the $S$-matrix elements in the context of particle scattering off an external field, which can serve as a model for the field of a large nucleus. The factorization takes the form of a convolution of light cone wave functions describing the physical incoming and outgoing states in terms of bare partons, and products of Wilson lines. The latter represent the interactio…
▽ More
We review the factorization of the $S$-matrix elements in the context of particle scattering off an external field, which can serve as a model for the field of a large nucleus. The factorization takes the form of a convolution of light cone wave functions describing the physical incoming and outgoing states in terms of bare partons, and products of Wilson lines. The latter represent the interaction between the bare partons and the external field. Specializing to elastic scattering amplitudes of onia at very high energies, we introduce the color dipole model, which formulates the calculation of the modulus-squared of the wave functions in quantum chromodynamics with the help of a branching random walk, and the scattering amplitudes as observables on this classical stochastic process. Methods developed for general branching processes produce analytical formulas for the asymptotics of such observables, and thus enable one to derive exact large-rapidity expressions for onium-nucleus cross sections, from which electron-nucleus cross sections may be inferred.
△ Less
Submitted 14 October, 2024; v1 submitted 24 November, 2023;
originally announced November 2023.
-
Evaluating the Surrogate Index as a Decision-Making Tool Using 200 A/B Tests at Netflix
Authors:
Vickie Zhang,
Michael Zhao,
and Maria Dimakopoulou,
Anh Le,
Nathan Kallus
Abstract:
Surrogate index approaches have recently become a popular method of estimating longer-term impact from shorter-term outcomes. In this paper, we leverage 1098 test arms from 200 A/B tests at Netflix to empirically investigate to what degree would decisions made using a surrogate index utilizing 14 days of data would align with those made using direct measurement of day 63 treatment effects. Focusin…
▽ More
Surrogate index approaches have recently become a popular method of estimating longer-term impact from shorter-term outcomes. In this paper, we leverage 1098 test arms from 200 A/B tests at Netflix to empirically investigate to what degree would decisions made using a surrogate index utilizing 14 days of data would align with those made using direct measurement of day 63 treatment effects. Focusing specifically on linear "auto-surrogate" models that utilize the shorter-term observations of the long-term outcome of interest, we find that the statistical inferences that we would draw from using the surrogate index are ~95% consistent with those from directly measuring the long-term treatment effect. Moreover, when we restrict ourselves to the set of tests that would be "launched" (i.e. positive and statistically significant) based on the 63-day directly measured treatment effects, we find that relying instead on the surrogate index achieves 79% and 65% recall.
△ Less
Submitted 30 January, 2024; v1 submitted 20 November, 2023;
originally announced November 2023.
-
Coherent postionization dynamics of molecules based on adiabatic strong-field approximation
Authors:
Shan Xue,
Wenli Yang,
Ping Li,
Yuxuan Zhang,
Pengji Ding,
Song-Feng Zhao,
Hongchuan Du,
Anh-Thu Le
Abstract:
Open-system density matrix methods typically employ incoherent population injection to investigate the postionization dynamics in strong laser fields. The presence of coherence injection has long been a subject of debate. In this context, we introduce a coherence injection model based on the adiabatic strong-field approximation (ASFA). This model effectively predicts ionic coherence resulting from…
▽ More
Open-system density matrix methods typically employ incoherent population injection to investigate the postionization dynamics in strong laser fields. The presence of coherence injection has long been a subject of debate. In this context, we introduce a coherence injection model based on the adiabatic strong-field approximation (ASFA). This model effectively predicts ionic coherence resulting from directional tunnel ionization. With increasing field strength, the degree of coherence predicted by the ASFA model gradually deviates from that of the SFA model but remains much milder compared to the results of the simple and partial-wave expansion models. The impact of coherence injection on the postionization molecular dynamics is explored in O$_2$ and N$_2$. We find that the ionization-induced vibrational coherence strongly enhances the population inversion of $X^2 Σ_g^+ -B^2 Σ_u^+ $ in N$_2^+$ and the dissociation probability of O$_2^+$. Conversely, the ionization-induced vibronic coherences have inhibitory effects on the related transitions. These findings reveal the significance of including the vibronic-state-resolved coherence injection in simulating molecular dynamics following strong-field ionization.
△ Less
Submitted 19 November, 2023;
originally announced November 2023.
-
Generalized Firefly Algorithm for Optimal Transmit Beamforming
Authors:
Tuan Anh Le,
Xin-She Yang
Abstract:
This paper proposes a generalized Firefly Algorithm (FA) to solve an optimization framework having objective function and constraints as multivariate functions of independent optimization variables. Four representative examples of how the proposed generalized FA can be adopted to solve downlink beamforming problems are shown for a classic transmit beamforming, cognitive beamforming, reconfigurable…
▽ More
This paper proposes a generalized Firefly Algorithm (FA) to solve an optimization framework having objective function and constraints as multivariate functions of independent optimization variables. Four representative examples of how the proposed generalized FA can be adopted to solve downlink beamforming problems are shown for a classic transmit beamforming, cognitive beamforming, reconfigurable-intelligent-surfaces-aided (RIS-aided) transmit beamforming, and RIS-aided wireless power transfer (WPT). Complexity analyzes indicate that in large-antenna regimes the proposed FA approaches require less computational complexity than their corresponding interior point methods (IPMs) do, yet demand a higher complexity than the iterative and the successive convex approximation (SCA) approaches do. Simulation results reveal that the proposed FA attains the same global optimal solution as that of the IPM for an optimization problem in cognitive beamforming. On the other hand, the proposed FA approaches outperform the iterative, IPM and SCA in terms of obtaining better solution for optimization problems, respectively, for a classic transmit beamforming, RIS-aided transmit beamforming and RIS-aided WPT.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
Unraveling the Complex Structure of AGN-driven Outflows. VI. Strong Ionized Outflows in Type 1 AGNs and the Outflow Size-Luminosity Relation
Authors:
Changseok Kim,
Jong-Hak Woo,
Rongxin Luo,
Aeree Chung,
Junhyun Baek,
Huynh Anh N. Le,
Donghoon Son
Abstract:
We present spatially resolved gas kinematics, ionization, and energetics of 11 type 1 and 5 type 2 active galactic nuclei (AGNs) with strong ionized gas outflows at z $<0.3$ using Gemini Multi-Object Spectrograph Integral Field Unit (GMOS-IFU) data. We find a strongly blueshifted region in [OIII] velocity maps, representing an approaching cone in biconical outflows, and blueshifted and redshifted…
▽ More
We present spatially resolved gas kinematics, ionization, and energetics of 11 type 1 and 5 type 2 active galactic nuclei (AGNs) with strong ionized gas outflows at z $<0.3$ using Gemini Multi-Object Spectrograph Integral Field Unit (GMOS-IFU) data. We find a strongly blueshifted region in [OIII] velocity maps, representing an approaching cone in biconical outflows, and blueshifted and redshifted regions in H$α$ velocity maps, which show gravitationally rotating kinematics. AGN photoionization is dominant in the central region of most targets, and some of them also show ring-like structures of LINER or composite that surround the AGN-dominated center. Following our previous studies, we kinematically determine outflow sizes by the ratio between [OIII] and stellar velocity dispersion. Outflow sizes of type 1 AGNs follow the same kinematic outflow size-[OIII] luminosity relation obtained from the type 2 IFU sample in Kang & Woo and Luo (updated slope $0.29\pm0.04$), while they are limited to the central kpc scales, indicating the lack of global impact of outflows on the interstellar medium. Small mass outflow rates and large star formation rates of the combined sample support that there is no evidence of rapid star formation quenching by outflows, which is consistent with the delayed AGN feedback.
△ Less
Submitted 12 October, 2023; v1 submitted 10 October, 2023;
originally announced October 2023.
-
Accelerating Motion Planning via Optimal Transport
Authors:
An T. Le,
Georgia Chalvatzaki,
Armin Biess,
Jan Peters
Abstract:
Motion planning is still an open problem for many disciplines, e.g., robotics, autonomous driving, due to their need for high computational resources that hinder real-time, efficient decision-making. A class of methods striving to provide smooth solutions is gradient-based trajectory optimization. However, those methods usually suffer from bad local minima, while for many settings, they may be ina…
▽ More
Motion planning is still an open problem for many disciplines, e.g., robotics, autonomous driving, due to their need for high computational resources that hinder real-time, efficient decision-making. A class of methods striving to provide smooth solutions is gradient-based trajectory optimization. However, those methods usually suffer from bad local minima, while for many settings, they may be inapplicable due to the absence of easy-to-access gradients of the optimization objectives. In response to these issues, we introduce Motion Planning via Optimal Transport (MPOT) -- a \textit{gradient-free} method that optimizes a batch of smooth trajectories over highly nonlinear costs, even for high-dimensional tasks, while imposing smoothness through a Gaussian Process dynamics prior via the planning-as-inference perspective. To facilitate batch trajectory optimization, we introduce an original zero-order and highly-parallelizable update rule: the Sinkhorn Step, which uses the regular polytope family for its search directions. Each regular polytope, centered on trajectory waypoints, serves as a local cost-probing neighborhood, acting as a \textit{trust region} where the Sinkhorn Step "transports" local waypoints toward low-cost regions. We theoretically show that Sinkhorn Step guides the optimizing parameters toward local minima regions of non-convex objective functions. We then show the efficiency of MPOT in a range of problems from low-dimensional point-mass navigation to high-dimensional whole-body robot motion planning, evincing its superiority compared to popular motion planners, paving the way for new applications of optimal transport in motion planning.
△ Less
Submitted 28 October, 2023; v1 submitted 27 September, 2023;
originally announced September 2023.
-
Averages of completely multiplicative functions over the Gaussian integers -- a dynamical approach
Authors:
Sebastián Donoso,
Anh N. Le,
Joel Moreira,
Wenbo Sun
Abstract:
We prove a pointwise convergence result for additive ergodic averages associated with certain multiplicative actions of the Gaussian integers. We derive several applications in dynamics and number theory, including:
(i) Wirsing's theorem for Gaussian integers: if $f\colon \mathbb{G} \to \mathbb{R}$ is a bounded completely multiplicative function, then the following limit exists:…
▽ More
We prove a pointwise convergence result for additive ergodic averages associated with certain multiplicative actions of the Gaussian integers. We derive several applications in dynamics and number theory, including:
(i) Wirsing's theorem for Gaussian integers: if $f\colon \mathbb{G} \to \mathbb{R}$ is a bounded completely multiplicative function, then the following limit exists: $$\lim_{N \to \infty} \frac{1}{N^2} \sum_{1 \leq m, n \leq N} f(m + {\rm i} n).$$ (ii) An answer to a special case of a question of Frantzikinakis and Host: for any completely multiplicative real-valued function $f: \mathbb{N} \to \mathbb{R}$, the following limit exists: $$\lim_{N \to \infty} \frac{1}{N^2} \sum_{1 \leq m, n \leq N} f(m^2 + n^2).$$ (iii) A variant of a theorem of Bergelson and Richter on ergodic averages along the $Ω$ function: if $(X,T)$ is a uniquely ergodic system with unique invariant measure $μ$, then for any $x\in X$ and $f\in C(X)$, $$\lim_{N\to\infty}\frac{1}{N^2}\sum_{1 \leq m, n \leq N} f(T^{Ω(m^2 + n^2)}x)=\int_Xf \ dμ.$$
△ Less
Submitted 6 March, 2024; v1 submitted 13 September, 2023;
originally announced September 2023.
-
Double RIS-Assisted MIMO Systems Over Spatially Correlated Rician Fading Channels and Finite Scatterers
Authors:
Ha An Le,
Trinh Van Chien,
Van Duc Nguyen,
Wan Choi
Abstract:
This paper investigates double RIS-assisted MIMO communication systems over Rician fading channels with finite scatterers, spatial correlation, and the existence of a double-scattering link between the transceiver. First, the statistical information is driven in closed form for the aggregated channels, unveiling various influences of the system and environment on the average channel power gains. N…
▽ More
This paper investigates double RIS-assisted MIMO communication systems over Rician fading channels with finite scatterers, spatial correlation, and the existence of a double-scattering link between the transceiver. First, the statistical information is driven in closed form for the aggregated channels, unveiling various influences of the system and environment on the average channel power gains. Next, we study two active and passive beamforming designs corresponding to two objectives. The first problem maximizes channel capacity by jointly optimizing the active precoding and combining matrices at the transceivers and passive beamforming at the double RISs subject to the transmitting power constraint. In order to tackle the inherently non-convex issue, we propose an efficient alternating optimization algorithm (AO) based on the alternating direction method of multipliers (ADMM). The second problem enhances communication reliability by jointly training the encoder and decoder at the transceivers and the phase shifters at the RISs. Each neural network representing a system entity in an end-to-end learning framework is proposed to minimize the symbol error rate of the detected symbols by controlling the transceiver and the RISs phase shifts. Numerical results verify our analysis and demonstrate the superior improvements of phase shift designs to boost system performance.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
A preliminary study of photometric redshifts based on the Wide Field Survey Telescope
Authors:
Yu Liu,
Xiao-zhi Lin,
Yong-quan Xue,
Huynh Anh N. Le
Abstract:
The Wide Field Survey Telescope (WFST) is a dedicated time-domain multi-band ($u$, $g$, $r$, $i$, and $z$) photometric survey facility under construction. In this paper, we present a preliminary study that assesses the quality of photometric redshifts based on WFST by utilizing mock observations derived with the galaxy catalog in the COSMOS/UltraVISTA field. We apply the template fitting technique…
▽ More
The Wide Field Survey Telescope (WFST) is a dedicated time-domain multi-band ($u$, $g$, $r$, $i$, and $z$) photometric survey facility under construction. In this paper, we present a preliminary study that assesses the quality of photometric redshifts based on WFST by utilizing mock observations derived with the galaxy catalog in the COSMOS/UltraVISTA field. We apply the template fitting technique to estimate photometric redshifts by using the ZEBRA photometric-redshift code and adopting a modified set of adaptive templates. We evaluate the bias (median relative offset between the output photometric redshifts and input redshifts), normalized median absolute deviation ($σ_{\rm NMAD}$) and outlier fraction ($f_{\rm outlier}$) of photometric redshifts in two typical WFST observational cases, the single 30-second exposure observations (hereafter shallow mode) and co-added 50-minute exposure observations (hereafter deep mode). We find bias$\la0.006$, $σ_{\rm NMAD}\la0.03$, and $f_{\rm outlier}\la5\%$ in the shallow mode and bias$\approx 0.005$, $σ_{\rm NMAD}\approx 0.06$, and $f_{\rm outlier}\approx 17\%$--$27\%$ in the deep mode, respectively, under various lunar phases. Combining the WFST mock observational data with that from the upcoming CSST and Euclid surveys, we demonstrate that the $z_{\rm phot}$ results can be significantly improved, with $f_{\rm outlier}\approx 1\%$ and $σ_{\rm NMAD}\approx 0.02$.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
Neural Amortized Inference for Nested Multi-agent Reasoning
Authors:
Kunal Jha,
Tuan Anh Le,
Chuanyang Jin,
Yen-Ling Kuo,
Joshua B. Tenenbaum,
Tianmin Shu
Abstract:
Multi-agent interactions, such as communication, teaching, and bluffing, often rely on higher-order social inference, i.e., understanding how others infer oneself. Such intricate reasoning can be effectively modeled through nested multi-agent reasoning. Nonetheless, the computational complexity escalates exponentially with each level of reasoning, posing a significant challenge. However, humans ef…
▽ More
Multi-agent interactions, such as communication, teaching, and bluffing, often rely on higher-order social inference, i.e., understanding how others infer oneself. Such intricate reasoning can be effectively modeled through nested multi-agent reasoning. Nonetheless, the computational complexity escalates exponentially with each level of reasoning, posing a significant challenge. However, humans effortlessly perform complex social inferences as part of their daily lives. To bridge the gap between human-like inference capabilities and computational limitations, we propose a novel approach: leveraging neural networks to amortize high-order social inference, thereby expediting nested multi-agent reasoning. We evaluate our method in two challenging multi-agent interaction domains. The experimental results demonstrate that our method is computationally efficient while exhibiting minimal degradation in accuracy.
△ Less
Submitted 21 August, 2023;
originally announced August 2023.
-
A Rank-One Optimization Framework and Its Applications to Transmit Beamforming
Authors:
Tuan Anh Le,
Derrick Wing Kwan Ng,
Xin-She Yang
Abstract:
This paper proposes an elegant optimization framework consisting of a mix of linear-matrix-inequality and second-order-cone constraints. The proposed framework generalizes the semidefinite relaxation (SDR) enabled solution to the typical transmit beamforming problems presented in the form of quadratically constrained quadratic programs (QCQPs) in the literature. It is proved that the optimization…
▽ More
This paper proposes an elegant optimization framework consisting of a mix of linear-matrix-inequality and second-order-cone constraints. The proposed framework generalizes the semidefinite relaxation (SDR) enabled solution to the typical transmit beamforming problems presented in the form of quadratically constrained quadratic programs (QCQPs) in the literature. It is proved that the optimization problems subsumed under the framework always admit a rank-one optimal solution when they are feasible and their optimal solutions are not trivial. This finding indicates that the relaxation is tight as the optimal solution of the original beamforming QCQP can be straightforwardly obtained from that of the SDR counterpart without any loss of optimality. Four representative examples of transmit beamforming, i.e., transmit beamforming with perfect channel state information (CSI), transmit beamforming with imperfect CSI, chance-constraint approach for imperfect CSI, and reconfigurable-intelligent-surface (RIS) aided beamforming, are shown to demonstrate how the proposed optimization framework can be realized in deriving the SDR counterparts for different beamforming designs.
△ Less
Submitted 18 August, 2023;
originally announced August 2023.
-
Coherently diffractive dissociation in electron-hadron collisions: from HERA to the future EIC
Authors:
Tuomas Lappi,
Anh Dung Le,
Heikki Mäntysaari
Abstract:
We present numerical results on diffractive dissociation with large invariant mass diffractive final states in the scattering of an electron off a hadron. The diffractive large-mass resummation is performed using the nonlinear Kovchegov-Levin equation, taking into account running coupling corrections. For the scattering off the proton, a (modified) McLerran-Venugopalan amplitude is used as the ini…
▽ More
We present numerical results on diffractive dissociation with large invariant mass diffractive final states in the scattering of an electron off a hadron. The diffractive large-mass resummation is performed using the nonlinear Kovchegov-Levin equation, taking into account running coupling corrections. For the scattering off the proton, a (modified) McLerran-Venugopalan amplitude is used as the initial condition for the nonlinear evolution, with free parameters being constrained by the HERA inclusive data. The results show a reasonable description of the HERA diffractive structure function data at moderately large diffractive mass when the impact parameter profile is constrained by the low-mass diffractive cross section data. The calculation is extended to nuclear scattering, where the initial condition is generalized from the proton case employing the optical Glauber model. The nonlinear large-mass resummation predicts a strong nuclear modification in diffractive scattering off a nuclear target in kinematics accessible at the future Electron-Ion collider.
△ Less
Submitted 14 August, 2023;
originally announced August 2023.
-
Motion Planning Diffusion: Learning and Planning of Robot Motions with Diffusion Models
Authors:
Joao Carvalho,
An T. Le,
Mark Baierl,
Dorothea Koert,
Jan Peters
Abstract:
Learning priors on trajectory distributions can help accelerate robot motion planning optimization. Given previously successful plans, learning trajectory generative models as priors for a new planning problem is highly desirable. Prior works propose several ways on utilizing this prior to bootstrapping the motion planning problem. Either sampling the prior for initializations or using the prior d…
▽ More
Learning priors on trajectory distributions can help accelerate robot motion planning optimization. Given previously successful plans, learning trajectory generative models as priors for a new planning problem is highly desirable. Prior works propose several ways on utilizing this prior to bootstrapping the motion planning problem. Either sampling the prior for initializations or using the prior distribution in a maximum-a-posterior formulation for trajectory optimization. In this work, we propose learning diffusion models as priors. We then can sample directly from the posterior trajectory distribution conditioned on task goals, by leveraging the inverse denoising process of diffusion models. Furthermore, diffusion has been recently shown to effectively encode data multimodality in high-dimensional settings, which is particularly well-suited for large trajectory dataset. To demonstrate our method efficacy, we compare our proposed method - Motion Planning Diffusion - against several baselines in simulated planar robot and 7-dof robot arm manipulator environments. To assess the generalization capabilities of our method, we test it in environments with previously unseen obstacles. Our experiments show that diffusion models are strong priors to encode high-dimensional trajectory distributions of robot motions.
△ Less
Submitted 26 March, 2024; v1 submitted 3 August, 2023;
originally announced August 2023.