-
The magnetic field in quiescent star-forming filament G16.96+0.27
Authors:
Qi-Lao Gu,
Tie Liu,
Zhi-Qiang Shen,
Sihan Jiao,
Julien Montillaud,
Mika Juvela,
Xing Lu,
Chang Won Lee,
Junhao Liu,
Pak Shing Li,
Xunchuan Liu,
Doug Johnstone,
Woojin Kwon,
Kee-Tae Kim,
Ken'ichi Tatematsu,
Patricio Sanhueza,
Isabelle Ristorcelli,
Patrick Koch,
Qizhou Zhang,
Kate Pattle,
Naomi Hirano,
Dana Alina,
James Di Francesco
Abstract:
We present 850 μm thermal dust polarization observations with a resolution of 14.4"(~ 0.13 pc) towards an infrared dark cloud G16.96+0.27 using JCMT/POL-2. The average magnetic field orientation, which roughly agrees with the larger-scale magnetic field orientation traced by the Planck 353 GHz data, is approximately perpendicular to the filament structure. The estimated plane-of-sky magnetic field…
▽ More
We present 850 μm thermal dust polarization observations with a resolution of 14.4"(~ 0.13 pc) towards an infrared dark cloud G16.96+0.27 using JCMT/POL-2. The average magnetic field orientation, which roughly agrees with the larger-scale magnetic field orientation traced by the Planck 353 GHz data, is approximately perpendicular to the filament structure. The estimated plane-of-sky magnetic field strength is ~ 96 μG and ~ 60 μG using two variants of the Davis-Chandrasekhar-Fermi methods. We calculate the virial and magnetic critical parameters to evaluate the relative importance of gravity, the magnetic field, and turbulence. The magnetic field and turbulence are both weaker than gravity, but magnetic fields and turbulence together are equal to gravity, suggesting that G16.96+0.27 is in a quasi-equilibrium state. The cloud-magnetic-field alignment is found to have a trend moving away from perpendicularity in the dense regions, which may serve as a tracer of potential fragmentation in such quiescent filaments.
△ Less
Submitted 21 October, 2024;
originally announced October 2024.
-
Improving Parallel Program Performance Through DSL-Driven Code Generation with LLM Optimizers
Authors:
Anjiang Wei,
Allen Nie,
Thiago S. F. X. Teixeira,
Rohan Yadav,
Wonchan Lee,
Ke Wang,
Alex Aiken
Abstract:
Mapping computations to processors and assigning data to memory are critical for maximizing performance in parallel programming. These mapping decisions are managed through the development of specialized low-level system code, called mappers, crafted by performance engineers. Each mapper is tailored to a specific application and optimized for the underlying machine architecture, a process that req…
▽ More
Mapping computations to processors and assigning data to memory are critical for maximizing performance in parallel programming. These mapping decisions are managed through the development of specialized low-level system code, called mappers, crafted by performance engineers. Each mapper is tailored to a specific application and optimized for the underlying machine architecture, a process that requires days of refinement and tuning from an expert. Despite advances in system research, automating mapper generation remains a challenge due to the complexity of making millions of decisions to find the optimal solution and generate the solution as code. We introduce an approach that leverages recent advances in LLM-based optimizers for mapper design. In under ten minutes, our method automatically discovers mappers that surpass human expert designs in scientific applications by up to 1.34X speedup. For parallel matrix multiplication algorithms, our mapper achieves up to 1.31X of the expert-designed solution. To achieve this, we simplify the complexity of low-level code generation by introducing a domain-specific language (DSL) that abstracts the low-level system programming details and defines a structured search space for LLMs to explore. To maximize the application performance, we use an LLM optimizer to improve an agentic system that generates the mapper code. As a result, this approach significantly reduces the workload for performance engineers while achieving substantial performance gains across diverse applications. Finally, our results demonstrate the effectiveness of LLM-based optimization in system design and suggest its potential for addressing other complex system challenges.
△ Less
Submitted 21 October, 2024;
originally announced October 2024.
-
The ALMA-QUARKS Survey: Fibers' role in star formation unveiled in an intermediate-mass protocluster region of the Vela D cloud
Authors:
Dongting Yang,
HongLi Liu,
Tie Liu,
Anandmayee Tej,
Xunchuan Liu,
Jinhua He,
Guido Garay,
Amelia Stutz,
Lei Zhu,
Sheng-Li Qin,
Fengwei Xu,
Pak-Shing Li,
Mika Juvela,
Pablo Garcia,
Paul F. Goldsmith,
Siju Zhang,
Xindi Tang,
Patricio Sanhueza,
Shanghuo Li,
Chang Won Lee,
Swagat Ranjan Das,
Wenyu Jiao,
Xiaofeng Mai,
Prasanta Gorai,
Yichen Zhang
, et al. (10 additional authors not shown)
Abstract:
In this paper, we present a detailed analysis of the IRS 17 filament within the intermediate-mass protocluster IRAS 08448-4343 (of $\sim\,10^3\,\rm M_{\odot}$), using ALMA data from the ATOMS 3-mm and QUARKS 1.3-mm surveys. The IRS 17 filament, which spans $\sim$54000 au ($0.26\,\rm pc$) in length and $\sim$4000 au ($0.02\,\rm pc$) in width, exhibits a complex, multi-component velocity field, and…
▽ More
In this paper, we present a detailed analysis of the IRS 17 filament within the intermediate-mass protocluster IRAS 08448-4343 (of $\sim\,10^3\,\rm M_{\odot}$), using ALMA data from the ATOMS 3-mm and QUARKS 1.3-mm surveys. The IRS 17 filament, which spans $\sim$54000 au ($0.26\,\rm pc$) in length and $\sim$4000 au ($0.02\,\rm pc$) in width, exhibits a complex, multi-component velocity field, and harbours hierarchical substructures. These substructures include three bundles of seven velocity-coherent fibers, and 29 dense ($n\sim 10^8\,\rm cm^{-3}$) condensations. The fibers have a median length of $\sim 4500\,\rm au$ and a median width of $\sim 1400\,\rm au$. Among these fibers, four are identified as ``fertile", each hosting at least three dense condensations, which are regarded as the ``seeds" of star formation. While the detected cores are randomly spaced within the IRS\,17 filament based on the 3-mm dust continuum image, periodic spacing ($\sim1600\,\rm au$) of condensations is observed in the fertile fibers according to the 1.3-mm dust map, consistent with the predictions of linear isothermal cylinder fragmentation models. These findings underscore the crucial role of fibers in star formation and suggest a hierarchical fragmentation process that extends from the filament to the fibers, and ultimately, to the smallest-scale condensations.
△ Less
Submitted 20 October, 2024;
originally announced October 2024.
-
A search using GEO600 for gravitational waves coincident with fast radio bursts from SGR 1935+2154
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
I. Abouelfettouh,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
D. Agarwal,
M. Agathos,
M. Aghaei Abchouyeh,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Al-Jodah,
C. Alléné
, et al. (1758 additional authors not shown)
Abstract:
The magnetar SGR 1935+2154 is the only known Galactic source of fast radio bursts (FRBs). FRBs from SGR 1935+2154 were first detected by CHIME/FRB and STARE2 in 2020 April, after the conclusion of the LIGO, Virgo, and KAGRA Collaborations' O3 observing run. Here we analyze four periods of gravitational wave (GW) data from the GEO600 detector coincident with four periods of FRB activity detected by…
▽ More
The magnetar SGR 1935+2154 is the only known Galactic source of fast radio bursts (FRBs). FRBs from SGR 1935+2154 were first detected by CHIME/FRB and STARE2 in 2020 April, after the conclusion of the LIGO, Virgo, and KAGRA Collaborations' O3 observing run. Here we analyze four periods of gravitational wave (GW) data from the GEO600 detector coincident with four periods of FRB activity detected by CHIME/FRB, as well as X-ray glitches and X-ray bursts detected by NICER and NuSTAR close to the time of one of the FRBs. We do not detect any significant GW emission from any of the events. Instead, using a short-duration GW search (for bursts $\leq$ 1 s) we derive 50\% (90\%) upper limits of $10^{48}$ ($10^{49}$) erg for GWs at 300 Hz and $10^{49}$ ($10^{50}$) erg at 2 kHz, and constrain the GW-to-radio energy ratio to $\leq 10^{14} - 10^{16}$. We also derive upper limits from a long-duration search for bursts with durations between 1 and 10 s. These represent the strictest upper limits on concurrent GW emission from FRBs.
△ Less
Submitted 11 October, 2024;
originally announced October 2024.
-
Observations of pre- and proto-brown dwarfs in nearby clouds: paving the way to further constraining theories of brown dwarf formation
Authors:
Aina Palau,
Nuria Huelamo,
David Barrado,
Michael M. Dunham,
Chang Won Lee
Abstract:
Brown Dwarfs (BDs) are crucial objects in our understanding of both star and planet formation. However, there is still an unconcluded debate about which is the dominant formation mechanism of these objects. For this, it is mandatory to study BDs in their earliest evolutionary stages (what we call pre- and proto-BDs), comparable to the `pre-stellar' and `Class 0/I' stages well characterized for the…
▽ More
Brown Dwarfs (BDs) are crucial objects in our understanding of both star and planet formation. However, there is still an unconcluded debate about which is the dominant formation mechanism of these objects. For this, it is mandatory to study BDs in their earliest evolutionary stages (what we call pre- and proto-BDs), comparable to the `pre-stellar' and `Class 0/I' stages well characterized for the formation of low-mass stars. In this review, the recent efforts aimed at searching, identifying and characterising pre- and proto-BD candidates in nearby star-forming regions are presented, and revised requirements for an object to be a promising proto-BD or pre-BD candidate are provided, based on a new, unexplored so far, relation between the internal luminosity and the accreted mass. By applying these requirements, a list of 67 promising proto-BD candidates is presented, along with a compilation of possible pre-BDs from the literature. Updated correlations of protostellar properties such as mass infall rate or outflow momentum rate with bolometric luminosity are provided down to the low-mass BD regime, where no significant deviations are apparent. Furthermore, the number of proto-BD candidates in different clouds of the Solar Neighborhood seem to follow the known relations of number of protostars with cloud properties. In addition, proto(star-to-BD) ratios for the different clouds are also explored, unveiling a particular underproduction of low-mass proto-BD candidates in Ophiuchus compared to Lupus and Taurus. Possible explanations for this behavior are discussed, including heating of the Ophiuchus cloud by the nearby OB stars. The overall results of this work tend to favor a star-like process for BD formation down to the planetary boundary, of about 10 Mjup, below which other mechanisms might be at work.
△ Less
Submitted 21 October, 2024; v1 submitted 10 October, 2024;
originally announced October 2024.
-
Can Transformers Reason Logically? A Study in SAT Solving
Authors:
Leyan Pan,
Vijay Ganesh,
Jacob Abernethy,
Chris Esposo,
Wenke Lee
Abstract:
We theoretically and empirically study the logical reasoning capabilities of LLMs in the context of the Boolean satisfiability (SAT) problem. First, we construct a decoder-only Transformer that can solve SAT using backtracking and deduction via Chain-of-Thought (CoT). We prove its correctness by showing trace equivalence to the well-known DPLL SAT-solving algorithm. Second, to support the implemen…
▽ More
We theoretically and empirically study the logical reasoning capabilities of LLMs in the context of the Boolean satisfiability (SAT) problem. First, we construct a decoder-only Transformer that can solve SAT using backtracking and deduction via Chain-of-Thought (CoT). We prove its correctness by showing trace equivalence to the well-known DPLL SAT-solving algorithm. Second, to support the implementation of this abstract construction, we design a compiler $\texttt{PARAT}$ that takes as input a procedural specification and outputs a transformer model implementing this specification. Third, rather than $\textit{programming}$ a transformer to reason, we evaluate empirically whether it can be $\textit{trained}$ to do so by learning directly from algorithmic traces ("reasoning paths") of the DPLL algorithm.
△ Less
Submitted 9 October, 2024;
originally announced October 2024.
-
Embedded State Estimation for Optimization of Cislunar Space Domain Awareness Constellation Design
Authors:
Thomas H. Clareson,
Matthew C. Fox,
Dominic K. Amato,
Hang Woon Lee
Abstract:
The traffic in cislunar space is expected to increase over the coming years, leading to a higher likelihood of conjunction events among active satellites, orbital debris, and non-cooperative satellites. This increase necessitates enhanced space domain awareness (SDA) capabilities that include state estimation for targets of interest. Both Earth surface-based and space-based observation platforms i…
▽ More
The traffic in cislunar space is expected to increase over the coming years, leading to a higher likelihood of conjunction events among active satellites, orbital debris, and non-cooperative satellites. This increase necessitates enhanced space domain awareness (SDA) capabilities that include state estimation for targets of interest. Both Earth surface-based and space-based observation platforms in geosynchronous orbit or below face challenges such as range, exclusion, and occlusion that hinder observation. Motivated by the need to place space-based observers in the cislunar space regime to overcome these challenges, this paper proposes a cislunar SDA constellation design and analysis framework that integrates state estimation into an optimization problem for determining the placement of observers for optimal state estimation performance on a set of targets. The proposed multi-observer placement optimization problem samples from a range of possible target orbits. Upon convergence, the optimized constellation is validated against a broader set of targets to assess its effectiveness. Two comparative analyses are presented to evaluate the effects of changes in the sensor tasking procedure and sensor fidelity on the optimized constellation, comparing these to a single observer baseline case. The results demonstrate that the optimized constellations can provide accurate state estimation for various orbit families.
△ Less
Submitted 8 October, 2024;
originally announced October 2024.
-
FINALLY: fast and universal speech enhancement with studio-like quality
Authors:
Nicholas Babaev,
Kirill Tamogashev,
Azat Saginbaev,
Ivan Shchekotov,
Hanbin Bae,
Hosang Sung,
WonJun Lee,
Hoon-Young Cho,
Pavel Andreev
Abstract:
In this paper, we address the challenge of speech enhancement in real-world recordings, which often contain various forms of distortion, such as background noise, reverberation, and microphone artifacts. We revisit the use of Generative Adversarial Networks (GANs) for speech enhancement and theoretically show that GANs are naturally inclined to seek the point of maximum density within the conditio…
▽ More
In this paper, we address the challenge of speech enhancement in real-world recordings, which often contain various forms of distortion, such as background noise, reverberation, and microphone artifacts. We revisit the use of Generative Adversarial Networks (GANs) for speech enhancement and theoretically show that GANs are naturally inclined to seek the point of maximum density within the conditional clean speech distribution, which, as we argue, is essential for the speech enhancement task. We study various feature extractors for perceptual loss to facilitate the stability of adversarial training, developing a methodology for probing the structure of the feature space. This leads us to integrate WavLM-based perceptual loss into MS-STFT adversarial training pipeline, creating an effective and stable training procedure for the speech enhancement model. The resulting speech enhancement model, which we refer to as FINALLY, builds upon the HiFi++ architecture, augmented with a WavLM encoder and a novel training pipeline. Empirical results on various datasets confirm our model's ability to produce clear, high-quality speech at 48 kHz, achieving state-of-the-art performance in the field of speech enhancement.
△ Less
Submitted 8 October, 2024;
originally announced October 2024.
-
Kiss up, Kick down: Exploring Behavioral Changes in Multi-modal Large Language Models with Assigned Visual Personas
Authors:
Seungjong Sun,
Eungu Lee,
Seo Yeon Baek,
Seunghyun Hwang,
Wonbyung Lee,
Dongyan Nan,
Bernard J. Jansen,
Jang Hyun Kim
Abstract:
This study is the first to explore whether multi-modal large language models (LLMs) can align their behaviors with visual personas, addressing a significant gap in the literature that predominantly focuses on text-based personas. We developed a novel dataset of 5K fictional avatar images for assignment as visual personas to LLMs, and analyzed their negotiation behaviors based on the visual traits…
▽ More
This study is the first to explore whether multi-modal large language models (LLMs) can align their behaviors with visual personas, addressing a significant gap in the literature that predominantly focuses on text-based personas. We developed a novel dataset of 5K fictional avatar images for assignment as visual personas to LLMs, and analyzed their negotiation behaviors based on the visual traits depicted in these images, with a particular focus on aggressiveness. The results indicate that LLMs assess the aggressiveness of images in a manner similar to humans and output more aggressive negotiation behaviors when prompted with an aggressive visual persona. Interestingly, the LLM exhibited more aggressive negotiation behaviors when the opponent's image appeared less aggressive than their own, and less aggressive behaviors when the opponents image appeared more aggressive.
△ Less
Submitted 4 October, 2024;
originally announced October 2024.
-
CounterQuill: Investigating the Potential of Human-AI Collaboration in Online Counterspeech Writing
Authors:
Xiaohan Ding,
Kaike Ping,
Uma Sushmitha Gunturi,
Buse Carik,
Sophia Stil,
Lance T Wilhelm,
Taufiq Daryanto,
James Hawdon,
Sang Won Lee,
Eugenia H Rho
Abstract:
Online hate speech has become increasingly prevalent on social media platforms, causing harm to individuals and society. While efforts have been made to combat this issue through content moderation, the potential of user-driven counterspeech as an alternative solution remains underexplored. Existing counterspeech methods often face challenges such as fear of retaliation and skill-related barriers.…
▽ More
Online hate speech has become increasingly prevalent on social media platforms, causing harm to individuals and society. While efforts have been made to combat this issue through content moderation, the potential of user-driven counterspeech as an alternative solution remains underexplored. Existing counterspeech methods often face challenges such as fear of retaliation and skill-related barriers. To address these challenges, we introduce CounterQuill, an AI-mediated system that assists users in composing effective and empathetic counterspeech. CounterQuill provides a three-step process: (1) a learning session to help users understand hate speech and counterspeech; (2) a brainstorming session that guides users in identifying key elements of hate speech and exploring counterspeech strategies; and (3) a co-writing session that enables users to draft and refine their counterspeech with CounterQuill. We conducted a within-subjects user study with 20 participants to evaluate CounterQuill in comparison to ChatGPT. Results show that CounterQuill's guidance and collaborative writing process provided users a stronger sense of ownership over their co-authored counterspeech. Users perceived CounterQuill as a writing partner and thus were more willing to post the co-written counterspeech online compared to the one written with ChatGPT.
△ Less
Submitted 3 October, 2024;
originally announced October 2024.
-
ALLO: A Photorealistic Dataset and Data Generation Pipeline for Anomaly Detection During Robotic Proximity Operations in Lunar Orbit
Authors:
Selina Leveugle,
Chang Won Lee,
Svetlana Stolpner,
Chris Langley,
Paul Grouchy,
Steven Waslander,
Jonathan Kelly
Abstract:
NASA's forthcoming Lunar Gateway space station, which will be uncrewed most of the time, will need to operate with an unprecedented level of autonomy. Enhancing autonomy on the Gateway presents several unique challenges, one of which is to equip the Canadarm3, the Gateway's external robotic system, with the capability to perform worksite monitoring. Monitoring will involve using the arm's inspecti…
▽ More
NASA's forthcoming Lunar Gateway space station, which will be uncrewed most of the time, will need to operate with an unprecedented level of autonomy. Enhancing autonomy on the Gateway presents several unique challenges, one of which is to equip the Canadarm3, the Gateway's external robotic system, with the capability to perform worksite monitoring. Monitoring will involve using the arm's inspection cameras to detect any anomalies within the operating environment, a task complicated by the widely-varying lighting conditions in space. In this paper, we introduce the visual anomaly detection and localization task for space applications and establish a benchmark with our novel synthetic dataset called ALLO (for Anomaly Localization in Lunar Orbit). We develop a complete data generation pipeline to create ALLO, which we use to evaluate the performance of state-of-the-art visual anomaly detection algorithms. Given the low tolerance for risk during space operations and the lack of relevant data, we emphasize the need for novel, robust, and accurate anomaly detection methods to handle the challenging visual conditions found in lunar orbit and beyond.
△ Less
Submitted 30 September, 2024;
originally announced September 2024.
-
Search for proton decay via $p\rightarrow{e^+η}$ and $p\rightarrow{μ^+η}$ with a 0.37 Mton-year exposure of Super-Kamiokande
Authors:
Super-Kamiokande Collaboration,
:,
N. Taniuchi,
K. Abe,
S. Abe,
Y. Asaoka,
C. Bronner,
M. Harada,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
M. Nakahata,
S. Nakayama,
Y. Noguchi
, et al. (267 additional authors not shown)
Abstract:
A search for proton decay into $e^+/μ^+$ and a $η$ meson has been performed using data from a 0.373 Mton$\cdot$year exposure (6050.3 live days) of Super-Kamiokande. Compared to previous searches this work introduces an improved model of the intranuclear $η$ interaction cross section, resulting in a factor of two reduction in uncertainties from this source and $\sim$10\% increase in signal efficien…
▽ More
A search for proton decay into $e^+/μ^+$ and a $η$ meson has been performed using data from a 0.373 Mton$\cdot$year exposure (6050.3 live days) of Super-Kamiokande. Compared to previous searches this work introduces an improved model of the intranuclear $η$ interaction cross section, resulting in a factor of two reduction in uncertainties from this source and $\sim$10\% increase in signal efficiency. No significant data excess was found above the expected number of atmospheric neutrino background events resulting in no indication of proton decay into either mode. Lower limits on the proton partial lifetime of $1.4\times\mathrm{10^{34}~years}$ for $p\rightarrow e^+η$ and $7.3\times\mathrm{10^{33}~years}$ for $p\rightarrow μ^+η$ at the 90$\%$ C.L. were set. These limits are around 1.5 times longer than our previous study and are the most stringent to date.
△ Less
Submitted 29 September, 2024;
originally announced September 2024.
-
Speech Boosting: Low-Latency Live Speech Enhancement for TWS Earbuds
Authors:
Hanbin Bae,
Pavel Andreev,
Azat Saginbaev,
Nicholas Babaev,
Won-Jun Lee,
Hosang Sung,
Hoon-Young Cho
Abstract:
This paper introduces a speech enhancement solution tailored for true wireless stereo (TWS) earbuds on-device usage. The solution was specifically designed to support conversations in noisy environments, with active noise cancellation (ANC) activated. The primary challenges for speech enhancement models in this context arise from computational complexity that limits on-device usage and latency tha…
▽ More
This paper introduces a speech enhancement solution tailored for true wireless stereo (TWS) earbuds on-device usage. The solution was specifically designed to support conversations in noisy environments, with active noise cancellation (ANC) activated. The primary challenges for speech enhancement models in this context arise from computational complexity that limits on-device usage and latency that must be less than 3 ms to preserve a live conversation. To address these issues, we evaluated several crucial design elements, including the network architecture and domain, design of loss functions, pruning method, and hardware-specific optimization. Consequently, we demonstrated substantial improvements in speech enhancement quality compared with that in baseline models, while simultaneously reducing the computational complexity and algorithmic latency.
△ Less
Submitted 27 September, 2024;
originally announced September 2024.
-
GRB 240529A: A Tale of Two Shocks
Authors:
Tian-Rui Sun,
Jin-Jun Geng,
Jing-Zhi Yan,
You-Dong Hu,
Xue-Feng Wu,
Alberto J. Castro-Tirado,
Chao Yang,
Yi-Ding Ping,
Chen-Ran Hu,
Fan Xu,
Hao-Xuan Gao,
Ji-An Jiang,
Yan-Tian Zhu,
Yongquan Xue,
Ignacio Pérez-García,
Si-Yu Wu,
Emilio Fernández-García,
María D. Caballero-García,
Rubén Sánchez-Ramírez,
Sergiy Guziy,
Ignacio Olivares,
Carlos Jesus Pérez del Pulgar,
A. Castellón,
Sebastián Castillo,
Ding-Rong Xiong
, et al. (44 additional authors not shown)
Abstract:
Thanks to the rapidly increasing time-domain facilities, we are entering a golden era of research on gamma-ray bursts (GRBs). In this Letter, we report our observations of GRB 240529A with the Burst Optical Observer and Transient Exploring System, the 1.5-meter telescope at Observatorio Sierra Nevada, the 2.5-meter Wide Field Survey Telescope of China, the Large Binocular Telescope, and the Telesc…
▽ More
Thanks to the rapidly increasing time-domain facilities, we are entering a golden era of research on gamma-ray bursts (GRBs). In this Letter, we report our observations of GRB 240529A with the Burst Optical Observer and Transient Exploring System, the 1.5-meter telescope at Observatorio Sierra Nevada, the 2.5-meter Wide Field Survey Telescope of China, the Large Binocular Telescope, and the Telescopio Nazionale Galileo. The prompt emission of GRB 240529A shows two comparable energetic episodes separated by a quiescence time of roughly 400 s. Combining all available data on the GRB Coordinates Network, we reveal the simultaneous apparent X-ray plateau and optical re-brightening around $10^3-10^4$ s after the burst. Rather than the energy injection from the magnetar as widely invoked for similar GRBs, the multi-wavelength emissions could be better explained as two shocks launched from the central engine separately. The optical peak time and our numerical modeling suggest that the initial bulk Lorentz factor of the later shock is roughly 50, which indicates that the later jet should be accretion-driven and have a higher mass loading than a typical one. The quiescence time between the two prompt emission episodes may be caused by the transition between different accretion states of a central magnetar or black hole, or the fall-back accretion process. A sample of similar bursts with multiple emission episodes in the prompt phase and sufficient follow-up could help to probe the underlying physics of GRB central engines.
△ Less
Submitted 26 September, 2024;
originally announced September 2024.
-
SelectiveKD: A semi-supervised framework for cancer detection in DBT through Knowledge Distillation and Pseudo-labeling
Authors:
Laurent Dillard,
Hyeonsoo Lee,
Weonsuk Lee,
Tae Soo Kim,
Ali Diba,
Thijs Kooi
Abstract:
When developing Computer Aided Detection (CAD) systems for Digital Breast Tomosynthesis (DBT), the complexity arising from the volumetric nature of the modality poses significant technical challenges for obtaining large-scale accurate annotations. Without access to large-scale annotations, the resulting model may not generalize to different domains. Given the costly nature of obtaining DBT annotat…
▽ More
When developing Computer Aided Detection (CAD) systems for Digital Breast Tomosynthesis (DBT), the complexity arising from the volumetric nature of the modality poses significant technical challenges for obtaining large-scale accurate annotations. Without access to large-scale annotations, the resulting model may not generalize to different domains. Given the costly nature of obtaining DBT annotations, how to effectively increase the amount of data used for training DBT CAD systems remains an open challenge.
In this paper, we present SelectiveKD, a semi-supervised learning framework for building cancer detection models for DBT, which only requires a limited number of annotated slices to reach high performance. We achieve this by utilizing unlabeled slices available in a DBT stack through a knowledge distillation framework in which the teacher model provides a supervisory signal to the student model for all slices in the DBT volume. Our framework mitigates the potential noise in the supervisory signal from a sub-optimal teacher by implementing a selective dataset expansion strategy using pseudo labels.
We evaluate our approach with a large-scale real-world dataset of over 10,000 DBT exams collected from multiple device manufacturers and locations. The resulting SelectiveKD process effectively utilizes unannotated slices from a DBT stack, leading to significantly improved cancer classification performance (AUC) and generalization performance.
△ Less
Submitted 24 September, 2024;
originally announced September 2024.
-
ALMASOP. The Localized and Chemically rich Features near the Bases of the Protostellar Jet in HOPS 87
Authors:
Shih-Ying Hsu,
Chin-Fei Lee,
Sheng-Yuan Liu,
Doug Johnstone,
Tie Liu,
Satoko Takahashi,
Leonardo Bronfman,
Huei-Ru Vivien Chen,
Somnath Dutta,
David J. Eden,
Neal J. Evans II,
Naomi Hirano,
Mika Juvela,
Yi-Jehng Kuan,
Woojin Kwon,
Chang Won Lee,
Jeong-Eun Lee,
Shanghuo Li,
Chun-Fan Liu,
Xunchuan Liu,
Qiuyi Luo,
Sheng-Li Qin,
Dipen Sahu,
Patricio Sanhueza,
Hsien Shang
, et al. (2 additional authors not shown)
Abstract:
HOPS 87 is a Class 0 protostellar core known to harbor an extremely young bipolar outflow and a hot corino. We report the discovery of localized, chemically rich regions near the bases of the two-lobe bipolar molecular outflow in HOPS 87 containing molecules such as H$_2$CO, $^{13}$CS, H$_2$S, OCS, and CH$_3$OH, the simplest complex organic molecule (COM). The locations and kinematics suggest that…
▽ More
HOPS 87 is a Class 0 protostellar core known to harbor an extremely young bipolar outflow and a hot corino. We report the discovery of localized, chemically rich regions near the bases of the two-lobe bipolar molecular outflow in HOPS 87 containing molecules such as H$_2$CO, $^{13}$CS, H$_2$S, OCS, and CH$_3$OH, the simplest complex organic molecule (COM). The locations and kinematics suggest that these localized features are due to jet-driven shocks rather than being part of the hot corino region encasing the protostar. The COM compositions of the molecular gas in these jet-localized regions are relatively simpler than those in the hot corino zone. We speculate that this simplicity is due to either the liberation of ice with a less complex chemical history or the effects of shock chemistry. Our study highlights the dynamic interplay between the protostellar bipolar outflow, disk, inner core environment, and the surrounding medium, contributing to our understanding of molecular complexity in solar-like young stellar objects.
△ Less
Submitted 22 September, 2024;
originally announced September 2024.
-
Unravelling and circumventing failure mechanisms in chalcogenide optical phase change materials
Authors:
Cosmin Constantin Popescu,
Kiumars Aryana,
Brian Mills,
Tae Woo Lee,
Louis Martin-Monier,
Luigi Ranno,
Jia Xu Brian Sia,
Khoi Phuong Dao,
Hyung-Bin Bae,
Vladimir Liberman,
Steven Vitale,
Myungkoo Kang,
Kathleen A. Richardson,
Carlos A. Ríos Ocampo,
Dennis Calahan,
Yifei Zhang,
William M. Humphreys,
Hyun Jung Kim,
Tian Gu,
Juejun Hu
Abstract:
Chalcogenide optical phase change materials (PCMs) have garnered significant interest for their growing applications in programmable photonics, optical analog computing, active metasurfaces, and beyond. Limited endurance or cycling lifetime is however increasingly becoming a bottleneck toward their practical deployment for these applications. To address this issue, we performed a systematic study…
▽ More
Chalcogenide optical phase change materials (PCMs) have garnered significant interest for their growing applications in programmable photonics, optical analog computing, active metasurfaces, and beyond. Limited endurance or cycling lifetime is however increasingly becoming a bottleneck toward their practical deployment for these applications. To address this issue, we performed a systematic study elucidating the cycling failure mechanisms of Ge$_2$Sb$_2$Se$_4$Te (GSST), a common optical PCM tailored for infrared photonic applications, in an electrothermal switching configuration commensurate with their applications in on-chip photonic devices. We further propose a set of design rules building on insights into the failure mechanisms, and successfully implemented them to boost the endurance of the GSST device to over 67,000 cycles.
△ Less
Submitted 18 September, 2024;
originally announced September 2024.
-
Escaping Local Minima: Hybrid Artificial Potential Field with Wall-Follower for Decentralized Multi-Robot Navigation
Authors:
Joonkyung Kim,
Sangjin Park,
Wonjong Lee,
Woojun Kim,
Nakju Doh,
Changjoo Nam
Abstract:
We tackle the challenges of decentralized multi-robot navigation in environments with nonconvex obstacles, where complete environmental knowledge is unavailable. While reactive methods like Artificial Potential Field (APF) offer simplicity and efficiency, they suffer from local minima, causing robots to become trapped due to their lack of global environmental awareness. Other existing solutions ei…
▽ More
We tackle the challenges of decentralized multi-robot navigation in environments with nonconvex obstacles, where complete environmental knowledge is unavailable. While reactive methods like Artificial Potential Field (APF) offer simplicity and efficiency, they suffer from local minima, causing robots to become trapped due to their lack of global environmental awareness. Other existing solutions either rely on inter-robot communication, are limited to single-robot scenarios, or struggle to overcome nonconvex obstacles effectively.
Our proposed methods enable collision-free navigation using only local sensor and state information without a map. By incorporating a wall-following (WF) behavior into the APF approach, our method allows robots to escape local minima, even in the presence of nonconvex and dynamic obstacles including other robots. We introduce two algorithms for switching between APF and WF: a rule-based system and an encoder network trained on expert demonstrations. Experimental results show that our approach achieves substantially higher success rates compared to state-of-the-art methods, highlighting its ability to overcome the limitations of local minima in complex environments
△ Less
Submitted 16 September, 2024;
originally announced September 2024.
-
The CRAFT Coherent (CRACO) upgrade I: System Description and Results of the 110-ms Radio Transient Pilot Survey
Authors:
Z. Wang,
K. W. Bannister,
V. Gupta,
X. Deng,
M. Pilawa,
J. Tuthill,
J. D. Bunton,
C. Flynn,
M. Glowacki,
A. Jaini,
Y. W. J. Lee,
E. Lenc,
J. Lucero,
A. Paek,
R. Radhakrishnan,
N. Thyagarajan,
P. Uttarkar,
Y. Wang,
N. D. R. Bhat,
C. W. James,
V. A. Moss,
Tara Murphy,
J. E. Reynolds,
R. M. Shannon,
L. G. Spitler
, et al. (18 additional authors not shown)
Abstract:
We present the first results from a new backend on the Australian Square Kilometre Array Pathfinder, the Commensal Realtime ASKAP Fast Transient COherent (CRACO) upgrade. CRACO records millisecond time resolution visibility data, and searches for dispersed fast transient signals including fast radio bursts (FRB), pulsars, and ultra-long period objects (ULPO). With the visibility data, CRACO can lo…
▽ More
We present the first results from a new backend on the Australian Square Kilometre Array Pathfinder, the Commensal Realtime ASKAP Fast Transient COherent (CRACO) upgrade. CRACO records millisecond time resolution visibility data, and searches for dispersed fast transient signals including fast radio bursts (FRB), pulsars, and ultra-long period objects (ULPO). With the visibility data, CRACO can localise the transient events to arcsecond-level precision after the detection. Here, we describe the CRACO system and report the result from a sky survey carried out by CRACO at 110ms resolution during its commissioning phase. During the survey, CRACO detected two FRBs (including one discovered solely with CRACO, FRB 20231027A), reported more precise localisations for four pulsars, discovered two new RRATs, and detected one known ULPO, GPM J1839-10, through its sub-pulse structure. We present a sensitivity calibration of CRACO, finding that it achieves the expected sensitivity of 11.6 Jy ms to bursts of 110 ms duration or less. CRACO is currently running at a 13.8 ms time resolution and aims at a 1.7 ms time resolution before the end of 2024. The planned CRACO has an expected sensitivity of 1.5 Jy ms to bursts of 1.7 ms duration or less, and can detect 10x more FRBs than the current CRAFT incoherent sum system (i.e., 0.5-2 localised FRBs per day), enabling us to better constrain the FRB emission mechanism model and use them as cosmological probes.
△ Less
Submitted 16 September, 2024;
originally announced September 2024.
-
Optimal Classification-based Anomaly Detection with Neural Networks: Theory and Practice
Authors:
Tian-Yi Zhou,
Matthew Lau,
Jizhou Chen,
Wenke Lee,
Xiaoming Huo
Abstract:
Anomaly detection is an important problem in many application areas, such as network security. Many deep learning methods for unsupervised anomaly detection produce good empirical performance but lack theoretical guarantees. By casting anomaly detection into a binary classification problem, we establish non-asymptotic upper bounds and a convergence rate on the excess risk on rectified linear unit…
▽ More
Anomaly detection is an important problem in many application areas, such as network security. Many deep learning methods for unsupervised anomaly detection produce good empirical performance but lack theoretical guarantees. By casting anomaly detection into a binary classification problem, we establish non-asymptotic upper bounds and a convergence rate on the excess risk on rectified linear unit (ReLU) neural networks trained on synthetic anomalies. Our convergence rate on the excess risk matches the minimax optimal rate in the literature. Furthermore, we provide lower and upper bounds on the number of synthetic anomalies that can attain this optimality. For practical implementation, we relax some conditions to improve the search for the empirical risk minimizer, which leads to competitive performance to other classification-based methods for anomaly detection. Overall, our work provides the first theoretical guarantees of unsupervised neural network-based anomaly detectors and empirical insights on how to design them well.
△ Less
Submitted 12 September, 2024;
originally announced September 2024.
-
Orbital inversion and emergent lattice dynamics in infinite layer CaCoO$_2$
Authors:
Daniel Jost,
Eder G. Lomeli,
Woo Jin Kim,
Emily M. Been,
Matteo Rossi,
Stefano Agrestini,
Kejin Zhou,
Chunjing Jia,
Brian Moritz,
Zhi-Xun Shen,
Harold Y. Hwang,
Thomas P. Devereaux,
Wei-Sheng Lee
Abstract:
The layered cobaltate CaCoO$_2$ exhibits a unique herringbone-like structure. Serving as a potential prototype for a new class of complex lattice patterns, we study the properties of CaCoO$_2$ using X-ray absorption spectroscopy (XAS) and resonant inelastic X-ray scattering (RIXS). Our results reveal a significant inter-plane hybridization between the Ca $4s-$ and Co $3d-$orbitals, leading to an i…
▽ More
The layered cobaltate CaCoO$_2$ exhibits a unique herringbone-like structure. Serving as a potential prototype for a new class of complex lattice patterns, we study the properties of CaCoO$_2$ using X-ray absorption spectroscopy (XAS) and resonant inelastic X-ray scattering (RIXS). Our results reveal a significant inter-plane hybridization between the Ca $4s-$ and Co $3d-$orbitals, leading to an inversion of the textbook orbital occupation of a square planar geometry. Further, our RIXS data reveal a strong low energy mode, with anomalous intensity modulations as a function of momentum transfer close to a quasi-static response suggestive of electronic and/or orbital ordering. These findings indicate that the newly discovered herringbone structure exhibited in CaCoO$_2$ may serve as a promising laboratory for the design of materials having strong electronic, orbital and lattice correlations.
△ Less
Submitted 11 September, 2024;
originally announced September 2024.
-
A Geodetic and Astrometric VLBI Experiment at 22/43/88/132 GHz
Authors:
Shuangjing Xu,
Taehyun Jung,
Bo Zhang,
Ming Hui Xu,
Do-Young Byun,
Xuan He,
Nobuyuki Sakai,
Oleg Titov,
Fengchun Shu,
Hyo-Ryoung Kim,
Jungho Cho,
Sung-Moon Yoo,
Byung-Kyu Choi,
Woo Kyoung Lee,
Yan Sun,
Xiaofeng Mai,
Guangli Wang
Abstract:
Extending geodetic and astrometric Very Long Baseline Interferometry (VLBI) observations from traditional centimeter wavebands to millimeter wavebands offers numerous scientific potentials and benefits. However, it was considered quite challenging due to various factors, including the increased effects of atmospheric opacity and turbulence at millimeter wavelengths. Here, we present the results of…
▽ More
Extending geodetic and astrometric Very Long Baseline Interferometry (VLBI) observations from traditional centimeter wavebands to millimeter wavebands offers numerous scientific potentials and benefits. However, it was considered quite challenging due to various factors, including the increased effects of atmospheric opacity and turbulence at millimeter wavelengths. Here, we present the results of the first geodetic-mode VLBI experiment, simultaneously observing 82 sources at 22/43/88/132 GHz (K/Q/W/D bands) using the Korean VLBI Network (KVN). We introduced the frequency phase transfer (FPT) method to geodetic VLBI analysis, an approach for calibrating atmospheric phase fluctuations at higher frequencies by transferring phase solutions from lower frequencies. With a 2-minute scan, FPT improved the signal-to-noise ratio (SNR) of most fringes, some by over 100%, thereby enhancing the detection rate of weak sources at millimeter wavebands. Additionally, FPT reduced systematic errors in group delay and delay rate, with the weighted root-mean-squares (WRMS) of the post-fitting residuals decreasing from 25.0 ps to 20.5 ps at the W band and from 39.3 ps to 27.6 ps at the D band. There were no notable differences observed in calibrating atmospheric phase fluctuations at the K band (WRMS = 12.4 ps) and Q band (WRMS = 11.8 ps) with the KVN baselines. This experiment demonstrated that the millimeter waveband can be used for geodetic and astrometric applications with high precision.
△ Less
Submitted 11 September, 2024;
originally announced September 2024.
-
Keyword-Aware ASR Error Augmentation for Robust Dialogue State Tracking
Authors:
Jihyun Lee,
Solee Im,
Wonjun Lee,
Gary Geunbae Lee
Abstract:
Dialogue State Tracking (DST) is a key part of task-oriented dialogue systems, identifying important information in conversations. However, its accuracy drops significantly in spoken dialogue environments due to named entity errors from Automatic Speech Recognition (ASR) systems. We introduce a simple yet effective data augmentation method that targets those entities to improve the robustness of D…
▽ More
Dialogue State Tracking (DST) is a key part of task-oriented dialogue systems, identifying important information in conversations. However, its accuracy drops significantly in spoken dialogue environments due to named entity errors from Automatic Speech Recognition (ASR) systems. We introduce a simple yet effective data augmentation method that targets those entities to improve the robustness of DST model. Our novel method can control the placement of errors using keyword-highlighted prompts while introducing phonetically similar errors. As a result, our method generated sufficient error patterns on keywords, leading to improved accuracy in noised and low-accuracy ASR environments.
△ Less
Submitted 10 September, 2024;
originally announced September 2024.
-
BACKRUNNER: Mitigating Smart Contract Attacks in the Real World
Authors:
Chaofan Shou,
Yuanyu Ke,
Yupeng Yang,
Qi Su,
Or Dadosh,
Assaf Eli,
David Benchimol,
Doudou Lu,
Daniel Tong,
Dex Chen,
Zoey Tan,
Jacob Chia,
Koushik Sen,
Wenke Lee
Abstract:
Billions of dollars have been lost due to vulnerabilities in smart contracts. To counteract this, researchers have proposed attack frontrunning protections designed to preempt malicious transactions by inserting "whitehat" transactions ahead of them to protect the assets. In this paper, we demonstrate that existing frontrunning protections have become ineffective in real-world scenarios. Specifica…
▽ More
Billions of dollars have been lost due to vulnerabilities in smart contracts. To counteract this, researchers have proposed attack frontrunning protections designed to preempt malicious transactions by inserting "whitehat" transactions ahead of them to protect the assets. In this paper, we demonstrate that existing frontrunning protections have become ineffective in real-world scenarios. Specifically, we collected 158 recent real-world attack transactions and discovered that 141 of them can bypass state-of-the-art frontrunning protections. We systematically analyze these attacks and show how inherent limitations of existing frontrunning techniques hinder them from protecting valuable assets in the real world. We then propose a new approach involving 1) preemptive hijack, and 2) attack backrunning, which circumvent the existing limitations and can help protect assets before and after an attack. Our approach adapts the exploit used in the attack to the same or similar contracts before and after the attack to safeguard the assets. We conceptualize adapting exploits as a program repair problem and apply established techniques to implement our approach into a full-fledged framework, BACKRUNNER. Running on previous attacks in 2023, BACKRUNNER can successfully rescue more than \$410M. In the real world, it has helped rescue over \$11.2M worth of assets in 28 separate incidents within two months.
△ Less
Submitted 10 September, 2024;
originally announced September 2024.
-
Programming Refusal with Conditional Activation Steering
Authors:
Bruce W. Lee,
Inkit Padhi,
Karthikeyan Natesan Ramamurthy,
Erik Miehling,
Pierre Dognin,
Manish Nagireddy,
Amit Dhurandhar
Abstract:
LLMs have shown remarkable capabilities, but precisely controlling their response behavior remains challenging. Existing activation steering methods alter LLM behavior indiscriminately, limiting their practical applicability in settings where selective responses are essential, such as content moderation or domain-specific assistants. In this paper, we propose Conditional Activation Steering (CAST)…
▽ More
LLMs have shown remarkable capabilities, but precisely controlling their response behavior remains challenging. Existing activation steering methods alter LLM behavior indiscriminately, limiting their practical applicability in settings where selective responses are essential, such as content moderation or domain-specific assistants. In this paper, we propose Conditional Activation Steering (CAST), which analyzes LLM activation patterns during inference to selectively apply or withhold activation steering based on the input context. Our method is based on the observation that different categories of prompts activate distinct patterns in the model's hidden states. Using CAST, one can systematically control LLM behavior with rules like "if input is about hate speech or adult content, then refuse" or "if input is not about legal advice, then refuse." This allows for selective modification of responses to specific content while maintaining normal responses to other content, all without requiring weight optimization. We release an open-source implementation of our framework.
△ Less
Submitted 6 September, 2024;
originally announced September 2024.
-
JCMT 850 $\micron$ continuum observations of density structures in the G35 molecular complex
Authors:
Xianjin Shen,
Hong-Li Liu,
Zhiyuan Ren,
Anandmayee Tej,
Di Li,
Hauyu Baobab Liu,
Gary A. Fuller,
Jinjin Xie,
Sihan Jiao,
Aiyuan Yang,
Patrick M. Koch,
Fengwei Xu,
Patricio Sanhueza,
Pham N. Diep,
Nicolas Peretto,
Ram K. Yadav,
Busaba H. Kramer,
Koichiro Sugiyama,
Mark Rawlings,
Chang Won Lee,
Ken'ichi Tatematsu,
Daniel Harsono,
David Eden,
Woojin Kwon,
Chao-Wei Tsai
, et al. (10 additional authors not shown)
Abstract:
Filaments are believed to play a key role in high-mass star formation. We present a systematic study of the filaments and their hosting clumps in the G35 molecular complex using JCMT SCUBA-2 850 $\micron$ continuum data. We identified five clouds in the complex and 91 filaments within them, some of which form 10 hub-filament systems (HFSs), each with at least 3 hub-composing filaments. We also com…
▽ More
Filaments are believed to play a key role in high-mass star formation. We present a systematic study of the filaments and their hosting clumps in the G35 molecular complex using JCMT SCUBA-2 850 $\micron$ continuum data. We identified five clouds in the complex and 91 filaments within them, some of which form 10 hub-filament systems (HFSs), each with at least 3 hub-composing filaments. We also compiled a catalogue of 350 dense clumps, 183 of which are associated with the filaments. We investigated the physical properties of the filaments and clumps, such as mass, density, and size, and their relation to star formation. We find that the global mass-length trend of the filaments is consistent with a turbulent origin, while the hub-composing filaments of high line masses ($m_{\rm l}\,>$\,230\,$\mathrm{M_{\odot}~pc^{-1}}$) in HFSs deviate from this relation, possibly due to feedback from massive star formation. We also find that the most massive and densest clumps (R\,$>$\,0.2\,pc, M\,$>35\,\mathrm{M_{\odot}}$, $\mathrmΣ>\,0.05\,\mathrm{g~cm^{-2}}$) are located in the filaments and in the hubs of HFS with the latter bearing a higher probability of occurrence of high-mass star-forming signatures, highlighting the preferential sites of HFSs for high-mass star formation. We do not find significant variation in the clump mass surface density across different evolutionary environments of the clouds, which may reflect the balance between mass accretion and stellar feedback.
△ Less
Submitted 9 September, 2024;
originally announced September 2024.
-
Origin of nonlinear photocurrents in chiral multifold semimetal CoSi unveiled by terahertz emission spectroscopy
Authors:
Yao-Jui Chan,
Syed Mohammed Faizanuddin,
Raju Kalaivanan,
Sankar Raman,
Hsin Lin,
Uddipta Kar,
Akhilesh Kr. Singh,
Wei-Li Lee,
Ranganayakulu K. Vankayala,
Min-Nan Ou,
Yu-Chieh Wen
Abstract:
Spectroscopic identification of distinct nonlinear photocurrents unveils quantum geometric properties of electron wavefunctions and the momentum-space topological structures. This is especially interesting, but still puzzling, for chiral topological semimetals with possibilities of hosting giant quantized circular photogalvanic effect. Here we report a comprehensive terahertz (THz) emission spectr…
▽ More
Spectroscopic identification of distinct nonlinear photocurrents unveils quantum geometric properties of electron wavefunctions and the momentum-space topological structures. This is especially interesting, but still puzzling, for chiral topological semimetals with possibilities of hosting giant quantized circular photogalvanic effect. Here we report a comprehensive terahertz (THz) emission spectroscopic analysis of nonlinear photoconductivity of chiral multifold CoSi at 0.26 ~ 1 eV. We find a large linear shift conductivity (17 μA/V2), and confirm a giant injection conductivity (167 μA/V2) as a consequence of strongly interfered non-quantized contributions from the vicinity of multifold nodes with opposite chiralities. The bulk injection current excited by the pump field with a complex wavevector is shown to carry both longitudinal and transverse components. Symmetry analyses further unveil weak nonlocal photon drag effect in addition to the photogalvanic effect. This work not only highlights chiral transition metal monosilicides for mid-infrared photovoltaic applications via various nonlinear optical channels, but also consolidates the THz spectroscopy for quantitative photovoltaic research.
△ Less
Submitted 15 September, 2024; v1 submitted 9 September, 2024;
originally announced September 2024.
-
Disentangled Representations for Short-Term and Long-Term Person Re-Identification
Authors:
Chanho Eom,
Wonkyung Lee,
Geon Lee,
Bumsub Ham
Abstract:
We address the problem of person re-identification (reID), that is, retrieving person images from a large dataset, given a query image of the person of interest. A key challenge is to learn person representations robust to intra-class variations, as different persons could have the same attribute, and persons' appearances look different, e.g., with viewpoint changes. Recent reID methods focus on l…
▽ More
We address the problem of person re-identification (reID), that is, retrieving person images from a large dataset, given a query image of the person of interest. A key challenge is to learn person representations robust to intra-class variations, as different persons could have the same attribute, and persons' appearances look different, e.g., with viewpoint changes. Recent reID methods focus on learning person features discriminative only for a particular factor of variations (e.g., human pose), which also requires corresponding supervisory signals (e.g., pose annotations). To tackle this problem, we propose to factorize person images into identity-related and unrelated features. Identity-related features contain information useful for specifying a particular person (e.g., clothing), while identity-unrelated ones hold other factors (e.g., human pose). To this end, we propose a new generative adversarial network, dubbed identity shuffle GAN (IS-GAN). It disentangles identity-related and unrelated features from person images through an identity-shuffling technique that exploits identification labels alone without any auxiliary supervisory signals. We restrict the distribution of identity-unrelated features or encourage the identity-related and unrelated features to be uncorrelated, facilitating the disentanglement process. Experimental results validate the effectiveness of IS-GAN, showing state-of-the-art performance on standard reID benchmarks, including Market-1501, CUHK03, and DukeMTMC-reID. We further demonstrate the advantages of disentangling person representations on a long-term reID task, setting a new state of the art on a Celeb-reID dataset.
△ Less
Submitted 8 September, 2024;
originally announced September 2024.
-
Envisioning an Optimal Network of Space-Based Lasers for Orbital Debris Remediation
Authors:
David O. Williams Rogers,
Matthew C. Fox,
Paul R. Stysley,
Hang Woon Lee
Abstract:
The rapid increase in resident space objects, including satellites and orbital debris, threatens the safety and sustainability of space missions. This paper explores orbital debris remediation using laser ablation with a network of collaborative space-based lasers. A novel delta-v vector analysis framework quantifies the effects of multiple simultaneous laser-to-debris (L2D) engagements by leverag…
▽ More
The rapid increase in resident space objects, including satellites and orbital debris, threatens the safety and sustainability of space missions. This paper explores orbital debris remediation using laser ablation with a network of collaborative space-based lasers. A novel delta-v vector analysis framework quantifies the effects of multiple simultaneous laser-to-debris (L2D) engagements by leveraging a vector composition of imparted delta-v vectors. The paper introduces the Concurrent Location-Scheduling Problem (CLSP), which optimizes the placement of laser platforms and schedules L2D engagements to maximize debris remediation capacity. Due to the computational complexity of CLSP, it is decomposed into two sequential subproblems: (1) optimal laser platform locations are determined using the Maximal Covering Location Problem, and (2) a novel integer linear programming-based approach schedules L2D engagements within the network configuration to maximize remediation capacity. Computational experiments are conducted to evaluate the proposed framework's effectiveness under various mission scenarios, demonstrating key network functions such as collaborative nudging, deorbiting, and just-in-time collision avoidance. A cost-benefit analysis further explores how varying the number and distribution of laser platforms affects debris remediation capacity, providing insights into optimizing the performance of space-based laser networks.
△ Less
Submitted 4 September, 2024;
originally announced September 2024.
-
Concurrent Data Structures Made Easy (Extended Version)
Authors:
Callista Le,
Kiran Gopinathan,
Koon Wen Lee,
Seth Gilbert,
Ilya Sergey
Abstract:
Design of an efficient thread-safe concurrent data structure is a balancing act between its implementation complexity and performance. Lock-based concurrent data structures, which are relatively easy to derive from their sequential counterparts and to prove thread-safe, suffer from poor throughput under even light multi-threaded workload. At the same time, lock-free concurrent structures allow for…
▽ More
Design of an efficient thread-safe concurrent data structure is a balancing act between its implementation complexity and performance. Lock-based concurrent data structures, which are relatively easy to derive from their sequential counterparts and to prove thread-safe, suffer from poor throughput under even light multi-threaded workload. At the same time, lock-free concurrent structures allow for high throughput, but are notoriously difficult to get right and require careful reasoning to formally establish their correctness.
We explore a solution to this conundrum based on batch parallelism, an approach for designing concurrent data structures via a simple insight: efficiently processing a batch of a priori known operations in parallel is easier than optimising performance for a stream of arbitrary asynchronous requests. Alas, batch-parallel structures have not seen wide practical adoption due to (i) the inconvenience of having to structure multi-threaded programs to explicitly group operations and (ii) the lack of a systematic methodology to implement batch-parallel structures as simply as lock-based ones.
We present OBatcher-an OCaml library that streamlines the design, implementation, and usage of batch-parallel structures. It solves the first challenge (how to use) by suggesting a new lightweight implicit batching design that is built on top of generic asynchronous programming mechanisms. The second challenge (how to implement) is addressed by identifying a family of strategies for converting common sequential structures into efficient batch-parallel ones. We showcase OBatcher with a diverse set of benchmarks. Our evaluation of all the implementations on large asynchronous workloads shows that (a) they consistently outperform the corresponding coarse-grained lock-based implementations and that (b) their throughput scales reasonably with the number of processors.
△ Less
Submitted 25 August, 2024;
originally announced August 2024.
-
Bundling instability of lophotrichous bacteria
Authors:
Jeungeun Park,
Yongsam Kim,
Wanho Lee,
Veronika Pfeifer,
Valeriia Muraveva,
Carsten Beta,
Sookkyung Lim
Abstract:
We present a mathematical model of lophotrichous bacteria, motivated by Pseudomonas putida, which swim through fluid by rotating a cluster of multiple flagella extended from near one pole of the cell body. Although the flagella rotate individually, they are typically bundled together, enabling the bacterium to exhibit three primary modes of motility: push, pull, and wrapping. One key determinant o…
▽ More
We present a mathematical model of lophotrichous bacteria, motivated by Pseudomonas putida, which swim through fluid by rotating a cluster of multiple flagella extended from near one pole of the cell body. Although the flagella rotate individually, they are typically bundled together, enabling the bacterium to exhibit three primary modes of motility: push, pull, and wrapping. One key determinant of these modes is the coordination between motor torque and rotational direction of motors. The computational variations in this coordination reveal a wide spectrum of dynamical motion regimes, which are modulated by hydrodynamic interactions between flagellar filaments. These dynamic modes can be categorized into two groups based on the collective behavior of flagella, i.e., bundled and unbundled configurations. For some of these configurations, experimental examples from fluorescence microscopy recordings of swimming P. putida cells are also presented. Furthermore, we analyze the characteristics of stable bundles, such as push and pull, and investigate the dependence of swimming behaviors on the elastic properties of the flagella.
△ Less
Submitted 23 August, 2024;
originally announced August 2024.
-
Free-breathing 3D cardiac extracellular volume (ECV) mapping using a linear tangent space alignment (LTSA) model
Authors:
Wonil Lee,
Paul Kyu Han,
Thibault Marin,
Ismaël B. G. Mounime,
Samira Vafay Eslahi,
Yanis Djebra,
Didi Chi,
Felicitas J. Bijari,
Marc D. Normandin,
Georges El Fakhri,
Chao Ma
Abstract:
$\textbf{Purpose:}$ To develop a new method for free-breathing 3D extracellular volume (ECV) mapping of the whole heart at 3T. $\textbf{Methods:}…
▽ More
$\textbf{Purpose:}$ To develop a new method for free-breathing 3D extracellular volume (ECV) mapping of the whole heart at 3T. $\textbf{Methods:}$ A free-breathing 3D cardiac ECV mapping method was developed at 3T. T1 mapping was performed before and after contrast agent injection using a free-breathing ECG-gated inversion-recovery sequence with spoiled gradient echo readout. A linear tangent space alignment (LTSA) model-based method was used to reconstruct high-frame-rate dynamic images from (k,t)-space data sparsely sampled along a random stack-of-stars trajectory. Joint T1 and transmit B1 estimation was performed voxel-by-voxel for pre- and post-contrast T1 mapping. To account for the time-varying T1 after contrast agent injection, a linearly time-varying T1 model was introduced for post-contrast T1 mapping. ECV maps were generated by aligning pre- and post-contrast T1 maps through affine transformation. $\textbf{Results:}$ The feasibility of the proposed method was demonstrated using in vivo studies with six healthy volunteers at 3T. We obtained 3D ECV maps at a spatial resolution of 1.9$\times$1.9$\times$4.5 $mm^{3}$ and a FOV of 308$\times$308$\times$144 $mm^{3}$, with a scan time of 10.1$\pm$1.4 and 10.6$\pm$1.6 min before and after contrast agent injection, respectively. The ECV maps and the pre- and post-contrast T1 maps obtained by the proposed method were in good agreement with the 2D MOLLI method both qualitatively and quantitatively. $\textbf{Conclusion:}$ The proposed method allows for free-breathing 3D ECV mapping of the whole heart within a practically feasible imaging time. The estimated ECV values from the proposed method were comparable to those from the existing method. $\textbf{Keywords:}$ cardiac extracellular volume (ECV) mapping, cardiac T1 mapping, linear tangent space alignment (LTSA), manifold learning
△ Less
Submitted 22 August, 2024;
originally announced August 2024.
-
Tuning THz magnons in a mixed van-der-Waals antiferromagnet
Authors:
F. Le Mardele,
I. Mohelsky,
D. Jana,
A. Pawbake,
J. Dzian,
W. -L. Lee,
K. Raju,
R. Sankar,
C. Faugeras,
M. Potemski,
M. E. Zhitomirsky,
M. Orlita
Abstract:
Alloying stands out as a pivotal technological method employed across various compounds, be they metallic, magnetic, or semiconducting, serving to fine-tune their properties to meet specific requirements. Ternary semiconductors represent a prominent example of such alloys. They offer fine-tuning of electronic bands, the band gap in particular, thus granting the technology of semiconductor heterost…
▽ More
Alloying stands out as a pivotal technological method employed across various compounds, be they metallic, magnetic, or semiconducting, serving to fine-tune their properties to meet specific requirements. Ternary semiconductors represent a prominent example of such alloys. They offer fine-tuning of electronic bands, the band gap in particular, thus granting the technology of semiconductor heterostructures devices, key elements in current electronics and optoelectronics. In the realm of magnetically ordered systems, akin to electronic bands in solids, spin waves exhibit characteristic dispersion relations, featuring sizeable magnon gaps in many antiferromagnets. The engineering of the magnon gap constitutes a relevant direction in current research on antiferromagnets, aiming to leverage their distinct properties for THz technologies, spintronics, or magnonics. In this study, we showcase the tunability of the magnon gap across the THz spectral range within an alloy comprising representative semiconducting van-der-Waals antiferromagnets FePS$_3$ and NiPS$_3$. These constituents share identical in-plane crystal structures, magnetic unit cells and the direction of the magnetic anisotropy, but differ in the amplitude and sign of the latter. Altogether these attributes result in the wide tunability of the magnon gap in the Fe$_{1-x}$Ni$_x$PS$_3$ alloy in which the magnetic order is imposed by stronger, perpendicular anisotropy of iron.
△ Less
Submitted 22 August, 2024;
originally announced August 2024.
-
Designing elegant Bell inequalities
Authors:
Kwangil Bae,
Junghee Ryu,
Ilkwon Sohn,
Wonhyuk Lee
Abstract:
Elegant Bell inequality is well known for its much exploited property, being maximally violated by maximal entanglement, mutually unbiased bases, and symmetric informationally complete positive operator-valued measure elements. It is the only one with such property known so far. We present a method to construct Bell inequalities with violation feature analogous to original elegant Bell inequality…
▽ More
Elegant Bell inequality is well known for its much exploited property, being maximally violated by maximal entanglement, mutually unbiased bases, and symmetric informationally complete positive operator-valued measure elements. It is the only one with such property known so far. We present a method to construct Bell inequalities with violation feature analogous to original elegant Bell inequality in high dimension from a simple analytic quantum bound. A Bell inequality with such feature is derived in three dimension for the first time. It shows larger violation than existing Bell inequalities of similar classes while requiring arguably small number of measurements.
△ Less
Submitted 23 August, 2024; v1 submitted 21 August, 2024;
originally announced August 2024.
-
RedWhale: An Adapted Korean LLM Through Efficient Continual Pretraining
Authors:
Anh-Dung Vo,
Minseong Jung,
Wonbeen Lee,
Daewoo Choi
Abstract:
The field of Natural Language Processing (NLP) has seen significant advancements with the development of Large Language Models (LLMs). However, much of this research remains focused on English, often overlooking low-resource languages like Korean. This oversight presents challenges due to the unique non-alphabetic token structure of Korean and the substantial memory and computational demands requi…
▽ More
The field of Natural Language Processing (NLP) has seen significant advancements with the development of Large Language Models (LLMs). However, much of this research remains focused on English, often overlooking low-resource languages like Korean. This oversight presents challenges due to the unique non-alphabetic token structure of Korean and the substantial memory and computational demands required for LLM training, which frequently lead to memory constraints and out-of-memory errors. To address these issues, we present RedWhale, a model specifically tailored for Korean language processing. RedWhale is developed using an efficient continual pretraining approach that includes a comprehensive Korean corpus preprocessing pipeline, a specialized tokenizer, an optimized model initialization technique, and a multistage pretraining strategy. These innovations collectively reduce training time and computational costs while maintaining high levels of accuracy and comprehension. By leveraging cross-lingual transfer learning, which exploits shared linguistic similarities across languages, RedWhale builds on English models to enhance Korean language processing. Experimental results demonstrate that RedWhale outperforms other leading models on Korean NLP benchmarks, including the Korean Balanced Evaluation of Significant Tasks (KoBEST), showing superior understanding and generation of Korean text. Furthermore, RedWhale showed no signs of convergence even after pretraining on 9.7 billion tokens, indicating the potential for further improvements with additional training. This work represents a significant advancement in bridging the linguistic divide, particularly in enhancing NLP capabilities for the Korean language.
△ Less
Submitted 20 August, 2024;
originally announced August 2024.
-
The spatial correlation between CN line and dust continuum emitting regions in high-mass star-forming cloud
Authors:
Jihye Hwang,
Chang Won Lee,
Jongsoo Kim,
Eun Jung Chung,
Kee-Tae Kim
Abstract:
Measuring the strength of three dimensional (3D) magnetic field vector is challenging as it is not easy to recognize whether its line-of-sight (LOS) and plane-of-sky (POS) components are obtained from the same region. CN ($N = 1 - 0$) emission has been used to get the LOS component of a magnetic field (B$_\mathrm{LOS}$) from its Zeeman splitting lines, while dust continuum emission has been used t…
▽ More
Measuring the strength of three dimensional (3D) magnetic field vector is challenging as it is not easy to recognize whether its line-of-sight (LOS) and plane-of-sky (POS) components are obtained from the same region. CN ($N = 1 - 0$) emission has been used to get the LOS component of a magnetic field (B$_\mathrm{LOS}$) from its Zeeman splitting lines, while dust continuum emission has been used to get the POS component of a magnetic field (B$_\mathrm{POS}$). We use the CN ($N = 1 - 0$) data observed with the Taeduk Radio Astronomy Observatory (TRAO) 14-m telescope and the dust continuum data from $Herschel$ archive toward six high-mass star-forming regions in order to test whether CN line and dust continuum emission can trace a similar region and thus can be used for inferring 3D magnetic field strength. Our comparison between CN and H$_2$ column densities for all targets indicates that CN line emission tends to be strong toward bright continuum regions. The positions of peak CN column densities are particularly well correlated with those of peak H$_2$ column densities at least over the H$_2$ column density of 8.0 $\times$ 10$^{22}$ cm$^{-2}$ within one or two telescope beam size in all targets, implying that CN line and dust continuum emitting regions are likely spatially coincident. This enabled us to make the reliable measurement of 3D magnetic field strengths of five targets by taking a vector sum of their B$_\mathrm{LOS}$ and B$_\mathrm{POS}$, helping to decide the magnetical criticality of the targets as supercritical or transcritical.
△ Less
Submitted 3 October, 2024; v1 submitted 19 August, 2024;
originally announced August 2024.
-
Diversity and stylization of the contemporary user-generated visual arts in the complexity-entropy plane
Authors:
Seunghwan Kim,
Byunghwee Lee,
Wonjae Lee
Abstract:
The advent of computational and numerical methods in recent times has provided new avenues for analyzing art historiographical narratives and tracing the evolution of art styles therein. Here, we investigate an evolutionary process underpinning the emergence and stylization of contemporary user-generated visual art styles using the complexity-entropy (C-H) plane, which quantifies local structures…
▽ More
The advent of computational and numerical methods in recent times has provided new avenues for analyzing art historiographical narratives and tracing the evolution of art styles therein. Here, we investigate an evolutionary process underpinning the emergence and stylization of contemporary user-generated visual art styles using the complexity-entropy (C-H) plane, which quantifies local structures in paintings. Informatizing 149,780 images curated in DeviantArt and Behance platforms from 2010 to 2020, we analyze the relationship between local information of the C-H space and multi-level image features generated by a deep neural network and a feature extraction algorithm. The results reveal significant statistical relationships between the C-H information of visual artistic styles and the dissimilarities of the multi-level image features over time within groups of artworks. By disclosing a particular C-H region where the diversity of image representations is noticeably manifested, our analyses reveal an empirical condition of emerging styles that are both novel in the C-H plane and characterized by greater stylistic diversity. Our research shows that visual art analyses combined with physics-inspired methodologies and machine learning, can provide macroscopic insights into quantitatively mapping relevant characteristics of an evolutionary process underpinning the creative stylization of uncharted visual arts of given groups and time.
△ Less
Submitted 21 August, 2024; v1 submitted 19 August, 2024;
originally announced August 2024.
-
Measuring Agreeableness Bias in Multimodal Models
Authors:
Jaehyuk Lim,
Bruce W. Lee
Abstract:
This paper examines a phenomenon in multimodal language models where pre-marked options in question images can significantly influence model responses. Our study employs a systematic methodology to investigate this effect: we present models with images of multiple-choice questions, which they initially answer correctly, then expose the same model to versions with pre-marked options. Our findings r…
▽ More
This paper examines a phenomenon in multimodal language models where pre-marked options in question images can significantly influence model responses. Our study employs a systematic methodology to investigate this effect: we present models with images of multiple-choice questions, which they initially answer correctly, then expose the same model to versions with pre-marked options. Our findings reveal a significant shift in the models' responses towards the pre-marked option, even when it contradicts their answers in the neutral settings. Comprehensive evaluations demonstrate that this agreeableness bias is a consistent and quantifiable behavior across various model architectures. These results show potential limitations in the reliability of these models when processing images with pre-marked options, raising important questions about their application in critical decision-making contexts where such visual cues might be present.
△ Less
Submitted 14 October, 2024; v1 submitted 17 August, 2024;
originally announced August 2024.
-
Language Models Show Stable Value Orientations Across Diverse Role-Plays
Authors:
Bruce W. Lee,
Yeongheon Lee,
Hyunsoo Cho
Abstract:
We demonstrate that large language models (LLMs) exhibit consistent value orientations despite adopting diverse personas, revealing a persistent inertia in their responses that remains stable across the variety of roles they are prompted to assume. To systematically explore this phenomenon, we introduce the role-play-at-scale methodology, which involves prompting LLMs with randomized, diverse pers…
▽ More
We demonstrate that large language models (LLMs) exhibit consistent value orientations despite adopting diverse personas, revealing a persistent inertia in their responses that remains stable across the variety of roles they are prompted to assume. To systematically explore this phenomenon, we introduce the role-play-at-scale methodology, which involves prompting LLMs with randomized, diverse personas and analyzing the macroscopic trend of their responses. Unlike previous works that simply feed these questions to LLMs as if testing human subjects, our role-play-at-scale methodology diagnoses inherent tendencies in a systematic and scalable manner by: (1) prompting the model to act in different random personas and (2) asking the same question multiple times for each random persona. This approach reveals consistent patterns in LLM responses across diverse role-play scenarios, indicating deeply encoded inherent tendencies. Our findings contribute to the discourse on value alignment in foundation models and demonstrate the efficacy of role-play-at-scale as a diagnostic tool for uncovering encoded biases in LLMs.
△ Less
Submitted 16 August, 2024;
originally announced August 2024.
-
Investigating Characteristics of Media Recommendation Solicitation in r/ifyoulikeblank
Authors:
Md Momen Bhuiyan,
Donghan Hu,
Andrew Jelson,
Tanushree Mitra,
Sang Won Lee
Abstract:
Despite the existence of search-based recommender systems like Google, Netflix, and Spotify, online users sometimes may turn to crowdsourced recommendations in places like the r/ifyoulikeblank subreddit. In this exploratory study, we probe why users go to r/ifyoulikeblank, how they look for recommendation, and how the subreddit users respond to recommendation requests. To answer, we collected samp…
▽ More
Despite the existence of search-based recommender systems like Google, Netflix, and Spotify, online users sometimes may turn to crowdsourced recommendations in places like the r/ifyoulikeblank subreddit. In this exploratory study, we probe why users go to r/ifyoulikeblank, how they look for recommendation, and how the subreddit users respond to recommendation requests. To answer, we collected sample posts from r/ifyoulikeblank and analyzed them using a qualitative approach. Our analysis reveals that users come to this subreddit for various reasons, such as exhausting popular search systems, not knowing what or how to search for an item, and thinking crowd have better knowledge than search systems. Examining users query and their description, we found novel information users provide during recommendation seeking using r/ifyoulikeblank. For example, sometimes they ask for artifacts recommendation based on the tools used to create them. Or, sometimes indicating a recommendation seeker's time constraints can help better suit recommendations to their needs. Finally, recommendation responses and interactions revealed patterns of how requesters and responders refine queries and recommendations. Our work informs future intelligent recommender systems design.
△ Less
Submitted 12 August, 2024;
originally announced August 2024.
-
An Investigation Into Explainable Audio Hate Speech Detection
Authors:
Jinmyeong An,
Wonjun Lee,
Yejin Jeon,
Jungseul Ok,
Yunsu Kim,
Gary Geunbae Lee
Abstract:
Research on hate speech has predominantly revolved around detection and interpretation from textual inputs, leaving verbal content largely unexplored. While there has been limited exploration into hate speech detection within verbal acoustic speech inputs, the aspect of interpretability has been overlooked. Therefore, we introduce a new task of explainable audio hate speech detection. Specifically…
▽ More
Research on hate speech has predominantly revolved around detection and interpretation from textual inputs, leaving verbal content largely unexplored. While there has been limited exploration into hate speech detection within verbal acoustic speech inputs, the aspect of interpretability has been overlooked. Therefore, we introduce a new task of explainable audio hate speech detection. Specifically, we aim to identify the precise time intervals, referred to as audio frame-level rationales, which serve as evidence for hate speech classification. Towards this end, we propose two different approaches: cascading and End-to-End (E2E). The cascading approach initially converts audio to transcripts, identifies hate speech within these transcripts, and subsequently locates the corresponding audio time frames. Conversely, the E2E approach processes audio utterances directly, which allows it to pinpoint hate speech within specific time frames. Additionally, due to the lack of explainable audio hate speech datasets that include audio frame-level rationales, we curated a synthetic audio dataset to train our models. We further validated these models on actual human speech utterances and found that the E2E approach outperforms the cascading method in terms of the audio frame Intersection over Union (IoU) metric. Furthermore, we observed that including frame-level rationales significantly enhances hate speech detection accuracy for the E2E approach.
\textbf{Disclaimer} The reader may encounter content of an offensive or hateful nature. However, given the nature of the work, this cannot be avoided.
△ Less
Submitted 12 August, 2024;
originally announced August 2024.
-
Enhancing Dialogue Speech Recognition with Robust Contextual Awareness via Noise Representation Learning
Authors:
Wonjun Lee,
San Kim,
Gary Geunbae Lee
Abstract:
Recent dialogue systems rely on turn-based spoken interactions, requiring accurate Automatic Speech Recognition (ASR). Errors in ASR can significantly impact downstream dialogue tasks. To address this, using dialogue context from user and agent interactions for transcribing subsequent utterances has been proposed. This method incorporates the transcription of the user's speech and the agent's resp…
▽ More
Recent dialogue systems rely on turn-based spoken interactions, requiring accurate Automatic Speech Recognition (ASR). Errors in ASR can significantly impact downstream dialogue tasks. To address this, using dialogue context from user and agent interactions for transcribing subsequent utterances has been proposed. This method incorporates the transcription of the user's speech and the agent's response as model input, using the accumulated context generated by each turn. However, this context is susceptible to ASR errors because it is generated by the ASR model in an auto-regressive fashion. Such noisy context can further degrade the benefits of context input, resulting in suboptimal ASR performance. In this paper, we introduce Context Noise Representation Learning (CNRL) to enhance robustness against noisy context, ultimately improving dialogue speech recognition accuracy. To maximize the advantage of context awareness, our approach includes decoder pre-training using text-based dialogue data and noise representation learning for a context encoder. Based on the evaluation of speech dialogues, our method shows superior results compared to baselines. Furthermore, the strength of our approach is highlighted in noisy environments where user speech is barely audible due to real-world noise, relying on contextual information to transcribe the input accurately.
△ Less
Submitted 12 August, 2024;
originally announced August 2024.
-
ICSFuzz: Collision Detector Bug Discovery in Autonomous Driving Simulators
Authors:
Weiwei Fu,
Heqing Huang,
Yifan Zhang,
Ke Zhang,
Jin Huang,
Wei-Bin Lee,
Jianping Wang
Abstract:
With the increasing adoption of autonomous vehicles, ensuring the reliability of autonomous driving systems (ADSs) deployed on autonomous vehicles has become a significant concern. Driving simulators have emerged as crucial platforms for testing autonomous driving systems, offering realistic, dynamic, and configurable environments. However, existing simulation-based ADS testers have largely overlo…
▽ More
With the increasing adoption of autonomous vehicles, ensuring the reliability of autonomous driving systems (ADSs) deployed on autonomous vehicles has become a significant concern. Driving simulators have emerged as crucial platforms for testing autonomous driving systems, offering realistic, dynamic, and configurable environments. However, existing simulation-based ADS testers have largely overlooked the reliability of the simulators, potentially leading to overlooked violation scenarios and subsequent safety security risks during real-world deployment. In our investigations, we identified that collision detectors in simulators could fail to detect and report collisions in certain collision scenarios, referred to as ignored collision scenarios.
This paper aims to systematically discover ignored collision scenarios to improve the reliability of autonomous driving simulators. To this end, we present ICSFuzz, a black-box fuzzing approach to discover ignored collision scenarios efficiently. Drawing upon the fact that the ignored collision scenarios are a sub-type of collision scenarios, our approach starts with the determined collision scenarios. Following the guidance provided by empirically studied factors contributing to collisions, we selectively mutate arbitrary collision scenarios in a step-wise manner toward the ignored collision scenarios and effectively discover them.
We compare ICSFuzz with DriveFuzz, a state-of-the-art simulation-based ADS testing method, by replacing its oracle with our ignored-collision-aware oracle. The evaluation demonstrates that ICSFuzz outperforms DriveFuzz by finding 10-20x more ignored collision scenarios with a 20-70x speedup. All the discovered ignored collisions have been confirmed by developers with one CVE ID assigned.
△ Less
Submitted 11 August, 2024;
originally announced August 2024.
-
DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba
Authors:
Chengran Yuan,
Zhanqi Zhang,
Jiawei Sun,
Shuo Sun,
Zefan Huang,
Christina Dao Wen Lee,
Dongen Li,
Yuhang Han,
Anthony Wong,
Keng Peng Tee,
Marcelo H. Ang Jr
Abstract:
Motion planning is a challenging task to generate safe and feasible trajectories in highly dynamic and complex environments, forming a core capability for autonomous vehicles. In this paper, we propose DRAMA, the first Mamba-based end-to-end motion planner for autonomous vehicles. DRAMA fuses camera, LiDAR Bird's Eye View images in the feature space, as well as ego status information, to generate…
▽ More
Motion planning is a challenging task to generate safe and feasible trajectories in highly dynamic and complex environments, forming a core capability for autonomous vehicles. In this paper, we propose DRAMA, the first Mamba-based end-to-end motion planner for autonomous vehicles. DRAMA fuses camera, LiDAR Bird's Eye View images in the feature space, as well as ego status information, to generate a series of future ego trajectories. Unlike traditional transformer-based methods with quadratic attention complexity for sequence length, DRAMA is able to achieve a less computationally intensive attention complexity, demonstrating potential to deal with increasingly complex scenarios. Leveraging our Mamba fusion module, DRAMA efficiently and effectively fuses the features of the camera and LiDAR modalities. In addition, we introduce a Mamba-Transformer decoder that enhances the overall planning performance. This module is universally adaptable to any Transformer-based model, especially for tasks with long sequence inputs. We further introduce a novel feature state dropout which improves the planner's robustness without increasing training and inference times. Extensive experimental results show that DRAMA achieves higher accuracy on the NAVSIM dataset compared to the baseline Transfuser, with fewer parameters and lower computational costs.
△ Less
Submitted 14 August, 2024; v1 submitted 7 August, 2024;
originally announced August 2024.
-
Hierarchical Neural Constructive Solver for Real-world TSP Scenarios
Authors:
Yong Liang Goh,
Zhiguang Cao,
Yining Ma,
Yanfei Dong,
Mohammed Haroon Dupty,
Wee Sun Lee
Abstract:
Existing neural constructive solvers for routing problems have predominantly employed transformer architectures, conceptualizing the route construction as a set-to-sequence learning task. However, their efficacy has primarily been demonstrated on entirely random problem instances that inadequately capture real-world scenarios. In this paper, we introduce realistic Traveling Salesman Problem (TSP)…
▽ More
Existing neural constructive solvers for routing problems have predominantly employed transformer architectures, conceptualizing the route construction as a set-to-sequence learning task. However, their efficacy has primarily been demonstrated on entirely random problem instances that inadequately capture real-world scenarios. In this paper, we introduce realistic Traveling Salesman Problem (TSP) scenarios relevant to industrial settings and derive the following insights: (1) The optimal next node (or city) to visit often lies within proximity to the current node, suggesting the potential benefits of biasing choices based on current locations. (2) Effectively solving the TSP requires robust tracking of unvisited nodes and warrants succinct grouping strategies. Building upon these insights, we propose integrating a learnable choice layer inspired by Hypernetworks to prioritize choices based on the current location, and a learnable approximate clustering algorithm inspired by the Expectation-Maximization algorithm to facilitate grouping the unvisited cities. Together, these two contributions form a hierarchical approach towards solving the realistic TSP by considering both immediate local neighbourhoods and learning an intermediate set of node representations. Our hierarchical approach yields superior performance compared to both classical and recent transformer models, showcasing the efficacy of the key designs.
△ Less
Submitted 7 August, 2024;
originally announced August 2024.
-
LLM-Empowered Resource Allocation in Wireless Communications Systems
Authors:
Woongsup Lee,
Jeonghun Park
Abstract:
The recent success of large language models (LLMs) has spurred their application in various fields. In particular, there have been efforts to integrate LLMs into various aspects of wireless communication systems. The use of LLMs in wireless communication systems has the potential to realize artificial general intelligence (AGI)-enabled wireless networks. In this paper, we investigate an LLM-based…
▽ More
The recent success of large language models (LLMs) has spurred their application in various fields. In particular, there have been efforts to integrate LLMs into various aspects of wireless communication systems. The use of LLMs in wireless communication systems has the potential to realize artificial general intelligence (AGI)-enabled wireless networks. In this paper, we investigate an LLM-based resource allocation scheme for wireless communication systems. Specifically, we formulate a simple resource allocation problem involving two transmit pairs and develop an LLM-based resource allocation approach that aims to maximize either energy efficiency or spectral efficiency. Additionally, we consider the joint use of low-complexity resource allocation techniques to compensate for the reliability shortcomings of the LLM-based scheme. After confirming the applicability and feasibility of LLM-based resource allocation, we address several key technical challenges that remain in applying LLMs in practice.
△ Less
Submitted 6 August, 2024;
originally announced August 2024.
-
Floquet engineering of topological phase transitions in quantum spin Hall $α$-$T_{3}$ system
Authors:
Kok Wai Lee,
Mateo Jalen Andrew Calderon,
Xiang-Long Yu,
Ching Hua Lee,
Yee Sin Ang,
Pei-Hao Fu
Abstract:
Floquet engineering of topological phase transitions driven by a high-frequency time-periodic field is a promising approach to realizing new topological phases of matter distinct from static states. Here, we theoretically investigate Floquet engineering topological phase transitions in the quantum spin Hall $α$-$T_{3}$ system driven by an off-resonant circularly polarized light. In addition to the…
▽ More
Floquet engineering of topological phase transitions driven by a high-frequency time-periodic field is a promising approach to realizing new topological phases of matter distinct from static states. Here, we theoretically investigate Floquet engineering topological phase transitions in the quantum spin Hall $α$-$T_{3}$ system driven by an off-resonant circularly polarized light. In addition to the quantum spin (anomalous) Hall insulator phase with multiple helical (chiral) edge states, spin-polarized topological metallic phases are observed, where the bulk topological band gap of one spin sub-band overlaps with the other gapless spin sub-band. Moreover, with a staggered potential, the topological invariants of the system depend on whether the middle band is occupied because of the breaking of particle-hole symmetry. Our work highlights the significance of Floquet engineering in realizing new topological phases in $α$-$T_{3}$ lattices.
△ Less
Submitted 27 August, 2024; v1 submitted 4 August, 2024;
originally announced August 2024.
-
Phonon screening of excitons in atomically thin semiconductors
Authors:
Woncheol Lee,
Antonios M. Alvertis,
Zhenglu Li,
Steven G. Louie,
Marina R. Filip,
Jeffrey B. Neaton,
Emmanouil Kioupakis
Abstract:
Atomically thin semiconductors, encompassing both 2D materials and quantum wells, exhibit a pronounced enhancement of excitonic effects due to geometric confinement. Consequently, these materials have become foundational platforms for the exploration and utilization of excitons. Recent ab initio studies have demonstrated that phonons can substantially screen electron-hole interactions in bulk semi…
▽ More
Atomically thin semiconductors, encompassing both 2D materials and quantum wells, exhibit a pronounced enhancement of excitonic effects due to geometric confinement. Consequently, these materials have become foundational platforms for the exploration and utilization of excitons. Recent ab initio studies have demonstrated that phonons can substantially screen electron-hole interactions in bulk semiconductors and strongly modify the properties of excitons. While excitonic properties of atomically thin semiconductors have been the subject of extensive theoretical investigations, the role of phonon screening on excitons in atomically thin structures remains unexplored. In this work, we demonstrate via ab initio GW-Bethe-Salpeter equation calculations that phonon screening can have a significant impact on optical excitations in atomically thin semiconductors. We further show that the degree of phonon screening can be tuned by structural engineering. We focus on atomically thin GaN quantum wells embedded in AlN and identify specific phonons in the surrounding material, AlN, that dramatically alter the lowest-lying exciton in monolayer GaN via screening. Our studies provide new intuition beyond standard models into the interplay among structural properties, phonon characteristics, and exciton properties in atomically thin semiconductors, and have implications for future experiments.
△ Less
Submitted 1 August, 2024;
originally announced August 2024.
-
Early Planet Formation in Embedded Disks (eDisk) XVI: An asymmetric dust disk driving a multi-component molecular outflow in the young Class 0 protostar GSS30 IRS3
Authors:
Alejandro Santamaria-Miranda,
Itziar de Gregorio-Monsalvo,
Nagayoshi Ohashi,
John J. Tobin,
Jinshi Sai,
Jes K. Jorgensen,
Yusuke Aso,
Zhe-Yu Daniel Lin,
Christian Flores,
Miyu Kido,
Patrick M. Koch,
Woojin Kwon,
Chang Won Lee,
Zhi-Yun Li,
Leslie W. Looney,
Adele L. Plunkett,
Shigehisa Takakuwa,
Merel L. R van t Hoff,
Jonathan P. Williams,
Hsi-Wei Yen
Abstract:
We present the results of the ALMA Large Program Early Planet Formation in Embedded disks observations of the Class 0 protostar GSS30 IRS3. Our observations included 1.3 mm continuum with a resolution of 0.''05 (7.8 au) and several molecular species including $^{12}$CO, $^{13}$CO, C$^{18}$O, H$_{2}$CO and c-C$_{3}$H$_{2}$. The dust continuum analysis unveiled a disk-shaped structure with a major a…
▽ More
We present the results of the ALMA Large Program Early Planet Formation in Embedded disks observations of the Class 0 protostar GSS30 IRS3. Our observations included 1.3 mm continuum with a resolution of 0.''05 (7.8 au) and several molecular species including $^{12}$CO, $^{13}$CO, C$^{18}$O, H$_{2}$CO and c-C$_{3}$H$_{2}$. The dust continuum analysis unveiled a disk-shaped structure with a major axis size of $\sim$200 au. We observed an asymmetry in the minor axis of the continuum emission suggesting that the emission is optically thick and the disk is flared. On the other hand, we identified two prominent bumps along the major axis located at distances of 26 and 50 au from the central protostar. The origin of the bumps remains uncertain and might be due to an embedded substructure within the disk or the result of the temperature distribution instead of surface density due to optically thick continuum emission. The $^{12}$CO emission reveals a molecular outflow consisting of three distinct components: a collimated one, an intermediate velocity component exhibiting an hourglass shape, and a wider angle low-velocity component. We associate these components with the coexistence of a jet and a disk-wind. The C$^{18}$O emission traces both a Keplerian rotating circumstellar disk and the infall of the rotating envelope. We measured a stellar dynamical mass of 0.35$\pm$0.09 M$_{\odot}$.
△ Less
Submitted 30 July, 2024;
originally announced July 2024.
-
Stability and topological nature of charged Gauss-Bonnet AdS black holes in five dimensions
Authors:
Imtak Jeon,
Bum-Hoon Lee,
Wonwoo Lee,
Madhu Mishra
Abstract:
We examine the thermodynamic characteristics and phase structures of a black hole, where the black hole horizon could be a hypersurface with positive, zero, or negative constant curvature, within the framework of Einstein-Maxwell theory, incorporating a negative cosmological constant and a Gauss-Bonnet correction. Our research follows the topological approach to black hole thermodynamics where we…
▽ More
We examine the thermodynamic characteristics and phase structures of a black hole, where the black hole horizon could be a hypersurface with positive, zero, or negative constant curvature, within the framework of Einstein-Maxwell theory, incorporating a negative cosmological constant and a Gauss-Bonnet correction. Our research follows the topological approach to black hole thermodynamics where we treat anti-de Sitter (AdS) black holes as topological defects in thermodynamic space. We study the nature of the black hole's critical points and local stability by computing the winding numbers/topological charge associated with the zero point of the vector field, derived from the temperature of extremal points and generalized off-shell Gibbs free energy, respectively. Black holes are classified into different topological classes based on their topological number. In this study, we found unlike the charged AdS black hole, the charged GB AdS black hole exhibits a critical point. Our findings reveal the occurrence of a liquid/gas-like first-order phase transition between small-large black hole phases of the spherical charged GB AdS black hole. We conclude that the charged GB AdS and charged AdS black holes belong to different topological classes in the grand canonical ensemble. Furthermore, connecting with the previous studies, we conclude that the charged AdS and charged GB AdS black holes in canonical and charged GB in the grand canonical ensemble belong to the same topological classes.
△ Less
Submitted 29 July, 2024;
originally announced July 2024.