subscribe to arXiv mailings

A search using GEO600 for gravitational waves coincident with fast radio bursts from SGR 1935+2154

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, I. Abouelfettouh, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah, C. Alléné , et al. (1758 additional authors not shown)

Abstract: The magnetar SGR 1935+2154 is the only known Galactic source of fast radio bursts (FRBs). FRBs from SGR 1935+2154 were first detected by CHIME/FRB and STARE2 in 2020 April, after the conclusion of the LIGO, Virgo, and KAGRA Collaborations' O3 observing run. Here we analyze four periods of gravitational wave (GW) data from the GEO600 detector coincident with four periods of FRB activity detected by… ▽ More The magnetar SGR 1935+2154 is the only known Galactic source of fast radio bursts (FRBs). FRBs from SGR 1935+2154 were first detected by CHIME/FRB and STARE2 in 2020 April, after the conclusion of the LIGO, Virgo, and KAGRA Collaborations' O3 observing run. Here we analyze four periods of gravitational wave (GW) data from the GEO600 detector coincident with four periods of FRB activity detected by CHIME/FRB, as well as X-ray glitches and X-ray bursts detected by NICER and NuSTAR close to the time of one of the FRBs. We do not detect any significant GW emission from any of the events. Instead, using a short-duration GW search (for bursts $\leq$ 1 s) we derive 50\% (90\%) upper limits of $10^{48}$ ($10^{49}$) erg for GWs at 300 Hz and $10^{49}$ ($10^{50}$) erg at 2 kHz, and constrain the GW-to-radio energy ratio to $\leq 10^{14} - 10^{16}$. We also derive upper limits from a long-duration search for bursts with durations between 1 and 10 s. These represent the strictest upper limits on concurrent GW emission from FRBs. △ Less

Submitted 11 October, 2024; originally announced October 2024.

Comments: 15 pages of text including references, 4 figures, 5 tables

Report number: LIGO-P2400192

arXiv:2410.08326 [pdf, other]

Neural Architecture Search of Hybrid Models for NPU-CIM Heterogeneous AR/VR Devices

Authors: Yiwei Zhao, Ziyun Li, Win-San Khwa, Xiaoyu Sun, Sai Qian Zhang, Syed Shakib Sarwar, Kleber Hugo Stangherlin, Yi-Lun Lu, Jorge Tomas Gomez, Jae-Sun Seo, Phillip B. Gibbons, Barbara De Salvo, Chiao Liu

Abstract: Low-Latency and Low-Power Edge AI is essential for Virtual Reality and Augmented Reality applications. Recent advances show that hybrid models, combining convolution layers (CNN) and transformers (ViT), often achieve superior accuracy/performance tradeoff on various computer vision and machine learning (ML) tasks. However, hybrid ML models can pose system challenges for latency and energy-efficien… ▽ More Low-Latency and Low-Power Edge AI is essential for Virtual Reality and Augmented Reality applications. Recent advances show that hybrid models, combining convolution layers (CNN) and transformers (ViT), often achieve superior accuracy/performance tradeoff on various computer vision and machine learning (ML) tasks. However, hybrid ML models can pose system challenges for latency and energy-efficiency due to their diverse nature in dataflow and memory access patterns. In this work, we leverage the architecture heterogeneity from Neural Processing Units (NPU) and Compute-In-Memory (CIM) and perform diverse execution schemas to efficiently execute these hybrid models. We also introduce H4H-NAS, a Neural Architecture Search framework to design efficient hybrid CNN/ViT models for heterogeneous edge systems with both NPU and CIM. Our H4H-NAS approach is powered by a performance estimator built with NPU performance results measured on real silicon, and CIM performance based on industry IPs. H4H-NAS searches hybrid CNN/ViT models with fine granularity and achieves significant (up to 1.34%) top-1 accuracy improvement on ImageNet dataset. Moreover, results from our Algo/HW co-design reveal up to 56.08% overall latency and 41.72% energy improvements by introducing such heterogeneous computing over baseline solutions. The framework guides the design of hybrid network architectures and system architectures of NPU+CIM heterogeneous systems. △ Less

Submitted 10 October, 2024; originally announced October 2024.

arXiv:2410.03145 [pdf, other]

Margin Matching Preference Optimization: Enhanced Model Alignment with Granular Feedback

Authors: Kyuyoung Kim, Ah Jeong Seo, Hao Liu, Jinwoo Shin, Kimin Lee

Abstract: Large language models (LLMs) fine-tuned with alignment techniques, such as reinforcement learning from human feedback, have been instrumental in developing some of the most capable AI systems to date. Despite their success, existing methods typically rely on simple binary labels, such as those indicating preferred outputs in pairwise preferences, which fail to capture the subtle differences in rel… ▽ More Large language models (LLMs) fine-tuned with alignment techniques, such as reinforcement learning from human feedback, have been instrumental in developing some of the most capable AI systems to date. Despite their success, existing methods typically rely on simple binary labels, such as those indicating preferred outputs in pairwise preferences, which fail to capture the subtle differences in relative quality between pairs. To address this limitation, we introduce an approach called Margin Matching Preference Optimization (MMPO), which incorporates relative quality margins into optimization, leading to improved LLM policies and reward models. Specifically, given quality margins in pairwise preferences, we design soft target probabilities based on the Bradley-Terry model, which are then used to train models with the standard cross-entropy objective. Experiments with both human and AI feedback data demonstrate that MMPO consistently outperforms baseline methods, often by a substantial margin, on popular benchmarks including MT-bench and RewardBench. Notably, the 7B model trained with MMPO achieves state-of-the-art performance on RewardBench as of June 2024, outperforming other models of the same scale. Our analysis also shows that MMPO is more robust to overfitting, leading to better-calibrated models. △ Less

Submitted 4 October, 2024; originally announced October 2024.

Comments: EMNLP 2024 Findings

arXiv:2409.20117 [pdf, other]

Masked Autoregressive Model for Weather Forecasting

Authors: Doyi Kim, Minseok Seo, Hakjin Lee, Junghoon Seo

Abstract: The growing impact of global climate change amplifies the need for accurate and reliable weather forecasting. Traditional autoregressive approaches, while effective for temporal modeling, suffer from error accumulation in long-term prediction tasks. The lead time embedding method has been suggested to address this issue, but it struggles to maintain crucial correlations in atmospheric events. To o… ▽ More The growing impact of global climate change amplifies the need for accurate and reliable weather forecasting. Traditional autoregressive approaches, while effective for temporal modeling, suffer from error accumulation in long-term prediction tasks. The lead time embedding method has been suggested to address this issue, but it struggles to maintain crucial correlations in atmospheric events. To overcome these challenges, we propose the Masked Autoregressive Model for Weather Forecasting (MAM4WF). This model leverages masked modeling, where portions of the input data are masked during training, allowing the model to learn robust spatiotemporal relationships by reconstructing the missing information. MAM4WF combines the advantages of both autoregressive and lead time embedding methods, offering flexibility in lead time modeling while iteratively integrating predictions. We evaluate MAM4WF across weather, climate forecasting, and video frame prediction datasets, demonstrating superior performance on five test datasets. △ Less

Submitted 30 September, 2024; originally announced September 2024.

Comments: 10 page. arXiv admin note: substantial text overlap with arXiv:2303.07849

arXiv:2409.19726 [pdf]

On-Chip Terahertz Spectroscopy for Dual-Gated van der Waals Heterostructures at Cryogenic Temperatures

Authors: Junseok Seo, Zhengguang Lu, Jixiang Yang, Fangzhou Xia, Shenyong Ye, Yuxuan Yao, Tonghang Han, Lihan Shi, Kenji Watanabe, Takashi Taniguchi, Long Ju

Abstract: Van der Waals heterostructures have emerged as a versatile platform to study correlated and topological electron physics. Spectroscopy experiments in the THz regime are crucial, since the energy of THz photons matches that of relevant excitations and charge dynamics. However, their micron-size and complex (dual-)gated structures have challenged such measurements. Here, we demonstrate on-chip THz s… ▽ More Van der Waals heterostructures have emerged as a versatile platform to study correlated and topological electron physics. Spectroscopy experiments in the THz regime are crucial, since the energy of THz photons matches that of relevant excitations and charge dynamics. However, their micron-size and complex (dual-)gated structures have challenged such measurements. Here, we demonstrate on-chip THz spectroscopy on a dual-gated bilayer graphene device at liquid helium temperature. To avoid unwanted THz absorption by metallic gates, we developed a scheme of operation by combining semiconducting gates and optically controlled gating. This allows us to measure the clean THz response of graphene without being affected by the gates. We observed the THz signatures of electric-field-induced bandgap opening at the charge neutrality. We measured Drude conductivities at varied charge densities and extracted key parameters, including effective masses and scattering rates. This work paves the way for studying novel emergent phenomena in dual-gated two-dimensional materials. △ Less

Submitted 29 September, 2024; originally announced September 2024.

arXiv:2409.19633 [pdf, other]

Search for proton decay via $p\rightarrow{e^+η}$ and $p\rightarrow{μ^+η}$ with a 0.37 Mton-year exposure of Super-Kamiokande

Authors: Super-Kamiokande Collaboration, :, N. Taniuchi, K. Abe, S. Abe, Y. Asaoka, C. Bronner, M. Harada, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, M. Nakahata, S. Nakayama, Y. Noguchi , et al. (267 additional authors not shown)

Abstract: A search for proton decay into $e^+/μ^+$ and a $η$ meson has been performed using data from a 0.373 Mton$\cdot$year exposure (6050.3 live days) of Super-Kamiokande. Compared to previous searches this work introduces an improved model of the intranuclear $η$ interaction cross section, resulting in a factor of two reduction in uncertainties from this source and $\sim$10\% increase in signal efficien… ▽ More A search for proton decay into $e^+/μ^+$ and a $η$ meson has been performed using data from a 0.373 Mton$\cdot$year exposure (6050.3 live days) of Super-Kamiokande. Compared to previous searches this work introduces an improved model of the intranuclear $η$ interaction cross section, resulting in a factor of two reduction in uncertainties from this source and $\sim$10\% increase in signal efficiency. No significant data excess was found above the expected number of atmospheric neutrino background events resulting in no indication of proton decay into either mode. Lower limits on the proton partial lifetime of $1.4\times\mathrm{10^{34}~years}$ for $p\rightarrow e^+η$ and $7.3\times\mathrm{10^{33}~years}$ for $p\rightarrow μ^+η$ at the 90$\%$ C.L. were set. These limits are around 1.5 times longer than our previous study and are the most stringent to date. △ Less

Submitted 29 September, 2024; originally announced September 2024.

arXiv:2409.18416 [pdf, other]

A Simulation Study of Low-Power Relativistic Jets: Flow Dynamics and Radio Morphology of FR-I Jets

Authors: Ayan Bhattacharjee, Jeongbhin Seo, Dongsu Ryu, Hyesung Kang

Abstract: Radio galaxies are classified into two primary categories based on their morphology: center-brightened FR-I and edge-brightened FR-II. It is believed that the jet power and interactions with the ambient medium govern the deceleration and decollimation of the jet-spine flows, which, in turn, influence this dichotomy. Using high-resolution, three-dimensional relativistic hydrodynamic simulations, we… ▽ More Radio galaxies are classified into two primary categories based on their morphology: center-brightened FR-I and edge-brightened FR-II. It is believed that the jet power and interactions with the ambient medium govern the deceleration and decollimation of the jet-spine flows, which, in turn, influence this dichotomy. Using high-resolution, three-dimensional relativistic hydrodynamic simulations, we follow the development of flow structures on sub-kpc to kpc scales in kinetically dominant low-power relativistic jets. We find that the bulk Lorentz factor of the jet spine and the advance speed of the jet head, which depend on the energy injection flux and the jet-to-background density contrast, primarily determine the dynamics and structures of the jet-induced flows. The entrainment of ambient gas and the background density and pressure gradient may also play significant roles. To emulate radio morphology, we produce the synthetic maps of the synchrotron surface brightness for the simulated jets, by employing simple models for magnetic field distribution and nonthermal electron population and considering relativistic beaming effects at different inclination angles. Both the flow structures and radio maps capture the longitudinal and transverse structures of the jet-spine and shear layer, consistent with observations. We also compare different background effects and argue that the loss of pressure confinement beyond the galactic core may be a key factor in the flaring and disruption of FR-I jets. Our results confirm that mildly relativistic jets could explain the one-sidedness or asymmetries with the boosted main jet and deboosted counterjet pairs. △ Less

Submitted 26 September, 2024; originally announced September 2024.

Comments: 20 pages, 9 figures, submitted to Astrophysical Journal

arXiv:2409.16073 [pdf, other]

Open-World Object Detection with Instance Representation Learning

Authors: Sunoh Lee, Minsik Jeon, Jihong Min, Junwon Seo

Abstract: While humans naturally identify novel objects and understand their relationships, deep learning-based object detectors struggle to detect and relate objects that are not observed during training. To overcome this issue, Open World Object Detection(OWOD) has been introduced to enable models to detect unknown objects in open-world scenarios. However, OWOD methods fail to capture the fine-grained rel… ▽ More While humans naturally identify novel objects and understand their relationships, deep learning-based object detectors struggle to detect and relate objects that are not observed during training. To overcome this issue, Open World Object Detection(OWOD) has been introduced to enable models to detect unknown objects in open-world scenarios. However, OWOD methods fail to capture the fine-grained relationships between detected objects, which are crucial for comprehensive scene understanding and applications such as class discovery and tracking. In this paper, we propose a method to train an object detector that can both detect novel objects and extract semantically rich features in open-world conditions by leveraging the knowledge of Vision Foundation Models(VFM). We first utilize the semantic masks from the Segment Anything Model to supervise the box regression of unknown objects, ensuring accurate localization. By transferring the instance-wise similarities obtained from the VFM features to the detector's instance embeddings, our method then learns a semantically rich feature space of these embeddings. Extensive experiments show that our method learns a robust and generalizable feature space, outperforming other OWOD-based feature extraction methods. Additionally, we demonstrate that the enhanced feature from our model increases the detector's applicability to tasks such as open-world tracking. △ Less

Submitted 24 September, 2024; originally announced September 2024.

Comments: Our project website can be found at https://sunohlee.github.io/OWODRep/

arXiv:2409.13342 [pdf, other]

Validity of Feature Importance in Low-Performing Machine Learning for Tabular Biomedical Data

Authors: Youngro Lee, Giacomo Baruzzo, Jeonghwan Kim, Jongmo Seo, Barbara Di Camillo

Abstract: In tabular biomedical data analysis, tuning models to high accuracy is considered a prerequisite for discussing feature importance, as medical practitioners expect the validity of feature importance to correlate with performance. In this work, we challenge the prevailing belief, showing that low-performing models may also be used for feature importance. We propose experiments to observe changes in… ▽ More In tabular biomedical data analysis, tuning models to high accuracy is considered a prerequisite for discussing feature importance, as medical practitioners expect the validity of feature importance to correlate with performance. In this work, we challenge the prevailing belief, showing that low-performing models may also be used for feature importance. We propose experiments to observe changes in feature rank as performance degrades sequentially. Using three synthetic datasets and six real biomedical datasets, we compare the rank of features from full datasets to those with reduced sample sizes (data cutting) or fewer features (feature cutting). In synthetic datasets, feature cutting does not change feature rank, while data cutting shows higher discrepancies with lower performance. In real datasets, feature cutting shows similar or smaller changes than data cutting, though some datasets exhibit the opposite. When feature interactions are controlled by removing correlations, feature cutting consistently shows better stability. By analyzing the distribution of feature importance values and theoretically examining the probability that the model cannot distinguish feature importance between features, we reveal that models can still distinguish feature importance despite performance degradation through feature cutting, but not through data cutting. We conclude that the validity of feature importance can be maintained even at low performance levels if the data size is adequate, which is a significant factor contributing to suboptimal performance in tabular medical data analysis. This paper demonstrates the potential for utilizing feature importance analysis alongside statistical analysis to compare features relatively, even when classifier performance is not satisfactory. △ Less

Submitted 20 September, 2024; originally announced September 2024.

arXiv:2409.12578 [pdf, other]

CLE-SH: Comprehensive Literal Explanation package for SHapley values by statistical validity

Authors: Youngro Lee, Kyungjin Kim, Jongmo Seo

Abstract: Recently, SHapley Additive exPlanations (SHAP) has been widely utilized in various research domains. This is particularly evident in medical applications, where SHAP analysis serves as a crucial tool for identifying biomarkers and assisting in result validation. However, despite its frequent usage, SHAP is often not applied in a manner that maximizes its potential contributions. A review of recent… ▽ More Recently, SHapley Additive exPlanations (SHAP) has been widely utilized in various research domains. This is particularly evident in medical applications, where SHAP analysis serves as a crucial tool for identifying biomarkers and assisting in result validation. However, despite its frequent usage, SHAP is often not applied in a manner that maximizes its potential contributions. A review of recent papers employing SHAP reveals that many studies subjectively select a limited number of features as 'important' and analyze SHAP values by approximately observing plots without assessing statistical significance. Such superficial application may hinder meaningful contributions to the applied fields. To address this, we propose a library package designed to simplify the interpretation of SHAP values. By simply inputting the original data and SHAP values, our library provides: 1) the number of important features to analyze, 2) the pattern of each feature via univariate analysis, and 3) the interaction between features. All information is extracted based on its statistical significance and presented in simple, comprehensible sentences, enabling users of all levels to understand the interpretations. We hope this library fosters a comprehensive understanding of statistically valid SHAP results. △ Less

Submitted 19 September, 2024; originally announced September 2024.

arXiv:2409.10459 [pdf, other]

Efficiently Crowdsourcing Visual Importance with Punch-Hole Annotation

Authors: Minsuk Chang, Soohyun Lee, Aeri Cho, Hyeon Jeon, Seokhyeon Park, Cindy Xiong Bearfield, Jinwook Seo

Abstract: We introduce a novel crowdsourcing method for identifying important areas in graphical images through punch-hole labeling. Traditional methods, such as gaze trackers and mouse-based annotations, which generate continuous data, can be impractical in crowdsourcing scenarios. They require many participants, and the outcome data can be noisy. In contrast, our method first segments the graphical image… ▽ More We introduce a novel crowdsourcing method for identifying important areas in graphical images through punch-hole labeling. Traditional methods, such as gaze trackers and mouse-based annotations, which generate continuous data, can be impractical in crowdsourcing scenarios. They require many participants, and the outcome data can be noisy. In contrast, our method first segments the graphical image with a grid and drops a portion of the patches (punch holes). Then, we iteratively ask the labeler to validate each annotation with holes, narrowing down the annotation only having the most important area. This approach aims to reduce annotation noise in crowdsourcing by standardizing the annotations while enhancing labeling efficiency and reliability. Preliminary findings from fundamental charts demonstrate that punch-hole labeling can effectively pinpoint critical regions. This also highlights its potential for broader application in visualization research, particularly in studying large-scale users' graphical perception. Our future work aims to enhance the algorithm to achieve faster labeling speed and prove its utility through large-scale experiments. △ Less

Submitted 16 September, 2024; originally announced September 2024.

Comments: 2 pages, 1 figure, presented at IEEE VIS 2024 poster session

arXiv:2409.08231 [pdf, other]

Design Optimization of Nuclear Fusion Reactor through Deep Reinforcement Learning

Authors: Jinsu Kim, Jaemin Seo

Abstract: This research explores the application of Deep Reinforcement Learning (DRL) to optimize the design of a nuclear fusion reactor. DRL can efficiently address the challenging issues attributed to multiple physics and engineering constraints for steady-state operation. The fusion reactor design computation and the optimization code applicable to parallelization with DRL are developed. The proposed fra… ▽ More This research explores the application of Deep Reinforcement Learning (DRL) to optimize the design of a nuclear fusion reactor. DRL can efficiently address the challenging issues attributed to multiple physics and engineering constraints for steady-state operation. The fusion reactor design computation and the optimization code applicable to parallelization with DRL are developed. The proposed framework enables finding the optimal reactor design that satisfies the operational requirements while reducing building costs. Multi-objective design optimization for a fusion reactor is now simplified by DRL, indicating the high potential of the proposed framework for advancing the efficient and sustainable design of future reactors. △ Less

Submitted 12 September, 2024; originally announced September 2024.

Comments: 16 pages

arXiv:2409.07467 [pdf, other]

Flexible Control in Symbolic Music Generation via Musical Metadata

Authors: Sangjun Han, Jiwon Ham, Chaeeun Lee, Heejin Kim, Soojong Do, Sihyuk Yi, Jun Seo, Seoyoon Kim, Yountae Jung, Woohyung Lim

Abstract: In this work, we introduce the demonstration of symbolic music generation, focusing on providing short musical motifs that serve as the central theme of the narrative. For the generation, we adopt an autoregressive model which takes musical metadata as inputs and generates 4 bars of multitrack MIDI sequences. During training, we randomly drop tokens from the musical metadata to guarantee flexible… ▽ More In this work, we introduce the demonstration of symbolic music generation, focusing on providing short musical motifs that serve as the central theme of the narrative. For the generation, we adopt an autoregressive model which takes musical metadata as inputs and generates 4 bars of multitrack MIDI sequences. During training, we randomly drop tokens from the musical metadata to guarantee flexible control. It provides users with the freedom to select input types while maintaining generative performance, enabling greater flexibility in music composition. We validate the effectiveness of the strategy through experiments in terms of model capacity, musical fidelity, diversity, and controllability. Additionally, we scale up the model and compare it with other music generation model through a subjective test. Our results indicate its superiority in both control and music quality. We provide a URL link https://www.youtube.com/watch?v=-0drPrFJdMQ to our demonstration video. △ Less

Submitted 28 August, 2024; originally announced September 2024.

arXiv:2409.07048 [pdf, other]

Pushing the Limits of Vision-Language Models in Remote Sensing without Human Annotations

Authors: Keumgang Cha, Donggeun Yu, Junghoon Seo

Abstract: The prominence of generalized foundation models in vision-language integration has witnessed a surge, given their multifarious applications. Within the natural domain, the procurement of vision-language datasets to construct these foundation models is facilitated by their abundant availability and the ease of web crawling. Conversely, in the remote sensing domain, although vision-language datasets… ▽ More The prominence of generalized foundation models in vision-language integration has witnessed a surge, given their multifarious applications. Within the natural domain, the procurement of vision-language datasets to construct these foundation models is facilitated by their abundant availability and the ease of web crawling. Conversely, in the remote sensing domain, although vision-language datasets exist, their volume is suboptimal for constructing robust foundation models. This study introduces an approach to curate vision-language datasets by employing an image decoding machine learning model, negating the need for human-annotated labels. Utilizing this methodology, we amassed approximately 9.6 million vision-language paired datasets in VHR imagery. The resultant model outperformed counterparts that did not leverage publicly available vision-language datasets, particularly in downstream tasks such as zero-shot classification, semantic localization, and image-text retrieval. Moreover, in tasks exclusively employing vision encoders, such as linear probing and k-NN classification, our model demonstrated superior efficacy compared to those relying on domain-specific vision-language datasets. △ Less

Submitted 11 September, 2024; originally announced September 2024.

Comments: This study was primarily conducted during the latter half of 2023

arXiv:2409.05227 [pdf, other]

BBS: Bi-directional Bit-level Sparsity for Deep Learning Acceleration

Authors: Yuzong Chen, Jian Meng, Jae-sun Seo, Mohamed S. Abdelfattah

Abstract: Bit-level sparsity methods skip ineffectual zero-bit operations and are typically applicable within bit-serial deep learning accelerators. This type of sparsity at the bit-level is especially interesting because it is both orthogonal and compatible with other deep neural network (DNN) efficiency methods such as quantization and pruning. In this work, we improve the practicality and efficiency of b… ▽ More Bit-level sparsity methods skip ineffectual zero-bit operations and are typically applicable within bit-serial deep learning accelerators. This type of sparsity at the bit-level is especially interesting because it is both orthogonal and compatible with other deep neural network (DNN) efficiency methods such as quantization and pruning. In this work, we improve the practicality and efficiency of bitlevel sparsity through a novel algorithmic bit-pruning, averaging, and compression method, and a co-designed efficient bit-serial hardware accelerator. On the algorithmic side, we introduce bidirectional bit sparsity (BBS). The key insight of BBS is that we can leverage bit sparsity in a symmetrical way to prune either zero-bits or one-bits. This significantly improves the load balance of bit-serial computing and guarantees the level of sparsity to be more than 50%. On top of BBS, we further propose two bit-level binary pruning methods that require no retraining, and can be seamlessly applied to quantized DNNs. Combining binary pruning with a new tensor encoding scheme, BBS can both skip computation and reduce the memory footprint associated with bi-directional sparse bit columns. On the hardware side, we demonstrate the potential of BBS through BitVert, a bitserial architecture with an efficient PE design to accelerate DNNs with low overhead, exploiting our proposed binary pruning. Evaluation on seven representative DNN models shows that our approach achieves: (1) on average 1.66$\times$ reduction in model sizewith negligible accuracy loss of < 0.5%; (2) up to 3.03$\times$ speedupand 2.44$\times$ energy saving compared to prior DNN accelerators. △ Less

Submitted 8 September, 2024; originally announced September 2024.

Comments: Accepted by IEEE/ACM MICRO 2024

arXiv:2408.15233 [pdf]

Signatures of Chiral Superconductivity in Rhombohedral Graphene

Authors: Tonghang Han, Zhengguang Lu, Yuxuan Yao, Lihan Shi, Jixiang Yang, Junseok Seo, Shenyong Ye, Zhenghan Wu, Muyang Zhou, Haoyang Liu, Gang Shi, Zhenqi Hua, Kenji Watanabe, Takashi Taniguchi, Peng Xiong, Liang Fu, Long Ju

Abstract: Chiral superconductors are unconventional superconducting states that break time reversal symmetry spontaneously and typically feature Cooper pairing at non-zero angular momentum. Such states may host Majorana fermions and provide an important platform for topological physics research and fault-tolerant quantum computing. Despite of intensive search and prolonged studies of several candidate syste… ▽ More Chiral superconductors are unconventional superconducting states that break time reversal symmetry spontaneously and typically feature Cooper pairing at non-zero angular momentum. Such states may host Majorana fermions and provide an important platform for topological physics research and fault-tolerant quantum computing. Despite of intensive search and prolonged studies of several candidate systems, chiral superconductivity has remained elusive so far. Here we report the discovery of unconventional superconductivity in rhombohedral tetra-layer graphene. We observed two superconducting states in the gate-induced flat conduction bands with Tc up to 300 mK and charge density ne as low as 2.4*1011 cm-2, appearing robustly in three different devices, where electrons reside close to a proximate WSe2 layer, far away from WSe2, and in the absence of WSe2 respectively. Spontaneous time-reversal-symmetry-breaking (TRSB) due to electron's orbital motion is found, and several observations indicate the chiral nature of these superconducting states, including 1. In the superconducting state, Rxx shows fluctuations at zero magnetic field and magnetic hysteresis versus an out-of-plane magnetic field B, which are absent from all other superconductors; 2. one superconducting state develops within a spin- and valley-polarized quarter-metal phase, and is robust against the neighboring spin-valley-polarized quarter-metal state under B; 3. the normal states show anomalous Hall signals at zero magnetic field and magnetic hysteresis. We also observed a critical B > 0.9 Tesla, higher than any graphene superconductivity reported so far and indicates a strong-coupling superconductivity close the BCS-BEC crossover. Our observations establish a pure carbon material for the study of topological superconductivity, and pave the way to explore Majorana modes and topological quantum computing. △ Less

Submitted 27 August, 2024; originally announced August 2024.

arXiv:2408.10203 [pdf]

Extended Quantum Anomalous Hall States in Graphene/hBN Moiré Superlattices

Authors: Zhengguang Lu, Tonghang Han, Yuxuan Yao, Jixiang Yang, Junseok Seo, Lihan Shi, Shenyong Ye, Kenji Watanabe, Takashi Taniguchi, Long Ju

Abstract: Electrons in topological flat bands can form novel topological states driven by the correlation effects. The penta-layer rhombohedral graphene/hBN moire superlattice has been shown to host fractional quantum anomalous Hall effect (FQAHE) at ~400 mK, triggering discussions around the underlying mechanism and the role of moire effects. In particular, novel electron crystal states with non-trivial to… ▽ More Electrons in topological flat bands can form novel topological states driven by the correlation effects. The penta-layer rhombohedral graphene/hBN moire superlattice has been shown to host fractional quantum anomalous Hall effect (FQAHE) at ~400 mK, triggering discussions around the underlying mechanism and the role of moire effects. In particular, novel electron crystal states with non-trivial topology have been proposed. Here we report DC electrical transport measurement in rhombohedral penta- and tetra-layer graphene/hBN moire superlattices at electronic temperatures down to ~40 mK. We observed two more FQAH states in the penta-layer devices than previously reported. In a new tetra-layer device, we observed FQAHE at filling factors v = 3/5 and 2/3 at 300 mK. With a small bias current and the lowest temperature, we observed a new extended quantum anomalous Hall (EQAH) state and magnetic hysteresis, where Rxy = h/e2 and vanishing Rxx span a wide range of moire filling factor v from 0.5 to up to 1.3. By increasing the temperature or current, FQAHE can be recovered -- suggesting the break-down of the EQAH states and a phase transition into the fractional quantum Hall liquid. Furthermore, we observed displacement field-induced quantum phase transitions from the EQAH states to Fermi liquid, FQAH liquid and the likely composite Fermi liquid. Our observation establishes a new topological phase of electrons with quantized Hall resistance at zero magnetic field, and enriches the emergent quantum phenomena in materials with topological flat bands. △ Less

Submitted 19 August, 2024; originally announced August 2024.

arXiv:2408.09906 [pdf]

Diverse Impacts of Spin-Orbit Coupling on Superconductivity in Rhombohedral Graphene

Authors: Jixiang Yang, Xiaoyan Shi, Shenyong Ye, Chiho Yoon, Zhengguang Lu, Vivek Kakani, Tonghang Han, Junseok Seo, Lihan Shi, Kenji Watanabe, Takashi Taniguchi, Fan Zhang, Long Ju

Abstract: Engineering non-Abelian quasiparticles by combining superconductivity and topological states have been proposed as a route to realize topological quantum computation. Rhombohedral multilayer graphene with layer number N>=3 has been shown as a promising platform, as it hosts integer and fractional quantum anomalous Hall effects when proximitized by transition metal dichalcogenide (TMD) and a moire… ▽ More Engineering non-Abelian quasiparticles by combining superconductivity and topological states have been proposed as a route to realize topological quantum computation. Rhombohedral multilayer graphene with layer number N>=3 has been shown as a promising platform, as it hosts integer and fractional quantum anomalous Hall effects when proximitized by transition metal dichalcogenide (TMD) and a moire potential. However, superconductivity in similar devices have remained largely unexplored, although proximitized spin-orbit-coupling (SOC) effect has been shown to strengthen or induce superconductivity in both crystalline and twisted graphene. Here we report electron transport measurements of TMD-proximitized rhombohedral trilayer graphene (RTG) at temperatures down to 40 mK. We observed a new hole-doped superconducting state SC4 with a transition temperature Tc of 230 mK. On the electron-doped side, we identified a new isospin-symmetry breaking three-quarter-metal (TQM) phase. Near this three-quarter-metal state, the state SC3, very weak in bare RTG, is fully developed into a superconducting state at 110 mK. By performing fermiology analysis based on the quantum oscillation measurement, we showed that the SC3 and SC4 states reside at the phase boundaries between different isospin-symmetry-breaking states. These observations are aligned with the existing understanding that SOC enhances graphene superconductivity. Surprisingly, the original superconducting state SC1 in bare RTG is strongly suppressed in the presence of TMD, and we cannot find it down to the base temperature of our measurement. Our observations form the basis of exploring superconductivity and non-Abelian quasiparticles in rhombohedral graphene devices, and provide experimental evidence that challenges the understanding of the impacts of SOC on graphene superconductivity. △ Less

Submitted 19 August, 2024; originally announced August 2024.

Comments: 35 pages; 4 figures, 1 table, 13 extended data figures;

arXiv:2408.05784 [pdf, other]

Quantum Support Vector Machine-Based Classification of GPS Signal Reception Conditions

Authors: Suhui Jeong, Sanghyun Kim, Jiwon Seo

Abstract: Global Positioning System (GPS) plays a critical role in navigation by utilizing satellite signals, but its accuracy in urban environments is often compromised by signal obstructions. Previous research has categorized GPS reception conditions into line-of-sight (LOS), non-line-of-sight (NLOS), and LOS+NLOS scenarios to enhance accuracy. This paper introduces a novel approach using quantum support… ▽ More Global Positioning System (GPS) plays a critical role in navigation by utilizing satellite signals, but its accuracy in urban environments is often compromised by signal obstructions. Previous research has categorized GPS reception conditions into line-of-sight (LOS), non-line-of-sight (NLOS), and LOS+NLOS scenarios to enhance accuracy. This paper introduces a novel approach using quantum support vector machines (QSVM) with a ZZ feature map and fidelity quantum kernel to classify urban GPS signal reception conditions, comparing its performance against classical SVM methods. While classical SVM has been previously explored for this purpose, our study is the first to apply QSVM to this classification task. We conducted experiments using datasets from two distinct urban locations to train and evaluate SVM and QSVM models. Our results demonstrate that QSVM achieves superior classification accuracy compared to classical SVM for urban GPS signal datasets. Additionally, we emphasize the importance of appropriately scaling raw data when utilizing QSVM. △ Less

Submitted 11 August, 2024; originally announced August 2024.

Comments: Submitted to IEEE QCE 2024

arXiv:2408.03609 [pdf, other]

doi 10.1109/MWC.011.2300354

HELPS for Emergency Location Service: Hyper-Enhanced Local Positioning System

Authors: Hichan Moon, Hyosoon Park, Jiwon Seo

Abstract: In this study, we propose a novel positioning and searching system for emergency location services, namely the hyper-enhanced local positioning system (HELPS), which is applicable to all mobile phone users, including legacy feature phone users. In the case of an emergency, rescuers are dispatched with portable signal measurement equipment around the estimated location of the emergency caller. Each… ▽ More In this study, we propose a novel positioning and searching system for emergency location services, namely the hyper-enhanced local positioning system (HELPS), which is applicable to all mobile phone users, including legacy feature phone users. In the case of an emergency, rescuers are dispatched with portable signal measurement equipment around the estimated location of the emergency caller. Each signal measurement device measures the uplink signal from the mobile phone of the caller. After calculating the rough location of the caller's mobile phone based on these measurements, rescuers can efficiently search for the caller using the received uplink signal strength. Thus, the positioning accuracy in a conventional sense is not a limitation for rescuers in finding the caller. HELPS is not a traditional positioning system but rather a system with humans in the loop designed to reduce search time in emergencies. HELPS can provide emergency location information even in environments where the GPS or Wi-Fi is not functional. Furthermore, for HELPS operation, no hardware changes or software installations are required on the caller's mobile phone. △ Less

Submitted 7 August, 2024; originally announced August 2024.

Comments: Submitted to IEEE Wireless Communications

arXiv:2408.02971 [pdf, other]

Wave Interpolation Neural Operator: Interpolated Prediction of Electric Fields Across Untrained Wavelengths

Authors: Joonhyuk Seo, Chanik Kang, Dongjin Seo, Haejun Chung

Abstract: Designing photonic structures requires electromagnetic simulations, which often require high computational costs. Researchers have developed surrogate solvers for predicting electric fields to alleviate the computational issues. However, existing surrogate solvers are limited to performing inference at fixed simulation conditions and require retraining for different conditions. To address this, we… ▽ More Designing photonic structures requires electromagnetic simulations, which often require high computational costs. Researchers have developed surrogate solvers for predicting electric fields to alleviate the computational issues. However, existing surrogate solvers are limited to performing inference at fixed simulation conditions and require retraining for different conditions. To address this, we propose Wave Interpolation Neural Operator (WINO), a novel surrogate solver enabling simulation condition interpolation across a continuous spectrum of broadband wavelengths. WINO introduces the Fourier Group Convolution Shuffling operator and a new conditioning method to efficiently predict electric fields from both trained and untrained wavelength data, achieving significant improvements in parameter efficiency and spectral interpolation performance. Our model demonstrates approximately 100 times faster performance than traditional finite-difference frequency-domain simulations. Moreover, compared to the state-of-the-art model, we achieve a 74% reduction in parameters and 80.5% improvements in prediction accuracy for untrained wavelengths, and 13.2% improvements for trained wavelengths. △ Less

Submitted 6 August, 2024; originally announced August 2024.

Comments: 9 pages, 5 figures, 4 tables / Appendix: 4 pages, 4 figures, 3 tables

arXiv:2407.20845 [pdf, other]

Assessing Graphical Perception of Image Embedding Models using Channel Effectiveness

Authors: Soohyun Lee, Minsuk Chang, Seokhyeon Park, Jinwook Seo

Abstract: Recent advancements in vision models have greatly improved their ability to handle complex chart understanding tasks, like chart captioning and question answering. However, it remains challenging to assess how these models process charts. Existing benchmarks only roughly evaluate model performance without evaluating the underlying mechanisms, such as how models extract image embeddings. This limit… ▽ More Recent advancements in vision models have greatly improved their ability to handle complex chart understanding tasks, like chart captioning and question answering. However, it remains challenging to assess how these models process charts. Existing benchmarks only roughly evaluate model performance without evaluating the underlying mechanisms, such as how models extract image embeddings. This limits our understanding of the model's ability to perceive fundamental graphical components. To address this, we introduce a novel evaluation framework to assess the graphical perception of image embedding models. For chart comprehension, we examine two main aspects of channel effectiveness: accuracy and discriminability of various visual channels. Channel accuracy is assessed through the linearity of embeddings, measuring how well the perceived magnitude aligns with the size of the stimulus. Discriminability is evaluated based on the distances between embeddings, indicating their distinctness. Our experiments with the CLIP model show that it perceives channel accuracy differently from humans and shows unique discriminability in channels like length, tilt, and curvature. We aim to develop this work into a broader benchmark for reliable visual encoders, enhancing models for precise chart comprehension and human-like perception in future applications. △ Less

Submitted 30 July, 2024; originally announced July 2024.

Comments: In Proceedings of the 2024 IEEE Visualization and Visual Analytics (VIS)

arXiv:2407.18950 [pdf, other]

Unexplainability of Artificial Intelligence Judgments in Kant's Perspective

Authors: Jongwoo Seo

Abstract: Kant's Critique of Pure Reason, a major contribution to the history of epistemology, proposes a table of categories to elucidate the structure of the a priori principle of human judgment. The technology of artificial intelligence (AI), based on functionalism, claims to simulate or replicate human judgment. To assess this claim, it is necessary to study whether AI judgment possesses the characteris… ▽ More Kant's Critique of Pure Reason, a major contribution to the history of epistemology, proposes a table of categories to elucidate the structure of the a priori principle of human judgment. The technology of artificial intelligence (AI), based on functionalism, claims to simulate or replicate human judgment. To assess this claim, it is necessary to study whether AI judgment possesses the characteristics of human judgment. This paper argues that AI judgments exhibit a form that cannot be understood in terms of the characteristics of human judgments according to Kant. Because the characteristics of judgment overlap, we can call this AI's uncertainty. Then, I show that concepts without physical intuitions are not easy to explain when their functions are shown through vision. Finally, I illustrate that even if AI makes sentences through subject and predicate in natural language, which are components of judgment, it is difficult to determine whether AI understands the concepts to the level humans can accept. This shows that it is questionable whether the explanation through natural language is reliable. △ Less

Submitted 8 September, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

Comments: 9 pages, 3 figures

arXiv:2407.16329 [pdf, other]

PhenoFlow: A Human-LLM Driven Visual Analytics System for Exploring Large and Complex Stroke Datasets

Authors: Jaeyoung Kim, Sihyeon Lee, Hyeon Jeon, Keon-Joo Lee, Hee-Joon Bae, Bohyoung Kim, Jinwook Seo

Abstract: Acute stroke demands prompt diagnosis and treatment to achieve optimal patient outcomes. However, the intricate and irregular nature of clinical data associated with acute stroke, particularly blood pressure (BP) measurements, presents substantial obstacles to effective visual analytics and decision-making. Through a year-long collaboration with experienced neurologists, we developed PhenoFlow, a… ▽ More Acute stroke demands prompt diagnosis and treatment to achieve optimal patient outcomes. However, the intricate and irregular nature of clinical data associated with acute stroke, particularly blood pressure (BP) measurements, presents substantial obstacles to effective visual analytics and decision-making. Through a year-long collaboration with experienced neurologists, we developed PhenoFlow, a visual analytics system that leverages the collaboration between human and Large Language Models (LLMs) to analyze the extensive and complex data of acute ischemic stroke patients. PhenoFlow pioneers an innovative workflow, where the LLM serves as a data wrangler while neurologists explore and supervise the output using visualizations and natural language interactions. This approach enables neurologists to focus more on decision-making with reduced cognitive load. To protect sensitive patient information, PhenoFlow only utilizes metadata to make inferences and synthesize executable codes, without accessing raw patient data. This ensures that the results are both reproducible and interpretable while maintaining patient privacy. The system incorporates a slice-and-wrap design that employs temporal folding to create an overlaid circular visualization. Combined with a linear bar graph, this design aids in exploring meaningful patterns within irregularly measured BP data. Through case studies, PhenoFlow has demonstrated its capability to support iterative analysis of extensive clinical datasets, reducing cognitive load and enabling neurologists to make well-informed decisions. Grounded in long-term collaboration with domain experts, our research demonstrates the potential of utilizing LLMs to tackle current challenges in data-driven clinical decision-making for acute ischemic stroke patients. △ Less

Submitted 23 July, 2024; originally announced July 2024.

Comments: 11 pages, 5 figures, paper to appear in IEEE Transactions on Visualization and Computer Graphics (TVCG) (Proc. IEEE VIS 2024)

arXiv:2407.16322 [pdf, other]

Offsetting Perceptual Bias in Visual Clustering: The Role of Point Size Adjustment in Variable Display Sizes

Authors: Taehyun Yang, Hyeon Jeon, Jinwook Seo

Abstract: Scatterplots are frequently shared across different displays in collaborative and communicative visual analytics. However, variations in displays diversify scatterplot sizes. Such variations can influence the perception of clustering patterns, introducing potential biases leading to misinterpretations in cluster analysis. In this research, we explore how scatterplot size affects cluster assignment… ▽ More Scatterplots are frequently shared across different displays in collaborative and communicative visual analytics. However, variations in displays diversify scatterplot sizes. Such variations can influence the perception of clustering patterns, introducing potential biases leading to misinterpretations in cluster analysis. In this research, we explore how scatterplot size affects cluster assignment and investigate how we can offset such bias. We first conduct a controlled study asking participants to perform visual clustering on scatterplots of varying sizes. We found that changes in scatterplot size significantly alter cluster perception in three key features. In our subsequent experiment, we examine how adjusting point sizes can mitigate this bias. As a result, we verify that adjusting point size can effectively counteract the perceptual biases caused by varying scatterplot sizes. We wrap up our research by discussing the necessity and applicability of our findings in realworld applications. △ Less

Submitted 23 July, 2024; originally announced July 2024.

Comments: work in progress

arXiv:2407.14300 [pdf, other]

Transversal cycles and paths in tournaments

Authors: Debsoumya Chakraborti, Jaehoon Kim, Hyunwoo Lee, Jaehyeon Seo

Abstract: Thomason [$\textit{Trans. Amer. Math. Soc.}$ 296.1 (1986)] proved that every sufficiently large tournament contains Hamilton paths and cycles with all possible orientations, except possibly the consistently oriented Hamilton cycle. This paper establishes $\textit{transversal}$ generalizations of these classical results. For a collection $\mathbf{T}=\{T_1,\dots,T_m\}$ of not-necessarily distinct to… ▽ More Thomason [$\textit{Trans. Amer. Math. Soc.}$ 296.1 (1986)] proved that every sufficiently large tournament contains Hamilton paths and cycles with all possible orientations, except possibly the consistently oriented Hamilton cycle. This paper establishes $\textit{transversal}$ generalizations of these classical results. For a collection $\mathbf{T}=\{T_1,\dots,T_m\}$ of not-necessarily distinct tournaments on the common vertex set $V$, an $m$-edge directed subgraph $\mathcal{D}$ with the vertices in $V$ is called a transversal if there exists an bijection $\varphi\colon E(\mathcal{D})\to [m]$ such that $e\in E(T_{\varphi(e)})$ for all $e\in E(\mathcal{D})$. We prove that for sufficiently large $n$, there exist transversal Hamilton cycles of all possible orientations possibly except the consistently oriented one. We also obtain a similar result for the transversal Hamilton paths of all possible orientations. These results generalize the classical theorem of Thomason, and our approach provides another proof of this theorem. △ Less

Submitted 19 July, 2024; originally announced July 2024.

arXiv:2407.12867 [pdf, other]

Swift-BAT GUANO follow-up of gravitational-wave triggers in the third LIGO-Virgo-KAGRA observing run

Authors: Gayathri Raman, Samuele Ronchini, James Delaunay, Aaron Tohuvavohu, Jamie A. Kennea, Tyler Parsotan, Elena Ambrosi, Maria Grazia Bernardini, Sergio Campana, Giancarlo Cusumano, Antonino D'Ai, Paolo D'Avanzo, Valerio D'Elia, Massimiliano De Pasquale, Simone Dichiara, Phil Evans, Dieter Hartmann, Paul Kuin, Andrea Melandri, Paul O'Brien, Julian P. Osborne, Kim Page, David M. Palmer, Boris Sbarufatti, Gianpiero Tagliaferri , et al. (1797 additional authors not shown)

Abstract: We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wav… ▽ More We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wave Transient Catalogs (GWTC-3). Targeted searches were carried out on the entire GW sample using the maximum--likelihood NITRATES pipeline on the BAT data made available via the GUANO infrastructure. We do not detect any significant electromagnetic emission that is temporally and spatially coincident with any of the GW candidates. We report flux upper limits in the 15-350 keV band as a function of sky position for all the catalog candidates. For GW candidates where the Swift-BAT false alarm rate is less than 10$^{-3}$ Hz, we compute the GW--BAT joint false alarm rate. Finally, the derived Swift-BAT upper limits are used to infer constraints on the putative electromagnetic emission associated with binary black hole mergers. △ Less

Submitted 13 July, 2024; originally announced July 2024.

Comments: 50 pages, 10 figures, 4 tables

arXiv:2407.12401 [pdf, other]

Geometric Remove-and-Retrain (GOAR): Coordinate-Invariant eXplainable AI Assessment

Authors: Yong-Hyun Park, Junghoon Seo, Bomseok Park, Seongsu Lee, Junghyo Jo

Abstract: Identifying the relevant input features that have a critical influence on the output results is indispensable for the development of explainable artificial intelligence (XAI). Remove-and-Retrain (ROAR) is a widely accepted approach for assessing the importance of individual pixels by measuring changes in accuracy following their removal and subsequent retraining of the modified dataset. However, w… ▽ More Identifying the relevant input features that have a critical influence on the output results is indispensable for the development of explainable artificial intelligence (XAI). Remove-and-Retrain (ROAR) is a widely accepted approach for assessing the importance of individual pixels by measuring changes in accuracy following their removal and subsequent retraining of the modified dataset. However, we uncover notable limitations in pixel-perturbation strategies. When viewed from a geometric perspective, we discover that these metrics fail to discriminate between differences among feature attribution methods, thereby compromising the reliability of the evaluation. To address this challenge, we introduce an alternative feature-perturbation approach named Geometric Remove-and-Retrain (GOAR). Through a series of experiments with both synthetic and real datasets, we substantiate that GOAR transcends the limitations of pixel-centric metrics. △ Less

Submitted 17 July, 2024; originally announced July 2024.

Comments: Accepted in XAI in Action Workshop @ NeurIPS2023

arXiv:2407.12227 [pdf, other]

Development of MMC-based lithium molybdate cryogenic calorimeters for AMoRE-II

Authors: A. Agrawal, V. V. Alenkov, P. Aryal, H. Bae, J. Beyer, B. Bhandari, R. S. Boiko, K. Boonin, O. Buzanov, C. R. Byeon, N. Chanthima, M. K. Cheoun, J. S. Choe, S. Choi, S. Choudhury, J. S. Chung, F. A. Danevich, M. Djamal, D. Drung, C. Enss, A. Fleischmann, A. M. Gangapshev, L. Gastaldo, Y. M. Gavrilyuk, A. M. Gezhaev , et al. (84 additional authors not shown)

Abstract: The AMoRE collaboration searches for neutrinoless double beta decay of $^{100}$Mo using molybdate scintillating crystals via low temperature thermal calorimetric detection. The early phases of the experiment, AMoRE-pilot and AMoRE-I, have demonstrated competitive discovery potential. Presently, the AMoRE-II experiment, featuring a large detector array with about 90 kg of $^{100}$Mo isotope, is und… ▽ More The AMoRE collaboration searches for neutrinoless double beta decay of $^{100}$Mo using molybdate scintillating crystals via low temperature thermal calorimetric detection. The early phases of the experiment, AMoRE-pilot and AMoRE-I, have demonstrated competitive discovery potential. Presently, the AMoRE-II experiment, featuring a large detector array with about 90 kg of $^{100}$Mo isotope, is under construction.This paper discusses the baseline design and characterization of the lithium molybdate cryogenic calorimeters to be used in the AMoRE-II detector modules. The results from prototype setups that incorporate new housing structures and two different crystal masses (316 g and 517 - 521 g), operated at 10 mK temperature, show energy resolutions (FWHM) of 7.55 - 8.82 keV at the 2.615 MeV $^{208}$Tl $γ$ line, and effective light detection of 0.79 - 0.96 keV/MeV. The simultaneous heat and light detection enables clear separation of alpha particles with a discrimination power of 12.37 - 19.50 at the energy region around $^6$Li(n, $α$)$^3$H with Q-value = 4.785 MeV. Promising detector performances were demonstrated at temperatures as high as 30 mK, which relaxes the temperature constraints for operating the large AMoRE-II array. △ Less

Submitted 16 July, 2024; originally announced July 2024.

arXiv:2407.11406 [pdf, other]

Revisiting the Impact of Pursuing Modularity for Code Generation

Authors: Deokyeong Kang, Ki Jung Seo, Taeuk Kim

Abstract: Modular programming, which aims to construct the final program by integrating smaller, independent building blocks, has been regarded as a desirable practice in software development. However, with the rise of recent code generation agents built upon large language models (LLMs), a question emerges: is this traditional practice equally effective for these new tools? In this work, we assess the impa… ▽ More Modular programming, which aims to construct the final program by integrating smaller, independent building blocks, has been regarded as a desirable practice in software development. However, with the rise of recent code generation agents built upon large language models (LLMs), a question emerges: is this traditional practice equally effective for these new tools? In this work, we assess the impact of modularity in code generation by introducing a novel metric for its quantitative measurement. Surprisingly, unlike conventional wisdom on the topic, we find that modularity is not a core factor for improving the performance of code generation models. We also explore potential explanations for why LLMs do not exhibit a preference for modular code compared to non-modular code. △ Less

Submitted 7 October, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

Comments: EMNLP 2024 Findings

arXiv:2407.09153 [pdf]

doi 10.1038/s41467-024-49841-6

Topological Fermi-arc surface state covered by floating electrons on a two-dimensional electride

Authors: Chan-young Lim, Min-Seok Kim, Dong Cheol Lim, Sunghun Kim, Yeonghoon Lee, Jaehoon Cha, Gyubin Lee, Sang Yong Song, Dinesh Thapa, Jonathan D. Denlinger, Seong-Gon Kim, Sung Wng Kim, Jungpil Seo, Yeongkwan Kim

Abstract: Two-dimensional electrides can acquire topologically non-trivial phases due to intriguing interplay between the cationic atomic layers and anionic electron layers. However, experimental evidence of topological surface states has yet to be verified. Here, via angle-resolved photoemission spectroscopy (ARPES) and scanning tunnelling microscopy (STM), we probe the magnetic Weyl states of the ferromag… ▽ More Two-dimensional electrides can acquire topologically non-trivial phases due to intriguing interplay between the cationic atomic layers and anionic electron layers. However, experimental evidence of topological surface states has yet to be verified. Here, via angle-resolved photoemission spectroscopy (ARPES) and scanning tunnelling microscopy (STM), we probe the magnetic Weyl states of the ferromagnetic electride $[Gd_{2}$C]^{2+}\cdot2e^{-}$. In particular, the presence of Weyl cones and Fermi-arc states is demonstrated through photon energy-dependent ARPES measurements, agreeing with theoretical band structure calculations. Notably, the STM measurements reveal that the Fermi-arc states exist underneath a floating quantum electron liquid on the top Gd layer, forming double-stacked surface states in a heterostructure. Our work thus not only unveils the non-trivial topology of the $[Gd_{2}$C]^{2+}\cdot2e^{-}$ electride but also realizes a surface heterostructure that can host phenomena distinct from the bulk. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 22 pages, 6 figures

Journal ref: Nat. Commun. 15 (2024) 5615

arXiv:2407.05618 [pdf, other]

Improved limit on neutrinoless double beta decay of \mohundred~from AMoRE-I

Authors: A. Agrawal, V. V. Alenkov, P. Aryal, J. Beyer, B. Bhandari, R. S. Boiko, K. Boonin, O. Buzanov, C. R. Byeon, N. Chanthima, M. K. Cheoun, J. S. Choe, Seonho Choi, S. Choudhury, J. S. Chung, F. A. Danevich, M. Djamal, D. Drung, C. Enss, A. Fleischmann, A. M. Gangapshev, L. Gastaldo, Y. M. Gavrilyuk, A. M. Gezhaev, O. Gileva , et al. (83 additional authors not shown)

Abstract: AMoRE searches for the signature of neutrinoless double beta decay of $^{100}$Mo with a 100 kg sample of enriched $^{100}$Mo. Scintillating molybdate crystals coupled with a metallic magnetic calorimeter operate at milli-Kelvin temperatures to measure the energy of electrons emitted in the decay. As a demonstration of the full-scale AMoRE, we conducted AMoRE-I, a pre-experiment with 18 molybdate c… ▽ More AMoRE searches for the signature of neutrinoless double beta decay of $^{100}$Mo with a 100 kg sample of enriched $^{100}$Mo. Scintillating molybdate crystals coupled with a metallic magnetic calorimeter operate at milli-Kelvin temperatures to measure the energy of electrons emitted in the decay. As a demonstration of the full-scale AMoRE, we conducted AMoRE-I, a pre-experiment with 18 molybdate crystals, at the Yangyang Underground Laboratory for over two years. The exposure was 8.02 kg$\cdot$year (or 3.89 kg$_{\mathrm{^{100}Mo}}\cdot$year) and the total background rate near the Q-value was 0.025 $\pm$ 0.002 counts/keV/kg/year. We observed no indication of $0νββ$ decay and report a new lower limit of the half-life of $^{100}$Mo $0νββ$ decay as $ T^{0ν}_{1/2}>3.0\times10^{24}~\mathrm{years}$ at 90\% confidence level. The effective Majorana mass limit range is $m_{ββ}<$(210--610) meV using nuclear matrix elements estimated in the framework of different models, including the recent shell model calculations. △ Less

Submitted 8 July, 2024; originally announced July 2024.

Comments: 7 pages, 4 figures

arXiv:2407.03010 [pdf, other]

Context-Aware Video Instance Segmentation

Authors: Seunghun Lee, Jiwan Seo, Kiljoon Han, Minwoo Choi, Sunghoon Im

Abstract: In this paper, we introduce the Context-Aware Video Instance Segmentation (CAVIS), a novel framework designed to enhance instance association by integrating contextual information adjacent to each object. To efficiently extract and leverage this information, we propose the Context-Aware Instance Tracker (CAIT), which merges contextual data surrounding the instances with the core instance features… ▽ More In this paper, we introduce the Context-Aware Video Instance Segmentation (CAVIS), a novel framework designed to enhance instance association by integrating contextual information adjacent to each object. To efficiently extract and leverage this information, we propose the Context-Aware Instance Tracker (CAIT), which merges contextual data surrounding the instances with the core instance features to improve tracking accuracy. Additionally, we introduce the Prototypical Cross-frame Contrastive (PCC) loss, which ensures consistency in object-level features across frames, thereby significantly enhancing instance matching accuracy. CAVIS demonstrates superior performance over state-of-the-art methods on all benchmark datasets in video instance segmentation (VIS) and video panoptic segmentation (VPS). Notably, our method excels on the OVIS dataset, which is known for its particularly challenging videos. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: Project page: https://seung-hun-lee.github.io/projects/CAVIS/

arXiv:2406.19707 [pdf, other]

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management

Authors: Wonbeom Lee, Jungi Lee, Junghwan Seo, Jaewoong Sim

Abstract: Transformer-based large language models (LLMs) demonstrate impressive performance across various natural language processing tasks. Serving LLM inference for generating long contents, however, poses a challenge due to the enormous memory footprint of the transient state, known as the key-value (KV) cache, which scales with the sequence length and batch size. In this paper, we present InfiniGen, a… ▽ More Transformer-based large language models (LLMs) demonstrate impressive performance across various natural language processing tasks. Serving LLM inference for generating long contents, however, poses a challenge due to the enormous memory footprint of the transient state, known as the key-value (KV) cache, which scales with the sequence length and batch size. In this paper, we present InfiniGen, a novel KV cache management framework tailored for long-text generation, which synergistically works with modern offloading-based inference systems. InfiniGen leverages the key insight that a few important tokens that are essential for computing the subsequent attention layer in the Transformer can be speculated by performing a minimal rehearsal with the inputs of the current layer and part of the query weight and key cache of the subsequent layer. This allows us to prefetch only the essential KV cache entries (without fetching them all), thereby mitigating the fetch overhead from the host memory in offloading-based LLM serving systems. Our evaluation on several representative LLMs shows that InfiniGen improves the overall performance of a modern offloading-based system by up to 3.00x compared to prior KV cache management methods while offering substantially better model accuracy. △ Less

Submitted 28 June, 2024; originally announced June 2024.

Comments: OSDI 2024

arXiv:2406.16042 [pdf, other]

Pose-dIVE: Pose-Diversified Augmentation with Diffusion Model for Person Re-Identification

Authors: Inès Hyeonsu Kim, JoungBin Lee, Woojeong Jin, Soowon Son, Kyusun Cho, Junyoung Seo, Min-Seop Kwak, Seokju Cho, JeongYeol Baek, Byeongwon Lee, Seungryong Kim

Abstract: Person re-identification (Re-ID) often faces challenges due to variations in human poses and camera viewpoints, which significantly affect the appearance of individuals across images. Existing datasets frequently lack diversity and scalability in these aspects, hindering the generalization of Re-ID models to new camera systems. We propose Pose-dIVE, a novel data augmentation approach that incorpor… ▽ More Person re-identification (Re-ID) often faces challenges due to variations in human poses and camera viewpoints, which significantly affect the appearance of individuals across images. Existing datasets frequently lack diversity and scalability in these aspects, hindering the generalization of Re-ID models to new camera systems. We propose Pose-dIVE, a novel data augmentation approach that incorporates sparse and underrepresented human pose and camera viewpoint examples into the training data, addressing the limited diversity in the original training data distribution. Our objective is to augment the training dataset to enable existing Re-ID models to learn features unbiased by human pose and camera viewpoint variations. To achieve this, we leverage the knowledge of pre-trained large-scale diffusion models. By conditioning the diffusion model on both the human pose and camera viewpoint concurrently through the SMPL model, we generate training data with diverse human poses and camera viewpoints. Experimental results demonstrate the effectiveness of our method in addressing human pose bias and enhancing the generalizability of Re-ID models compared to other data augmentation-based Re-ID approaches. △ Less

Submitted 15 October, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

arXiv:2406.13502 [pdf, other]

ManWav: The First Manchu ASR Model

Authors: Jean Seo, Minha Kang, Sungjoo Byun, Sangah Lee

Abstract: This study addresses the widening gap in Automatic Speech Recognition (ASR) research between high resource and extremely low resource languages, with a particular focus on Manchu, a critically endangered language. Manchu exemplifies the challenges faced by marginalized linguistic communities in accessing state-of-the-art technologies. In a pioneering effort, we introduce the first-ever Manchu ASR… ▽ More This study addresses the widening gap in Automatic Speech Recognition (ASR) research between high resource and extremely low resource languages, with a particular focus on Manchu, a critically endangered language. Manchu exemplifies the challenges faced by marginalized linguistic communities in accessing state-of-the-art technologies. In a pioneering effort, we introduce the first-ever Manchu ASR model ManWav, leveraging Wav2Vec2-XLSR-53. The results of the first Manchu ASR is promising, especially when trained with our augmented data. Wav2Vec2-XLSR-53 fine-tuned with augmented data demonstrates a 0.02 drop in CER and 0.13 drop in WER compared to the same base model fine-tuned with original data. △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: ACL2024/Field Matters

arXiv:2406.09698 [pdf, other]

Projected background and sensitivity of AMoRE-II

Authors: A. Agrawal, V. V. Alenkov, P. Aryal, J. Beyer, B. Bhandari, R. S. Boiko, K. Boonin, O. Buzanov, C. R. Byeon, N. Chanthima, M. K. Cheoun, J. S. Choe, Seonho Choi, S. Choudhury, J. S. Chung, F. A. Danevich, M. Djamal, D. Drung, C. Enss, A. Fleischmann, A. M. Gangapshev, L. Gastaldo, Y. M. Gavrilyuk, A. M. Gezhaev, O. Gileva , et al. (81 additional authors not shown)

Abstract: AMoRE-II aims to search for neutrinoless double beta decay with an array of 423 Li$_2$$^{100}$MoO$_4$ crystals operating in the cryogenic system as the main phase of the Advanced Molybdenum-based Rare process Experiment (AMoRE). AMoRE has been planned to operate in three phases: AMoRE-pilot, AMoRE-I, and AMoRE-II. AMoRE-II is currently being installed at the Yemi Underground Laboratory, located ap… ▽ More AMoRE-II aims to search for neutrinoless double beta decay with an array of 423 Li$_2$$^{100}$MoO$_4$ crystals operating in the cryogenic system as the main phase of the Advanced Molybdenum-based Rare process Experiment (AMoRE). AMoRE has been planned to operate in three phases: AMoRE-pilot, AMoRE-I, and AMoRE-II. AMoRE-II is currently being installed at the Yemi Underground Laboratory, located approximately 1000 meters deep in Jeongseon, Korea. The goal of AMoRE-II is to reach up to $T^{0νββ}_{1/2}$ $\sim$ 6 $\times$ 10$^{26}$ years, corresponding to an effective Majorana mass of 15 - 29 meV, covering all the inverted mass hierarchy regions. To achieve this, the background level of the experimental configurations and possible background sources of gamma and beta events should be well understood. We have intensively performed Monte Carlo simulations using the GEANT4 toolkit in all the experimental configurations with potential sources. We report the estimated background level that meets the 10$^{-4}$counts/(keV$\cdot$kg$\cdot$yr) requirement for AMoRE-II in the region of interest (ROI) and show the projected half-life sensitivity based on the simulation study. △ Less

Submitted 14 October, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

arXiv:2406.08714 [pdf, other]

Real-time Digital RF Emulation -- II: A Near Memory Custom Accelerator

Authors: Mandovi Mukherjee, Xiangyu Mao, Nael Rahman, Coleman DeLude, Joe Driscoll, Sudarshan Sharma, Payman Behnam, Uday Kamal, Jongseok Woo, Daehyun Kim, Sharjeel Khan, Jianming Tong, Jamin Seo, Prachi Sinha, Madhavan Swaminathan, Tushar Krishna, Santosh Pande, Justin Romberg, Saibal Mukhopadhyay

Abstract: A near memory hardware accelerator, based on a novel direct path computational model, for real-time emulation of radio frequency systems is demonstrated. Our evaluation of hardware performance uses both application-specific integrated circuits (ASIC) and field programmable gate arrays (FPGA) methodologies: 1). The ASIC testchip implementation, using TSMC 28nm CMOS, leverages distributed autonomous… ▽ More A near memory hardware accelerator, based on a novel direct path computational model, for real-time emulation of radio frequency systems is demonstrated. Our evaluation of hardware performance uses both application-specific integrated circuits (ASIC) and field programmable gate arrays (FPGA) methodologies: 1). The ASIC testchip implementation, using TSMC 28nm CMOS, leverages distributed autonomous control to extract concurrency in compute as well as low latency. It achieves a $518$ MHz per channel bandwidth in a prototype $4$-node system. The maximum emulation range supported in this paradigm is $9.5$ km with $0.24$ $μ$s of per-sample emulation latency. 2). The FPGA-based implementation, evaluated on a Xilinx ZCU104 board, demonstrates a $9$-node test case (two Transmitters, one Receiver, and $6$ passive reflectors) with an emulation range of $1.13$ km to $27.3$ km at $215$ MHz bandwidth. △ Less

Submitted 12 June, 2024; originally announced June 2024.

arXiv:2405.20042

CycleFormer : TSP Solver Based on Language Modeling

Authors: Jieun Yook, Junpyo Seo, Joon Huh, Han Joon Byun, Byung-ro Moon

Abstract: We propose a new transformer model for the Traveling Salesman Problem (TSP) called CycleFormer. We identified distinctive characteristics that need to be considered when applying a conventional transformer model to TSP and aimed to fully incorporate these elements into the TSP-specific transformer. Unlike the token sets in typical language models, which are limited and static, the token (node) set… ▽ More We propose a new transformer model for the Traveling Salesman Problem (TSP) called CycleFormer. We identified distinctive characteristics that need to be considered when applying a conventional transformer model to TSP and aimed to fully incorporate these elements into the TSP-specific transformer. Unlike the token sets in typical language models, which are limited and static, the token (node) set in TSP is unlimited and dynamic. To exploit this fact to the fullest, we equated the encoder output with the decoder linear layer and directly connected the context vector of the encoder to the decoder encoding. Additionally, we added a positional encoding to the encoder tokens that reflects the two-dimensional nature of TSP, and devised a circular positional encoding for the decoder tokens that considers the cyclic properties of a tour. By incorporating these ideas, CycleFormer outperforms state-of-the-art (SOTA) transformer models for TSP from TSP-50 to TSP-500. Notably, on TSP-500, the optimality gap was reduced by approximately 2.8 times, from 3.09% to 1.10%, compared to the existing SOTA. The code will be made available at https://github.com/Giventicket/CycleFormer. △ Less

Submitted 4 October, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

Comments: The paper's content (experiments) is insufficient

arXiv:2405.17840 [pdf, other]

Benchmarks Underestimate the Readiness of Multi-lingual Dialogue Agents

Authors: Andrew H. Lee, Sina J. Semnani, Galo Castillo-López, Gäel de Chalendar, Monojit Choudhury, Ashna Dua, Kapil Rajesh Kavitha, Sungkyun Kim, Prashant Kodali, Ponnurangam Kumaraguru, Alexis Lombard, Mehrad Moradshahi, Gihyun Park, Nasredine Semmar, Jiwon Seo, Tianhao Shen, Manish Shrivastava, Deyi Xiong, Monica S. Lam

Abstract: Creating multilingual task-oriented dialogue (TOD) agents is challenging due to the high cost of training data acquisition. Following the research trend of improving training data efficiency, we show for the first time, that in-context learning is sufficient to tackle multilingual TOD. To handle the challenging dialogue state tracking (DST) subtask, we break it down to simpler steps that are mor… ▽ More Creating multilingual task-oriented dialogue (TOD) agents is challenging due to the high cost of training data acquisition. Following the research trend of improving training data efficiency, we show for the first time, that in-context learning is sufficient to tackle multilingual TOD. To handle the challenging dialogue state tracking (DST) subtask, we break it down to simpler steps that are more compatible with in-context learning where only a handful of few-shot examples are used. We test our approach on the multilingual TOD dataset X-RiSAWOZ, which has 12 domains in Chinese, English, French, Korean, Hindi, and code-mixed Hindi-English. Our turn-by-turn DST accuracy on the 6 languages range from 55.6% to 80.3%, seemingly worse than the SOTA results from fine-tuned models that achieve from 60.7% to 82.8%; our BLEU scores in the response generation (RG) subtask are also significantly lower than SOTA. However, after manual evaluation of the validation set, we find that by correcting gold label errors and improving dataset annotation schema, GPT-4 with our prompts can achieve (1) 89.6%-96.8% accuracy in DST, and (2) more than 99% correct response generation across different languages. This leads us to conclude that current automatic metrics heavily underestimate the effectiveness of in-context learning. △ Less

Submitted 16 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

arXiv:2405.17251 [pdf, other]

GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping

Authors: Junyoung Seo, Kazumi Fukuda, Takashi Shibuya, Takuya Narihira, Naoki Murata, Shoukang Hu, Chieh-Hsin Lai, Seungryong Kim, Yuki Mitsufuji

Abstract: Generating novel views from a single image remains a challenging task due to the complexity of 3D scenes and the limited diversity in the existing multi-view datasets to train a model on. Recent research combining large-scale text-to-image (T2I) models with monocular depth estimation (MDE) has shown promise in handling in-the-wild images. In these methods, an input view is geometrically warped to… ▽ More Generating novel views from a single image remains a challenging task due to the complexity of 3D scenes and the limited diversity in the existing multi-view datasets to train a model on. Recent research combining large-scale text-to-image (T2I) models with monocular depth estimation (MDE) has shown promise in handling in-the-wild images. In these methods, an input view is geometrically warped to novel views with estimated depth maps, then the warped image is inpainted by T2I models. However, they struggle with noisy depth maps and loss of semantic details when warping an input view to novel viewpoints. In this paper, we propose a novel approach for single-shot novel view synthesis, a semantic-preserving generative warping framework that enables T2I generative models to learn where to warp and where to generate, through augmenting cross-view attention with self-attention. Our approach addresses the limitations of existing methods by conditioning the generative model on source view images and incorporating geometric warping signals. Qualitative and quantitative evaluations demonstrate that our model outperforms existing methods in both in-domain and out-of-domain scenarios. Project page is available at https://GenWarp-NVS.github.io/. △ Less

Submitted 26 September, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

Comments: Accepted to NeurIPS 2024 / Project page: https://GenWarp-NVS.github.io

arXiv:2405.16855 [pdf, ps, other]

Maximal operators given by Fourier multipliers with dilation of fractional dimensions

Authors: Jin Bong Lee, Jinsol Seo

Abstract: In this paper, we investigate $L^p$ bounds of maximal Fourier multiplier operators with dilation of fractional dimensions. For the Fourier multipliers, we suggest a criterion related to dimensions of dilation sets which guarantees $L^p$ bounds of the maximal operators for each $p$. Our criterion covers Mikhlin-type multipliers, multipliers with limited decay, and multipliers with slow decay. In this paper, we investigate $L^p$ bounds of maximal Fourier multiplier operators with dilation of fractional dimensions. For the Fourier multipliers, we suggest a criterion related to dimensions of dilation sets which guarantees $L^p$ bounds of the maximal operators for each $p$. Our criterion covers Mikhlin-type multipliers, multipliers with limited decay, and multipliers with slow decay. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: 15 pages

MSC Class: 42B25; 42B15; 42B35; 42B37

arXiv:2405.16132 [pdf, other]

doi 10.1109/ITC-CSCC62988.2024.10628280

Efficient Quantum Circuit Encoding of Object Information in 2D Ray Casting

Authors: Seungjae Lee, Suhui Jeong, Jiwon Seo

Abstract: Quantum computing holds the potential to solve problems that are practically unsolvable by classical computers due to its ability to significantly reduce time complexity. We aim to harness this potential to enhance ray casting, a pivotal technique in computer graphics for simplifying the rendering of 3D objects. To perform ray casting in a quantum computer, we need to encode the defining parameter… ▽ More Quantum computing holds the potential to solve problems that are practically unsolvable by classical computers due to its ability to significantly reduce time complexity. We aim to harness this potential to enhance ray casting, a pivotal technique in computer graphics for simplifying the rendering of 3D objects. To perform ray casting in a quantum computer, we need to encode the defining parameters of primitives into qubits. However, during the current noisy intermediate-scale quantum (NISQ) era, challenges arise from the limited number of qubits and the impact of noise when executing multiple gates. Through logic optimization, we reduced the depth of quantum circuits as well as the number of gates and qubits. As a result, the event count of correct measurements from an IBM quantum computer significantly exceeded that of incorrect measurements. △ Less

Submitted 25 May, 2024; originally announced May 2024.

Comments: Submitted to ITC-CSCC 2024

arXiv:2405.14222 [pdf, other]

RAQ-VAE: Rate-Adaptive Vector-Quantized Variational Autoencoder

Authors: Jiwan Seo, Joonhyuk Kang

Abstract: Vector Quantized Variational AutoEncoder (VQ-VAE) is an established technique in machine learning for learning discrete representations across various modalities. However, its scalability and applicability are limited by the need to retrain the model to adjust the codebook for different data or model scales. We introduce the Rate-Adaptive VQ-VAE (RAQ-VAE) framework, which addresses this challenge… ▽ More Vector Quantized Variational AutoEncoder (VQ-VAE) is an established technique in machine learning for learning discrete representations across various modalities. However, its scalability and applicability are limited by the need to retrain the model to adjust the codebook for different data or model scales. We introduce the Rate-Adaptive VQ-VAE (RAQ-VAE) framework, which addresses this challenge with two novel codebook representation methods: a model-based approach using a clustering-based technique on an existing well-trained VQ-VAE model, and a data-driven approach utilizing a sequence-to-sequence (Seq2Seq) model for variable-rate codebook generation. Our experiments demonstrate that RAQ-VAE achieves effective reconstruction performance across multiple rates, often outperforming conventional fixed-rate VQ-VAE models. This work enhances the adaptability and performance of VQ-VAEs, with broad applications in data reconstruction, generation, and computer vision tasks. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: Under review

arXiv:2405.12488 [pdf, other]

First joint oscillation analysis of Super-Kamiokande atmospheric and T2K accelerator neutrino data

Authors: Super-Kamiokande, T2K collaborations, :, S. Abe, K. Abe, N. Akhlaq, R. Akutsu, H. Alarakia-Charles, A. Ali, Y. I. Alj Hakim, S. Alonso Monsalve, S. Amanai, C. Andreopoulos, L. H. V. Anthony, M. Antonova, S. Aoki, K. A. Apte, T. Arai, T. Arihara, S. Arimoto, Y. Asada, R. Asaka, Y. Ashida, E. T. Atkin, N. Babu , et al. (524 additional authors not shown)

Abstract: The Super-Kamiokande and T2K collaborations present a joint measurement of neutrino oscillation parameters from their atmospheric and beam neutrino data. It uses a common interaction model for events overlapping in neutrino energy and correlated detector systematic uncertainties between the two datasets, which are found to be compatible. Using 3244.4 days of atmospheric data and a beam exposure of… ▽ More The Super-Kamiokande and T2K collaborations present a joint measurement of neutrino oscillation parameters from their atmospheric and beam neutrino data. It uses a common interaction model for events overlapping in neutrino energy and correlated detector systematic uncertainties between the two datasets, which are found to be compatible. Using 3244.4 days of atmospheric data and a beam exposure of $19.7(16.3) \times 10^{20}$ protons on target in (anti)neutrino mode, the analysis finds a 1.9$σ$ exclusion of CP-conservation (defined as $J_{CP}=0$) and a preference for the normal mass ordering. △ Less

Submitted 15 October, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

Comments: 12 pages, 4 figures

arXiv:2405.11689 [pdf, other]

Investigation of suppression of $Υ(nS)$ in relativistic heavy-ion collisions at RHIC and LHC energies

Authors: Junlee Kim, Jaebeom Park, Byungsik Hong, Juhee Hong, Eun-Joo Kim, Yongsun Kim, MinJung Kweon, Su Houng Lee, Sanghoon Lim, Jinjoo Seo

Abstract: The primary purpose of studying quarkonium production in relativistic heavy-ion collisions is to understand the properties of the quark-gluon plasma. At various collision systems, measurements of quarkonium states of different binding energies, such as $Υ(nS)$, can provide comprehensive information. A model study has been performed to investigate the modification of $Υ(nS)$ production in Pb-Pb col… ▽ More The primary purpose of studying quarkonium production in relativistic heavy-ion collisions is to understand the properties of the quark-gluon plasma. At various collision systems, measurements of quarkonium states of different binding energies, such as $Υ(nS)$, can provide comprehensive information. A model study has been performed to investigate the modification of $Υ(nS)$ production in Pb-Pb collisions at $\sqrt{s_{\mathrm{NN}}}=$ 5.02 TeV and Au-Au collisions at $\sqrt{s_{\mathrm{NN}}}=$ 200 GeV. The Monte-Carlo simulation study is performed with a publicly available hydrodynamic simulation package for the quark-gluon plasma medium and a theoretical calculation of temperature-dependent thermal width of $Υ(nS)$ considering the gluo-dissociation and inelastic parton scattering for dissociation inside the medium. In addition, we perform a systematic study with different descriptions of initial collision geometry and formation time of $Υ(nS)$ to investigate their impacts on yield modification. The model calculation with a varied parameter set can describe the experimental data of $Υ(nS)$ in Pb-Pb collisions at 5.02 TeV and $Υ(2S)$ in Au-Au collisions at 200 GeV but underestimates the modification of $Υ(1S)$ at the lower collision energy. The nuclear absorption mechanism is explored to understand the discrepancy between the data and simulation. △ Less

Submitted 19 May, 2024; originally announced May 2024.

Comments: 9 pages, 11 figures

arXiv:2405.09879 [pdf, other]

Generative Unlearning for Any Identity

Authors: Juwon Seo, Sung-Hoon Lee, Tae-Young Lee, Seungjun Moon, Gyeong-Moon Park

Abstract: Recent advances in generative models trained on large-scale datasets have made it possible to synthesize high-quality samples across various domains. Moreover, the emergence of strong inversion networks enables not only a reconstruction of real-world images but also the modification of attributes through various editing methods. However, in certain domains related to privacy issues, e.g., human fa… ▽ More Recent advances in generative models trained on large-scale datasets have made it possible to synthesize high-quality samples across various domains. Moreover, the emergence of strong inversion networks enables not only a reconstruction of real-world images but also the modification of attributes through various editing methods. However, in certain domains related to privacy issues, e.g., human faces, advanced generative models along with strong inversion methods can lead to potential misuses. In this paper, we propose an essential yet under-explored task called generative identity unlearning, which steers the model not to generate an image of a specific identity. In the generative identity unlearning, we target the following objectives: (i) preventing the generation of images with a certain identity, and (ii) preserving the overall quality of the generative model. To satisfy these goals, we propose a novel framework, Generative Unlearning for Any Identity (GUIDE), which prevents the reconstruction of a specific identity by unlearning the generator with only a single image. GUIDE consists of two parts: (i) finding a target point for optimization that un-identifies the source latent code and (ii) novel loss functions that facilitate the unlearning procedure while less affecting the learned distribution. Our extensive experiments demonstrate that our proposed method achieves state-of-the-art performance in the generative machine unlearning task. The code is available at https://github.com/KHU-AGI/GUIDE. △ Less

Submitted 16 May, 2024; originally announced May 2024.

Comments: 15 pages, 17 figures, 10 tables, CVPR 2024 Poster

arXiv:2405.09765 [pdf, other]

doi 10.1109/ICASSP48485.2024.10446698

Unsupervised Extractive Dialogue Summarization in Hyperdimensional Space

Authors: Seongmin Park, Kyungho Kim, Jaejin Seo, Jihwa Lee

Abstract: We present HyperSum, an extractive summarization framework that captures both the efficiency of traditional lexical summarization and the accuracy of contemporary neural approaches. HyperSum exploits the pseudo-orthogonality that emerges when randomly initializing vectors at extremely high dimensions ("blessing of dimensionality") to construct representative and efficient sentence embeddings. Simp… ▽ More We present HyperSum, an extractive summarization framework that captures both the efficiency of traditional lexical summarization and the accuracy of contemporary neural approaches. HyperSum exploits the pseudo-orthogonality that emerges when randomly initializing vectors at extremely high dimensions ("blessing of dimensionality") to construct representative and efficient sentence embeddings. Simply clustering the obtained embeddings and extracting their medoids yields competitive summaries. HyperSum often outperforms state-of-the-art summarizers -- in terms of both summary accuracy and faithfulness -- while being 10 to 100 times faster. We open-source HyperSum as a strong baseline for unsupervised extractive summarization. △ Less

Submitted 15 May, 2024; originally announced May 2024.

Comments: ICASSP 2024

arXiv:2405.07267 [pdf, ps, other]

Fields, Bridges, and Foundations: How Researchers Browse Citation Network Visualizations

Authors: Kiroong Choe, Eunhye Kim, Sangwon Park, Jinwook Seo

Abstract: Visualizing citation relations with network structures is widely used, but the visual complexity can make it challenging for individual researchers trying to navigate them. We collected data from 18 researchers with an interface that we designed using network simplification methods and analyzed how users browsed and identified important papers. Our analysis reveals six major patterns used for iden… ▽ More Visualizing citation relations with network structures is widely used, but the visual complexity can make it challenging for individual researchers trying to navigate them. We collected data from 18 researchers with an interface that we designed using network simplification methods and analyzed how users browsed and identified important papers. Our analysis reveals six major patterns used for identifying papers of interest, which can be categorized into three key components: Fields, Bridges, and Foundations, each viewed from two distinct perspectives: layout-oriented and connection-oriented. The connection-oriented approach was found to be more reliable for selecting relevant papers, but the layout-oriented method was adopted more often, even though it led to unexpected results and user frustration. Our findings emphasize the importance of integrating these components and the necessity to balance visual layouts with meaningful connections to enhance the effectiveness of citation networks in academic browsing systems. △ Less

Submitted 11 September, 2024; v1 submitted 12 May, 2024; originally announced May 2024.

arXiv:2405.06265 [pdf, other]

Uncertainty-aware Semantic Mapping in Off-road Environments with Dempster-Shafer Theory of Evidence

Authors: Junyoung Kim, Junwon Seo

Abstract: Semantic mapping with Bayesian Kernel Inference (BKI) has shown promise in providing a richer understanding of environments by effectively leveraging local spatial information. However, existing methods face challenges in constructing accurate semantic maps or reliable uncertainty maps in perceptually challenging environments due to unreliable semantic predictions. To address this issue, we propos… ▽ More Semantic mapping with Bayesian Kernel Inference (BKI) has shown promise in providing a richer understanding of environments by effectively leveraging local spatial information. However, existing methods face challenges in constructing accurate semantic maps or reliable uncertainty maps in perceptually challenging environments due to unreliable semantic predictions. To address this issue, we propose an evidential semantic mapping framework, which integrates the evidential reasoning of Dempster-Shafer Theory of Evidence (DST) into the entire mapping pipeline by adopting Evidential Deep Learning (EDL) and Dempster's rule of combination. Additionally, the extended belief is devised to incorporate local spatial information based on their uncertainty during the mapping process. Comprehensive experiments across various off-road datasets demonstrate that our framework enhances the reliability of uncertainty maps, consistently outperforming existing methods in scenes with high perceptual uncertainties while showing semantic accuracy comparable to the best-performing semantic mapping techniques. △ Less

Submitted 10 May, 2024; originally announced May 2024.

Comments: Our project website can be found at https://kjyoung.github.io/Homepage/#/Projects/Fully-Evidential-Semantic-Mapping

Showing 1–50 of 492 results for author: Seo, J