Skip to main content

Showing 1–50 of 955 results for author: Bao, J

  1. arXiv:2410.13850  [pdf, other

    cs.LG cs.AI

    Influence Functions for Scalable Data Attribution in Diffusion Models

    Authors: Bruno Mlodozeniec, Runa Eschenhagen, Juhan Bae, Alexander Immer, David Krueger, Richard Turner

    Abstract: Diffusion models have led to significant advancements in generative modelling. Yet their widespread adoption poses challenges regarding data attribution and interpretability. In this paper, we aim to help address such challenges in diffusion models by developing an \textit{influence functions} framework. Influence function-based data attribution methods approximate how a model's output would have… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  2. arXiv:2410.10821  [pdf, other

    cs.CV

    Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models

    Authors: Jingzhi Bao, Xueting Li, Ming-Hsuan Yang

    Abstract: 3D meshes are widely used in computer vision and graphics for their efficiency in animation and minimal memory use, playing a crucial role in movies, games, AR, and VR. However, creating temporally consistent and realistic textures for mesh sequences remains labor-intensive for professional artists. On the other hand, while video diffusion models excel at text-driven video generation, they often l… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: Project page: https://tex4d.github.io/

  3. arXiv:2410.08245  [pdf, other

    cs.LG cs.AI

    Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts

    Authors: Sukwon Yun, Inyoung Choi, Jie Peng, Yangfan Wu, Jingxuan Bao, Qiyiwen Zhang, Jiayi Xin, Qi Long, Tianlong Chen

    Abstract: Multimodal learning has gained increasing importance across various fields, offering the ability to integrate data from diverse sources such as images, text, and personalized records, which are frequently observed in medical domains. However, in scenarios where some modalities are missing, many existing frameworks struggle to accommodate arbitrary modality combinations, often relying heavily on a… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: NeurIPS 2024 Spotlight

  4. arXiv:2410.04442  [pdf, other

    cs.LG stat.ML

    TimeBridge: Non-Stationarity Matters for Long-term Time Series Forecasting

    Authors: Peiyuan Liu, Beiliang Wu, Yifan Hu, Naiqi Li, Tao Dai, Jigang Bao, Shu-tao Xia

    Abstract: Non-stationarity poses significant challenges for multivariate time series forecasting due to the inherent short-term fluctuations and long-term trends that can lead to spurious regressions or obscure essential long-term relationships. Most existing methods either eliminate or retain non-stationarity without adequately addressing its distinct impacts on short-term and long-term modeling. Eliminati… ▽ More

    Submitted 12 October, 2024; v1 submitted 6 October, 2024; originally announced October 2024.

  5. arXiv:2410.03937  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Clustering Alzheimer's Disease Subtypes via Similarity Learning and Graph Diffusion

    Authors: Tianyi Wei, Shu Yang, Davoud Ataee Tarzanagh, Jingxuan Bao, Jia Xu, Patryk Orzechowski, Joost B. Wagenaar, Qi Long, Li Shen

    Abstract: Alzheimer's disease (AD) is a complex neurodegenerative disorder that affects millions of people worldwide. Due to the heterogeneous nature of AD, its diagnosis and treatment pose critical challenges. Consequently, there is a growing research interest in identifying homogeneous AD subtypes that can assist in addressing these challenges in recent years. In this study, we aim to identify subtypes of… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: ICIBM'23': International Conference on Intelligent Biology and Medicine, Tampa, FL, USA, July 16-19, 2023

  6. arXiv:2410.01300  [pdf

    physics.chem-ph

    Atmospheric Pressure Ammonia Synthesis on AuRu Catalysts Enabled by Plasmon-Controlled Hydrogenation and Nitrogen-species Desorption

    Authors: Lin Yuan, Briley B. Bourgeois, Elijah Begin, Yirui Zhang, Alan X. Dai, Zhihua Cheng, Amy S. McKeown-Green, Zhichen Xue, Yi Cui, Kun Xu, Yu Wang, Matthew R. Jones, Yi Cui, Arun Majumdar, Junwei Lucas Bao, Jennifer A. Dionne

    Abstract: Ammonia is a key component of fertilizer and a potential clean fuel and hydrogen carrier. The Haber-Bosch process for ammonia synthesis consumes more than half of industrial hydrogen and contributes up to ~3% of global greenhouse gas emissions. Light-driven reactions via surface plasmon resonances offer a less energy-intensive pathway for ammonia production by altering reaction intermediates. Here… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

    Comments: 21 pages, 4 figures, journal article submission soon

  7. arXiv:2409.19989  [pdf, other

    cs.CV cs.GR

    RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models

    Authors: Jangyeong Kim, Donggoo Kang, Junyoung Choi, Jeonga Wi, Junho Gwon, Jiun Bae, Dumim Yoon, Junghyun Han

    Abstract: Text-to-texture generation has recently attracted increasing attention, but existing methods often suffer from the problems of view inconsistencies, apparent seams, and misalignment between textures and the underlying mesh. In this paper, we propose a robust text-to-texture method for generating consistent and seamless textures that are well aligned with the mesh. Our method leverages state-of-the… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

    Comments: 11 pages, 13 figures

  8. arXiv:2409.17224  [pdf, other

    hep-th

    Atomic Higgsings of 6D SCFTs

    Authors: Jiakang Bao, Hao Y. Zhang

    Abstract: In this paper, we study the full Higgs branch Hasse diagram for any given 6d $\mathcal{N}=(1,0)$ SCFT constructed via F-theory. This can be done by a procedure of determining all the minimal Higgsings on the generalized quiver of the 6d SCFT. We call this procedure the atomic Higgsing, which can be implemented iteratively. We present our general algorithms with many concrete examples of Hasse diag… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: 75 pages and appendices

  9. arXiv:2409.16517  [pdf, other

    cs.AI

    SynChart: Synthesizing Charts from Language Models

    Authors: Mengchen Liu, Qixiu Li, Dongdong Chen, Dong Chen, Jianmin Bao, Yunsheng Li

    Abstract: With the release of GPT-4V(O), its use in generating pseudo labels for multi-modality tasks has gained significant popularity. However, it is still a secret how to build such advanced models from its base large language models (LLMs). This work explores the potential of using LLMs alone for data generation and develop competitive multi-modality models focusing on chart understanding. We construct… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  10. arXiv:2409.11738  [pdf, other

    eess.IV cs.CV

    Adaptive Selection of Sampling-Reconstruction in Fourier Compressed Sensing

    Authors: Seongmin Hong, Jaehyeok Bae, Jongho Lee, Se Young Chun

    Abstract: Compressed sensing (CS) has emerged to overcome the inefficiency of Nyquist sampling. However, traditional optimization-based reconstruction is slow and can not yield an exact image in practice. Deep learning-based reconstruction has been a promising alternative to optimization-based reconstruction, outperforming it in accuracy and computation speed. Finding an efficient sampling method with deep… ▽ More

    Submitted 18 September, 2024; v1 submitted 18 September, 2024; originally announced September 2024.

    Comments: 30 pages, 9.8 MB, Accepted to ECCV 2024

  11. arXiv:2409.10244  [pdf, other

    cs.SI

    ES-KT-24: A Multimodal Knowledge Tracing Benchmark Dataset with Educational Game Playing Video and Synthetic Text Generation

    Authors: Dohee Kim, Unggi Lee, Sookbun Lee, Jiyeong Bae, Taekyung Ahn, Jaekwon Park, Gunho Lee, Hyeoncheol Kim

    Abstract: This paper introduces ES-KT-24, a novel multimodal Knowledge Tracing (KT) dataset for intelligent tutoring systems in educational game contexts. Although KT is crucial in adaptive learning, existing datasets often lack game-based and multimodal elements. ES-KT-24 addresses these limitations by incorporating educational game-playing videos, synthetically generated question text, and detailed game l… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: 11 pages, 5 figures

  12. arXiv:2409.08018  [pdf, other

    math.AP

    Emergence of peaked singularities in the Euler-Poisson system

    Authors: Junsik Bae, Sang-Hyuck Moon, Kwan Woo

    Abstract: We consider the one-dimensional Euler-Poisson system equipped with the Boltzmann relation and provide the exact asymptotic behavior of the peaked solitary wave solutions near the peak. This enables us to study the cold ion limit of the peaked solitary waves with the sharp range of Hölder exponents. Furthermore, we provide numerical evidence for $C^1$ blow-up solutions to the pressureless Euler-Poi… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    Comments: 29 pages, 10 figures

  13. arXiv:2409.07688  [pdf, other

    hep-ph

    Charged Higgs Boson Phenomenology in the Dark Z mediated Fermionic Dark Matter Model

    Authors: Kyu Jung Bae, Jinn-Ouk Gong, Dong-Won Jung, Kang Young Lee, Chaehyun Yu, Chan Beom Park

    Abstract: We study the phenomenology of the charged Higgs boson, $H^\pm$,appearing in the fermionic dark matter model mediated by the dark $Z$ boson. This model is in favor of the light dark $Z$ boson, $Z'$, and the light additional neutral Higgs boson, $h$. We find that $H^\pm \to W^\pm h$ and the $H^\pm \to W^\pm Z'$ are dominant decay channels. Thus the promising final states are trilepton signals,… ▽ More

    Submitted 19 September, 2024; v1 submitted 11 September, 2024; originally announced September 2024.

    Comments: 12 pages, 4 figures

    Report number: APCTP Pre2024-015

  14. How to Align Large Language Models for Teaching English? Designing and Developing LLM based-Chatbot for Teaching English Conversation in EFL, Findings and Limitations

    Authors: Jaekwon Park, Jiyoung Bae, Unggi Lee, Taekyung Ahn, Sookbun Lee, Dohee Kim, Aram Choi, Yeil Jeong, Jewoong Moon, Hyeoncheol Kim

    Abstract: This study investigates the design, development, and evaluation of a Large Language Model (LLM)-based chatbot for teaching English conversations in an English as a Foreign Language (EFL) context. Employing the Design and Development Research (DDR), we analyzed needs, established design principles, and iteratively refined a chatbot through experimenting various LLMs and alignment methods. Through b… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

    Comments: 56 pages

  15. arXiv:2409.00844  [pdf, other

    cs.LG cs.AI cs.CL

    Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries

    Authors: Blair Yang, Fuyang Cui, Keiran Paster, Jimmy Ba, Pashootan Vaezipoor, Silviu Pitis, Michael R. Zhang

    Abstract: The rapid development and dynamic nature of large language models (LLMs) make it difficult for conventional quantitative benchmarks to accurately assess their capabilities. We propose report cards, which are human-interpretable, natural language summaries of model behavior for specific skills or topics. We develop a framework to evaluate report cards based on three criteria: specificity (ability t… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

    Comments: 11 pages, 8 figures

  16. From Prediction to Application: Language Model-based Code Knowledge Tracing with Domain Adaptive Pre-Training and Automatic Feedback System with Pedagogical Prompting for Comprehensive Programming Education

    Authors: Unggi Lee, Jiyeong Bae, Yeonji Jung, Minji Kang, Gyuri Byun, Yeonseo Lee, Dohee Kim, Sookbun Lee, Jaekwon Park, Taekyung Ahn, Gunho Lee, Hyeoncheol Kim

    Abstract: Knowledge Tracing (KT) is a critical component in online learning, but traditional approaches face limitations in interpretability and cross-domain adaptability. This paper introduces Language Model-based Code Knowledge Tracing (CodeLKT), an innovative application of Language model-based Knowledge Tracing (LKT) to programming education. CodeLKT leverages pre-trained language models to process lear… ▽ More

    Submitted 30 August, 2024; originally announced September 2024.

    Comments: 9 pages, 2 figures

  17. arXiv:2408.17242  [pdf, ps, other

    math.PR math.DS

    The random periodic solutions for McKean-Vlasov stochastic differential equations

    Authors: Jianhai Bao, Goncalo Dos Reis, Yue Wu

    Abstract: In this paper, we study well-posedness of random periodic solutions of stochastic differential equations (SDEs) of McKean-Vlasov type driven by a two-sided Brownian motion, where the random periodic behaviour is characterised by the equations' long-time behaviour. Given the well-known connection between McKean-Vlasov SDEs and interacting particle systems, we show propagation of chaos and that the… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: 24 pages, no figures

  18. arXiv:2408.12349  [pdf, other

    quant-ph

    Machine-learning certification of multipartite entanglement for noisy quantum hardware

    Authors: Andreas J. C. Fuchs, Eric Brunner, Jiheon Seong, Hyeokjea Kwon, Seungchan Seo, Joonwoo Bae, Andreas Buchleitner, Edoardo G. Carnio

    Abstract: Entanglement is a fundamental aspect of quantum physics, both conceptually and for its many applications. Classifying an arbitrary multipartite state as entangled or separable -- a task referred to as the separability problem -- poses a significant challenge, since a state can be entangled with respect to many different of its partitions. We develop a certification pipeline that feeds the statisti… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: 15 pages, 5 figures

  19. arXiv:2408.11062  [pdf, other

    cs.CL cs.AI

    Interactive-T2S: Multi-Turn Interactions for Text-to-SQL with Large Language Models

    Authors: Guanming Xiong, Junwei Bao, Hongfei Jiang, Yang Song, Wen Zhao

    Abstract: This study explores text-to-SQL parsing by leveraging the powerful reasoning capabilities of large language models (LLMs). Despite recent advancements, existing LLM-based methods have not adequately addressed scalability, leading to inefficiencies when processing wide tables. Furthermore, current interaction-based approaches either lack a step-by-step, interpretable SQL generation process or fail… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: 15 pages, 7 figures

    ACM Class: I.2.7

  20. arXiv:2408.07857  [pdf, other

    cs.SE cs.DB

    An Exploratory Case Study of Query Plan Representations

    Authors: Jinsheng Ba, Manuel Rigger

    Abstract: In database systems, a query plan is a series of concrete internal steps to execute a query. Multiple testing approaches utilize query plans for finding bugs. However, query plans are represented in a database-specific manner, so implementing these testing approaches requires a non-trivial effort, hindering their adoption. We envision that a unified query plan representation can facilitate the imp… ▽ More

    Submitted 15 August, 2024; v1 submitted 14 August, 2024; originally announced August 2024.

  21. arXiv:2408.07416  [pdf, other

    cs.CV

    Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space

    Authors: Hyunjee Lee, Youngsik Yun, Jeongmin Bae, Seoha Kim, Youngjung Uh

    Abstract: Understanding the 3D semantics of a scene is a fundamental problem for various scenarios such as embodied agents. While NeRFs and 3DGS excel at novel-view synthesis, previous methods for understanding their semantics have been limited to incomplete 3D understanding: their segmentation results are 2D masks and their supervision is anchored at 2D pixels. This paper revisits the problem set to pursue… ▽ More

    Submitted 18 August, 2024; v1 submitted 14 August, 2024; originally announced August 2024.

    Comments: Project page: https://hyunji12.github.io/Open3DRF

  22. arXiv:2408.06828  [pdf, other

    cs.CV

    Photometric Inverse Rendering: Shading Cues Modeling and Surface Reflectance Regularization

    Authors: Jingzhi Bao, Guanying Chen, Shuguang Cui

    Abstract: This paper addresses the problem of inverse rendering from photometric images. Existing approaches for this problem suffer from the effects of self-shadows, inter-reflections, and lack of constraints on the surface reflectance, leading to inaccurate decomposition of reflectance and illumination due to the ill-posed nature of inverse rendering. In this work, we propose a new method for neural inver… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    Comments: Project page: https://jzbao03.site/projects/PIR/

  23. arXiv:2408.05177  [pdf, other

    cs.LG

    Beyond Closure Models: Learning Chaotic-Systems via Physics-Informed Neural Operators

    Authors: Chuwei Wang, Julius Berner, Zongyi Li, Di Zhou, Jiayun Wang, Jane Bae, Anima Anandkumar

    Abstract: Accurately predicting the long-term behavior of chaotic systems is crucial for various applications such as climate modeling. However, achieving such predictions typically requires iterative computations over a dense spatiotemporal grid to account for the unstable nature of chaotic systems, which is expensive and impractical in many real-world situations. An alternative approach to such a full-res… ▽ More

    Submitted 9 October, 2024; v1 submitted 9 August, 2024; originally announced August 2024.

  24. arXiv:2408.04048  [pdf, other

    astro-ph.EP astro-ph.SR

    A Survey of Protoplanetary Disks Using the Keck/NIRC2 Vortex Coronagraph

    Authors: Nicole L. Wallack, Jean-Baptiste Ruffio, Garreth Ruane, Bin B. Ren, Jerry W. Xuan, Marion Villenave, Dimitri Mawet, Karl Stapelfeldt, Jason J. Wang, Michael C. Liu, Olivier Absil, Carlos Alvarez, Jaehan Bae, Charlotte Bond, Michael Bottom, Benjamin Calvin, Élodie Choquet, Valentin Christiaens, Therese Cook, Bruno Femenía Castellá, Carlos Gomez Gonzalez, Greta Guidi, Elsa Huby, Joel Kastner, Heather A. Knutson , et al. (12 additional authors not shown)

    Abstract: Recent Atacama Large Millimeter/submillimeter Array (ALMA) observations of protoplanetary disks in the millimeter continuum have shown a variety of radial gaps, cavities, and spiral features. These substructures may be signposts for ongoing planet formation, and therefore these systems are promising targets for direct imaging planet searches in the near-infrared. To this end, we present results fr… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: 23 pages, 14 figures, 3 tables, accepted for publication in AJ

  25. Decomposed Prompting to Answer Questions on a Course Discussion Board

    Authors: Brandon Jaipersaud, Paul Zhang, Jimmy Ba, Andrew Petersen, Lisa Zhang, Michael R. Zhang

    Abstract: We propose and evaluate a question-answering system that uses decomposed prompting to classify and answer student questions on a course discussion board. Our system uses a large language model (LLM) to classify questions into one of four types: conceptual, homework, logistics, and not answerable. This enables us to employ a different strategy for answering questions that fall under different types… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: 6 pages. Published at International Conference on Artificial Intelligence in Education 2023. Code repository: https://github.com/brandonjaipersaud/piazza-qabot-gpt

    Journal ref: In: Artificial Intelligence in Education. AIED 2023. Communications in Computer and Information Science, vol 1831. Springer, Cham

  26. arXiv:2407.20501  [pdf, other

    cond-mat.str-el

    Magnon Spectra of Cuprates beyond Spin Wave Theory

    Authors: Jiahui Bao, Matthias Gohlke, Jeffrey G. Rau, Nic Shannon

    Abstract: The usual starting point for understanding magnons in cuprate antiferromagnets such as La$_2$CuO$_4$ is a spin model incorporating cyclic exchange, which descends from a one-band Hubbard model, and has parameters taken from fits based on non-interacting spin wave theory. Here we explore whether this provides a reliable description of experiment, using matrix product states (MPS) to calculate magno… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  27. arXiv:2407.19653  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Unraveling the role of Ta in the phase transition of Pb(Ta1+xSe2)2 using low-temperature Raman spectroscopy

    Authors: Yu Ma, Chi Sin Tang, Xiaohui Yang, Yi Wei Ho, Jun Zhou, Wenjun Wu, Shuo Sun, Jin-Ke Bao, Dingguan Wang, Xiao Lin, Magdalena Grzeszczyk, Shijie Wang, Mark B H Breese, Chuanbing Cai, Andrew T. S. Wee, Maciej Koperski, Zhu-An Xu, Xinmao Yin

    Abstract: Phase engineering strategies in two-dimensional transition metal dichalcogenides (2D-TMDs) have garnered significant attention due to their potential applications in electronics, optoelectronics, and energy storage. Various methods, including direct synthesis, pressure control, and chemical doping, have been employed to manipulate structural transitions in 2D-TMDs. Metal intercalation emerges as a… ▽ More

    Submitted 8 August, 2024; v1 submitted 28 July, 2024; originally announced July 2024.

  28. arXiv:2407.19468  [pdf, other

    cs.CV cs.MM

    MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllability and Generalizability

    Authors: Buyu Liu, Kai Wang, Yansong Liu, Jun Bao, Tingting Han, Jun Yu

    Abstract: This work aims to address the multi-view perspective RGB generation from text prompts given Bird-Eye-View(BEV) semantics. Unlike prior methods that neglect layout consistency, lack the ability to handle detailed text prompts, or are incapable of generalizing to unseen view points, MVPbev simultaneously generates cross-view consistent images of different perspective views with a two-stage design, a… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: Accepted by ACM MM24

  29. arXiv:2407.18619  [pdf, ps, other

    math.AP

    Singularity formation of hydromagnetic waves in cold plasma

    Authors: Junsik Bae, Junho Choi, Bongsuk Kwon

    Abstract: We study $C^1$ blow-up of the compressible fluid model introduced by Gardner and Morikawa, which describes the dynamics of a magnetized cold plasma. We propose sufficient conditions that lead to $C^1$ blow-up. In particular, we find that smooth solutions can break down in finite time even if the gradient of initial velocity is identically zero. The density and the gradient of the velocity become u… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: 7 pages, 2 figures

  30. arXiv:2407.15669  [pdf, other

    math.AP

    Delta-shock for the pressureless Euler-Poisson system

    Authors: Junsik Bae, Yunjoo Kim, Bongsuk Kwon

    Abstract: We study singularity formation for the pressureless Euler-Poisson system of cold ion dynamics. In contrast to the Euler-Poisson system with pressure, when its smooth solutions experience $C^1$ blow-up, the $L^\infty$ norm of the density becomes unbounded, which is often referred to as a delta-shock. We provide a constructive proof of singularity formation to obtain an exact blow-up profile and the… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 31 pages, 2 figures. arXiv admin note: text overlap with arXiv:2405.02557

  31. arXiv:2407.15632  [pdf, ps, other

    math.CO

    Partial Difference Sets with Denniston Parameters in Elementary Abelian $p$-Groups

    Authors: Jingjun Bao, Qing Xiang, Meng Zhao

    Abstract: Denniston \cite{D1969} constructed partial difference sets (PDS) with parameters $(2^{3m}, (2^{m+r}-2^m+2^r)(2^m-1), 2^m-2^r+(2^{m+r}-2^m+2^r)(2^r-2), (2^{m+r}-2^m+2^r)(2^r-1))$ in elementary abelian groups of order $2^{3m}$ for all $m\geq 2$ and $1 \leq r < m$. These PDS correspond to maximal arcs in the Desarguesian projective planes PG$(2, 2^m)$. Davis et al. \cite{DHJP2024} and also De Winter… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 11 pages

  32. arXiv:2407.14829  [pdf, other

    cs.CL

    Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks

    Authors: Jiayu Lin, Guanrong Chen, Bojun Jin, Chenyang Li, Shutong Jia, Wancong Lin, Yang Sun, Yuhang He, Caihua Yang, Jianzhu Bao, Jipeng Wu, Wen Su, Jinglu Chen, Xinyi Li, Tianyu Chen, Mingjie Han, Shuaiwen Du, Zijian Wang, Jiyin Li, Fuzhong Suo, Hao Wang, Nuanchen Lin, Xuanjing Huang, Changjian Jiang, RuiFeng Xu , et al. (4 additional authors not shown)

    Abstract: In this paper we present the results of the AI-Debater 2023 Challenge held by the Chinese Conference on Affect Computing (CCAC 2023), and introduce the related datasets. We organize two tracks to handle the argumentative generation tasks in different scenarios, namely, Counter-Argument Generation (Track 1) and Claim-based Argument Generation (Track 2). Each track is equipped with its distinct data… ▽ More

    Submitted 24 July, 2024; v1 submitted 20 July, 2024; originally announced July 2024.

  33. arXiv:2407.12450  [pdf, other

    physics.acc-ph hep-ex

    Interim report for the International Muon Collider Collaboration (IMCC)

    Authors: C. Accettura, S. Adrian, R. Agarwal, C. Ahdida, C. Aimé, A. Aksoy, G. L. Alberghi, S. Alden, N. Amapane, D. Amorim, P. Andreetto, F. Anulli, R. Appleby, A. Apresyan, P. Asadi, M. Attia Mahmoud, B. Auchmann, J. Back, A. Badea, K. J. Bae, E. J. Bahng, L. Balconi, F. Balli, L. Bandiera, C. Barbagallo , et al. (362 additional authors not shown)

    Abstract: The International Muon Collider Collaboration (IMCC) [1] was established in 2020 following the recommendations of the European Strategy for Particle Physics (ESPP) and the implementation of the European Strategy for Particle Physics-Accelerator R&D Roadmap by the Laboratory Directors Group [2], hereinafter referred to as the the European LDG roadmap. The Muon Collider Study (MuC) covers the accele… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: This document summarises the International Muon Collider Collaboration (IMCC) progress and status of the Muon Collider R&D programme

  34. arXiv:2407.11332  [pdf, other

    cond-mat.supr-con cond-mat.mes-hall cond-mat.mtrl-sci

    In-situ local imaging of ferromagnetism and superconductivity in RbEuFe$_4$As$_4$

    Authors: Huiyuan Man, Yusuke Iguchi, Jin-Ke Bao, Duck Young Chung, Mercouri G. Kanatzidis

    Abstract: The coexistence of superconductivity and ferromagnetism is an intrinsically interesting research focus in condensed matter physics but the study is limited by low superconducting ($T_c$) and magnetic ($T_m$) transition temperatures in related materials. Here, we used a scanning superconducting quantum interference device to image the in-situ diamagnetic and ferromagnetic responses of RbEuFe$_4$As… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 11 pages, 5 figures

    Journal ref: Nano Letters (2024)

  35. arXiv:2407.10613  [pdf, other

    physics.plasm-ph

    Global destabilization of drift-tearing mode with coupling to discretized electron drift-wave instability

    Authors: J. Bao, W. L. Zhang, Z. Lin, H. S. Cai, D. J. Liu, H. T. Chen, C. Dong, J. T. Cao, D. Li

    Abstract: The global linear behaviors of 2/1 DTM in the collisional regime are investigated based on a concisely resistive drift-MHD model. Besides DTM, extra normal modes including EDW and SAW are coupled together and destabilized in different parameter regimes by considering resistivity in this system. The EVP approach is applied for solving the eigenstate spectra with the distribution of all unstable sol… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 23 pages, 15 figues

  36. arXiv:2407.08464  [pdf, other

    cs.LG cs.AI

    TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations

    Authors: Junik Bae, Kwanyoung Park, Youngwoon Lee

    Abstract: Unsupervised goal-conditioned reinforcement learning (GCRL) is a promising paradigm for developing diverse robotic skills without external supervision. However, existing unsupervised GCRL methods often struggle to cover a wide range of states in complex environments due to their limited exploration and sparse or noisy rewards for GCRL. To overcome these challenges, we propose a novel unsupervised… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Website: https://heatz123.github.io/tldr

  37. arXiv:2407.06272  [pdf, other

    astro-ph.EP astro-ph.GA astro-ph.SR

    ALMA high-resolution observations unveil planet formation shaping molecular emission in the PDS 70 disk

    Authors: L. Rampinelli, S. Facchini, M. Leemker, J. Bae, M. Benisty, R. Teague, C. J. Law, K. I. Öberg, B. Portilla-Revelo, A. J. Cridland

    Abstract: With two directly detected protoplanets, the PDS 70 system is a unique source in which to study the complex interplay between forming planets and their natal environment. The large dust cavity carved by the two giant planets can affect the disk chemistry, and therefore the molecular emission morphology. On the other hand, chemical properties of the gas component of the disk are expected to leave a… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 25 pages, 10 figures, 3 tables, 8 figures and one table in appendix. Accepted for publication in A&A

    Journal ref: A&A 689, A65 (2024)

  38. arXiv:2407.02767  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Comparison of Short-Range Order in GeSn Grown by Molecular Beam Epitaxy and Chemical Vapor Deposition

    Authors: Shang Liu, Yunfan Liang, Haochen Zhao, Nirosh M. Eldose, Jin-Hee Bae, Omar Concepcion, Xiaochen Jin, Shunda Chen, Ilias Bikmukhametov, Austin Akey, Cory T. Cline, Alejandra Cuervo Covian, Xiaoxin Wang, Tianshu Li, Yuping Zeng, Dan Buca, Shui-Qing Yu, Gregory J. Salamo, Shengbai Zhang, Jifeng Liu

    Abstract: Atomic short-range order (SRO) in direct-bandgap GeSn for infrared photonics has recently attracted attention due to its notable impact on band structures. However, the SRO in GeSn thin films grown by different methods have hardly been compared. This paper compares SRO in GeSn thin films of similar compositions grown by molecular beam epitaxy (MBE) and chemical vapor deposition (CVD) using atom pr… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  39. arXiv:2407.01679  [pdf, other

    astro-ph.EP astro-ph.SR

    Constraints on the gas-phase C/O ratio of DR Tau's outer disk from CS, SO, and C$_2$H observations

    Authors: Jane Huang, Edwin A. Bergin, Romane Le Gal, Sean M. Andrews, Jaehan Bae, Luke Keyte, J. A. Sturm

    Abstract: Millimeter wavelength observations of Class II protoplanetary disks often display strong emission from hydrocarbons and high CS/SO values, providing evidence that the gas-phase C/O ratio commonly exceeds 1 in their outer regions. We present new NOEMA observations of CS $5-4$, SO $7_6-6_5$ and $5_6-4_5$, C$_2$H $N=3-2$, HCN $3-2$, HCO$^+$ $3-2$, and H$^{13}$CO$^+$ $3-2$ in the DR Tau protoplanetary… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Accepted by ApJ

  40. arXiv:2406.18531  [pdf, other

    q-bio.NC

    A principled framework to assess the information-theoretic fitness of brain functional sub-circuits

    Authors: Duy Duong-Tran, Nghi Nguyen, Shizhuo Mu, Jiong Chen, Jingxuan Bao, Frederick Xu, Sumita Garai, Jose Cadena-Pico, Alan David Kaplan, Tianlong Chen, Yize Zhao, Li Shen, Joaquín Goñi

    Abstract: In systems and network neuroscience, many common practices in brain connectomic analysis are often not properly scrutinized. One such practice is mapping a predetermined set of sub-circuits, like functional networks (FNs), onto subjects' functional connectomes (FCs) without adequately assessing the information-theoretic appropriateness of the partition. Another practice that goes unchallenged is t… ▽ More

    Submitted 23 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

  41. arXiv:2406.14879  [pdf, other

    quant-ph

    Improved bounds on quantum uncommon information

    Authors: Yonghae Lee, Joonwoo Bae, Hayata Yamasaki, Soojoon Lee

    Abstract: In classical information theory, channel capacity quantifies the maximum number of messages that can be reliably transmitted using shared information. An equivalent concept, termed uncommon information, represents the number of messages required to be exchanged to completely share all information in common. However, this equivalence does not extend to quantum information theory. Specifically, quan… ▽ More

    Submitted 24 July, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

    Comments: 24 pages, 7 figures, typos were corrected and readability was improved

  42. arXiv:2406.12909  [pdf, other

    cs.LG physics.comp-ph

    Scalable Training of Trustworthy and Energy-Efficient Predictive Graph Foundation Models for Atomistic Materials Modeling: A Case Study with HydraGNN

    Authors: Massimiliano Lupo Pasini, Jong Youl Choi, Kshitij Mehta, Pei Zhang, David Rogers, Jonghyun Bae, Khaled Z. Ibrahim, Ashwin M. Aji, Karl W. Schulz, Jorda Polo, Prasanna Balaprakash

    Abstract: We present our work on developing and training scalable, trustworthy, and energy-efficient predictive graph foundation models (GFMs) using HydraGNN, a multi-headed graph convolutional neural network architecture. HydraGNN expands the boundaries of graph neural network (GNN) computations in both training scale and data diversity. It abstracts over message passing algorithms, allowing both reproduct… ▽ More

    Submitted 16 October, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: 20 pages, 25 figures

    MSC Class: 68T07; 68T09 ACM Class: C.2.4; I.2.11

  43. arXiv:2406.08392  [pdf, other

    cs.CV

    FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation

    Authors: Xinzhi Mu, Li Chen, Bohan Chen, Shuyang Gu, Jianmin Bao, Dong Chen, Ji Li, Yuhui Yuan

    Abstract: Recently, the application of modern diffusion-based text-to-image generation models for creating artistic fonts, traditionally the domain of professional designers, has garnered significant interest. Diverging from the majority of existing studies that concentrate on generating artistic typography, our research aims to tackle a novel and more demanding challenge: the generation of text effects for… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Project-page: https://font-studio.github.io/

  44. arXiv:2406.04158  [pdf, other

    cs.CV eess.IV

    Sparse Multi-baseline SAR Cross-modal 3D Reconstruction of Vehicle Targets

    Authors: Da Li, Guoqiang Zhao, Houjun Sun, Jiacheng Bao

    Abstract: Multi-baseline SAR 3D imaging faces significant challenges due to data sparsity. In recent years, deep learning techniques have achieved notable success in enhancing the quality of sparse SAR 3D imaging. However, previous work typically rely on full-aperture high-resolution radar images to supervise the training of deep neural networks (DNNs), utilizing only single-modal information from radar dat… ▽ More

    Submitted 8 August, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  45. arXiv:2406.02893  [pdf, other

    cs.CL

    Language Model Can Do Knowledge Tracing: Simple but Effective Method to Integrate Language Model and Knowledge Tracing Task

    Authors: Unggi Lee, Jiyeong Bae, Dohee Kim, Sookbun Lee, Jaekwon Park, Taekyung Ahn, Gunho Lee, Damji Stratton, Hyeoncheol Kim

    Abstract: Knowledge Tracing (KT) is a critical task in online learning for modeling student knowledge over time. Despite the success of deep learning-based KT models, which rely on sequences of numbers as data, most existing approaches fail to leverage the rich semantic information in the text of questions and concepts. This paper proposes Language model-based Knowledge Tracing (LKT), a novel framework that… ▽ More

    Submitted 9 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 11 pages, 5 figures, 3 tables

  46. arXiv:2406.02212  [pdf, other

    cs.CE

    Generative Pre-Trained Diffusion Paradigm for Zero-Shot Time Series Forecasting

    Authors: Jiarui Yang, Tao Dai, Naiqi Li, Junxi Wu, Peiyuan Liu, Jinmin Li, Jigang Bao, Haigang Zhang, Shutao Xia

    Abstract: In recent years, generative pre-trained paradigms such as Large Language Models (LLMs) and Large Vision Models (LVMs) have achieved revolutionary advancements and widespread real-world applications. Particularly, the emergence of pre-trained LLMs-based temporal works, compared to previous deep model approaches, has demonstrated superior generalization and robustness, showcasing the potential of ge… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  47. arXiv:2406.00416  [pdf, other

    stat.ML cs.LG eess.SP

    Representation and De-interleaving of Mixtures of Hidden Markov Processes

    Authors: Jiadi Bao, Mengtao Zhu, Yunjie Li, Shafei Wang

    Abstract: De-interleaving of the mixtures of Hidden Markov Processes (HMPs) generally depends on its representation model. Existing representation models consider Markov chain mixtures rather than hidden Markov, resulting in the lack of robustness to non-ideal situations such as observation noise or missing observations. Besides, de-interleaving methods utilize a search-based strategy, which is time-consumi… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 13 pages, 9 figures, submitted to IEEE transactions on Signal Processing

  48. arXiv:2405.20920  [pdf, other

    physics.geo-ph

    On the viscoelastic-electromagnetic-gravitational analogy

    Authors: Jose' M. Carcione, Jing Ba

    Abstract: The analogy between electromagnetism and gravitation was achieved by linearizing the tensorial gravitational equations of general relativity and converting them into a vector form corresponding to Maxwell's electromagnetic equations. On this basis, we use the equivalence with viscoelasticity (SH waves) and propose a theory of gravitational waves. We add a damping term to the differential equations… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 11 figures

  49. arXiv:2405.18710  [pdf, other

    cs.LG cs.AI

    To FP8 and Back Again: Quantifying the Effects of Reducing Precision on LLM Training Stability

    Authors: Joonhyung Lee, Jeongin Bae, Byeongwook Kim, Se Jung Kwon, Dongsoo Lee

    Abstract: The massive computational costs associated with large language model (LLM) pretraining have spurred great interest in reduced-precision floating-point representations to accelerate the process. As a result, the BrainFloat16 (BF16) precision has become the de facto standard for LLM training, with hardware support included in recent accelerators. This trend has gone even further in the latest proces… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  50. arXiv:2405.13954  [pdf, other

    cs.LG cs.AI cs.CL

    What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions

    Authors: Sang Keun Choe, Hwijeen Ahn, Juhan Bae, Kewen Zhao, Minsoo Kang, Youngseog Chung, Adithya Pratapa, Willie Neiswanger, Emma Strubell, Teruko Mitamura, Jeff Schneider, Eduard Hovy, Roger Grosse, Eric Xing

    Abstract: Large language models (LLMs) are trained on a vast amount of human-written data, but data providers often remain uncredited. In response to this issue, data valuation (or data attribution), which quantifies the contribution or value of each data to the model output, has been discussed as a potential solution. Nevertheless, applying existing data valuation methods to recent LLMs and their vast trai… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.