Skip to main content

Showing 1–50 of 678 results for author: Lee, N

  1. arXiv:2410.12785  [pdf, other

    cs.LG

    Metal Price Spike Prediction via a Neurosymbolic Ensemble Approach

    Authors: Nathaniel Lee, Noel Ngu, Harshdeep Singh Sahdev, Pramod Motaganahall, Al Mehdi Saadat Chowdhury, Bowen Xi, Paulo Shakarian

    Abstract: Predicting price spikes in critical metals such as Cobalt, Copper, Magnesium, and Nickel is crucial for mitigating economic risks associated with global trends like the energy transition and reshoring of manufacturing. While traditional models have focused on regression-based approaches, our work introduces a neurosymbolic ensemble framework that integrates multiple neural models with symbolic err… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  2. arXiv:2410.10539  [pdf

    cond-mat.str-el

    Incommensurate Transverse Peierls Transition

    Authors: F. Z. Yang, K. F. Luo, Weizhe Zhang, Xiaoyu Guo, W. R. Meier, H. Ni, H. X. Li, P. Mercado Lozano, G. Fabbris, A. H. Said, C. Nelson, T. T. Zhang, A. F. May, M. A. McGuire, R. Juneja, L. Lindsay, H. N. Lee, J. -M. Zuo, M. F. Chi, X. Dai, Liuyan Zhao, H. Miao

    Abstract: In one-dimensional quantum materials, conducting electrons and the underlying lattices can undergo a spontaneous translational symmetry breaking, known as Peierls transition. For nearly a century, the Peierls transition has been understood within the paradigm of electron-electron interactions mediated by longitudinal acoustic phonons. This classical picture has recently been revised in topological… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: Supplementary materials are available upon request

  3. arXiv:2410.07168  [pdf, other

    cs.CL cs.SD eess.AS

    Sylber: Syllabic Embedding Representation of Speech from Raw Audio

    Authors: Cheol Jun Cho, Nicholas Lee, Akshat Gupta, Dhruv Agarwal, Ethan Chen, Alan W Black, Gopala K. Anumanchipalli

    Abstract: Syllables are compositional units of spoken language that play a crucial role in human speech perception and production. However, current neural speech representations lack structure, resulting in dense token sequences that are costly to process. To bridge this gap, we propose a new model, Sylber, that produces speech representations with clean and robust syllabic structure. Specifically, we propo… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  4. arXiv:2410.06238  [pdf, other

    cs.LG cs.AI cs.CL

    EVOLvE: Evaluating and Optimizing LLMs For Exploration

    Authors: Allen Nie, Yi Su, Bo Chang, Jonathan N. Lee, Ed H. Chi, Quoc V. Le, Minmin Chen

    Abstract: Despite their success in many domains, large language models (LLMs) remain under-studied in scenarios requiring optimal decision-making under uncertainty. This is crucial as many real-world applications, ranging from personalized recommendations to healthcare interventions, demand that LLMs not only predict but also actively learn to make optimal decisions through exploration. In this work, we mea… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: 28 pages

  5. arXiv:2410.04358  [pdf

    physics.med-ph

    Enabling Clinical Use of Linear Energy Transfer in Proton Therapy for Head and Neck Cancer -- A Review of Implications for Treatment Planning and Adverse Events Study

    Authors: Jingyuan Chen, Yunze Yang, Hongying Feng, Chenbin Liu, Lian Zhang, Jason M. Holmes, Zhengliang Liu, Haibo Lin, Tianming Liu, Charles B. Simone II, Nancy Y. Lee, Steven E. Frank, Daniel J. Ma, Samir H. Patel, Wei Liu

    Abstract: Proton therapy offers significant advantages due to its unique physical and biological properties, particularly the Bragg peak, enabling precise dose delivery to tumors while sparing healthy tissues. However, the clinical implementation is challenged by the oversimplification of the relative biological effectiveness (RBE) as a fixed value of 1.1, which does not account for the complex interplay be… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

  6. arXiv:2410.03336  [pdf, other

    hep-ph

    Master integrals for $e^{+}e^{-}\rightarrow2γ$ process at large energies and angles

    Authors: Roman N. Lee, Vyacheslav A. Stotsky

    Abstract: We calculate two-loop massive master integrals for $e^{+}e^{-}\rightarrow2γ$ in terms of generalized power series with respect to electron mass. The coefficients of this series are expressed via Goncharov's polylogarithms. Our approach exploits a number of modern multiloop methods: IBP reduction, differential equations for master integrals, Frobenius method, reduction to $ε$-form, and DRA method.

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: 13 pages

  7. arXiv:2409.14617  [pdf, other

    cs.LG q-bio.BM q-bio.QM

    Protein-Mamba: Biological Mamba Models for Protein Function Prediction

    Authors: Bohao Xu, Yingzhou Lu, Yoshitaka Inoue, Namkyeong Lee, Tianfan Fu, Jintai Chen

    Abstract: Protein function prediction is a pivotal task in drug discovery, significantly impacting the development of effective and safe therapeutics. Traditional machine learning models often struggle with the complexity and variability inherent in predicting protein functions, necessitating more sophisticated approaches. In this work, we introduce Protein-Mamba, a novel two-stage model that leverages both… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

  8. arXiv:2409.11402  [pdf, other

    cs.CL cs.AI cs.CV cs.LG cs.MM

    NVLM: Open Frontier-Class Multimodal LLMs

    Authors: Wenliang Dai, Nayeon Lee, Boxin Wang, Zhuoling Yang, Zihan Liu, Jon Barker, Tuomas Rintamaki, Mohammad Shoeybi, Bryan Catanzaro, Wei Ping

    Abstract: We introduce NVLM 1.0, a family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks, rivaling the leading proprietary models (e.g., GPT-4o) and open-access models (e.g., Llama 3-V 405B and InternVL 2). Remarkably, NVLM 1.0 shows improved text-only performance over its LLM backbone after multimodal training. In terms of model desi… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

  9. arXiv:2409.10400  [pdf, other

    astro-ph.IM astro-ph.HE

    Sherpa: An Open Source Python Fitting Package

    Authors: Aneta Siemiginowska, Douglas Burke, Hans Moritz Günther, Nicholas P. Lee, Warren McLaughlin, David A. Principe, Harlan Cheer, Antonella Fruscione, Omar Laurino, Jonathan McDowell, Marie Terrell

    Abstract: We present an overview of Sherpa, an open source Python project, and discuss its development history, broad design concepts and capabilities. Sherpa contains powerful tools for combining parametric models into complex expressions that can be fit to data using a variety of statistics and optimization methods. It is easily extensible to include user-defined models, statistics, and optimization metho… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: accepted by the Astrophysical Journal Supplement Series, 12 pages, 11 figures

  10. arXiv:2409.09647  [pdf, other

    cs.SD cs.AI eess.AS

    Self-supervised Learning for Acoustic Few-Shot Classification

    Authors: Jingyong Liang, Bernd Meyer, Issac Ning Lee, Thanh-Toan Do

    Abstract: Labelled data are limited and self-supervised learning is one of the most important approaches for reducing labelling requirements. While it has been extensively explored in the image domain, it has so far not received the same amount of attention in the acoustic domain. Yet, reducing labelling is a key requirement for many acoustic applications. Specifically in bioacoustic, there are rarely suffi… ▽ More

    Submitted 15 September, 2024; originally announced September 2024.

  11. arXiv:2409.04002  [pdf, ps, other

    cs.IT eess.SP

    Low-Earth Orbit Satellite Network Analysis: Coverage under Distance-Dependent Shadowing

    Authors: Jinseok Choi, Jeonghun Park, Junse Lee, Namyoon Lee

    Abstract: This paper offers a thorough analysis of the coverage performance of Low Earth Orbit (LEO) satellite networks using a strongest satellite association approach, with a particular emphasis on shadowing effects modeled through a Poisson point process (PPP)-based network framework. We derive an analytical expression for the coverage probability, which incorporates key system parameters and a distance-… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

    Comments: 13 pages, 10 figures

  12. arXiv:2409.03483  [pdf, other

    hep-th

    Defects and type D relativistic Toda lattice for some 5d gauge theories

    Authors: Kimyeong Lee, Norton Lee

    Abstract: We perform folding on the ADHM construction of the instanton moduli space from $SU$ to $SO$ group. A Young diagram description for the $SO$ instanton is obtained after modifying the real and complex moment maps of the ADHM data. We study the Bethe gauge correspondence between type D relativistic Toda lattice and 5d $\mathcal{N}=1$ folded theory. In particular we prove that the regular monodromy de… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

    Comments: 28+16 pages, 2 figures

    Report number: CGP24011

  13. arXiv:2409.02094  [pdf, other

    cs.SE cs.PL

    Software Verification with CPAchecker 3.0: Tutorial and User Guide (Extended Version)

    Authors: Daniel Baier, Dirk Beyer, Po-Chun Chien, Marie-Christine Jakobs, Marek Jankola, Matthias Kettl, Nian-Ze Lee, Thomas Lemberger, Marian Lingsch-Rosenfeld, Henrik Wachowitz, Philipp Wendler

    Abstract: This tutorial provides an introduction to CPAchecker for users. CPAchecker is a flexible and configurable framework for software verification and testing. The framework provides many abstract domains, such as BDDs, explicit values, intervals, memory graphs, and predicates, and many program-analysis and model-checking algorithms, such as abstract interpretation, bounded model checking, Impact, inte… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: 39 pages, 17 figures, 6 tables

    ACM Class: D.2.4; F.3.1; D.3.1; F.4.3

  14. arXiv:2409.00608  [pdf, other

    cs.CL cs.LG

    TinyAgent: Function Calling at the Edge

    Authors: Lutfi Eren Erdogan, Nicholas Lee, Siddharth Jha, Sehoon Kim, Ryan Tabrizi, Suhong Moon, Coleman Hooper, Gopala Anumanchipalli, Kurt Keutzer, Amir Gholami

    Abstract: Recent large language models (LLMs) have enabled the development of advanced agentic systems that can integrate various tools and APIs to fulfill user queries through function calling. However, the deployment of these LLMs on the edge has not been explored since they typically require cloud-based infrastructure due to their substantial model size and computational demands. To this end, we present… ▽ More

    Submitted 21 October, 2024; v1 submitted 1 September, 2024; originally announced September 2024.

    Comments: EMNLP 2024 Demo

  15. arXiv:2408.13249  [pdf, other

    cond-mat.mtrl-sci physics.app-ph

    Isolation and characterization of atomically thin mica phyllosilicates

    Authors: Kristine L. Haley, Noah F. Lee, Vergil M. Schreiber, Nicholas T. Pereira, Randy M. Sterbentz, Timothy Y. Chung, Joshua O. Island

    Abstract: One of the roadblocks to employing two-dimensional (2D) materials in next generation devices is the lack of high quality insulators. Insulating layered materials with inert and atomically flat surfaces are ideal for high performance transistors and this has been exemplified with commonly used boron nitride. While the list of insulating 2D materials is limited, the earth-abundant phyllosilicates ar… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    Comments: 18 pages, 4 figures

  16. arXiv:2408.12145  [pdf, ps, other

    eess.SP

    Spectrum Sharing Between Low Earth Orbit Satellite and Terrestrial Networks: A Stochastic Geometry Perspective Analysis

    Authors: Daeun Kim, Jeonghun Park, Jinseok Choi, Namyoon Lee

    Abstract: Low Earth orbit (LEO) satellite networks with mega constellations have the potential to provide 5G and beyond services ubiquitously. However, these networks may introduce mutual interference to both satellite and terrestrial networks, particularly when sharing spectrum resources. In this paper, we present a system-level performance analysis to address these interference issues using the tool of st… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  17. arXiv:2408.07660  [pdf, other

    stat.ML cs.LG

    Off-Policy Reinforcement Learning with High Dimensional Reward

    Authors: Dong Neuck Lee, Michael R. Kosorok

    Abstract: Conventional off-policy reinforcement learning (RL) focuses on maximizing the expected return of scalar rewards. Distributional RL (DRL), in contrast, studies the distribution of returns with the distributional Bellman operator in a Euclidean space, leading to highly flexible choices for utility. This paper establishes robust theoretical foundations for DRL. We prove the contraction property of th… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 24 pages, 12 figures

    MSC Class: 68T05; 46B09 (Primary) 46B06 (Secondary)

  18. arXiv:2408.05422  [pdf, other

    cs.IT eess.SP

    Sparsely Pre-transformed Polar Codes for Low-Latency SCL Decoding

    Authors: Geon Choi, Namyoon Lee

    Abstract: Deep polar codes, employing multi-layered polar kernel pre-transforms in series, are recently introduced variants of pre-transformed polar codes. These codes have demonstrated the ability to reduce the number of minimum weight codewords, thereby closely achieving finite-block length capacity with successive cancellation list (SCL) decoders in certain scenarios. However, when the list size of the S… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: 13 pages

  19. Debiased Graph Poisoning Attack via Contrastive Surrogate Objective

    Authors: Kanghoon Yoon, Yeonjun In, Namkyeong Lee, Kibum Kim, Chanyoung Park

    Abstract: Graph neural networks (GNN) are vulnerable to adversarial attacks, which aim to degrade the performance of GNNs through imperceptible changes on the graph. However, we find that in fact the prevalent meta-gradient-based attacks, which utilizes the gradient of the loss w.r.t the adjacency matrix, are biased towards training nodes. That is, their meta-gradient is determined by a training procedure o… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: 9 pages. Proceeding ACM International Conference on Information and Knowledge Management (CIKM 2024) Proceeding

  20. arXiv:2407.15551  [pdf, other

    cs.SE cs.PL

    MoXIchecker: An Extensible Model Checker for MoXI

    Authors: Salih Ates, Dirk Beyer, Po-Chun Chien, Nian-Ze Lee

    Abstract: MoXI is a new intermediate verification language introduced in 2024 to promote the standardization and open-source implementations for symbolic model checking by extending the SMT-LIB 2 language with constructs to define state-transition systems. The tool suite of MoXI provides a translator from MoXI to Btor2, which is a lower-level intermediate language for hardware verification, and a translatio… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 13 pages, 6 figures, 2 tables

    ACM Class: D.2.4; F.3.1; D.3.1; F.4.3

  21. arXiv:2407.14933  [pdf, other

    cs.CL cs.AI cs.LG

    Consent in Crisis: The Rapid Decline of the AI Data Commons

    Authors: Shayne Longpre, Robert Mahari, Ariel Lee, Campbell Lund, Hamidah Oderinwale, William Brannon, Nayan Saxena, Naana Obeng-Marnu, Tobin South, Cole Hunter, Kevin Klyman, Christopher Klamm, Hailey Schoelkopf, Nikhil Singh, Manuel Cherep, Ahmad Anis, An Dinh, Caroline Chitongo, Da Yin, Damien Sileo, Deividas Mataciunas, Diganta Misra, Emad Alghamdi, Enrico Shippole, Jianguo Zhang , et al. (24 additional authors not shown)

    Abstract: General-purpose artificial intelligence (AI) systems are built on massive swathes of public web data, assembled into corpora such as C4, RefinedWeb, and Dolma. To our knowledge, we conduct the first, large-scale, longitudinal audit of the consent protocols for the web domains underlying AI training corpora. Our audit of 14,000 web domains provides an expansive view of crawlable web data and how co… ▽ More

    Submitted 24 July, 2024; v1 submitted 20 July, 2024; originally announced July 2024.

    Comments: 41 pages (13 main), 5 figures, 9 tables

  22. arXiv:2407.12503  [pdf, other

    hep-th hep-ph

    Polylogarithmic functions with prescribed branching locus and linear relations between them

    Authors: Roman N. Lee

    Abstract: We consider the problem of finding the set of classical polylogarithmic functions $\text{Li}_n$ with branching locus determined by the solution of $p_1\cdot p_2\cdot \ldots \cdot p_n=0$, where $p_1,\ldots, p_n$ are irreducible polynomials of several variables. We present an algorithm of constructing a complete set of possible arguments of $\text{Li}_n$ functions. The corresponding Mathematica code… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 7 pages

  23. arXiv:2407.10542  [pdf, other

    cs.CV cs.AI

    3D Geometric Shape Assembly via Efficient Point Cloud Matching

    Authors: Nahyuk Lee, Juhong Min, Junha Lee, Seungwook Kim, Kanghee Lee, Jaesik Park, Minsu Cho

    Abstract: Learning to assemble geometric shapes into a larger target structure is a pivotal task in various practical applications. In this work, we tackle this problem by establishing local correspondences between point clouds of part shapes in both coarse- and fine-levels. To this end, we introduce Proxy Match Transform (PMT), an approximate high-order feature transform layer that enables reliable matchin… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Accepted to ICML 2024

  24. arXiv:2407.10461  [pdf, ps, other

    cs.IT

    Multibeam Satellite Communications with Massive MIMO: Asymptotic Performance Analysis and Design Insights

    Authors: Seyong Kim, Jinseok Choi, Wonjae Shin, Namyoon Lee, Jeonghun Park

    Abstract: To achieve high performance without substantial overheads associated with channel state information (CSI) of ground users, we consider a fixed-beam precoding approach, where a satellite forms multiple fixed-beams without relying on CSI, then select a suitable user set for each beam. Upon this precoding method, we put forth a satellite equipped with massive multiple-input multiple-output (MIMO), by… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  25. arXiv:2407.09043  [pdf, other

    cs.AI

    Vision Language Model is NOT All You Need: Augmentation Strategies for Molecule Language Models

    Authors: Namkyeong Lee, Siddhartha Laghuvarapu, Chanyoung Park, Jimeng Sun

    Abstract: Recently, there has been a growing interest among researchers in understanding molecules and their textual descriptions through molecule language models (MoLM). However, despite some early promising developments, the advancement of MoLM still trails significantly behind that of vision language models (VLM). This is because unique challenges exist apart from VLM in the field of MoLM due to 1) a lim… ▽ More

    Submitted 23 July, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

    Comments: CIKM 2024 / ACL 2024 Workshop on Languages and Molecule

  26. arXiv:2406.15524  [pdf, other

    cs.CL cs.LG

    Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimization

    Authors: Sungbin Shin, Wonpyo Park, Jaeho Lee, Namhoon Lee

    Abstract: This work suggests fundamentally rethinking the current practice of pruning large language models (LLMs). The way it is done is by divide and conquer: split the model into submodels, sequentially prune them, and reconstruct predictions of the dense counterparts on small calibration data one at a time; the final model is obtained simply by putting the resulting sparse submodels together. While this… ▽ More

    Submitted 10 October, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

    Comments: EMNLP 2024 main

  27. arXiv:2406.09948  [pdf, other

    cs.CL

    BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages

    Authors: Junho Myung, Nayeon Lee, Yi Zhou, Jiho Jin, Rifki Afina Putri, Dimosthenis Antypas, Hsuvas Borkakoty, Eunsu Kim, Carla Perez-Almendros, Abinew Ali Ayele, Víctor Gutiérrez-Basulto, Yazmín Ibáñez-García, Hwaran Lee, Shamsuddeen Hassan Muhammad, Kiwoong Park, Anar Sabuhi Rzayev, Nina White, Seid Muhie Yimam, Mohammad Taher Pilehvar, Nedjma Ousidhoum, Jose Camacho-Collados, Alice Oh

    Abstract: Large language models (LLMs) often lack culture-specific knowledge of daily life, especially across diverse regions and non-English languages. Existing benchmarks for evaluating LLMs' cultural sensitivities are limited to a single language or collected from online sources such as Wikipedia, which do not reflect the mundane everyday lifestyles of diverse regions. That is, information about the food… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  28. arXiv:2406.06424  [pdf, other

    cs.CV

    Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

    Authors: Jiwoo Hong, Sayak Paul, Noah Lee, Kashif Rasul, James Thorne, Jongheon Jeong

    Abstract: Modern alignment techniques based on human preferences, such as RLHF and DPO, typically employ divergence regularization relative to the reference model to ensure training stability. However, this often limits the flexibility of models during alignment, especially when there is a clear distributional discrepancy between the preference data and the reference model. In this paper, we focus on the al… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Preprint

  29. arXiv:2406.05761  [pdf, other

    cs.CL

    The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

    Authors: Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, Chaeeun Kim, Dongkeun Yoon, Guijin Son, Yejin Cho, Sheikh Shafayat, Jinheon Baek, Sue Hyun Park, Hyeonbin Hwang, Jinkyung Jo, Hyowon Cho, Haebin Shin, Seongyun Lee, Hanseok Oh, Noah Lee, Namgyu Ho, Se June Joo, Miyoung Ko, Yoonjoo Lee, Hyungjoo Chae, Jamin Shin, Joel Jang , et al. (7 additional authors not shown)

    Abstract: As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on spec… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Work in Progress

  30. arXiv:2406.00925  [pdf, other

    hep-th

    Dimers for Type D Relativistic Toda Model

    Authors: Kimyeong Lee, Norton Lee

    Abstract: We construct dimer graphs for type D relativistic Toda models by introducing impurities to the $Y^{2N,0}$ square dimer graphs. By properly placing the impurities and change of canonical variables assigned to the 1-loops on the dimer graph, we introduce the "folding" of the graphs and get the type D relativistic Toda lattice Hamiltonian and monodromy matrix.

    Submitted 2 September, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

    Comments: 25+6 pages, 14 figures, add citation

    Report number: KIAS-P24038, CGP24008

  31. arXiv:2405.08614  [pdf, other

    eess.SP

    FDD Massive MIMO: How to Optimally Combine UL Pilot and Limited DL CSI Feedback?

    Authors: Jungyeon Kim, Jinseok Choi, Jeonghun Park, Ahmed Alkhateeb, Namyoon Lee

    Abstract: In frequency-division duplexing (FDD) multiple-input multiple-output (MIMO) systems, obtaining accurate downlink channel state information (CSI) for precoding is vastly challenging due to the tremendous feedback overhead with the growing number of antennas. Utilizing uplink pilots for downlink CSI estimation is a promising approach that can eliminate CSI feedback. However, the downlink CSI estimat… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 13 pages, 10 figures

  32. arXiv:2404.18470  [pdf, other

    cs.CE cs.AI cs.CL q-fin.RM q-fin.TR

    ECC Analyzer: Extract Trading Signal from Earnings Conference Calls using Large Language Model for Stock Performance Prediction

    Authors: Yupeng Cao, Zhi Chen, Qingyun Pei, Nathan Jinseok Lee, K. P. Subbalakshmi, Papa Momar Ndiaye

    Abstract: In the realm of financial analytics, leveraging unstructured data, such as earnings conference calls (ECCs), to forecast stock volatility is a critical challenge that has attracted both academics and investors. While previous studies have used multimodal deep learning-based models to obtain a general view of ECCs for volatility predicting, they often fail to capture detailed, complex information.… ▽ More

    Submitted 29 August, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: 9 pages, 1 figures, 2 tables

  33. arXiv:2404.14276  [pdf, other

    stat.ML cs.LG

    A Bayesian Approach for Prioritising Driving Behaviour Investigations in Telematic Auto Insurance Policies

    Authors: Mark McLeod, Bernardo Perez-Orozco, Nika Lee, Davide Zilli

    Abstract: Automotive insurers increasingly have access to telematic information via black-box recorders installed in the insured vehicle, and wish to identify undesirable behaviour which may signify increased risk or uninsured activities. However, identification of such behaviour with machine learning is non-trivial, and results are far from perfect, requiring human investigation to verify suspected cases.… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: International Congress of Actuaries (2023)

  34. arXiv:2404.09959  [pdf, other

    hep-ph hep-ex

    NNLO QCD corrections to polarized semi-inclusive DIS

    Authors: Saurav Goyal, Roman N. Lee, Sven-Olaf Moch, Vaibhav Pathak, Narayan Rana, V. Ravindran

    Abstract: Polarized semi-inclusive deep-inelastic scattering (SIDIS) is a key process in the quest for a resolution of the proton spin puzzle. We present the complete results for the polarized SIDIS process at next-to-next-to-leading order (NNLO) in perturbative quantum chromodynamics. Our analytical results include all partonic channels for the scattering of polarized leptons off hadrons and a spin-average… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 6 pages, 2 figures; 1 ancillary file

  35. Magnetic fields from small-scale primordial perturbations

    Authors: Nanoom Lee, Yacine Ali-Haimoud

    Abstract: Weak magnetic fields must have existed in the early Universe, as they were sourced by the cross product of electron density and temperature gradients through the Biermann-battery mechanism. In this paper we calculate the magnetic fields generated at cosmic dawn by a variety of small-scale primordial perturbations, carefully computing the evolution of electron density and temperature fluctuations,… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 12 pages, 6 figures

  36. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  37. Generalized Calogero-Moser system and supergroup gauge origami

    Authors: Taro Kimura, Norton Lee

    Abstract: We study the integrability and the Bethe/Gauge correspondence of the Generalized Calogero-Moser system proposed by Berntson, Langmann and Lenells which we call the elliptic quadruple Calogero-Moser system (eqCM). We write down the Dunkl operators which give commuting Hamiltonians of the quantum integrable system. We identify the gauge theory in correspondence is a supergroup version of the gauge o… ▽ More

    Submitted 30 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 28+4 pages. hyperlink fixed, add reference. arXiv admin note: text overlap with arXiv:1908.04928

    Report number: CGP24006

    Journal ref: Nucl.Phys.B1005(2024)116604

  38. arXiv:2403.18932  [pdf, other

    cs.CL cs.AI

    Measuring Political Bias in Large Language Models: What Is Said and How It Is Said

    Authors: Yejin Bang, Delong Chen, Nayeon Lee, Pascale Fung

    Abstract: We propose to measure political bias in LLMs by analyzing both the content and style of their generated content regarding political issues. Existing benchmarks and measures focus on gender and racial biases. However, political bias exists in LLMs and can lead to polarization and other harms in downstream applications. In order to provide transparency to users, we advocate that there should be fine… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 16 pages

  39. arXiv:2403.16372  [pdf, other

    cs.LG cs.DC eess.SP

    SignSGD with Federated Voting

    Authors: Chanho Park, H. Vincent Poor, Namyoon Lee

    Abstract: Distributed learning is commonly used for accelerating model training by harnessing the computational capabilities of multiple-edge devices. However, in practical applications, the communication delay emerges as a bottleneck due to the substantial information exchange required between workers and a central parameter server. SignSGD with majority voting (signSGD-MV) is an effective distributed lear… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  40. arXiv:2403.15692  [pdf, other

    cs.IT eess.SP

    Block Orthogonal Sparse Superposition Codes for $ \sf{L}^3 $ Communications: Low Error Rate, Low Latency, and Low Power Consumption

    Authors: Donghwa Han, Bowhyung Lee, Min Jang, Donghun Lee, Seho Myung, Namyoon Lee

    Abstract: Block orthogonal sparse superposition (BOSS) code is a class of joint coded modulation methods, which can closely achieve the finite-blocklength capacity with a low-complexity decoder at a few coding rates under Gaussian channels. However, for fading channels, the code performance degrades considerably because coded symbols experience different channel fading effects. In this paper, we put forth n… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  41. arXiv:2403.15042  [pdf, other

    cs.CL

    LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

    Authors: Nicholas Lee, Thanakul Wattanawong, Sehoon Kim, Karttikeya Mangalam, Sheng Shen, Gopala Anumanchipalli, Michael W. Mahoney, Kurt Keutzer, Amir Gholami

    Abstract: Pretrained large language models (LLMs) are currently state-of-the-art for solving the vast majority of natural language processing tasks. While many real-world applications still require fine-tuning to reach satisfactory levels of performance, many of them are in the low-data regime, making fine-tuning challenging. To address this, we propose LLM2LLM, a targeted and iterative data augmentation st… ▽ More

    Submitted 13 July, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

    Comments: ACL 2024

  42. arXiv:2403.11762  [pdf, other

    cs.IT eess.SP

    Full-Duplex MU-MIMO Systems with Coarse Quantization: How Many Bits Do We Need?

    Authors: Seunghyeong Yoo, Seokjun Park, Mintaek Oh, Namyoon Lee, Jinseok Choi

    Abstract: This paper investigates full-duplex (FD) multi-user multiple-input multiple-output (MU-MIMO) system design with coarse quantization. We first analyze the impact of self-interference (SI) on quantization in FD single-input single-output systems. The analysis elucidates that the minimum required number of analog-to-digital converter (ADC) bits is logarithmically proportional to the ratio of total re… ▽ More

    Submitted 18 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  43. arXiv:2403.11096  [pdf, other

    eess.SP

    Modeling and Coverage Analysis of K-Tier Integrated Satellite-Terrestrial Downlink Networks

    Authors: Jungbin Yim, Jeonghun Park, Namyoon Lee

    Abstract: Integrated satellite-terrestrial networks (ISTNs) can significantly expand network coverage while diminishing reliance on terrestrial infrastructure. Despite the enticing potential of ISTNs, there is no comprehensive mathematical performance analysis framework for these emerging networks. In this paper, we introduce a tractable approach to analyze the downlink coverage performance of multi-tier IS… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 13 pages, 9 figures

  44. arXiv:2403.11094  [pdf, other

    eess.SP

    Nonlinear Self-Interference Cancellation With Learnable Orthonormal Polynomials for Full-Duplex Wireless Systems

    Authors: Hyowon Lee, Jungyeon Kim, Geon Choi, Ian P. Roberts, Jinseok Choi, Namyoon Lee

    Abstract: Nonlinear self-interference cancellation (SIC) is essential for full-duplex communication systems, which can offer twice the spectral efficiency of traditional half-duplex systems. The challenge of nonlinear SIC is similar to the classic problem of system identification in adaptive filter theory, whose crux lies in identifying the optimal nonlinear basis functions for a nonlinear system. This beco… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 13 pages, total 16 figures

  45. arXiv:2403.07821  [pdf, other

    cs.SE

    Augmenting Interpolation-Based Model Checking with Auxiliary Invariants (Extended Version)

    Authors: Dirk Beyer, Po-Chun Chien, Nian-Ze Lee

    Abstract: Software model checking is a challenging problem, and generating relevant invariants is a key factor in proving the safety properties of a program. Program invariants can be obtained by various approaches, including lightweight procedures based on data-flow analysis and intensive techniques using Craig interpolation. Although data-flow analysis runs efficiently, it often produces invariants that a… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  46. arXiv:2403.07691  [pdf, other

    cs.CL cs.AI

    ORPO: Monolithic Preference Optimization without Reference Model

    Authors: Jiwoo Hong, Noah Lee, James Thorne

    Abstract: While recent preference alignment algorithms for language models have demonstrated promising results, supervised fine-tuning (SFT) remains imperative for achieving successful convergence. In this paper, we study the crucial role of SFT within the context of preference alignment, emphasizing that a minor penalty for the disfavored generation style is sufficient for preference-aligned SFT. Building… ▽ More

    Submitted 14 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Preprint

  47. arXiv:2403.05389  [pdf, other

    physics.chem-ph

    Multi-reference coupled cluster theory using the normal ordered exponential ansatz

    Authors: Alexander Gunasekera, Nicholas Lee, David P. Tew

    Abstract: Properly spin-adapted coupled-cluster theory for general open-shell configurations remains an active area of research in electronic structure theory. In this contribution we examine Lindgren's normal-ordered exponential ansatz to correlate specific spin states using spin-free excitation operators, with the aid of automatic equation generation software. We present an intermediately normalised and s… ▽ More

    Submitted 30 September, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  48. arXiv:2402.13889  [pdf, other

    hep-th math-ph math.DG math.QA nlin.SI

    Bispectral duality and separation of variables from surface defect transition

    Authors: Saebyeok Jeong, Norton Lee

    Abstract: We study two types of surface observables $-$ the $\mathbf{Q}$-observables and the $\mathbf{H}$-observables $-$ of the 4d $\mathcal{N}=2$ $A_1$-quiver $U(N)$ gauge theory obtained by coupling a 2d $\mathcal{N}=(2,2)$ gauged linear sigma model. We demonstrate that the transition between the two surface defects manifests as a Fourier transformation between the surface observables. Utilizing the resu… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 62+11 pages; 10 figures

    Report number: CERN-TH-2024-024, CGP24003

  49. arXiv:2402.13888  [pdf, other

    hep-th math-ph math.DG math.QA nlin.SI

    di-Langlands correspondence and extended observables

    Authors: Saebyeok Jeong, Norton Lee, Nikita Nekrasov

    Abstract: We explore the $\textit{difference Langlands correspondence}$ using the four dimensional ${\mathcal{N}}=2$ super-QCD. Surface defects and surface observables play the crucial role. As an application, we give the first construction of the full set of quantum integrals, i.e. commuting differential operators, such that the partition function of the so-called regular monodromy surface defect is their… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 50+11 pages

    Report number: CERN-TH-2023-220, CGP24002

  50. arXiv:2402.09903  [pdf, ps, other

    math.CO

    Enumeration of multiplex juggling card sequences using generalized q-derivatives

    Authors: Yumin Cho, Jaehyun Kim, Jang Soo Kim, Nakyung Lee

    Abstract: In 2019, Butler, Choi, Kim, and Seo introduced a new type of juggling card that represents multiplex juggling patterns in a natural bijective way. They conjectured a formula for the generating function for the number of multiplex juggling cards with capacity 2. In this paper we prove their conjecture. More generally, we find an explicit formula for the generating function with any capacity. We als… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 17 pages, 4 figures