Skip to main content

Showing 101–150 of 1,068 results for author: Hong, J

  1. arXiv:2403.00261  [pdf, other

    cs.CV

    Spatial Cascaded Clustering and Weighted Memory for Unsupervised Person Re-identification

    Authors: Jiahao Hong, Jialong Zuo, Chuchu Han, Ruochen Zheng, Ming Tian, Changxin Gao, Nong Sang

    Abstract: Recent unsupervised person re-identification (re-ID) methods achieve high performance by leveraging fine-grained local context. These methods are referred to as part-based methods. However, most part-based methods obtain local contexts through horizontal division, which suffer from misalignment due to various human poses. Additionally, the misalignment of semantic information in part features rest… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  2. arXiv:2402.19282  [pdf, other

    cs.CL

    WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset

    Authors: Jiantao Qiu, Haijun Lv, Zhenjiang Jin, Rui Wang, Wenchang Ning, Jia Yu, ChaoBin Zhang, Zhenxiang Li, Pei Chu, Yuan Qu, Jin Shi, Lindong Lu, Runyu Peng, Zhiyuan Zeng, Huanze Tang, Zhikai Lei, Jiawei Hong, Keyu Chen, Zhaoye Fei, Ruiliang Xu, Wei Li, Zhongying Tu, Lin Dahua, Yu Qiao, Hang Yan , et al. (1 additional authors not shown)

    Abstract: This paper presents WanJuan-CC, a safe and high-quality open-sourced English webtext dataset derived from Common Crawl data. The study addresses the challenges of constructing large-scale pre-training datasets for language models, which require vast amounts of high-quality data. A comprehensive process was designed to handle Common Crawl data, including extraction, heuristic rule filtering, fuzzy… ▽ More

    Submitted 17 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

  3. arXiv:2402.18076  [pdf, other

    eess.SY

    Online Ecological Gearshift Strategy via Neural Network with Soft-Argmax Operator

    Authors: Xi Luo, Shiying Dong, Jinlong Hong, Bingzhao Gao, Hong Chen

    Abstract: This paper presents a neural network optimizer with soft-argmax operator to achieve an ecological gearshift strategy in real-time. The strategy is reformulated as the mixed-integer model predictive control (MIMPC) problem to minimize energy consumption. Then the outer convexification is introduced to transform integer variables into relaxed binary controls. To approximate binary solutions properly… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 6 pages, 5 figures, submitted to 8th IFAC Conference on Nonlinear Model Predictive Control

  4. arXiv:2402.14573  [pdf

    cond-mat.mtrl-sci

    Local Manipulation of Skyrmion Lattice in Fe3GaTe2 at Room Temperature

    Authors: Shuaizhao Jin, Zhan Wang, Shouzhe Dong, Yiting Wang, Kun Han, Guangcheng Wang, Zunyi Deng, Xingan Jiang, Ying Zhang, Houbing Huang, Jiawang Hong, Xiaolei Wang, Tianlong Xia, Sang-Wook Cheong, Xueyun Wang

    Abstract: Motivated by advances in spintronic devices, an extensive exploration is underway to uncover materials that host topologically protected spin textures, exemplified by skyrmions. One critical challenge involved in the potential application of skyrmions in van der Waals (vdW) materials is the attainment and manipulation of skyrmions at room temperature. In this study, we report the creation of intri… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  5. arXiv:2402.14209  [pdf, other

    astro-ph.SR astro-ph.IM

    Developing an Automated Detection, Tracking and Analysis Method for Solar Filaments Observed by CHASE via Machine Learning

    Authors: Z. Zheng, Q. Hao, Y. Qiu, J. Hong, C. Li, M. D. Ding

    Abstract: Studies on the dynamics of solar filaments have significant implications for understanding their formation, evolution, and eruption, which are of great importance for space weather warning and forecasting. The H$α$ Imaging Spectrograph (HIS) onboard the recently launched Chinese H$α$ Solar Explorer (CHASE) can provide full-disk solar H$α$ spectroscopic observations, which bring us an opportunity t… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 13 pages, 9 figures, Accepted for publication in ApJ

  6. arXiv:2402.13259  [pdf, other

    stat.ME cs.CE math.NA math.PR

    Fast Discrete-Event Simulation of Markovian Queueing Networks through Euler Approximation

    Authors: L. Jeff Hong, Yingda Song, Tan Wang

    Abstract: The efficient management of large-scale queueing networks is critical for a variety of sectors, including healthcare, logistics, and customer service, where system performance has profound implications for operational effectiveness and cost management. To address this key challenge, our paper introduces simulation techniques tailored for complex, large-scale Markovian queueing networks. We develop… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  7. arXiv:2402.11632  [pdf, other

    eess.SP

    Reliable long timescale decision-directed channel estimation for OFDM system

    Authors: Xun Wang, Xin Xie, Cunqing Hua, Jianan Hong, Pengwenlong Gu

    Abstract: Decision-directed channel estimation (DDCE) is one kind of blind channel estimation method that tracks the channel blindly by an iterative algorithm without relying on the pilots, which can increase the utilization of wireless resource. However, one major problem of DDCE is the performance degradation caused by error accumulation during the tracking process. In this paper, we propose an reliable D… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  8. arXiv:2402.11592  [pdf, other

    cs.LG cs.CL

    Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark

    Authors: Yihua Zhang, Pingzhi Li, Junyuan Hong, Jiaxiang Li, Yimeng Zhang, Wenqing Zheng, Pin-Yu Chen, Jason D. Lee, Wotao Yin, Mingyi Hong, Zhangyang Wang, Sijia Liu, Tianlong Chen

    Abstract: In the evolving landscape of natural language processing (NLP), fine-tuning pre-trained Large Language Models (LLMs) with first-order (FO) optimizers like SGD and Adam has become standard. Yet, as LLMs grow {in size}, the substantial memory overhead from back-propagation (BP) for FO gradient computation presents a significant challenge. Addressing this issue is crucial, especially for applications… ▽ More

    Submitted 27 May, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  9. arXiv:2402.11272  [pdf, other

    cond-mat.dis-nn quant-ph

    Many-body localization properties of fully frustrated Heisenberg spin-1/2 ladder model with next-nearest-neighbor interaction

    Authors: Jiameng Hong, Taotao Hu

    Abstract: Many-body localization (MBL) is an intriguing physical phenomenon that arises from the interplay of interaction and disorder, allowing quantum systems to prevent thermalization. In this study, we investigate the MBL properties of the fully frustrated Heisenberg spin-1/2 ladder model with next-nearest-neighbor hopping interaction along the leg direction and compare it with the Heisenberg spin-1/2 s… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    Comments: 9 pages, 6 figures

  10. arXiv:2402.10539  [pdf, ps, other

    astro-ph.SR

    Two-sided Loop Solar Jet Driven by the Eruption of a Small Filament in a Big Filament Channel

    Authors: Jiayan Yang, Hechao Chen, Junchao Hong, Bo Yang, Yi Bi

    Abstract: Similar to the cases of anemone jets, two-sided loop solar jets could also be produced by either flux emergence from the solar interior or small scale filament eruptions. Using the high-quality data from the Solar Dynamic Observatory (SDO), we analyzed a two-sided loop solar jet triggered by the eruption of a small filament in this paper. The jet was occurred in a pre-existing big filament channel… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  11. arXiv:2402.08138  [pdf, other

    cs.CV

    H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields

    Authors: Minyoung Park, Mirae Do, YeonJae Shin, Jaeseok Yoo, Jongkwang Hong, Joongrock Kim, Chul Lee

    Abstract: Advanced techniques using Neural Radiance Fields (NeRF), Signed Distance Fields (SDF), and Occupancy Fields have recently emerged as solutions for 3D indoor scene reconstruction. We introduce a novel two-phase learning approach, H2O-SDF, that discriminates between object and non-object regions within indoor environments. This method achieves a nuanced balance, carefully preserving the geometric in… ▽ More

    Submitted 8 March, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  12. arXiv:2402.06777  [pdf, other

    cs.HC cs.MM cs.SD eess.AS

    Capturing Cancer as Music: Cancer Mechanisms Expressed through Musification

    Authors: Rostyslav Hnatyshyn, Jiayi Hong, Ross Maciejewski, Christopher Norby, Carlo C. Maley

    Abstract: The development of cancer is difficult to express on a simple and intuitive level due to its complexity. Since cancer is so widespread, raising public awareness about its mechanisms can help those affected cope with its realities, as well as inspire others to make lifestyle adjustments and screen for the disease. Unfortunately, studies have shown that cancer literature is too technical for the gen… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  13. arXiv:2402.06332  [pdf, other

    cs.CL

    InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

    Authors: Huaiyuan Ying, Shuo Zhang, Linyang Li, Zhejian Zhou, Yunfan Shao, Zhaoye Fei, Yichuan Ma, Jiawei Hong, Kuikun Liu, Ziyi Wang, Yudong Wang, Zijian Wu, Shuaibin Li, Fengzhe Zhou, Hongwei Liu, Songyang Zhang, Wenwei Zhang, Hang Yan, Xipeng Qiu, Jiayu Wang, Kai Chen, Dahua Lin

    Abstract: The math abilities of large language models can represent their abstract reasoning ability. In this paper, we introduce and open-source our math reasoning LLMs InternLM-Math which is continue pre-trained from InternLM2. We unify chain-of-thought reasoning, reward modeling, formal reasoning, data augmentation, and code interpreter in a unified seq2seq format and supervise our model to be a versatil… ▽ More

    Submitted 24 May, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

  14. arXiv:2402.04159  [pdf, ps, other

    math.AP math.DS

    Optimal transport in the frame of abstract Lax-Oleinik operator revisited

    Authors: Wei Cheng, Jiahui Hong, Tianqi Shi

    Abstract: This is our first paper on the extension of our recent work on the Lax-Oleinik commutators and its applications to the intrinsic approach of propagation of singularities of the viscosity solutions of Hamilton-Jacobi equations. We reformulate Kantorovich-Rubinstein duality theorem in the theory of optimal transport in terms of abstract Lax-Oleinik operators, and analyze the relevant optimal transpo… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  15. arXiv:2402.03582  [pdf, other

    cs.HC cs.CR

    Matcha: An IDE Plugin for Creating Accurate Privacy Nutrition Labels

    Authors: Tianshi Li, Lorrie Faith Cranor, Yuvraj Agarwal, Jason I. Hong

    Abstract: Apple and Google introduced their versions of privacy nutrition labels to the mobile app stores to better inform users of the apps' data practices. However, these labels are self-reported by developers and have been found to contain many inaccuracies due to misunderstandings of the label taxonomy. In this work, we present Matcha, an IDE plugin that uses automated code analysis to help developers c… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 38 pages

  16. arXiv:2402.01137  [pdf, ps, other

    math.PR math.NA

    Long-time dynamics of stochastic wave equation with dissipative damping and its full discretization: exponential ergodicity and strong law of large numbers

    Authors: Meng Cai, Chuchu Chen, Jialin Hong, Tau Zhou

    Abstract: For stochastic wave equation, when the dissipative damping is a non-globally Lipschitz function of the velocity, there are few results on the long-time dynamics, in particular, the exponential ergodicity and strong law of large numbers, for the equation and its numerical discretization to our knowledge. Focus on this issue, the main contributions of this paper are as follows. First, based on const… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  17. arXiv:2402.00907  [pdf, other

    cs.LG stat.ME

    AlphaRank: An Artificial Intelligence Approach for Ranking and Selection Problems

    Authors: Ruihan Zhou, L. Jeff Hong, Yijie Peng

    Abstract: We introduce AlphaRank, an artificial intelligence approach to address the fixed-budget ranking and selection (R&S) problems. We formulate the sequential sampling decision as a Markov decision process and propose a Monte Carlo simulation-based rollout policy that utilizes classic R&S procedures as base policies for efficiently learning the value function of stochastic dynamic programming. We accel… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

  18. arXiv:2401.10660  [pdf, other

    cs.CL cs.AI

    Accelerating Multilingual Language Model for Excessively Tokenized Languages

    Authors: Jimin Hong, Gibbeum Lee, Jaewoong Cho

    Abstract: Recent advancements in large language models (LLMs) have remarkably enhanced performances on a variety of tasks in multiple languages. However, tokenizers in LLMs trained primarily on English-centric corpora often overly fragment a text into character or Unicode-level tokens in non-Roman alphabetic languages, leading to inefficient text generation. We introduce a simple yet effective framework to… ▽ More

    Submitted 6 August, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Accepted to ACL 2024 Findings

  19. arXiv:2401.08628  [pdf, other

    math.NA math-ph

    Monostatic imaging of an extended target with MCMC sampling

    Authors: Jiho Hong, Sangwoo Kang, Mikyoung Lim

    Abstract: We consider the imaging of a planar extended target from far-field data under a monostatic measurement configuration, in which the data is measured by a single moving transducer, as frequently encountered in practical application. In this paper, we develop a Bayesian approach to recover the shape of the extended target with MCMC sampling, where a new shape basis selection is proposed based on the… ▽ More

    Submitted 8 December, 2023; originally announced January 2024.

    Comments: 17 pages, 8 figures

  20. arXiv:2401.06315  [pdf

    physics.flu-dyn

    Stochastic modelling of the instantaneous velocity profile in rough-wall turbulent boundary layers

    Authors: Roozbeh Ehsani, Michael Heisel, Jiaqi Li, Vaughan Voller, Jiarong Hong, Michele Guala

    Abstract: The statistical properties of Uniform Momentum Zones (UMZs) are extracted from laboratory and field measurements in rough wall turbulent boundary layers to formulate a set of stochastic models for the simulation of instantaneous velocity profiles. A spatio-temporally resolved velocity dataset, covering a field of view of $8 \times 9$ m$^2$, was obtained in the atmospheric surface layer using super… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Journal ref: Journal of Fluid Mechanics , Volume 979 , 25 January 2024 , A12

  21. A Survey of Designs for Combined 2D+3D Visual Representations

    Authors: Jiayi Hong, Rostyslav Hnatyshyn, Ebrar A. D. Santos, Ross Maciejewski, Tobias Isenberg

    Abstract: We examine visual representations of data that make use of combinations of both 2D and 3D data mappings. Combining 2D and 3D representations is a common technique that allows viewers to understand multiple facets of the data with which they are interacting. While 3D representations focus on the spatial character of the data or the dedicated 3D data mapping, 2D representations often show abstract d… ▽ More

    Submitted 12 January, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Journal ref: IEEE Transactions on Visualization and Computer Graphics 2024

  22. arXiv:2401.04541  [pdf

    cond-mat.str-el

    Flexomagnetoelectric effect in Sr2IrO4 thin films

    Authors: Xin Liu, Ting Hu, Yujun Zhang, Xueli Xu, Biao Wu, Zongwei Ma, Peng Lv, Yuelin Zhang, Shih-Wen Huang, Jialu Wu, Jing Ma, Jiawang Hong, Zhigao Sheng, Chenglong Jia, Erjun Kan, Ce-Wen Nan, Jinxing Zhang

    Abstract: Symmetry engineering is explicitly effective to manipulate and even create phases and orderings in strongly correlated materials. Flexural stress is universally practical to break the space-inversion or time-reversal symmetry. Here, by introducing strain gradient in a centrosymmetric antiferromagnet Sr2IrO4, the space-inversion symmetry is broken accompanying a non-equivalent O p-Ir d orbital hybr… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

  23. arXiv:2401.02656  [pdf, other

    cs.CV cs.LG

    GTA: Guided Transfer of Spatial Attention from Object-Centric Representations

    Authors: SeokHyun Seo, Jinwoo Hong, JungWoo Chae, Kyungyul Kim, Sangheum Hwang

    Abstract: Utilizing well-trained representations in transfer learning often results in superior performance and faster convergence compared to training from scratch. However, even if such good representations are transferred, a model can easily overfit the limited training dataset and lose the valuable properties of the transferred representations. This phenomenon is more severe in ViT due to its low induct… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  24. arXiv:2401.02437  [pdf, other

    cs.NE cs.CV cs.LG

    Randomly Weighted Neuromodulation in Neural Networks Facilitates Learning of Manifolds Common Across Tasks

    Authors: Jinyung Hong, Theodore P. Pavlic

    Abstract: Geometric Sensitive Hashing functions, a family of Local Sensitive Hashing functions, are neural network models that learn class-specific manifold geometry in supervised learning. However, given a set of supervised learning tasks, understanding the manifold geometries that can represent each task and the kinds of relationships between the tasks based on them has received little attention. We explo… ▽ More

    Submitted 17 November, 2023; originally announced January 2024.

    Comments: 10 pages, 7 figures, 1 table, Appear in NeurIPS 2023

  25. arXiv:2312.15603  [pdf, other

    cs.CL

    A Split-and-Privatize Framework for Large Language Model Fine-Tuning

    Authors: Xicong Shen, Yang Liu, Huiqi Liu, Jue Hong, Bing Duan, Zirui Huang, Yunlong Mao, Ye Wu, Di Wu

    Abstract: Fine-tuning is a prominent technique to adapt a pre-trained language model to downstream scenarios. In parameter-efficient fine-tuning, only a small subset of modules are trained over the downstream datasets, while leaving the rest of the pre-trained model frozen to save computation resources. In recent years, a popular productization form arises as Model-as-a-Service (MaaS), in which vendors prov… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

  26. arXiv:2312.14201  [pdf, other

    cs.CV cs.AI

    Towards Better Visualizing the Decision Basis of Networks via Unfold and Conquer Attribution Guidance

    Authors: Jung-Ho Hong, Woo-Jeoung Nam, Kyu-Sung Jeon, Seong-Whan Lee

    Abstract: Revealing the transparency of Deep Neural Networks (DNNs) has been widely studied to describe the decision mechanisms of network inner structures. In this paper, we propose a novel post-hoc framework, Unfold and Conquer Attribution Guidance (UCAG), which enhances the explainability of the network decision by spatially scrutinizing the input features with respect to the model confidence. Addressing… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 9 pages, 5 figures, Accepted paper in AAAI Conference on Artificial Intelligence (AAAI), 2023

  27. arXiv:2312.12080  [pdf, other

    cs.CV cs.GR

    Learning Subject-Aware Cropping by Outpainting Professional Photos

    Authors: James Hong, Lu Yuan, Michaël Gharbi, Matthew Fisher, Kayvon Fatahalian

    Abstract: How to frame (or crop) a photo often depends on the image subject and its context; e.g., a human portrait. Recent works have defined the subject-aware image cropping task as a nuanced and practical version of image cropping. We propose a weakly-supervised approach (GenCrop) to learn what makes a high-quality, subject-aware crop from professional stock images. Unlike supervised prior work, GenCrop… ▽ More

    Submitted 4 April, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: AAAI 24. Extended version with supplemental materials

  28. arXiv:2312.09295  [pdf, other

    cs.NI cs.SI

    Networking for the Metaverse: The Standardization Landscape

    Authors: Cedric Westphal, Jungha Hong, Shin-Gak Kang, Leonardo Chiariglione, Tianji Jiang

    Abstract: New applications are being supported by current and future networks. In particular, it is expected that Metaverse applications will be deployed in the near future, as 5G and 6G network provide sufficient bandwidth and sufficiently low latency to provide a satisfying end-user experience. However, networks still need to evolve to better support this type of application. We present here a basic taxon… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: To appear in ITU Journal on Future and Evolving Technologies J-FET December 2023

  29. arXiv:2312.08909  [pdf, ps, other

    hep-th

    A compendium of logarithmic corrections in AdS/CFT

    Authors: Nikolay Bobev, Marina David, Junho Hong, Valentin Reys, Xuao Zhang

    Abstract: We study the logarithmic corrections to various CFT partition functions in the context of the AdS$_4$/CFT$_3$ correspondence for theories arising on the worldvolume of M2-branes. We utilize four-dimensional gauged supergravity and heat kernel methods and present general expressions for the logarithmic corrections to the gravitational on-shell action and black hole entropy for a number of different… ▽ More

    Submitted 5 April, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: v1: 87 pages, 10 tables; v2: Seeley-de Witt coefficients in the presence of non-minimal couplings dictated by supersymmetry are newly addressed in section 5 and following places

  30. arXiv:2312.06846  [pdf, ps, other

    math.AC

    Hilbert Coefficients and Sally Modules: A Survey of Vasconcelos' Contributions

    Authors: Jooyoun Hong, Susan Morey

    Abstract: This paper surveys and summarizes Wolmer Vasconcelos' results surrounding multiplicities, Hilbert coefficients, and their extensions. We particularly focus on Vasconcelos' results regarding multiplicities and Chern coefficients, and other invariants which they bound. The Sally module is an important instrument introduced by Vasconcelos for this study, which naturally relates Hilbert coefficients t… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 30 pages

    MSC Class: 13-02 (Primary) 13D40; 13H10; 13H15 (Secondary)

  31. arXiv:2312.06708  [pdf, other

    cs.CV

    Neutral Editing Framework for Diffusion-based Video Editing

    Authors: Sunjae Yoon, Gwanhyeong Koo, Ji Woo Hong, Chang D. Yoo

    Abstract: Text-conditioned image editing has succeeded in various types of editing based on a diffusion framework. Unfortunately, this success did not carry over to a video, which continues to be challenging. Existing video editing systems are still limited to rigid-type editing such as style transfer and object overlay. To this end, this paper proposes Neutral Editing (NeuEdit) framework to enable complex… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 18 pages, 14 figures

  32. arXiv:2312.05174  [pdf, other

    physics.optics cond-mat.mtrl-sci

    High Absorptivity Nanotextured Powders for Additive Manufacturing

    Authors: Ottman A. Tertuliano, Philip J. DePond, Andrew C. Lee, Jiho Hong, David Doan, Luc Capaldi, Mark Brongersma, X. Wendy Gu, Manyalibo J. Matthews, Wei Cai, Adrian J. Lew

    Abstract: The widespread application of metal additive manufacturing (AM) is limited by the ability to control the complex interactions between the energy source and the feedstock material. Here we develop a generalizable process to introduce nanoscale grooves to the surface of metal powders which increases the powder absorptivity by up to 70% during laser powder bed fusion. Absorptivity enhancements in cop… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  33. arXiv:2312.04594  [pdf, other

    cs.CR cs.AI cs.LG

    FedGeo: Privacy-Preserving User Next Location Prediction with Federated Learning

    Authors: Chung Park, Taekyoon Choi, Taesan Kim, Mincheol Cho, Junui Hong, Minsung Choi, Jaegul Choo

    Abstract: A User Next Location Prediction (UNLP) task, which predicts the next location that a user will move to given his/her trajectory, is an indispensable task for a wide range of applications. Previous studies using large-scale trajectory datasets in a single server have achieved remarkable performance in UNLP task. However, in real-world applications, legal and ethical issues have been raised regardin… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: Accepted at 31st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM SIGSPATIAL 2023)

  34. arXiv:2312.03724  [pdf, other

    cs.CL cs.AI

    DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer

    Authors: Junyuan Hong, Jiachen T. Wang, Chenhui Zhang, Zhangheng Li, Bo Li, Zhangyang Wang

    Abstract: Large Language Models (LLMs) have emerged as dominant tools for various tasks, particularly when tailored for a specific target by prompt tuning. Nevertheless, concerns surrounding data privacy present obstacles due to the tuned prompts' dependency on sensitive private information. A practical solution is to host a local LLM and optimize a soft prompt privately using data. Yet, hosting a local mod… ▽ More

    Submitted 17 March, 2024; v1 submitted 26 November, 2023; originally announced December 2023.

    Comments: Accepted to ICLR'24 Splotlight (updated version)

  35. arXiv:2312.03205  [pdf, other

    cs.CR

    Who Leaked the Model? Tracking IP Infringers in Accountable Federated Learning

    Authors: Shuyang Yu, Junyuan Hong, Yi Zeng, Fei Wang, Ruoxi Jia, Jiayu Zhou

    Abstract: Federated learning (FL) emerges as an effective collaborative learning framework to coordinate data and computation resources from massive and distributed clients in training. Such collaboration results in non-trivial intellectual property (IP) represented by the model parameters that should be protected and shared by the whole party rather than an individual user. Meanwhile, the distributed natur… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  36. arXiv:2312.02669  [pdf, other

    physics.optics eess.IV

    Deep-learning-driven end-to-end metalens imaging

    Authors: Joonhyuk Seo, Jaegang Jo, Joohoon Kim, Joonho Kang, Chanik Kang, Seongwon Moon, Eunji Lee, Jehyeong Hong, Junsuk Rho, Haejun Chung

    Abstract: Recent advances in metasurface lenses (metalenses) have shown great potential for opening a new era in compact imaging, photography, light detection and ranging (LiDAR), and virtual reality/augmented reality (VR/AR) applications. However, the fundamental trade-off between broadband focusing efficiency and operating bandwidth limits the performance of broadband metalenses, resulting in chromatic ab… ▽ More

    Submitted 10 May, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: 17 pages, 7 figures, 1 table

  37. arXiv:2312.00792  [pdf

    physics.flu-dyn

    Visualization and Characterization of Agricultural Sprays Using Machine Learning based Digital Inline Holography

    Authors: Shyam Kumar M, Christopher J. Hogan, Steven A. Fredericks, Jiarong Hong

    Abstract: Accurate characterization of agricultural sprays is crucial to predict in field performance of liquid applied crop protection products. Here we introduce a robust and efficient machine learning (ML) based Digital In-line Holography (DIH) to accurately characterize the droplet field for a wide range of agricultural spray nozzles. Compared to non-ML methods, our method enhances accuracy, generalizab… ▽ More

    Submitted 13 November, 2023; originally announced December 2023.

    Comments: 24 pages, 12 figures

  38. arXiv:2312.00407  [pdf, other

    cs.CL

    CoLLiE: Collaborative Training of Large Language Models in an Efficient Way

    Authors: Kai Lv, Shuo Zhang, Tianle Gu, Shuhao Xing, Jiawei Hong, Keyu Chen, Xiaoran Liu, Yuqing Yang, Honglin Guo, Tengxiao Liu, Yu Sun, Qipeng Guo, Hang Yan, Xipeng Qiu

    Abstract: Large language models (LLMs) are increasingly pivotal in a wide range of natural language processing tasks. Access to pre-trained models, courtesy of the open-source community, has made it possible to adapt these models to specific applications for enhanced performance. However, the substantial resources required for training these models necessitate efficient solutions. This paper introduces CoLL… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: To appear at EMNLP 2023 Demo; Code is available at https://github.com/OpenLMLab/collie

  39. arXiv:2311.18232  [pdf, other

    cs.CL cs.AI cs.LG

    LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models

    Authors: Marwa Abdulhai, Isadora White, Charlie Snell, Charles Sun, Joey Hong, Yuexiang Zhai, Kelvin Xu, Sergey Levine

    Abstract: Large language models (LLMs) provide excellent text-generation capabilities, but standard prompting and generation methods generally do not lead to intentional or goal-directed agents and might necessitate considerable prompt tuning. This becomes particularly apparent in multi-turn conversations: even the best current LLMs rarely ask clarifying questions, engage in explicit information gathering,… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  40. arXiv:2311.17797  [pdf, other

    cs.LG stat.ME

    Learning to Simulate: Generative Metamodeling via Quantile Regression

    Authors: L. Jeff Hong, Yanxi Hou, Qingkai Zhang, Xiaowei Zhang

    Abstract: Stochastic simulation models, while effective in capturing the dynamics of complex systems, are often too slow to run for real-time decision-making. Metamodeling techniques are widely used to learn the relationship between a summary statistic of the outputs (e.g., the mean or quantile) and the inputs of the simulator, so that it can be used in real time. However, this methodology requires the know… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Main body: 36 pages, 7 figures; supplemental material: 12 pages

  41. arXiv:2311.16951  [pdf, other

    physics.flu-dyn

    Three-dimensional internal flow evolution of an evaporating droplet and its role in particle deposition pattern

    Authors: Jiaqi Li, Jiarong Hong

    Abstract: The internal flow within an evaporating sessile droplet is one of the driving mechanisms that lead to the variety of particle deposition patterns seen in applications such as inkjet printing, surface patterning, and blood stain analysis. Despite decades of research, the causal link between droplet internal flow and particle deposition patterns has not been fully established. In this study, we empl… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  42. arXiv:2311.16901  [pdf, other

    astro-ph.SR astro-ph.GA

    Stellar Loci. VII. Photometric Metallicities of 5 Million FGK Stars Based on GALEX GR6+7 AIS and Gaia EDR3

    Authors: Xue Lu, Haibo Yuan, Shuai Xu, Ruoyi Zhang, Kai Xiao, Yang Huang, Timothy C. Beers, Jihye Hong

    Abstract: We combine photometric data from GALEX GR6+7 AIS and Gaia EDR3 with stellar parameters from the SAGA and PASTEL catalogs to construct high-quality training samples for dwarfs ($\rm 0.4< BP-RP<1.6$) and giants ($\rm 0.6< BP-RP <1.6$). We apply careful reddening corrections using empirical temperature- and extinction-dependent extinction coefficients. Using the two samples, we establish a relationsh… ▽ More

    Submitted 11 January, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: 24 pages, 19 figures, accepted by ApJS

  43. arXiv:2311.13757   

    math.PR

    Exceptional times for the instantaneous propagation of superprocess

    Authors: Jieliang Hong, Leonid Mytnik

    Abstract: For a Dawson-Watanabe superprocess $X$ on $\mathbb{R}^d$, it is shown in Perkins (1990) that if the underlying spatial motion belongs to a certain class of Lévy processes that admit jumps, then with probability one the closed support of $X_t$ is the whole space for almost all $t>0$ before extinction, the so-called ``instantaneous propagation'' property. In this paper for superprocesses on… ▽ More

    Submitted 6 December, 2023; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: The latest version for this paper is in arXiv:2207.11705

    MSC Class: 60J68; 60G17

  44. arXiv:2311.13488  [pdf, other

    eess.SY

    Machine Learning based Post Event Analysis for Cybersecurity of Cyber-Physical System

    Authors: Kuchan Park, Junho Hong, Wencong Su, HyoJong Lee

    Abstract: As Information and Communication Technology (ICT) equipment continues to be integrated into power systems, issues related to cybersecurity are increasingly emerging. Particularly noteworthy is the transition to digital substations, which is shifting operations from traditional hardwired-based systems to communication-based Supervisory Control and Data Acquisition (SCADA) system operations. These c… ▽ More

    Submitted 7 March, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: Submitted to 2024 IEEE Power and Energy Society General Meeting

  45. arXiv:2311.13188  [pdf, other

    cs.AI cs.LG

    Cracking the Code of Negative Transfer: A Cooperative Game Theoretic Approach for Cross-Domain Sequential Recommendation

    Authors: Chung Park, Taesan Kim, Taekyoon Choi, Junui Hong, Yelim Yu, Mincheol Cho, Kyunam Lee, Sungil Ryu, Hyungjun Yoon, Minsung Choi, Jaegul Choo

    Abstract: This paper investigates Cross-Domain Sequential Recommendation (CDSR), a promising method that uses information from multiple domains (more than three) to generate accurate and diverse recommendations, and takes into account the sequential nature of user interactions. The effectiveness of these systems often depends on the complex interplay among the multiple domains. In this dynamic landscape, th… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: Accepted at 32nd ACM International Conference on Information and Knowledge Management (CIKM 2023)

  46. arXiv:2311.12205  [pdf, other

    cs.CR cs.CY

    SDN-Based Dynamic Cybersecurity Framework of IEC-61850 Communications in Smart Grid

    Authors: Mansi Girdhar, Junho Hong, Wencong Su, Akila Herath, Chen-Ching Liu

    Abstract: In recent years, critical infrastructure and power grids have experienced a series of cyber-attacks, leading to temporary, widespread blackouts of considerable magnitude. Since most substations are unmanned and have limited physical security protection, cyber breaches into power grid substations present a risk. Nowadays, software-defined network (SDN), a popular virtual network technology based on… ▽ More

    Submitted 7 March, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: 5 pages, 6 figures, 1 table, conference paper, supported by DOE (CESER) program

  47. arXiv:2311.11279  [pdf, other

    math.PR

    Staffing under Taylor's Law: A Unifying Framework for Bridging Square-root and Linear Safety Rules

    Authors: L. Jeff Hong, Weihuan Huang, Jiheng Zhang, Xiaowei Zhang

    Abstract: Staffing rules serve as an essential management tool in service industries to attain target service levels. Traditionally, the square-root safety rule, based on the Poisson arrival assumption, has been commonly used. However, empirical findings suggest that arrival processes often exhibit an ``over-dispersion'' phenomenon, in which the variance of the arrival exceeds the mean. In this paper, we de… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

    Comments: 55 pages

  48. arXiv:2311.11126  [pdf, other

    cs.LG cs.AI

    Bayesian Neural Networks: A Min-Max Game Framework

    Authors: Junping Hong, Ercan Engin Kuruoglu

    Abstract: This paper is a preliminary study of the robustness and noise analysis of deep neural networks via a game theory formulation Bayesian Neural Networks (BNN) and the maximal coding rate distortion loss. BNN has been shown to provide some robustness to deep learning, and the minimax method used to be a natural conservative way to assist the Bayesian method. Inspired by the recent closed-loop transcri… ▽ More

    Submitted 29 May, 2024; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: 6 pages, 8 figures,

  49. arXiv:2311.10263  [pdf, other

    cs.LG stat.ME

    Stable Differentiable Causal Discovery

    Authors: Achille Nazaret, Justin Hong, Elham Azizi, David Blei

    Abstract: Inferring causal relationships as directed acyclic graphs (DAGs) is an important but challenging problem. Differentiable Causal Discovery (DCD) is a promising approach to this problem, framing the search as a continuous optimization. But existing DCD methods are numerically unstable, with poor performance beyond tens of variables. In this paper, we propose Stable Differentiable Causal Discovery (S… ▽ More

    Submitted 27 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  50. arXiv:2311.07000  [pdf, ps, other

    math.AP math.DS

    Topological and control theoretic properties of Hamilton-Jacobi equations via Lax-Oleinik commutators

    Authors: Piermarco Cannarsa, Wei Cheng, Jiahui Hong

    Abstract: In the context of weak KAM theory, we discuss the commutators $\{T^-_t\circ T^+_t\}_{t\geqslant0}$ and $\{T^+_t\circ T^-_t\}_{t\geqslant0}$ of Lax-Oleinik operators. We characterize the relation $T^-_t\circ T^+_t=Id$ for both small time and arbitrary time $t$. We show this relation characterizes controllability for evolutionary Hamilton-Jacobi equation. Based on our previous work on the cut locus… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.