subscribe to arXiv mailings

Utilizing Large Language Models in An Iterative Paradigm with Domain Feedback for Molecule Optimization

Abstract: Molecule optimization is a critical task in drug discovery to optimize desired properties of a given molecule through chemical modification. Despite Large Language Models (LLMs) holding the potential to efficiently simulate this task by using natural language to direct the optimization, straightforwardly utilizing shows limited performance. In this work, we facilitate utilizing LLMs in an iterativ… ▽ More Molecule optimization is a critical task in drug discovery to optimize desired properties of a given molecule through chemical modification. Despite Large Language Models (LLMs) holding the potential to efficiently simulate this task by using natural language to direct the optimization, straightforwardly utilizing shows limited performance. In this work, we facilitate utilizing LLMs in an iterative paradigm by proposing a simple yet highly effective domain feedback provider, namely $\text{Re}^2$DF. In detail, $\text{Re}^2$DF harnesses an external toolkit, RDKit, to handle the molecule hallucination, if the modified molecule is chemically invalid. Otherwise, its desired properties are computed and compared to the original one, establishing reliable domain feedback with correct direction and distance towards the objective, followed by a retrieved example, to explicitly guide the LLM to refine the modified molecule. We conduct experiments across both single- and multi-property objectives with 2 thresholds, where $\text{Re}^2$DF shows significant improvements. Particularly, for 20 single-property objectives, $\text{Re}^2$DF enhances Hit ratio by 16.95% and 20.76% under loose and strict thresholds, respectively. For 32 multi-property objectives, $\text{Re}^2$DF enhances Hit ratio by 6.04% and 5.25%. △ Less

Submitted 20 October, 2024; v1 submitted 16 October, 2024; originally announced October 2024.

arXiv:2410.09913 [pdf, other]

Stratified Domain Adaptation: A Progressive Self-Training Approach for Scene Text Recognition

Authors: Kha Nhat Le, Hoang-Tuan Nguyen, Hung Tien Tran, Thanh Duc Ngo

Abstract: Unsupervised domain adaptation (UDA) has become increasingly prevalent in scene text recognition (STR), especially where training and testing data reside in different domains. The efficacy of existing UDA approaches tends to degrade when there is a large gap between the source and target domains. To deal with this problem, gradually shifting or progressively learning to shift from domain to domain… ▽ More Unsupervised domain adaptation (UDA) has become increasingly prevalent in scene text recognition (STR), especially where training and testing data reside in different domains. The efficacy of existing UDA approaches tends to degrade when there is a large gap between the source and target domains. To deal with this problem, gradually shifting or progressively learning to shift from domain to domain is the key issue. In this paper, we introduce the Stratified Domain Adaptation (StrDA) approach, which examines the gradual escalation of the domain gap for the learning process. The objective is to partition the training data into subsets so that the progressively self-trained model can adapt to gradual changes. We stratify the training data by evaluating the proximity of each data sample to both the source and target domains. We propose a novel method for employing domain discriminators to estimate the out-of-distribution and domain discriminative levels of data samples. Extensive experiments on benchmark scene-text datasets show that our approach significantly improves the performance of baseline (source-trained) STR models. △ Less

Submitted 17 October, 2024; v1 submitted 13 October, 2024; originally announced October 2024.

Comments: 15 pages, 12 figures, 5 tables, include supplementary materials

arXiv:2410.06599 [pdf, ps, other]

Analytically weak and mild solutions to stochastic heat equation with irregular drift

Authors: Siva Athreya, Oleg Butkovsky, Khoa Lê, Leonid Mytnik

Abstract: Consider the stochastic heat equation \begin{equation*} \partial_t u_t(x)=\frac12 \partial^2_{xx}u_t(x) +b(u_t(x))+\dot{W}_{t}(x),\quad t\in(0,T],\, x\in [0,1], \end{equation*} where $b$ is a generalized function, and $\dot W$ is space-time white noise on ${\mathbb R}_+\times[0,1]$. If the drift $b$ is a sufficiently regular function, then it is well-known that any analytically weak solution to th… ▽ More Consider the stochastic heat equation \begin{equation*} \partial_t u_t(x)=\frac12 \partial^2_{xx}u_t(x) +b(u_t(x))+\dot{W}_{t}(x),\quad t\in(0,T],\, x\in [0,1], \end{equation*} where $b$ is a generalized function, and $\dot W$ is space-time white noise on ${\mathbb R}_+\times[0,1]$. If the drift $b$ is a sufficiently regular function, then it is well-known that any analytically weak solution to this equation is also analytically mild, and vice versa. We extend this result to drifts that are generalized functions, with an appropriate adaptation of the notions of mild and weak solutions. As a corollary of our results, we show that for $b\in L_p({\mathbb R})$, $p\ge1$, this equation has a unique analytically weak and mild solution, thus extending the classical results of Gyöngy and Pardoux (1993). △ Less

Submitted 9 October, 2024; originally announced October 2024.

MSC Class: 60H15; 60H50; 60H17

arXiv:2410.02845 [pdf, other]

Towards Layer-Wise Personalized Federated Learning: Adaptive Layer Disentanglement via Conflicting Gradients

Authors: Minh Duong Nguyen, Khanh Le, Khoi Do, Nguyen H. Tran, Duc Nguyen, Chien Trinh, Zhaohui Yang

Abstract: In personalized Federated Learning (pFL), high data heterogeneity can cause significant gradient divergence across devices, adversely affecting the learning process. This divergence, especially when gradients from different users form an obtuse angle during aggregation, can negate progress, leading to severe weight and gradient update degradation. To address this issue, we introduce a new approach… ▽ More In personalized Federated Learning (pFL), high data heterogeneity can cause significant gradient divergence across devices, adversely affecting the learning process. This divergence, especially when gradients from different users form an obtuse angle during aggregation, can negate progress, leading to severe weight and gradient update degradation. To address this issue, we introduce a new approach to pFL design, namely Federated Learning with Layer-wise Aggregation via Gradient Analysis (FedLAG), utilizing the concept of gradient conflict at the layer level. Specifically, when layer-wise gradients of different clients form acute angles, those gradients align in the same direction, enabling updates across different clients toward identifying client-invariant features. Conversely, when layer-wise gradient pairs make create obtuse angles, the layers tend to focus on client-specific tasks. In hindsights, FedLAG assigns layers for personalization based on the extent of layer-wise gradient conflicts. Specifically, layers with gradient conflicts are excluded from the global aggregation process. The theoretical evaluation demonstrates that when integrated into other pFL baselines, FedLAG enhances pFL performance by a certain margin. Therefore, our proposed method achieves superior convergence behavior compared with other baselines. Extensive experiments show that our FedLAG outperforms several state-of-the-art methods and can be easily incorporated with many existing methods to further enhance performance. △ Less

Submitted 3 October, 2024; originally announced October 2024.

arXiv:2410.02221 [pdf, other]

doi 10.1038/s42256-023-00780-9

Capturing complex hand movements and object interactions using machine learning-powered stretchable smart textile gloves

Authors: Arvin Tashakori, Zenan Jiang, Amir Servati, Saeid Soltanian, Harishkumar Narayana, Katherine Le, Caroline Nakayama, Chieh-ling Yang, Z. Jane Wang, Janice J. Eng, Peyman Servati

Abstract: Accurate real-time tracking of dexterous hand movements and interactions has numerous applications in human-computer interaction, metaverse, robotics, and tele-health. Capturing realistic hand movements is challenging because of the large number of articulations and degrees of freedom. Here, we report accurate and dynamic tracking of articulated hand and finger movements using stretchable, washabl… ▽ More Accurate real-time tracking of dexterous hand movements and interactions has numerous applications in human-computer interaction, metaverse, robotics, and tele-health. Capturing realistic hand movements is challenging because of the large number of articulations and degrees of freedom. Here, we report accurate and dynamic tracking of articulated hand and finger movements using stretchable, washable smart gloves with embedded helical sensor yarns and inertial measurement units. The sensor yarns have a high dynamic range, responding to low 0.005 % to high 155 % strains, and show stability during extensive use and washing cycles. We use multi-stage machine learning to report average joint angle estimation root mean square errors of 1.21 and 1.45 degrees for intra- and inter-subjects cross-validation, respectively, matching accuracy of costly motion capture cameras without occlusion or field of view limitations. We report a data augmentation technique that enhances robustness to noise and variations of sensors. We demonstrate accurate tracking of dexterous hand movements during object interactions, opening new avenues of applications including accurate typing on a mock paper keyboard, recognition of complex dynamic and static gestures adapted from American Sign Language and object identification. △ Less

Submitted 3 October, 2024; originally announced October 2024.

Journal ref: Nature Machine Intelligence 6 (2024) 106-118

arXiv:2409.11130 [pdf, ps, other]

Regularisation by multiplicative noise for reaction-diffusion equations

Authors: Konstantinos Dareiotis, Teodor Holland, Khoa Lê

Abstract: We consider the stochastic reaction-diffusion equation in $1+1$ dimensions driven by multiplicative space-time white noise, with a distributional drift belonging to a Besov-Hölder space with any regularity index larger than $-1$. We assume that the diffusion coefficient is a regular function which is bounded away from zero. By using a combination of stochastic sewing techniques and Malliavin calcu… ▽ More We consider the stochastic reaction-diffusion equation in $1+1$ dimensions driven by multiplicative space-time white noise, with a distributional drift belonging to a Besov-Hölder space with any regularity index larger than $-1$. We assume that the diffusion coefficient is a regular function which is bounded away from zero. By using a combination of stochastic sewing techniques and Malliavin calculus, we show that the equation admits a unique solution. △ Less

Submitted 17 September, 2024; originally announced September 2024.

arXiv:2409.05706 [pdf, other]

Quantitative approximation of stochastic kinetic equations: from discrete to continuum

Authors: Zimo Hao, Khoa Lê, Chengcheng Ling

Abstract: We study the convergence of a generic tamed Euler-Maruyama (EM) scheme for the kinetic type stochastic differential equations (SDEs) (also known as second order SDEs) with singular coefficients in both weak and strong probabilistic senses. We show that when the drift exhibits a relatively low regularity compared to the state of the art, the singular system is well-defined both in the weak and stro… ▽ More We study the convergence of a generic tamed Euler-Maruyama (EM) scheme for the kinetic type stochastic differential equations (SDEs) (also known as second order SDEs) with singular coefficients in both weak and strong probabilistic senses. We show that when the drift exhibits a relatively low regularity compared to the state of the art, the singular system is well-defined both in the weak and strong probabilistic senses. Meanwhile, the corresponding tamed EM scheme is shown to converge at the rate of 1/2 in both the weak and the strong senses. △ Less

Submitted 9 September, 2024; originally announced September 2024.

Comments: 51 pages

MSC Class: Primary 60H35; 65C30; 60H10; Secondary 60H50; 60L90; 35K65; 35R05; 35B65

arXiv:2408.09139 [pdf, ps, other]

Explicit Convergence Rate of The Proximal Point Algorithm under R-Continuity

Authors: Ba Khiet Le, Michel Théra

Abstract: The paper provides a thorough comparison between R-continuity and other fundamental tools in optimization such as metric regularity, metric subregularity and calmness. We show that R-continuity has some advantages in the convergence rate analysis of algorithms solving optimization problems. We also present some properties of R-continuity and study the explicit convergence rate of the Proximal Poin… ▽ More The paper provides a thorough comparison between R-continuity and other fundamental tools in optimization such as metric regularity, metric subregularity and calmness. We show that R-continuity has some advantages in the convergence rate analysis of algorithms solving optimization problems. We also present some properties of R-continuity and study the explicit convergence rate of the Proximal Point Algorithm (PPA) under the R-continuity. △ Less

Submitted 17 August, 2024; originally announced August 2024.

arXiv:2408.04854 [pdf, ps, other]

A propensity score weighting approach to integrate aggregated data in random-effect individual-level data meta-analysis

Authors: Tran Trong Khoi Le, Sivem Afach, Tat-Thang Vo

Abstract: In evidence synthesis, collecting individual participant data (IPD) across eligible studies is the most reliable way to investigate the treatment effects in different subgroups defined by participant characteristics. Nonetheless, access to all IPD from all studies might be very challenging due to privacy concerns. To overcome this, many approaches such as multilevel modeling have been proposed to… ▽ More In evidence synthesis, collecting individual participant data (IPD) across eligible studies is the most reliable way to investigate the treatment effects in different subgroups defined by participant characteristics. Nonetheless, access to all IPD from all studies might be very challenging due to privacy concerns. To overcome this, many approaches such as multilevel modeling have been proposed to incorporate the vast amount of aggregated data from the literature into IPD meta-analysis. These methods, however, often rely on specifying separate models for trial-level versus patient-level data, which likely suffers from ecological bias when there are non-linearities in the outcome generating mechanism. In this paper, we introduce a novel method to combine aggregated data and IPD in meta-analysis that is free from ecological bias. The proposed approach relies on modeling the study membership given covariates, then using inverse weighting to estimate the trial-specific coefficients in the individual-level outcome model of studies without IPD accessible. The weights derived from this approach also shed insights on the similarity in the case-mix across studies, which is useful to assess whether eligible trials are sufficiently similar to be meta-analyzed. We evaluate the proposed method by synthetic data, then apply it to a real-world meta-analysis comparing the chance of response between guselkumab and adalimumab among patients with psoriasis. △ Less

Submitted 9 August, 2024; originally announced August 2024.

arXiv:2407.03788 [pdf, other]

MAMA: Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

Authors: Thong Nguyen, Yi Bin, Xiaobao Wu, Xinshuai Dong, Zhiyuan Hu, Khoi Le, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan

Abstract: Data quality stands at the forefront of deciding the effectiveness of video-language representation learning. However, video-text pairs in previous data typically do not align perfectly with each other, which might lead to video-language representations that do not accurately reflect cross-modal semantics. Moreover, previous data also possess an uneven distribution of concepts, thereby hampering t… ▽ More Data quality stands at the forefront of deciding the effectiveness of video-language representation learning. However, video-text pairs in previous data typically do not align perfectly with each other, which might lead to video-language representations that do not accurately reflect cross-modal semantics. Moreover, previous data also possess an uneven distribution of concepts, thereby hampering the downstream performance across unpopular subjects. To address these problems, we propose MAMA, a new approach to learning video-language representations by utilizing a contrastive objective with a subtractive angular margin to regularize cross-modal representations in their effort to reach perfect similarity. Furthermore, to adapt to the non-uniform concept distribution, MAMA utilizes a multi-layer perceptron (MLP)-parameterized weighting function that maps loss values to sample weights which enable dynamic adjustment of the model's focus throughout the training. With the training guided by a small amount of unbiased meta-data and augmented by video-text data generated by large vision-language model, MAMA improves video-language representations and achieve superior performances on commonly used video question answering and text-video retrieval datasets. The code, model, and data have been made available at https://nguyentthong.github.io/MAMA. △ Less

Submitted 9 October, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

Comments: Accepted to ECCV 2024

arXiv:2406.09717 [pdf, other]

UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages

Authors: Trinh Pham, Khoi M. Le, Luu Anh Tuan

Abstract: In this paper, we introduce UniBridge (Cross-Lingual Transfer Learning with Optimized Embeddings and Vocabulary), a comprehensive approach developed to improve the effectiveness of Cross-Lingual Transfer Learning, particularly in languages with limited resources. Our approach tackles two essential elements of a language model: the initialization of embeddings and the optimal vocabulary size. Speci… ▽ More In this paper, we introduce UniBridge (Cross-Lingual Transfer Learning with Optimized Embeddings and Vocabulary), a comprehensive approach developed to improve the effectiveness of Cross-Lingual Transfer Learning, particularly in languages with limited resources. Our approach tackles two essential elements of a language model: the initialization of embeddings and the optimal vocabulary size. Specifically, we propose a novel embedding initialization method that leverages both lexical and semantic alignment for a language. In addition, we present a method for systematically searching for the optimal vocabulary size, ensuring a balance between model complexity and linguistic coverage. Our experiments across multilingual datasets show that our approach greatly improves the F1-Score in several languages. UniBridge is a robust and adaptable solution for cross-lingual systems in various languages, highlighting the significance of initializing embeddings and choosing the right vocabulary size in cross-lingual environments. △ Less

Submitted 20 August, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

Comments: First two authors contribute equally. Accepted at ACL 2024

arXiv:2406.06777 [pdf, other]

MolX: Enhancing Large Language Models for Molecular Learning with A Multi-Modal Extension

Authors: Khiem Le, Zhichun Guo, Kaiwen Dong, Xiaobao Huang, Bozhao Nan, Roshni Iyer, Xiangliang Zhang, Olaf Wiest, Wei Wang, Nitesh V. Chawla

Abstract: Large Language Models (LLMs) with their strong task-handling capabilities have shown remarkable advancements across a spectrum of fields, moving beyond natural language understanding. However, their proficiency within the chemistry domain remains restricted, especially in solving professional molecule-related tasks. This challenge is attributed to their inherent limitations in comprehending molecu… ▽ More Large Language Models (LLMs) with their strong task-handling capabilities have shown remarkable advancements across a spectrum of fields, moving beyond natural language understanding. However, their proficiency within the chemistry domain remains restricted, especially in solving professional molecule-related tasks. This challenge is attributed to their inherent limitations in comprehending molecules using only common textual representations, i.e., SMILES strings. In this study, we seek to enhance the ability of LLMs to comprehend molecules by equipping them with a multi-modal external module, namely MolX. In particular, instead of directly using a SMILES string to represent a molecule, we utilize specific encoders to extract fine-grained features from both SMILES string and 2D molecular graph representations for feeding into an LLM. Moreover, a handcrafted molecular fingerprint is incorporated to leverage its embedded domain knowledge. Then, to establish an alignment between MolX and the LLM's textual input space, the whole model in which the LLM is frozen, is pre-trained with a versatile strategy including a diverse set of tasks. Experimental evaluations show that our proposed method outperforms baselines across 4 downstream molecule-related tasks ranging from molecule-to-text translation to retrosynthesis, with and without fine-tuning the LLM, while only introducing a small number of trainable parameters 0.53% and 0.82%, respectively. △ Less

Submitted 21 August, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

arXiv:2406.05808 [pdf, other]

The Landau--Lifshitz--Bloch equation: Unique existence and finite element approximation

Authors: Kim-Ngan Le, Agus L. Soenjaya, Thanh Tran

Abstract: The Landau--Lifshitz--Bloch equation (LLBE) describes the evolution of magnetic spin field in a ferromagnet at high temperatures. We consider a viscous (pseudo-parabolic) regularisation of the LLBE for temperatures higher than the Curie temperature, which we call the $ε$-LLBE. Variants of the $ε$-LLBE are applicable to model pattern formation, phase transition, and heat conduction for non-simple m… ▽ More The Landau--Lifshitz--Bloch equation (LLBE) describes the evolution of magnetic spin field in a ferromagnet at high temperatures. We consider a viscous (pseudo-parabolic) regularisation of the LLBE for temperatures higher than the Curie temperature, which we call the $ε$-LLBE. Variants of the $ε$-LLBE are applicable to model pattern formation, phase transition, and heat conduction for non-simple materials, among other things. In this paper, we show well-posedness of the $ε$-LLBE and the convergence of the solution $\boldsymbol{u}^ε$ of the regularised equation to the solution $\boldsymbol{u}$ of the LLBE as $ε\to 0^+$. As a by-product of our analysis, we show the existence and uniqueness of regular solution to the LLBE for temperatures higher than the Curie temperature. Furthermore, we propose a linear fully discrete conforming finite element scheme to approximate the solution of the $ε$-LLBE. Error analysis is performed to show unconditional stability and optimal uniform-in-time convergence rate for the schemes. Several numerical simulations corroborate our theoretical results. △ Less

Submitted 9 June, 2024; originally announced June 2024.

MSC Class: 65M12; 65M60; 35K59; 35Q60

arXiv:2406.02624 [pdf, other]

Take a Step Further: Understanding Page Spray in Linux Kernel Exploitation

Authors: Ziyi Guo, Dang K Le, Zhenpeng Lin, Kyle Zeng, Ruoyu Wang, Tiffany Bao, Yan Shoshitaishvili, Adam Doupé, Xinyu Xing

Abstract: Recently, a novel method known as Page Spray emerges, focusing on page-level exploitation for kernel vulnerabilities. Despite the advantages it offers in terms of exploitability, stability, and compatibility, comprehensive research on Page Spray remains scarce. Questions regarding its root causes, exploitation model, comparative benefits over other exploitation techniques, and possible mitigation… ▽ More Recently, a novel method known as Page Spray emerges, focusing on page-level exploitation for kernel vulnerabilities. Despite the advantages it offers in terms of exploitability, stability, and compatibility, comprehensive research on Page Spray remains scarce. Questions regarding its root causes, exploitation model, comparative benefits over other exploitation techniques, and possible mitigation strategies have largely remained unanswered. In this paper, we conduct a systematic investigation into Page Spray, providing an in-depth understanding of this exploitation technique. We introduce a comprehensive exploit model termed the \sys model, elucidating its fundamental principles. Additionally, we conduct a thorough analysis of the root causes underlying Page Spray occurrences within the Linux Kernel. We design an analyzer based on the Page Spray analysis model to identify Page Spray callsites. Subsequently, we evaluate the stability, exploitability, and compatibility of Page Spray through meticulously designed experiments. Finally, we propose mitigation principles for addressing Page Spray and introduce our own lightweight mitigation approach. This research aims to assist security researchers and developers in gaining insights into Page Spray, ultimately enhancing our collective understanding of this emerging exploitation technique and making improvements to the community. △ Less

Submitted 6 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

arXiv:2405.20431 [pdf, other]

Exploring the Practicality of Federated Learning: A Survey Towards the Communication Perspective

Authors: Khiem Le, Nhan Luong-Ha, Manh Nguyen-Duc, Danh Le-Phuoc, Cuong Do, Kok-Seng Wong

Abstract: Federated Learning (FL) is a promising paradigm that offers significant advancements in privacy-preserving, decentralized machine learning by enabling collaborative training of models across distributed devices without centralizing data. However, the practical deployment of FL systems faces a significant bottleneck: the communication overhead caused by frequently exchanging large model updates bet… ▽ More Federated Learning (FL) is a promising paradigm that offers significant advancements in privacy-preserving, decentralized machine learning by enabling collaborative training of models across distributed devices without centralizing data. However, the practical deployment of FL systems faces a significant bottleneck: the communication overhead caused by frequently exchanging large model updates between numerous devices and a central server. This communication inefficiency can hinder training speed, model performance, and the overall feasibility of real-world FL applications. In this survey, we investigate various strategies and advancements made in communication-efficient FL, highlighting their impact and potential to overcome the communication challenges inherent in FL systems. Specifically, we define measures for communication efficiency, analyze sources of communication inefficiency in FL systems, and provide a taxonomy and comprehensive review of state-of-the-art communication-efficient FL methods. Additionally, we discuss promising future research directions for enhancing the communication efficiency of FL systems. By addressing the communication bottleneck, FL can be effectively applied and enable scalable and practical deployment across diverse applications that require privacy-preserving, decentralized machine learning, such as IoT, healthcare, or finance. △ Less

Submitted 30 May, 2024; originally announced May 2024.

arXiv:2405.15308 [pdf, other]

Nudging Users to Change Breached Passwords Using the Protection Motivation Theory

Authors: Yixin Zou, Khue Le, Peter Mayer, Alessandro Acquisti, Adam J. Aviv, Florian Schaub

Abstract: We draw on the Protection Motivation Theory (PMT) to design nudges that encourage users to change breached passwords. Our online experiment ($n$=$1,386$) compared the effectiveness of a threat appeal (highlighting negative consequences of breached passwords) and a coping appeal (providing instructions on how to change the breached password) in a 2x2 factorial design. Compared to the control condit… ▽ More We draw on the Protection Motivation Theory (PMT) to design nudges that encourage users to change breached passwords. Our online experiment ($n$=$1,386$) compared the effectiveness of a threat appeal (highlighting negative consequences of breached passwords) and a coping appeal (providing instructions on how to change the breached password) in a 2x2 factorial design. Compared to the control condition, participants receiving the threat appeal were more likely to intend to change their passwords, and participants receiving both appeals were more likely to end up changing their passwords; both comparisons have a small effect size. Participants' password change behaviors are further associated with other factors such as their security attitudes (SA-6) and time passed since the breach, suggesting that PMT-based nudges are useful but insufficient to fully motivate users to change their passwords. Our study contributes to PMT's application in security research and provides concrete design implications for improving compromised credential notifications. △ Less

Submitted 24 May, 2024; originally announced May 2024.

Comments: Manuscript under review at ACM Transactions on Computer-Human Interaction

arXiv:2405.08442 [pdf, ps, other]

Algorithmic aspects of left-orderings of solvable Baumslag--Solitar groups via its dynamical realization

Authors: Meng-Che "Turbo" Ho, Khanh Le, Dino Rossegger

Abstract: We answer a question of Calderoni and Clay by showing that the conjugation equivalence relation of left orderings of the Baumslag-Solitar groups $\mathrm{BS}(1,n)$ is hyperfinite for any $n$. Our proof relies on a classification of $\mathrm{BS}(1,n)$'s left-orderings via its one-dimensional dynamical realizations. We furthermore use the effectiveness of the dynamical realizations of… ▽ More We answer a question of Calderoni and Clay by showing that the conjugation equivalence relation of left orderings of the Baumslag-Solitar groups $\mathrm{BS}(1,n)$ is hyperfinite for any $n$. Our proof relies on a classification of $\mathrm{BS}(1,n)$'s left-orderings via its one-dimensional dynamical realizations. We furthermore use the effectiveness of the dynamical realizations of $\mathrm{BS}(1,n)$ to study algorithmic properties of the left-orderings on $\mathrm{BS}(1,n)$. △ Less

Submitted 14 May, 2024; originally announced May 2024.

MSC Class: 03E15; 20F60; 03C57; 03D45

arXiv:2403.15605 [pdf, other]

Efficiently Assemble Normalization Layers and Regularization for Federated Domain Generalization

Authors: Khiem Le, Long Ho, Cuong Do, Danh Le-Phuoc, Kok-Seng Wong

Abstract: Domain shift is a formidable issue in Machine Learning that causes a model to suffer from performance degradation when tested on unseen domains. Federated Domain Generalization (FedDG) attempts to train a global model using collaborative clients in a privacy-preserving manner that can generalize well to unseen clients possibly with domain shift. However, most existing FedDG methods either cause ad… ▽ More Domain shift is a formidable issue in Machine Learning that causes a model to suffer from performance degradation when tested on unseen domains. Federated Domain Generalization (FedDG) attempts to train a global model using collaborative clients in a privacy-preserving manner that can generalize well to unseen clients possibly with domain shift. However, most existing FedDG methods either cause additional privacy risks of data leakage or induce significant costs in client communication and computation, which are major concerns in the Federated Learning paradigm. To circumvent these challenges, here we introduce a novel architectural method for FedDG, namely gPerXAN, which relies on a normalization scheme working with a guiding regularizer. In particular, we carefully design Personalized eXplicitly Assembled Normalization to enforce client models selectively filtering domain-specific features that are biased towards local data while retaining discrimination of those features. Then, we incorporate a simple yet effective regularizer to guide these models in directly capturing domain-invariant representations that the global model's classifier can leverage. Extensive experimental results on two benchmark datasets, i.e., PACS and Office-Home, and a real-world medical dataset, Camelyon17, indicate that our proposed method outperforms other existing methods in addressing this particular problem. △ Less

Submitted 22 March, 2024; originally announced March 2024.

arXiv:2403.13193 [pdf, ps, other]

doi 10.1145/3589335.3651463

A Study of Vulnerability Repair in JavaScript Programs with Large Language Models

Authors: Tan Khang Le, Saba Alimadadi, Steven Y. Ko

Abstract: In recent years, JavaScript has become the most widely used programming language, especially in web development. However, writing secure JavaScript code is not trivial, and programmers often make mistakes that lead to security vulnerabilities in web applications. Large Language Models (LLMs) have demonstrated substantial advancements across multiple domains, and their evolving capabilities indicat… ▽ More In recent years, JavaScript has become the most widely used programming language, especially in web development. However, writing secure JavaScript code is not trivial, and programmers often make mistakes that lead to security vulnerabilities in web applications. Large Language Models (LLMs) have demonstrated substantial advancements across multiple domains, and their evolving capabilities indicate their potential for automatic code generation based on a required specification, including automatic bug fixing. In this study, we explore the accuracy of LLMs, namely ChatGPT and Bard, in finding and fixing security vulnerabilities in JavaScript programs. We also investigate the impact of context in a prompt on directing LLMs to produce a correct patch of vulnerable JavaScript code. Our experiments on real-world software vulnerabilities show that while LLMs are promising in automatic program repair of JavaScript code, achieving a correct bug fix often requires an appropriate amount of context in the prompt. △ Less

Submitted 19 March, 2024; originally announced March 2024.

Comments: camera-ready version accepted to the short paper track at WWW'24

arXiv:2403.08876 [pdf, other]

ARtVista: Gateway To Empower Anyone Into Artist

Authors: Trong-Vu Hoang, Quang-Binh Nguyen, Duy-Nam Ly, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le

Abstract: Drawing is an art that enables people to express their imagination and emotions. However, individuals usually face challenges in drawing, especially when translating conceptual ideas into visually coherent representations and bridging the gap between mental visualization and practical execution. In response, we propose ARtVista - a novel system integrating AR and generative AI technologies. ARtVis… ▽ More Drawing is an art that enables people to express their imagination and emotions. However, individuals usually face challenges in drawing, especially when translating conceptual ideas into visually coherent representations and bridging the gap between mental visualization and practical execution. In response, we propose ARtVista - a novel system integrating AR and generative AI technologies. ARtVista not only recommends reference images aligned with users' abstract ideas and generates sketches for users to draw but also goes beyond, crafting vibrant paintings in various painting styles. ARtVista also offers users an alternative approach to create striking paintings by simulating the paint-by-number concept on reference images, empowering users to create visually stunning artwork devoid of the necessity for advanced drawing skills. We perform a pilot study and reveal positive feedback on its usability, emphasizing its effectiveness in visualizing user ideas and aiding the painting process to achieve stunning pictures without requiring advanced drawing skills. The source code will be available at https://github.com/htrvu/ARtVista. △ Less

Submitted 13 March, 2024; originally announced March 2024.

Comments: CHI 2024

arXiv:2403.08746 [pdf, other]

iCONTRA: Toward Thematic Collection Design Via Interactive Concept Transfer

Authors: Dinh-Khoi Vo, Duy-Nam Ly, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le

Abstract: Creating thematic collections in industries demands innovative designs and cohesive concepts. Designers may face challenges in maintaining thematic consistency when drawing inspiration from existing objects, landscapes, or artifacts. While AI-powered graphic design tools offer help, they often fail to generate cohesive sets based on specific thematic concepts. In response, we introduce iCONTRA, an… ▽ More Creating thematic collections in industries demands innovative designs and cohesive concepts. Designers may face challenges in maintaining thematic consistency when drawing inspiration from existing objects, landscapes, or artifacts. While AI-powered graphic design tools offer help, they often fail to generate cohesive sets based on specific thematic concepts. In response, we introduce iCONTRA, an interactive CONcept TRAnsfer system. With a user-friendly interface, iCONTRA enables both experienced designers and novices to effortlessly explore creative design concepts and efficiently generate thematic collections. We also propose a zero-shot image editing algorithm, eliminating the need for fine-tuning models, which gradually integrates information from initial objects, ensuring consistency in the generation process without influencing the background. A pilot study suggests iCONTRA's potential to reduce designers' efforts. Experimental results demonstrate its effectiveness in producing consistent and high-quality object concept transfers. iCONTRA stands as a promising tool for innovation and creative exploration in thematic collection design. The source code will be available at: https://github.com/vdkhoi20/iCONTRA. △ Less

Submitted 13 March, 2024; originally announced March 2024.

Comments: CHI 2024

arXiv:2403.05454 [pdf, ps, other]

Quantitative Propagation of Chaos for Singular Interacting Particle Systems Driven by Fractional Brownian Motion

Authors: Lucio Galeati, Khoa Lê, Avi Mayorcas

Abstract: We consider interacting systems particle driven by i.i.d. fractional Brownian motions, subject to irregular, possibly distributional, pairwise interactions. We show propagation of chaos and mean field convergence to the law of the associated McKean--Vlasov equation, as the number of particles $N\to\infty$, with quantitative sharp rates of order $N^{-1/2}$. Our results hold for a wide class of poss… ▽ More We consider interacting systems particle driven by i.i.d. fractional Brownian motions, subject to irregular, possibly distributional, pairwise interactions. We show propagation of chaos and mean field convergence to the law of the associated McKean--Vlasov equation, as the number of particles $N\to\infty$, with quantitative sharp rates of order $N^{-1/2}$. Our results hold for a wide class of possibly time-dependent interactions, which are only assumed to satisfy a Besov-type regularity, related to the Hurst parameter $H\in (0,+\infty)\setminus \mathbb{N}$ of the driving noises. In particular, as $H$ decreases to $0$, interaction kernels of arbitrary singularity can be considered, a phenomenon frequently observed in regularization by noise results. Our proofs rely on a combinations of Sznitman's direct comparison argument with stochastic sewing techniques. △ Less

Submitted 8 March, 2024; originally announced March 2024.

MSC Class: 60H10; 82C22; 60H50; 60G22; 60L90

arXiv:2402.11539 [pdf, ps, other]

Zagier-Hoffman's conjectures in positive characteristic II

Authors: Bo-Hae Im, Hojin Kim, Khac Nhuan Le, Tuan Ngo Dac, Lan Huong Pham

Abstract: Zagier-Hoffman's conjectures predict the dimension and a basis for the $\mathbb Q$-vector spaces spanned by $N$th cyclotomic multiple zeta values (MZV's) of fixed weight where $N$ is a natural number. For $N=1$ (MZV's case), half of these conjectures have been solved by the work of Terasoma, Deligne-Goncharov and Brown with the help of Zagier's identity. The other half are completely open. For… ▽ More Zagier-Hoffman's conjectures predict the dimension and a basis for the $\mathbb Q$-vector spaces spanned by $N$th cyclotomic multiple zeta values (MZV's) of fixed weight where $N$ is a natural number. For $N=1$ (MZV's case), half of these conjectures have been solved by the work of Terasoma, Deligne-Goncharov and Brown with the help of Zagier's identity. The other half are completely open. For $N=2$ (alternating MZV's case) and $N=3,4,8$, Deligne-Goncharov and Deligne solved the same half of these conjectures for $N$th-cyclotomic MZV's. For other values of $N$, no sharp upper bound on the dimension is known. In this paper we completely establish, for all $N$, Zagier-Hoffman's conjectures for $N$th cyclotomic multiple zeta values in positive characteristic. By working with the tower of all cyclotomic extensions, we present a proof that is uniform on $N$ and give an effective algorithm to express any cyclotomic multiple zeta value in the chosen basis. This generalizes all previous work on these conjectures for MZV's and alternating MZV's in positive characteristic. △ Less

Submitted 18 February, 2024; originally announced February 2024.

Comments: 45 pages

MSC Class: Primary 11M32; Secondary 11G09; 11J93; 11M38; 11R58

arXiv:2402.08251 [pdf, other]

doi 10.1109/SII58957.2024.10417611

Object Detection in Thermal Images Using Deep Learning for Unmanned Aerial Vehicles

Authors: Minh Dang Tu, Kieu Trang Le, Manh Duong Phung

Abstract: This work presents a neural network model capable of recognizing small and tiny objects in thermal images collected by unmanned aerial vehicles. Our model consists of three parts, the backbone, the neck, and the prediction head. The backbone is developed based on the structure of YOLOv5 combined with the use of a transformer encoder at the end. The neck includes a BI-FPN block combined with the us… ▽ More This work presents a neural network model capable of recognizing small and tiny objects in thermal images collected by unmanned aerial vehicles. Our model consists of three parts, the backbone, the neck, and the prediction head. The backbone is developed based on the structure of YOLOv5 combined with the use of a transformer encoder at the end. The neck includes a BI-FPN block combined with the use of a sliding window and a transformer to increase the information fed into the prediction head. The prediction head carries out the detection by evaluating feature maps with the Sigmoid function. The use of transformers with attention and sliding windows increases recognition accuracy while keeping the model at a reasonable number of parameters and computation requirements for embedded systems. Experiments conducted on public dataset VEDAI and our collected datasets show that our model has a higher accuracy than state-of-the-art methods such as ResNet, Faster RCNN, ComNet, ViT, YOLOv5, SMPNet, and DPNetV3. Experiments on the embedded computer Jetson AGX show that our model achieves a real-time computation speed with a stability rate of over 90%. △ Less

Submitted 13 February, 2024; originally announced February 2024.

Comments: Published in: 2024 IEEE/SICE International Symposium on System Integration (SII)

arXiv:2402.06139 [pdf, ps, other]

Sliding Mode Observers for Set-valued Lur'e Systems with Uncertainties Beyond Observational Range

Authors: Samir Adly, Jun Huang, Ba Khiet Le

Abstract: In this paper, we introduce a new sliding mode observer for Lur'e set-valued dynamical systems, particularly addressing challenges posed by uncertainties not within the standard range of observation. Traditionally, most of Luenberger-like observers and sliding mode observer have been designed only for uncertainties in the range of observation. Central to our approach is the treatment of the uncert… ▽ More In this paper, we introduce a new sliding mode observer for Lur'e set-valued dynamical systems, particularly addressing challenges posed by uncertainties not within the standard range of observation. Traditionally, most of Luenberger-like observers and sliding mode observer have been designed only for uncertainties in the range of observation. Central to our approach is the treatment of the uncertainty term which we decompose into two components: the first part in the observation subspace and the second part in its complemented subspace. We establish that when the second part converges to zero, an exact sliding mode observer for the system can be obtained. In scenarios where this convergence does not occur, our methodology allows for the estimation of errors between the actual state and the observer state. This leads to a practical interval estimation technique, valuable in situations where part of the uncertainty lies outside the observable range. Finally, we show that our observer is also a T- observer as well as a strong H-infinity observer. △ Less

Submitted 25 April, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

arXiv:2401.07278 [pdf, other]

Semi-Supervised Semantic Segmentation using Redesigned Self-Training for White Blood Cells

Authors: Vinh Quoc Luu, Duy Khanh Le, Huy Thanh Nguyen, Minh Thanh Nguyen, Thinh Tien Nguyen, Vinh Quang Dinh

Abstract: Artificial Intelligence (AI) in healthcare, especially in white blood cell cancer diagnosis, is hindered by two primary challenges: the lack of large-scale labeled datasets for white blood cell (WBC) segmentation and outdated segmentation methods. These challenges inhibit the development of more accurate and modern techniques to diagnose cancer relating to white blood cells. To address the first c… ▽ More Artificial Intelligence (AI) in healthcare, especially in white blood cell cancer diagnosis, is hindered by two primary challenges: the lack of large-scale labeled datasets for white blood cell (WBC) segmentation and outdated segmentation methods. These challenges inhibit the development of more accurate and modern techniques to diagnose cancer relating to white blood cells. To address the first challenge, a semi-supervised learning framework should be devised to efficiently capitalize on the scarcity of the dataset available. In this work, we address this issue by proposing a novel self-training pipeline with the incorporation of FixMatch. Self-training is a technique that utilizes the model trained on labeled data to generate pseudo-labels for the unlabeled data and then re-train on both of them. FixMatch is a consistency-regularization algorithm to enforce the model's robustness against variations in the input image. We discover that by incorporating FixMatch in the self-training pipeline, the performance improves in the majority of cases. Our performance achieved the best performance with the self-training scheme with consistency on DeepLab-V3 architecture and ResNet-50, reaching 90.69%, 87.37%, and 76.49% on Zheng 1, Zheng 2, and LISC datasets, respectively. △ Less

Submitted 23 February, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

arXiv:2401.04348 [pdf, other]

doi 10.1609/aaai.v38i16.29804

LAMPAT: Low-Rank Adaption for Multilingual Paraphrasing Using Adversarial Training

Authors: Khoi M. Le, Trinh Pham, Tho Quan, Anh Tuan Luu

Abstract: Paraphrases are texts that convey the same meaning while using different words or sentence structures. It can be used as an automatic data augmentation tool for many Natural Language Processing tasks, especially when dealing with low-resource languages, where data shortage is a significant problem. To generate a paraphrase in multilingual settings, previous studies have leveraged the knowledge fro… ▽ More Paraphrases are texts that convey the same meaning while using different words or sentence structures. It can be used as an automatic data augmentation tool for many Natural Language Processing tasks, especially when dealing with low-resource languages, where data shortage is a significant problem. To generate a paraphrase in multilingual settings, previous studies have leveraged the knowledge from the machine translation field, i.e., forming a paraphrase through zero-shot machine translation in the same language. Despite good performance on human evaluation, those methods still require parallel translation datasets, thus making them inapplicable to languages that do not have parallel corpora. To mitigate that problem, we proposed the first unsupervised multilingual paraphrasing model, LAMPAT ($\textbf{L}$ow-rank $\textbf{A}$daptation for $\textbf{M}$ultilingual $\textbf{P}$araphrasing using $\textbf{A}$dversarial $\textbf{T}$raining), by which monolingual dataset is sufficient enough to generate a human-like and diverse sentence. Throughout the experiments, we found out that our method not only works well for English but can generalize on unseen languages as well. Data and code are available at https://github.com/VinAIResearch/LAMPAT. △ Less

Submitted 23 June, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

Comments: First two authors contribute equally. Accepted at AAAI 2024

arXiv:2401.03917 [pdf, other]

Toward a comprehensive simulation framework for hypergraphs: a Python-base approach

Authors: Quoc Chuong Nguyen, Trung Kien Le

Abstract: Hypergraphs, or generalization of graphs such that edges can contain more than two nodes, have become increasingly prominent in understanding complex network analysis. Unlike graphs, hypergraphs have relatively few supporting platforms, and such dearth presents a barrier to more widespread adaptation of hypergraph computational toolboxes that could enable further research in several areas. Here, w… ▽ More Hypergraphs, or generalization of graphs such that edges can contain more than two nodes, have become increasingly prominent in understanding complex network analysis. Unlike graphs, hypergraphs have relatively few supporting platforms, and such dearth presents a barrier to more widespread adaptation of hypergraph computational toolboxes that could enable further research in several areas. Here, we introduce HyperRD, a Python package for hypergraph computation, simulation, and interoperability with other powerful Python packages in graph and hypergraph research. Then, we will introduce two models on hypergraph, the general Schelling's model and the SIR model, and simulate them with HyperRD. △ Less

Submitted 8 January, 2024; originally announced January 2024.

Comments: 13 pages, 3 figures

arXiv:2312.11045 [pdf, other]

Decays of Standard Model like Higgs boson $h \rightarrowγγ, Z γ$ in a minimal left-right symmetric model

Authors: T. T. Hong, V. K. Le, L. T. T. Phuong, N . C. Hoi, N. T. K. Ngan, N. H. T. Nha

Abstract: Two decay channels $h\rightarrow γγ, Zγ$ of the Standard Model-like Higgs in a left-right symmetry model are investigated under recent experimental data. We will show there exist one-loop contributions that affect the $h\rightarrow Zγ$ amplitude, but not the $h\rightarrow γγ$ amplitude. From numerical investigations, we show that the signal strength $μ_{Z γ}$ of the decay $h\rightarrow Zγ$ is stil… ▽ More Two decay channels $h\rightarrow γγ, Zγ$ of the Standard Model-like Higgs in a left-right symmetry model are investigated under recent experimental data. We will show there exist one-loop contributions that affect the $h\rightarrow Zγ$ amplitude, but not the $h\rightarrow γγ$ amplitude. From numerical investigations, we show that the signal strength $μ_{Z γ}$ of the decay $h\rightarrow Zγ$ is still constrained strictly by that of $h\rightarrow γγ$, namely $|Δμ_{γγ}|<38\%$ results in max $|Δμ_{Z γ}|<46\%$. On the other hand, the future experimental sensitivity $|Δμ_{γγ}|=4\%$ still allows $|Δμ_{Z γ}|$ reaches to values larger than the expected sensitivity $|Δμ_{Z γ}|=23\%$. △ Less

Submitted 11 March, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

Comments: 23 pages, 6 figures

arXiv:2312.07035 [pdf, other]

HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts

Authors: Giang Do, Khiem Le, Quang Pham, TrungTin Nguyen, Thanh-Nam Doan, Bint T. Nguyen, Chenghao Liu, Savitha Ramasamy, Xiaoli Li, Steven Hoi

Abstract: By routing input tokens to only a few split experts, Sparse Mixture-of-Experts has enabled efficient training of large language models. Recent findings suggest that fixing the routers can achieve competitive performance by alleviating the collapsing problem, where all experts eventually learn similar representations. However, this strategy has two key limitations: (i) the policy derived from rando… ▽ More By routing input tokens to only a few split experts, Sparse Mixture-of-Experts has enabled efficient training of large language models. Recent findings suggest that fixing the routers can achieve competitive performance by alleviating the collapsing problem, where all experts eventually learn similar representations. However, this strategy has two key limitations: (i) the policy derived from random routers might be sub-optimal, and (ii) it requires extensive resources during training and evaluation, leading to limited efficiency gains. This work introduces \HyperRout, which dynamically generates the router's parameters through a fixed hypernetwork and trainable embeddings to achieve a balance between training the routers and freezing them to learn an improved routing policy. Extensive experiments across a wide range of tasks demonstrate the superior performance and efficiency gains of \HyperRouter compared to existing routing methods. Our implementation is publicly available at {\url{https://github.com/giangdip2410/HyperRouter}}. △ Less

Submitted 12 December, 2023; originally announced December 2023.

arXiv:2312.06950 [pdf, other]

READ: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language Modeling

Authors: Thong Nguyen, Xiaobao Wu, Xinshuai Dong, Khoi Le, Zhiyuan Hu, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan

Abstract: Fully fine-tuning pretrained large-scale transformer models has become a popular paradigm for video-language modeling tasks, such as temporal language grounding and video-language summarization. With a growing number of tasks and limited training data, such full fine-tuning approach leads to costly model storage and unstable training. To overcome these shortcomings, we introduce lightweight adapte… ▽ More Fully fine-tuning pretrained large-scale transformer models has become a popular paradigm for video-language modeling tasks, such as temporal language grounding and video-language summarization. With a growing number of tasks and limited training data, such full fine-tuning approach leads to costly model storage and unstable training. To overcome these shortcomings, we introduce lightweight adapters to the pre-trained model and only update them at fine-tuning time. However, existing adapters fail to capture intrinsic temporal relations among video frames or textual words. Moreover, they neglect the preservation of critical task-related information that flows from the raw video-language input into the adapter's low-dimensional space. To address these issues, we first propose a novel REcurrent ADapter (READ) that employs recurrent computation to enable temporal modeling capability. Second, we propose Partial Video-Language Alignment (PVLA) objective via the use of partial optimal transport to maintain task-related information flowing into our READ modules. We validate our READ framework through extensive experiments where READ significantly outperforms all existing fine-tuning strategies on multiple low-resource temporal language grounding and video-language summarization benchmarks. The code, model, and data have been made available at https://nguyentthong.github.io/READ. △ Less

Submitted 5 October, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

Comments: Accepted at AAAI 2024

arXiv:2311.18225 [pdf, other]

Harnessing graph state resources for robust quantum magnetometry under noise

Authors: Phu Trong Nguyen, Trung Kien Le, Hung Q. Nguyen, Le Bin Ho

Abstract: Precise measurement of magnetic fields is essential for various applications, such as fundamental physics, space exploration, and biophysics. Although recent progress in quantum engineering has assisted in creating advanced quantum magnetometers, there are still ongoing challenges in improving their efficiency and noise resistance. This study focuses on using symmetric graph state resources for qu… ▽ More Precise measurement of magnetic fields is essential for various applications, such as fundamental physics, space exploration, and biophysics. Although recent progress in quantum engineering has assisted in creating advanced quantum magnetometers, there are still ongoing challenges in improving their efficiency and noise resistance. This study focuses on using symmetric graph state resources for quantum magnetometry to enhance measurement precision by analyzing the estimation theory under time-homogeneous and time-inhomogeneous noise models. The results show a significant improvement in estimating both single and multiple Larmor frequencies. In single Larmor frequency estimation, the quantum Fisher information spans a spectrum from the standard quantum limit to the Heisenberg limit within a periodic range of the Larmor frequency, and in the case of multiple Larmor frequencies, it can exceed the standard quantum limit for both noisy cases. This study highlights the potential of graph state-based methods for improving magnetic field measurements under noisy environments. △ Less

Submitted 3 September, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

Comments: 11 pages, 7 figures

Journal ref: Scientific Reports (2024)

arXiv:2311.13096 [pdf, ps, other]

R-Continuity with Applications to Convergence Analysis of Tikhonov Regularization and DC Programming

Authors: Ba Khiet Le

Abstract: In the paper, we study the convergence analysis of Tikhonov regularization in finding a zero of a maximal monotone operator using the notion of R-continuity. Applications to convex minimization and DC programming are provided. In the paper, we study the convergence analysis of Tikhonov regularization in finding a zero of a maximal monotone operator using the notion of R-continuity. Applications to convex minimization and DC programming are provided. △ Less

Submitted 21 November, 2023; originally announced November 2023.

arXiv:2311.13092 [pdf, other]

State-Dependent Sweeping Processes: Asymptotic Behavior and Algorithmic Approaches

Authors: Samir Adly, Monica G. Cojocaru, Ba Khiet Le

Abstract: In this paper, we investigate the asymptotic properties of a particular class of state-dependent sweeping processes. While extensive research has been conducted on the existence and uniqueness of solutions for sweeping processes, there is a scarcity of studies addressing their behavior in the limit of large time. Additionally, we introduce novel algorithms designed for the resolution of quasi-vari… ▽ More In this paper, we investigate the asymptotic properties of a particular class of state-dependent sweeping processes. While extensive research has been conducted on the existence and uniqueness of solutions for sweeping processes, there is a scarcity of studies addressing their behavior in the limit of large time. Additionally, we introduce novel algorithms designed for the resolution of quasi-variational inequalities. As a result, we introduce a new derivative-free algorithm to find zeros of nonsmooth Lipschitz continuous mappings with a linear convergence rate. This algorithm can be effectively used in nonsmooth and nonconvex optimization problems that do not possess necessarily second-order differentiability conditions of the data. △ Less

Submitted 21 November, 2023; originally announced November 2023.

MSC Class: 28B05; 34A36; 34A60; 49J52; 49J53; 93D20

arXiv:2311.12283 [pdf, other]

A new twist on modular links from an old perspective

Authors: Khanh Le

Abstract: We show that the complement of arithmetic modular links found in arXiv:2307.09409 is homeomorphic to the complement of augmented chainlinks. In particular, these link complements arise as n-fold cyclic covers of the Whitehead link complement. We show that the complement of arithmetic modular links found in arXiv:2307.09409 is homeomorphic to the complement of augmented chainlinks. In particular, these link complements arise as n-fold cyclic covers of the Whitehead link complement. △ Less

Submitted 20 November, 2023; originally announced November 2023.

Comments: 8 pages. 7 figures. Comments are welcome!

arXiv:2310.19443 [pdf, other]

Asymptotically accurate and locking-free finite element implementation of first order shear deformation theory for plates

Authors: Khanh Chau Le, Hoang Giang Bui

Abstract: A formulation of the asymptotically exact first-order shear deformation theory for linear-elastic homogeneous plates in the rescaled coordinates and rotation angles is considered. This allows the development of its asymptotically accurate and shear-locking-free finite element implementation. As applications, numerical simulations are performed for circular and rectangular plates, showing complete… ▽ More A formulation of the asymptotically exact first-order shear deformation theory for linear-elastic homogeneous plates in the rescaled coordinates and rotation angles is considered. This allows the development of its asymptotically accurate and shear-locking-free finite element implementation. As applications, numerical simulations are performed for circular and rectangular plates, showing complete agreement between the analytical solution and the numerical solutions based on two-dimensional theory and three-dimensional elasticity theory. △ Less

Submitted 16 April, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

Comments: 32 pages, 11 figures

arXiv:2309.16339 [pdf, ps, other]

A central limit theorem for the Euler method for SDEs with irregular drifts

Authors: Konstantinos Dareiotis, Máté Gerencsér, Khoa Lê

Abstract: The goal of this article is to establish a central limit theorem for the Euler-Maruyama scheme approximating multidimensional SDEs with elliptic Brownian diffusion, under very mild regularity requirements on the drift coefficients. When the drift is Hölder continuous, we show that the limiting law of the rescaled fluctuations around the true solution is characterised as the unique solution of a hy… ▽ More The goal of this article is to establish a central limit theorem for the Euler-Maruyama scheme approximating multidimensional SDEs with elliptic Brownian diffusion, under very mild regularity requirements on the drift coefficients. When the drift is Hölder continuous, we show that the limiting law of the rescaled fluctuations around the true solution is characterised as the unique solution of a hybrid Young-Itô differential equation. When the drift has positive Sobolev regularity, this limit is characterised by the solution of a transformed SDE. Our result is an extension of the results of Jacod-Kurtz-Protter (1991, 1998) in which SDEs with differentiable coefficients were considered. To compensate for the lack of regularity of the drifts, we utilize the regularisation effect from the non-degenerate noise. △ Less

Submitted 28 September, 2023; originally announced September 2023.

MSC Class: 60H10; 60H50; 60H35

arXiv:2309.03506 [pdf, other]

Towards Robust Natural-Looking Mammography Lesion Synthesis on Ipsilateral Dual-Views Breast Cancer Analysis

Authors: Thanh-Huy Nguyen, Quang Hien Kha, Thai Ngoc Toan Truong, Ba Thinh Lam, Ba Hung Ngo, Quang Vinh Dinh, Nguyen Quoc Khanh Le

Abstract: In recent years, many mammographic image analysis methods have been introduced for improving cancer classification tasks. Two major issues of mammogram classification tasks are leveraging multi-view mammographic information and class-imbalance handling. In the first problem, many multi-view methods have been released for concatenating features of two or more views for the training and inference st… ▽ More In recent years, many mammographic image analysis methods have been introduced for improving cancer classification tasks. Two major issues of mammogram classification tasks are leveraging multi-view mammographic information and class-imbalance handling. In the first problem, many multi-view methods have been released for concatenating features of two or more views for the training and inference stage. Having said that, most multi-view existing methods are not explainable in the meaning of feature fusion, and treat many views equally for diagnosing. Our work aims to propose a simple but novel method for enhancing examined view (main view) by leveraging low-level feature information from the auxiliary view (ipsilateral view) before learning the high-level feature that contains the cancerous features. For the second issue, we also propose a simple but novel malignant mammogram synthesis framework for upsampling minor class samples. Our easy-to-implement and no-training framework has eliminated the current limitation of the CutMix algorithm which is unreliable synthesized images with random pasted patches, hard-contour problems, and domain shift problems. Our results on VinDr-Mammo and CMMD datasets show the effectiveness of our two new frameworks for both multi-view training and synthesizing mammographic images, outperforming the previous conventional methods in our experimental settings. △ Less

Submitted 7 September, 2023; originally announced September 2023.

arXiv:2308.13798 [pdf, other]

DM-VTON: Distilled Mobile Real-time Virtual Try-On

Authors: Khoi-Nguyen Nguyen-Ngoc, Thanh-Tung Phan-Nguyen, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le

Abstract: The fashion e-commerce industry has witnessed significant growth in recent years, prompting exploring image-based virtual try-on techniques to incorporate Augmented Reality (AR) experiences into online shopping platforms. However, existing research has primarily overlooked a crucial aspect - the runtime of the underlying machine-learning model. While existing methods prioritize enhancing output qu… ▽ More The fashion e-commerce industry has witnessed significant growth in recent years, prompting exploring image-based virtual try-on techniques to incorporate Augmented Reality (AR) experiences into online shopping platforms. However, existing research has primarily overlooked a crucial aspect - the runtime of the underlying machine-learning model. While existing methods prioritize enhancing output quality, they often disregard the execution time, which restricts their applications on a limited range of devices. To address this gap, we propose Distilled Mobile Real-time Virtual Try-On (DM-VTON), a novel virtual try-on framework designed to achieve simplicity and efficiency. Our approach is based on a knowledge distillation scheme that leverages a strong Teacher network as supervision to guide a Student network without relying on human parsing. Notably, we introduce an efficient Mobile Generative Module within the Student network, significantly reducing the runtime while ensuring high-quality output. Additionally, we propose Virtual Try-on-guided Pose for Data Synthesis to address the limited pose variation observed in training images. Experimental results show that the proposed method can achieve 40 frames per second on a single Nvidia Tesla T4 GPU and only take up 37 MB of memory while producing almost the same output quality as other state-of-the-art methods. DM-VTON stands poised to facilitate the advancement of real-time AR applications, in addition to the generation of lifelike attired human figures tailored for diverse specialized training tasks. https://sites.google.com/view/ltnghia/research/DMVTON △ Less

Submitted 26 August, 2023; originally announced August 2023.

Comments: Accepted to ISMAR 2023 (Poster paper)

arXiv:2308.13795 [pdf, other]

VIDES: Virtual Interior Design via Natural Language and Visual Guidance

Authors: Minh-Hien Le, Chi-Bien Chu, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le

Abstract: Interior design is crucial in creating aesthetically pleasing and functional indoor spaces. However, developing and editing interior design concepts requires significant time and expertise. We propose Virtual Interior DESign (VIDES) system in response to this challenge. Leveraging cutting-edge technology in generative AI, our system can assist users in generating and editing indoor scene concepts… ▽ More Interior design is crucial in creating aesthetically pleasing and functional indoor spaces. However, developing and editing interior design concepts requires significant time and expertise. We propose Virtual Interior DESign (VIDES) system in response to this challenge. Leveraging cutting-edge technology in generative AI, our system can assist users in generating and editing indoor scene concepts quickly, given user text description and visual guidance. Using both visual guidance and language as the conditional inputs significantly enhances the accuracy and coherence of the generated scenes, resulting in visually appealing designs. Through extensive experimentation, we demonstrate the effectiveness of VIDES in developing new indoor concepts, changing indoor styles, and replacing and removing interior objects. The system successfully captures the essence of users' descriptions while providing flexibility for customization. Consequently, this system can potentially reduce the entry barrier for indoor design, making it more accessible to users with limited technical skills and reducing the time required to create high-quality images. Individuals who have a background in design can now easily communicate their ideas visually and effectively present their design concepts. https://sites.google.com/view/ltnghia/research/VIDES △ Less

Submitted 26 August, 2023; originally announced August 2023.

Comments: Accepted to ISMAR 2023 (Poster paper)

arXiv:2307.13253 [pdf, other]

doi 10.1016/j.spa.2024.104443

A class of space-time discretizations for the stochastic $p$-Stokes system

Authors: Kim-Ngan Le, Jörn Wichmann

Abstract: The main objective of the present paper is to construct a new class of space-time discretizations for the stochastic $p$-Stokes system and analyze its stability and convergence properties. We derive regularity results for the approximation that are similar to the natural regularity of solutions. One of the key arguments relies on discrete extrapolation that allows to relate lower moments of disc… ▽ More The main objective of the present paper is to construct a new class of space-time discretizations for the stochastic $p$-Stokes system and analyze its stability and convergence properties. We derive regularity results for the approximation that are similar to the natural regularity of solutions. One of the key arguments relies on discrete extrapolation that allows to relate lower moments of discrete maximal processes. We show that, if the generic spatial discretization is constraint conforming, then the velocity approximation satisfies a best-approximation property in the natural distance. Moreover, we present an example such that the resulting velocity approximation converges with rate $1/2$ in time and $1$ in space towards the (unknown) target velocity with respect to the natural distance. The theory is corroborated by numerical experiments. △ Less

Submitted 5 August, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

Comments: 52 pages, 3 figures

MSC Class: 35K55; 35K65; 35K67; 35R60; 34K28; 65C30; 60H15

arXiv:2307.01844 [pdf, other]

Advancing Wound Filling Extraction on 3D Faces: Auto-Segmentation and Wound Face Regeneration Approach

Authors: Duong Q. Nguyen, Thinh D. Le, Phuong D. Nguyen, Nga T. K. Le, H. Nguyen-Xuan

Abstract: Facial wound segmentation plays a crucial role in preoperative planning and optimizing patient outcomes in various medical applications. In this paper, we propose an efficient approach for automating 3D facial wound segmentation using a two-stream graph convolutional network. Our method leverages the Cir3D-FaIR dataset and addresses the challenge of data imbalance through extensive experimentation… ▽ More Facial wound segmentation plays a crucial role in preoperative planning and optimizing patient outcomes in various medical applications. In this paper, we propose an efficient approach for automating 3D facial wound segmentation using a two-stream graph convolutional network. Our method leverages the Cir3D-FaIR dataset and addresses the challenge of data imbalance through extensive experimentation with different loss functions. To achieve accurate segmentation, we conducted thorough experiments and selected a high-performing model from the trained models. The selected model demonstrates exceptional segmentation performance for complex 3D facial wounds. Furthermore, based on the segmentation model, we propose an improved approach for extracting 3D facial wound fillers and compare it to the results of the previous study. Our method achieved a remarkable accuracy of 0.9999986\% on the test suite, surpassing the performance of the previous method. From this result, we use 3D printing technology to illustrate the shape of the wound filling. The outcomes of this study have significant implications for physicians involved in preoperative planning and intervention design. By automating facial wound segmentation and improving the accuracy of wound-filling extraction, our approach can assist in carefully assessing and optimizing interventions, leading to enhanced patient outcomes. Additionally, it contributes to advancing facial reconstruction techniques by utilizing machine learning and 3D bioprinting for printing skin tissue implants. Our source code is available at \url{https://github.com/SIMOGroup/WoundFilling3D}. △ Less

Submitted 12 July, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

arXiv:2306.12948 [pdf, ps, other]

On a Conjecture of Gezmis and Pellarin

Authors: Khac Nhuan Le, Kien Huu Nguyen

Abstract: In 2022, Gezmis and Pellarin introduced and studied the concept of trivial multiple zeta values, along with a map from the vector space spanned by these values to the vector space spanned by Thakur's multiple zeta values. Their construction allows us to generate some linear relations among the latter values using the former. In our work, we determine the structure of the kernel of the aforemention… ▽ More In 2022, Gezmis and Pellarin introduced and studied the concept of trivial multiple zeta values, along with a map from the vector space spanned by these values to the vector space spanned by Thakur's multiple zeta values. Their construction allows us to generate some linear relations among the latter values using the former. In our work, we determine the structure of the kernel of the aforementioned map. As a consequence, we give an answer to a conjecture proposed by Gezmis and Pellarin regarding the injectivity of this specific map. △ Less

Submitted 22 June, 2023; originally announced June 2023.

arXiv:2306.12668 [pdf, ps, other]

Numerical analysis of the stochastic Stefan problem

Authors: Jerome Droniou, Muhammad Awais Khan, Kim Ngan Le

Abstract: The gradient discretisation method (GDM) -- a generic framework encompassing many numerical methods -- is studied for a general stochastic Stefan problem with multiplicative noise. The convergence of the numerical solutions is proved by compactness method using discrete functional analysis tools, Skorohod theorem and the martingale representation theorem. The generic convergence results establishe… ▽ More The gradient discretisation method (GDM) -- a generic framework encompassing many numerical methods -- is studied for a general stochastic Stefan problem with multiplicative noise. The convergence of the numerical solutions is proved by compactness method using discrete functional analysis tools, Skorohod theorem and the martingale representation theorem. The generic convergence results established in the GDM framework are applicable to a range of different numerical methods, including for example mass-lumped finite elements, but also some finite volume methods, mimetic methods, lowest-order virtual element methods, etc. Theoretical results are complemented by numerical tests based on two methods that fit in GDM framework. △ Less

Submitted 8 August, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

arXiv:2306.10320 [pdf, other]

Collapsing molecular clouds with tracer particles: Part II, Collapse Histories

Authors: David C. Collins, Dan K. Le, Luz L. Jimenez Vela

Abstract: In order to develop a complete theory of star formation, one essentially needs to know two things: what collapses, and how long it takes. This is the second paper in a series, where we query how long a parcel of gas takes to collapse and the process it undergoes. We embed pseudo-Lagrangian tracer particles in simulations of collapsing molecular clouds, identify the particles that end in dense knot… ▽ More In order to develop a complete theory of star formation, one essentially needs to know two things: what collapses, and how long it takes. This is the second paper in a series, where we query how long a parcel of gas takes to collapse and the process it undergoes. We embed pseudo-Lagrangian tracer particles in simulations of collapsing molecular clouds, identify the particles that end in dense knots, and then examine the collapse history of the gas. We find a nearly universal behavior of cruise-then-collapse, wherein a core stays at intermediate densities for a significant fraction of its life before finally collapsing. We identify time immediately before each core collapses, $t_{\rm{sing}}$, and examine how it transitions to high density. We find that the time to collapse is uniformly distributed between $0.25 t_{\rm{ff}}$ and the end of the simulation at $\sim 1 t_{\rm{ff}}$, and that the duration of collapse is universally short, $Δt \sim 0.1 t_{\rm{ff}}$, where $t_{\rm{ff}}$ is the free-fall time at the mean density. We describe the collapse in three stages; collection, hardening, and singularity. Collection sweeps low density gas into moderate density. Hardening brings kinetic and gravitational energies into quasi-equipartition. Singularity is the free-fall collapse, forming an envelope in rough energy balance and central over density in $\sim 0.1 t_{\rm{ff}}$. △ Less

Submitted 12 June, 2024; v1 submitted 17 June, 2023; originally announced June 2023.

arXiv:2305.08289 [pdf, other]

Variational quantum metrology for multiparameter estimation under dephasing noise

Authors: Trung Kien Le, Hung Q. Nguyen, Le Bin Ho

Abstract: We present a hybrid quantum-classical variational scheme to enhance precision in quantum metrology. In the scheme, both the initial state and the measurement basis in the quantum part are parameterized and optimized via the classical part. It enables the maximization of information gained about the measured quantity. We discuss specific applications to 3D magnetic field sensing under several depha… ▽ More We present a hybrid quantum-classical variational scheme to enhance precision in quantum metrology. In the scheme, both the initial state and the measurement basis in the quantum part are parameterized and optimized via the classical part. It enables the maximization of information gained about the measured quantity. We discuss specific applications to 3D magnetic field sensing under several dephasing noise modes. Indeed, we demonstrate its ability to simultaneously estimate all parameters and surpass the standard quantum limit, making it a powerful tool for metrological applications. △ Less

Submitted 12 October, 2023; v1 submitted 14 May, 2023; originally announced May 2023.

Comments: 8 pages, 8 figures

Journal ref: Scientific Reports (2023)

arXiv:2304.08274 [pdf, other]

An asymptotically exact first-order shear deformation theory for functionally graded plates

Authors: Khanh Chau Le

Abstract: An asymptotically exact first-order shear deformation theory for functionally graded elastic plates is derived using the variational-asymptotic method. As an application, an analytical solution to the problem of wave propagation in a sandwich plate is found in accordance with this refined theory. Comparison between the dispersion curves obtained by 2-D plate theory and 3-D elasticity theory reveal… ▽ More An asymptotically exact first-order shear deformation theory for functionally graded elastic plates is derived using the variational-asymptotic method. As an application, an analytical solution to the problem of wave propagation in a sandwich plate is found in accordance with this refined theory. Comparison between the dispersion curves obtained by 2-D plate theory and 3-D elasticity theory reveals that the former is accurate up to the order of h^2/l^2, where h is the plate thickness and l the wavelength. △ Less

Submitted 5 May, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

Comments: 27 pages, 4 figures

arXiv:2304.06802 [pdf, other]

Path-by-path uniqueness for stochastic differential equations under Krylov-Röckner condition

Authors: Lukas Anzeletti, Khoa Lê, Chengcheng Ling

Abstract: We show that any stochastic differential equation (SDE) driven by Brownian motion with drift satisfying the Krylov-Röckner condition has exactly one solution in an ordinary sense for almost every trajectory of the Brownian motion. Additionally, we show that such SDE is strongly complete, i.e. for almost every trajectory of the Brownian motion, the family of solutions with different initial data fo… ▽ More We show that any stochastic differential equation (SDE) driven by Brownian motion with drift satisfying the Krylov-Röckner condition has exactly one solution in an ordinary sense for almost every trajectory of the Brownian motion. Additionally, we show that such SDE is strongly complete, i.e. for almost every trajectory of the Brownian motion, the family of solutions with different initial data forms a continuous semiflow for all nonnegative times. △ Less

Submitted 13 April, 2023; originally announced April 2023.

MSC Class: 60H10; 60H50; 60J60

arXiv:2304.06053 [pdf, other]

TextANIMAR: Text-based 3D Animal Fine-Grained Retrieval

Authors: Trung-Nghia Le, Tam V. Nguyen, Minh-Quan Le, Trong-Thuan Nguyen, Viet-Tham Huynh, Trong-Le Do, Khanh-Duy Le, Mai-Khiem Tran, Nhat Hoang-Xuan, Thang-Long Nguyen-Ho, Vinh-Tiep Nguyen, Tuong-Nghiem Diep, Khanh-Duy Ho, Xuan-Hieu Nguyen, Thien-Phuc Tran, Tuan-Anh Yang, Kim-Phat Tran, Nhu-Vinh Hoang, Minh-Quang Nguyen, E-Ro Nguyen, Minh-Khoi Nguyen-Nhat, Tuan-An To, Trung-Truc Huynh-Le, Nham-Tan Nguyen, Hoang-Chau Luong , et al. (8 additional authors not shown)

Abstract: 3D object retrieval is an important yet challenging task that has drawn more and more attention in recent years. While existing approaches have made strides in addressing this issue, they are often limited to restricted settings such as image and sketch queries, which are often unfriendly interactions for common users. In order to overcome these limitations, this paper presents a novel SHREC chall… ▽ More 3D object retrieval is an important yet challenging task that has drawn more and more attention in recent years. While existing approaches have made strides in addressing this issue, they are often limited to restricted settings such as image and sketch queries, which are often unfriendly interactions for common users. In order to overcome these limitations, this paper presents a novel SHREC challenge track focusing on text-based fine-grained retrieval of 3D animal models. Unlike previous SHREC challenge tracks, the proposed task is considerably more challenging, requiring participants to develop innovative approaches to tackle the problem of text-based retrieval. Despite the increased difficulty, we believe this task can potentially drive useful applications in practice and facilitate more intuitive interactions with 3D objects. Five groups participated in our competition, submitting a total of 114 runs. While the results obtained in our competition are satisfactory, we note that the challenges presented by this task are far from fully solved. As such, we provide insights into potential areas for future research and improvements. We believe we can help push the boundaries of 3D object retrieval and facilitate more user-friendly interactions via vision-language technologies. https://aichallenge.hcmus.edu.vn/textanimar △ Less

Submitted 9 August, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

Comments: Accepted to Computers and Graphics (3DOR, Journal Track)

arXiv:2304.05731 [pdf, other]

SketchANIMAR: Sketch-based 3D Animal Fine-Grained Retrieval

Authors: Trung-Nghia Le, Tam V. Nguyen, Minh-Quan Le, Trong-Thuan Nguyen, Viet-Tham Huynh, Trong-Le Do, Khanh-Duy Le, Mai-Khiem Tran, Nhat Hoang-Xuan, Thang-Long Nguyen-Ho, Vinh-Tiep Nguyen, Nhat-Quynh Le-Pham, Huu-Phuc Pham, Trong-Vu Hoang, Quang-Binh Nguyen, Trong-Hieu Nguyen-Mau, Tuan-Luc Huynh, Thanh-Danh Le, Ngoc-Linh Nguyen-Ha, Tuong-Vy Truong-Thuy, Truong Hoai Phong, Tuong-Nghiem Diep, Khanh-Duy Ho, Xuan-Hieu Nguyen, Thien-Phuc Tran , et al. (9 additional authors not shown)

Abstract: The retrieval of 3D objects has gained significant importance in recent years due to its broad range of applications in computer vision, computer graphics, virtual reality, and augmented reality. However, the retrieval of 3D objects presents significant challenges due to the intricate nature of 3D models, which can vary in shape, size, and texture, and have numerous polygons and vertices. To this… ▽ More The retrieval of 3D objects has gained significant importance in recent years due to its broad range of applications in computer vision, computer graphics, virtual reality, and augmented reality. However, the retrieval of 3D objects presents significant challenges due to the intricate nature of 3D models, which can vary in shape, size, and texture, and have numerous polygons and vertices. To this end, we introduce a novel SHREC challenge track that focuses on retrieving relevant 3D animal models from a dataset using sketch queries and expedites accessing 3D models through available sketches. Furthermore, a new dataset named ANIMAR was constructed in this study, comprising a collection of 711 unique 3D animal models and 140 corresponding sketch queries. Our contest requires participants to retrieve 3D models based on complex and detailed sketches. We receive satisfactory results from eight teams and 204 runs. Although further improvement is necessary, the proposed task has the potential to incentivize additional research in the domain of 3D object retrieval, potentially yielding benefits for a wide range of applications. We also provide insights into potential areas of future research, such as improving techniques for feature extraction and matching and creating more diverse datasets to evaluate retrieval performance. https://aichallenge.hcmus.edu.vn/sketchanimar △ Less

Submitted 9 August, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

Comments: Accepted to Computers & Graphics (3DOR 2023, Journal track)

Showing 1–50 of 174 results for author: Le, K