-
Utilizing Large Language Models in An Iterative Paradigm with Domain Feedback for Molecule Optimization
Authors:
Khiem Le,
Nitesh V. Chawla
Abstract:
Molecule optimization is a critical task in drug discovery to optimize desired properties of a given molecule through chemical modification. Despite Large Language Models (LLMs) holding the potential to efficiently simulate this task by using natural language to direct the optimization, straightforwardly utilizing shows limited performance. In this work, we facilitate utilizing LLMs in an iterativ…
▽ More
Molecule optimization is a critical task in drug discovery to optimize desired properties of a given molecule through chemical modification. Despite Large Language Models (LLMs) holding the potential to efficiently simulate this task by using natural language to direct the optimization, straightforwardly utilizing shows limited performance. In this work, we facilitate utilizing LLMs in an iterative paradigm by proposing a simple yet highly effective domain feedback provider, namely $\text{Re}^2$DF. In detail, $\text{Re}^2$DF harnesses an external toolkit, RDKit, to handle the molecule hallucination, if the modified molecule is chemically invalid. Otherwise, its desired properties are computed and compared to the original one, establishing reliable domain feedback with correct direction and distance towards the objective, followed by a retrieved example, to explicitly guide the LLM to refine the modified molecule. We conduct experiments across both single- and multi-property objectives with 2 thresholds, where $\text{Re}^2$DF shows significant improvements. Particularly, for 20 single-property objectives, $\text{Re}^2$DF enhances Hit ratio by 16.95% and 20.76% under loose and strict thresholds, respectively. For 32 multi-property objectives, $\text{Re}^2$DF enhances Hit ratio by 6.04% and 5.25%.
△ Less
Submitted 20 October, 2024; v1 submitted 16 October, 2024;
originally announced October 2024.
-
Stratified Domain Adaptation: A Progressive Self-Training Approach for Scene Text Recognition
Authors:
Kha Nhat Le,
Hoang-Tuan Nguyen,
Hung Tien Tran,
Thanh Duc Ngo
Abstract:
Unsupervised domain adaptation (UDA) has become increasingly prevalent in scene text recognition (STR), especially where training and testing data reside in different domains. The efficacy of existing UDA approaches tends to degrade when there is a large gap between the source and target domains. To deal with this problem, gradually shifting or progressively learning to shift from domain to domain…
▽ More
Unsupervised domain adaptation (UDA) has become increasingly prevalent in scene text recognition (STR), especially where training and testing data reside in different domains. The efficacy of existing UDA approaches tends to degrade when there is a large gap between the source and target domains. To deal with this problem, gradually shifting or progressively learning to shift from domain to domain is the key issue. In this paper, we introduce the Stratified Domain Adaptation (StrDA) approach, which examines the gradual escalation of the domain gap for the learning process. The objective is to partition the training data into subsets so that the progressively self-trained model can adapt to gradual changes. We stratify the training data by evaluating the proximity of each data sample to both the source and target domains. We propose a novel method for employing domain discriminators to estimate the out-of-distribution and domain discriminative levels of data samples. Extensive experiments on benchmark scene-text datasets show that our approach significantly improves the performance of baseline (source-trained) STR models.
△ Less
Submitted 17 October, 2024; v1 submitted 13 October, 2024;
originally announced October 2024.
-
Analytically weak and mild solutions to stochastic heat equation with irregular drift
Authors:
Siva Athreya,
Oleg Butkovsky,
Khoa Lê,
Leonid Mytnik
Abstract:
Consider the stochastic heat equation \begin{equation*} \partial_t u_t(x)=\frac12 \partial^2_{xx}u_t(x) +b(u_t(x))+\dot{W}_{t}(x),\quad t\in(0,T],\, x\in [0,1], \end{equation*} where $b$ is a generalized function, and $\dot W$ is space-time white noise on ${\mathbb R}_+\times[0,1]$. If the drift $b$ is a sufficiently regular function, then it is well-known that any analytically weak solution to th…
▽ More
Consider the stochastic heat equation \begin{equation*} \partial_t u_t(x)=\frac12 \partial^2_{xx}u_t(x) +b(u_t(x))+\dot{W}_{t}(x),\quad t\in(0,T],\, x\in [0,1], \end{equation*} where $b$ is a generalized function, and $\dot W$ is space-time white noise on ${\mathbb R}_+\times[0,1]$. If the drift $b$ is a sufficiently regular function, then it is well-known that any analytically weak solution to this equation is also analytically mild, and vice versa. We extend this result to drifts that are generalized functions, with an appropriate adaptation of the notions of mild and weak solutions. As a corollary of our results, we show that for $b\in L_p({\mathbb R})$, $p\ge1$, this equation has a unique analytically weak and mild solution, thus extending the classical results of Gyöngy and Pardoux (1993).
△ Less
Submitted 9 October, 2024;
originally announced October 2024.
-
Towards Layer-Wise Personalized Federated Learning: Adaptive Layer Disentanglement via Conflicting Gradients
Authors:
Minh Duong Nguyen,
Khanh Le,
Khoi Do,
Nguyen H. Tran,
Duc Nguyen,
Chien Trinh,
Zhaohui Yang
Abstract:
In personalized Federated Learning (pFL), high data heterogeneity can cause significant gradient divergence across devices, adversely affecting the learning process. This divergence, especially when gradients from different users form an obtuse angle during aggregation, can negate progress, leading to severe weight and gradient update degradation. To address this issue, we introduce a new approach…
▽ More
In personalized Federated Learning (pFL), high data heterogeneity can cause significant gradient divergence across devices, adversely affecting the learning process. This divergence, especially when gradients from different users form an obtuse angle during aggregation, can negate progress, leading to severe weight and gradient update degradation. To address this issue, we introduce a new approach to pFL design, namely Federated Learning with Layer-wise Aggregation via Gradient Analysis (FedLAG), utilizing the concept of gradient conflict at the layer level. Specifically, when layer-wise gradients of different clients form acute angles, those gradients align in the same direction, enabling updates across different clients toward identifying client-invariant features. Conversely, when layer-wise gradient pairs make create obtuse angles, the layers tend to focus on client-specific tasks. In hindsights, FedLAG assigns layers for personalization based on the extent of layer-wise gradient conflicts. Specifically, layers with gradient conflicts are excluded from the global aggregation process. The theoretical evaluation demonstrates that when integrated into other pFL baselines, FedLAG enhances pFL performance by a certain margin. Therefore, our proposed method achieves superior convergence behavior compared with other baselines. Extensive experiments show that our FedLAG outperforms several state-of-the-art methods and can be easily incorporated with many existing methods to further enhance performance.
△ Less
Submitted 3 October, 2024;
originally announced October 2024.
-
Capturing complex hand movements and object interactions using machine learning-powered stretchable smart textile gloves
Authors:
Arvin Tashakori,
Zenan Jiang,
Amir Servati,
Saeid Soltanian,
Harishkumar Narayana,
Katherine Le,
Caroline Nakayama,
Chieh-ling Yang,
Z. Jane Wang,
Janice J. Eng,
Peyman Servati
Abstract:
Accurate real-time tracking of dexterous hand movements and interactions has numerous applications in human-computer interaction, metaverse, robotics, and tele-health. Capturing realistic hand movements is challenging because of the large number of articulations and degrees of freedom. Here, we report accurate and dynamic tracking of articulated hand and finger movements using stretchable, washabl…
▽ More
Accurate real-time tracking of dexterous hand movements and interactions has numerous applications in human-computer interaction, metaverse, robotics, and tele-health. Capturing realistic hand movements is challenging because of the large number of articulations and degrees of freedom. Here, we report accurate and dynamic tracking of articulated hand and finger movements using stretchable, washable smart gloves with embedded helical sensor yarns and inertial measurement units. The sensor yarns have a high dynamic range, responding to low 0.005 % to high 155 % strains, and show stability during extensive use and washing cycles. We use multi-stage machine learning to report average joint angle estimation root mean square errors of 1.21 and 1.45 degrees for intra- and inter-subjects cross-validation, respectively, matching accuracy of costly motion capture cameras without occlusion or field of view limitations. We report a data augmentation technique that enhances robustness to noise and variations of sensors. We demonstrate accurate tracking of dexterous hand movements during object interactions, opening new avenues of applications including accurate typing on a mock paper keyboard, recognition of complex dynamic and static gestures adapted from American Sign Language and object identification.
△ Less
Submitted 3 October, 2024;
originally announced October 2024.
-
Regularisation by multiplicative noise for reaction-diffusion equations
Authors:
Konstantinos Dareiotis,
Teodor Holland,
Khoa Lê
Abstract:
We consider the stochastic reaction-diffusion equation in $1+1$ dimensions driven by multiplicative space-time white noise, with a distributional drift belonging to a Besov-Hölder space with any regularity index larger than $-1$. We assume that the diffusion coefficient is a regular function which is bounded away from zero. By using a combination of stochastic sewing techniques and Malliavin calcu…
▽ More
We consider the stochastic reaction-diffusion equation in $1+1$ dimensions driven by multiplicative space-time white noise, with a distributional drift belonging to a Besov-Hölder space with any regularity index larger than $-1$. We assume that the diffusion coefficient is a regular function which is bounded away from zero. By using a combination of stochastic sewing techniques and Malliavin calculus, we show that the equation admits a unique solution.
△ Less
Submitted 17 September, 2024;
originally announced September 2024.
-
Quantitative approximation of stochastic kinetic equations: from discrete to continuum
Authors:
Zimo Hao,
Khoa Lê,
Chengcheng Ling
Abstract:
We study the convergence of a generic tamed Euler-Maruyama (EM) scheme for the kinetic type stochastic differential equations (SDEs) (also known as second order SDEs) with singular coefficients in both weak and strong probabilistic senses. We show that when the drift exhibits a relatively low regularity compared to the state of the art, the singular system is well-defined both in the weak and stro…
▽ More
We study the convergence of a generic tamed Euler-Maruyama (EM) scheme for the kinetic type stochastic differential equations (SDEs) (also known as second order SDEs) with singular coefficients in both weak and strong probabilistic senses. We show that when the drift exhibits a relatively low regularity compared to the state of the art, the singular system is well-defined both in the weak and strong probabilistic senses. Meanwhile, the corresponding tamed EM scheme is shown to converge at the rate of 1/2 in both the weak and the strong senses.
△ Less
Submitted 9 September, 2024;
originally announced September 2024.
-
Explicit Convergence Rate of The Proximal Point Algorithm under R-Continuity
Authors:
Ba Khiet Le,
Michel Théra
Abstract:
The paper provides a thorough comparison between R-continuity and other fundamental tools in optimization such as metric regularity, metric subregularity and calmness. We show that R-continuity has some advantages in the convergence rate analysis of algorithms solving optimization problems. We also present some properties of R-continuity and study the explicit convergence rate of the Proximal Poin…
▽ More
The paper provides a thorough comparison between R-continuity and other fundamental tools in optimization such as metric regularity, metric subregularity and calmness. We show that R-continuity has some advantages in the convergence rate analysis of algorithms solving optimization problems. We also present some properties of R-continuity and study the explicit convergence rate of the Proximal Point Algorithm (PPA) under the R-continuity.
△ Less
Submitted 17 August, 2024;
originally announced August 2024.
-
A propensity score weighting approach to integrate aggregated data in random-effect individual-level data meta-analysis
Authors:
Tran Trong Khoi Le,
Sivem Afach,
Tat-Thang Vo
Abstract:
In evidence synthesis, collecting individual participant data (IPD) across eligible studies is the most reliable way to investigate the treatment effects in different subgroups defined by participant characteristics. Nonetheless, access to all IPD from all studies might be very challenging due to privacy concerns. To overcome this, many approaches such as multilevel modeling have been proposed to…
▽ More
In evidence synthesis, collecting individual participant data (IPD) across eligible studies is the most reliable way to investigate the treatment effects in different subgroups defined by participant characteristics. Nonetheless, access to all IPD from all studies might be very challenging due to privacy concerns. To overcome this, many approaches such as multilevel modeling have been proposed to incorporate the vast amount of aggregated data from the literature into IPD meta-analysis. These methods, however, often rely on specifying separate models for trial-level versus patient-level data, which likely suffers from ecological bias when there are non-linearities in the outcome generating mechanism. In this paper, we introduce a novel method to combine aggregated data and IPD in meta-analysis that is free from ecological bias. The proposed approach relies on modeling the study membership given covariates, then using inverse weighting to estimate the trial-specific coefficients in the individual-level outcome model of studies without IPD accessible. The weights derived from this approach also shed insights on the similarity in the case-mix across studies, which is useful to assess whether eligible trials are sufficiently similar to be meta-analyzed. We evaluate the proposed method by synthetic data, then apply it to a real-world meta-analysis comparing the chance of response between guselkumab and adalimumab among patients with psoriasis.
△ Less
Submitted 9 August, 2024;
originally announced August 2024.
-
MAMA: Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning
Authors:
Thong Nguyen,
Yi Bin,
Xiaobao Wu,
Xinshuai Dong,
Zhiyuan Hu,
Khoi Le,
Cong-Duy Nguyen,
See-Kiong Ng,
Luu Anh Tuan
Abstract:
Data quality stands at the forefront of deciding the effectiveness of video-language representation learning. However, video-text pairs in previous data typically do not align perfectly with each other, which might lead to video-language representations that do not accurately reflect cross-modal semantics. Moreover, previous data also possess an uneven distribution of concepts, thereby hampering t…
▽ More
Data quality stands at the forefront of deciding the effectiveness of video-language representation learning. However, video-text pairs in previous data typically do not align perfectly with each other, which might lead to video-language representations that do not accurately reflect cross-modal semantics. Moreover, previous data also possess an uneven distribution of concepts, thereby hampering the downstream performance across unpopular subjects. To address these problems, we propose MAMA, a new approach to learning video-language representations by utilizing a contrastive objective with a subtractive angular margin to regularize cross-modal representations in their effort to reach perfect similarity. Furthermore, to adapt to the non-uniform concept distribution, MAMA utilizes a multi-layer perceptron (MLP)-parameterized weighting function that maps loss values to sample weights which enable dynamic adjustment of the model's focus throughout the training. With the training guided by a small amount of unbiased meta-data and augmented by video-text data generated by large vision-language model, MAMA improves video-language representations and achieve superior performances on commonly used video question answering and text-video retrieval datasets. The code, model, and data have been made available at https://nguyentthong.github.io/MAMA.
△ Less
Submitted 9 October, 2024; v1 submitted 4 July, 2024;
originally announced July 2024.
-
UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages
Authors:
Trinh Pham,
Khoi M. Le,
Luu Anh Tuan
Abstract:
In this paper, we introduce UniBridge (Cross-Lingual Transfer Learning with Optimized Embeddings and Vocabulary), a comprehensive approach developed to improve the effectiveness of Cross-Lingual Transfer Learning, particularly in languages with limited resources. Our approach tackles two essential elements of a language model: the initialization of embeddings and the optimal vocabulary size. Speci…
▽ More
In this paper, we introduce UniBridge (Cross-Lingual Transfer Learning with Optimized Embeddings and Vocabulary), a comprehensive approach developed to improve the effectiveness of Cross-Lingual Transfer Learning, particularly in languages with limited resources. Our approach tackles two essential elements of a language model: the initialization of embeddings and the optimal vocabulary size. Specifically, we propose a novel embedding initialization method that leverages both lexical and semantic alignment for a language. In addition, we present a method for systematically searching for the optimal vocabulary size, ensuring a balance between model complexity and linguistic coverage. Our experiments across multilingual datasets show that our approach greatly improves the F1-Score in several languages. UniBridge is a robust and adaptable solution for cross-lingual systems in various languages, highlighting the significance of initializing embeddings and choosing the right vocabulary size in cross-lingual environments.
△ Less
Submitted 20 August, 2024; v1 submitted 14 June, 2024;
originally announced June 2024.
-
MolX: Enhancing Large Language Models for Molecular Learning with A Multi-Modal Extension
Authors:
Khiem Le,
Zhichun Guo,
Kaiwen Dong,
Xiaobao Huang,
Bozhao Nan,
Roshni Iyer,
Xiangliang Zhang,
Olaf Wiest,
Wei Wang,
Nitesh V. Chawla
Abstract:
Large Language Models (LLMs) with their strong task-handling capabilities have shown remarkable advancements across a spectrum of fields, moving beyond natural language understanding. However, their proficiency within the chemistry domain remains restricted, especially in solving professional molecule-related tasks. This challenge is attributed to their inherent limitations in comprehending molecu…
▽ More
Large Language Models (LLMs) with their strong task-handling capabilities have shown remarkable advancements across a spectrum of fields, moving beyond natural language understanding. However, their proficiency within the chemistry domain remains restricted, especially in solving professional molecule-related tasks. This challenge is attributed to their inherent limitations in comprehending molecules using only common textual representations, i.e., SMILES strings. In this study, we seek to enhance the ability of LLMs to comprehend molecules by equipping them with a multi-modal external module, namely MolX. In particular, instead of directly using a SMILES string to represent a molecule, we utilize specific encoders to extract fine-grained features from both SMILES string and 2D molecular graph representations for feeding into an LLM. Moreover, a handcrafted molecular fingerprint is incorporated to leverage its embedded domain knowledge. Then, to establish an alignment between MolX and the LLM's textual input space, the whole model in which the LLM is frozen, is pre-trained with a versatile strategy including a diverse set of tasks. Experimental evaluations show that our proposed method outperforms baselines across 4 downstream molecule-related tasks ranging from molecule-to-text translation to retrosynthesis, with and without fine-tuning the LLM, while only introducing a small number of trainable parameters 0.53% and 0.82%, respectively.
△ Less
Submitted 21 August, 2024; v1 submitted 10 June, 2024;
originally announced June 2024.
-
The Landau--Lifshitz--Bloch equation: Unique existence and finite element approximation
Authors:
Kim-Ngan Le,
Agus L. Soenjaya,
Thanh Tran
Abstract:
The Landau--Lifshitz--Bloch equation (LLBE) describes the evolution of magnetic spin field in a ferromagnet at high temperatures. We consider a viscous (pseudo-parabolic) regularisation of the LLBE for temperatures higher than the Curie temperature, which we call the $ε$-LLBE. Variants of the $ε$-LLBE are applicable to model pattern formation, phase transition, and heat conduction for non-simple m…
▽ More
The Landau--Lifshitz--Bloch equation (LLBE) describes the evolution of magnetic spin field in a ferromagnet at high temperatures. We consider a viscous (pseudo-parabolic) regularisation of the LLBE for temperatures higher than the Curie temperature, which we call the $ε$-LLBE. Variants of the $ε$-LLBE are applicable to model pattern formation, phase transition, and heat conduction for non-simple materials, among other things. In this paper, we show well-posedness of the $ε$-LLBE and the convergence of the solution $\boldsymbol{u}^ε$ of the regularised equation to the solution $\boldsymbol{u}$ of the LLBE as $ε\to 0^+$. As a by-product of our analysis, we show the existence and uniqueness of regular solution to the LLBE for temperatures higher than the Curie temperature. Furthermore, we propose a linear fully discrete conforming finite element scheme to approximate the solution of the $ε$-LLBE. Error analysis is performed to show unconditional stability and optimal uniform-in-time convergence rate for the schemes. Several numerical simulations corroborate our theoretical results.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Take a Step Further: Understanding Page Spray in Linux Kernel Exploitation
Authors:
Ziyi Guo,
Dang K Le,
Zhenpeng Lin,
Kyle Zeng,
Ruoyu Wang,
Tiffany Bao,
Yan Shoshitaishvili,
Adam Doupé,
Xinyu Xing
Abstract:
Recently, a novel method known as Page Spray emerges, focusing on page-level exploitation for kernel vulnerabilities. Despite the advantages it offers in terms of exploitability, stability, and compatibility, comprehensive research on Page Spray remains scarce. Questions regarding its root causes, exploitation model, comparative benefits over other exploitation techniques, and possible mitigation…
▽ More
Recently, a novel method known as Page Spray emerges, focusing on page-level exploitation for kernel vulnerabilities. Despite the advantages it offers in terms of exploitability, stability, and compatibility, comprehensive research on Page Spray remains scarce. Questions regarding its root causes, exploitation model, comparative benefits over other exploitation techniques, and possible mitigation strategies have largely remained unanswered. In this paper, we conduct a systematic investigation into Page Spray, providing an in-depth understanding of this exploitation technique. We introduce a comprehensive exploit model termed the \sys model, elucidating its fundamental principles. Additionally, we conduct a thorough analysis of the root causes underlying Page Spray occurrences within the Linux Kernel. We design an analyzer based on the Page Spray analysis model to identify Page Spray callsites. Subsequently, we evaluate the stability, exploitability, and compatibility of Page Spray through meticulously designed experiments. Finally, we propose mitigation principles for addressing Page Spray and introduce our own lightweight mitigation approach. This research aims to assist security researchers and developers in gaining insights into Page Spray, ultimately enhancing our collective understanding of this emerging exploitation technique and making improvements to the community.
△ Less
Submitted 6 June, 2024; v1 submitted 3 June, 2024;
originally announced June 2024.
-
Exploring the Practicality of Federated Learning: A Survey Towards the Communication Perspective
Authors:
Khiem Le,
Nhan Luong-Ha,
Manh Nguyen-Duc,
Danh Le-Phuoc,
Cuong Do,
Kok-Seng Wong
Abstract:
Federated Learning (FL) is a promising paradigm that offers significant advancements in privacy-preserving, decentralized machine learning by enabling collaborative training of models across distributed devices without centralizing data. However, the practical deployment of FL systems faces a significant bottleneck: the communication overhead caused by frequently exchanging large model updates bet…
▽ More
Federated Learning (FL) is a promising paradigm that offers significant advancements in privacy-preserving, decentralized machine learning by enabling collaborative training of models across distributed devices without centralizing data. However, the practical deployment of FL systems faces a significant bottleneck: the communication overhead caused by frequently exchanging large model updates between numerous devices and a central server. This communication inefficiency can hinder training speed, model performance, and the overall feasibility of real-world FL applications. In this survey, we investigate various strategies and advancements made in communication-efficient FL, highlighting their impact and potential to overcome the communication challenges inherent in FL systems. Specifically, we define measures for communication efficiency, analyze sources of communication inefficiency in FL systems, and provide a taxonomy and comprehensive review of state-of-the-art communication-efficient FL methods. Additionally, we discuss promising future research directions for enhancing the communication efficiency of FL systems. By addressing the communication bottleneck, FL can be effectively applied and enable scalable and practical deployment across diverse applications that require privacy-preserving, decentralized machine learning, such as IoT, healthcare, or finance.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Nudging Users to Change Breached Passwords Using the Protection Motivation Theory
Authors:
Yixin Zou,
Khue Le,
Peter Mayer,
Alessandro Acquisti,
Adam J. Aviv,
Florian Schaub
Abstract:
We draw on the Protection Motivation Theory (PMT) to design nudges that encourage users to change breached passwords. Our online experiment ($n$=$1,386$) compared the effectiveness of a threat appeal (highlighting negative consequences of breached passwords) and a coping appeal (providing instructions on how to change the breached password) in a 2x2 factorial design. Compared to the control condit…
▽ More
We draw on the Protection Motivation Theory (PMT) to design nudges that encourage users to change breached passwords. Our online experiment ($n$=$1,386$) compared the effectiveness of a threat appeal (highlighting negative consequences of breached passwords) and a coping appeal (providing instructions on how to change the breached password) in a 2x2 factorial design. Compared to the control condition, participants receiving the threat appeal were more likely to intend to change their passwords, and participants receiving both appeals were more likely to end up changing their passwords; both comparisons have a small effect size. Participants' password change behaviors are further associated with other factors such as their security attitudes (SA-6) and time passed since the breach, suggesting that PMT-based nudges are useful but insufficient to fully motivate users to change their passwords. Our study contributes to PMT's application in security research and provides concrete design implications for improving compromised credential notifications.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
Algorithmic aspects of left-orderings of solvable Baumslag--Solitar groups via its dynamical realization
Authors:
Meng-Che "Turbo" Ho,
Khanh Le,
Dino Rossegger
Abstract:
We answer a question of Calderoni and Clay by showing that the conjugation equivalence relation of left orderings of the Baumslag-Solitar groups $\mathrm{BS}(1,n)$ is hyperfinite for any $n$. Our proof relies on a classification of $\mathrm{BS}(1,n)$'s left-orderings via its one-dimensional dynamical realizations. We furthermore use the effectiveness of the dynamical realizations of…
▽ More
We answer a question of Calderoni and Clay by showing that the conjugation equivalence relation of left orderings of the Baumslag-Solitar groups $\mathrm{BS}(1,n)$ is hyperfinite for any $n$. Our proof relies on a classification of $\mathrm{BS}(1,n)$'s left-orderings via its one-dimensional dynamical realizations. We furthermore use the effectiveness of the dynamical realizations of $\mathrm{BS}(1,n)$ to study algorithmic properties of the left-orderings on $\mathrm{BS}(1,n)$.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Efficiently Assemble Normalization Layers and Regularization for Federated Domain Generalization
Authors:
Khiem Le,
Long Ho,
Cuong Do,
Danh Le-Phuoc,
Kok-Seng Wong
Abstract:
Domain shift is a formidable issue in Machine Learning that causes a model to suffer from performance degradation when tested on unseen domains. Federated Domain Generalization (FedDG) attempts to train a global model using collaborative clients in a privacy-preserving manner that can generalize well to unseen clients possibly with domain shift. However, most existing FedDG methods either cause ad…
▽ More
Domain shift is a formidable issue in Machine Learning that causes a model to suffer from performance degradation when tested on unseen domains. Federated Domain Generalization (FedDG) attempts to train a global model using collaborative clients in a privacy-preserving manner that can generalize well to unseen clients possibly with domain shift. However, most existing FedDG methods either cause additional privacy risks of data leakage or induce significant costs in client communication and computation, which are major concerns in the Federated Learning paradigm. To circumvent these challenges, here we introduce a novel architectural method for FedDG, namely gPerXAN, which relies on a normalization scheme working with a guiding regularizer. In particular, we carefully design Personalized eXplicitly Assembled Normalization to enforce client models selectively filtering domain-specific features that are biased towards local data while retaining discrimination of those features. Then, we incorporate a simple yet effective regularizer to guide these models in directly capturing domain-invariant representations that the global model's classifier can leverage. Extensive experimental results on two benchmark datasets, i.e., PACS and Office-Home, and a real-world medical dataset, Camelyon17, indicate that our proposed method outperforms other existing methods in addressing this particular problem.
△ Less
Submitted 22 March, 2024;
originally announced March 2024.
-
A Study of Vulnerability Repair in JavaScript Programs with Large Language Models
Authors:
Tan Khang Le,
Saba Alimadadi,
Steven Y. Ko
Abstract:
In recent years, JavaScript has become the most widely used programming language, especially in web development. However, writing secure JavaScript code is not trivial, and programmers often make mistakes that lead to security vulnerabilities in web applications. Large Language Models (LLMs) have demonstrated substantial advancements across multiple domains, and their evolving capabilities indicat…
▽ More
In recent years, JavaScript has become the most widely used programming language, especially in web development. However, writing secure JavaScript code is not trivial, and programmers often make mistakes that lead to security vulnerabilities in web applications. Large Language Models (LLMs) have demonstrated substantial advancements across multiple domains, and their evolving capabilities indicate their potential for automatic code generation based on a required specification, including automatic bug fixing. In this study, we explore the accuracy of LLMs, namely ChatGPT and Bard, in finding and fixing security vulnerabilities in JavaScript programs. We also investigate the impact of context in a prompt on directing LLMs to produce a correct patch of vulnerable JavaScript code. Our experiments on real-world software vulnerabilities show that while LLMs are promising in automatic program repair of JavaScript code, achieving a correct bug fix often requires an appropriate amount of context in the prompt.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
ARtVista: Gateway To Empower Anyone Into Artist
Authors:
Trong-Vu Hoang,
Quang-Binh Nguyen,
Duy-Nam Ly,
Khanh-Duy Le,
Tam V. Nguyen,
Minh-Triet Tran,
Trung-Nghia Le
Abstract:
Drawing is an art that enables people to express their imagination and emotions. However, individuals usually face challenges in drawing, especially when translating conceptual ideas into visually coherent representations and bridging the gap between mental visualization and practical execution. In response, we propose ARtVista - a novel system integrating AR and generative AI technologies. ARtVis…
▽ More
Drawing is an art that enables people to express their imagination and emotions. However, individuals usually face challenges in drawing, especially when translating conceptual ideas into visually coherent representations and bridging the gap between mental visualization and practical execution. In response, we propose ARtVista - a novel system integrating AR and generative AI technologies. ARtVista not only recommends reference images aligned with users' abstract ideas and generates sketches for users to draw but also goes beyond, crafting vibrant paintings in various painting styles. ARtVista also offers users an alternative approach to create striking paintings by simulating the paint-by-number concept on reference images, empowering users to create visually stunning artwork devoid of the necessity for advanced drawing skills. We perform a pilot study and reveal positive feedback on its usability, emphasizing its effectiveness in visualizing user ideas and aiding the painting process to achieve stunning pictures without requiring advanced drawing skills. The source code will be available at https://github.com/htrvu/ARtVista.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
iCONTRA: Toward Thematic Collection Design Via Interactive Concept Transfer
Authors:
Dinh-Khoi Vo,
Duy-Nam Ly,
Khanh-Duy Le,
Tam V. Nguyen,
Minh-Triet Tran,
Trung-Nghia Le
Abstract:
Creating thematic collections in industries demands innovative designs and cohesive concepts. Designers may face challenges in maintaining thematic consistency when drawing inspiration from existing objects, landscapes, or artifacts. While AI-powered graphic design tools offer help, they often fail to generate cohesive sets based on specific thematic concepts. In response, we introduce iCONTRA, an…
▽ More
Creating thematic collections in industries demands innovative designs and cohesive concepts. Designers may face challenges in maintaining thematic consistency when drawing inspiration from existing objects, landscapes, or artifacts. While AI-powered graphic design tools offer help, they often fail to generate cohesive sets based on specific thematic concepts. In response, we introduce iCONTRA, an interactive CONcept TRAnsfer system. With a user-friendly interface, iCONTRA enables both experienced designers and novices to effortlessly explore creative design concepts and efficiently generate thematic collections. We also propose a zero-shot image editing algorithm, eliminating the need for fine-tuning models, which gradually integrates information from initial objects, ensuring consistency in the generation process without influencing the background. A pilot study suggests iCONTRA's potential to reduce designers' efforts. Experimental results demonstrate its effectiveness in producing consistent and high-quality object concept transfers. iCONTRA stands as a promising tool for innovation and creative exploration in thematic collection design. The source code will be available at: https://github.com/vdkhoi20/iCONTRA.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Quantitative Propagation of Chaos for Singular Interacting Particle Systems Driven by Fractional Brownian Motion
Authors:
Lucio Galeati,
Khoa Lê,
Avi Mayorcas
Abstract:
We consider interacting systems particle driven by i.i.d. fractional Brownian motions, subject to irregular, possibly distributional, pairwise interactions. We show propagation of chaos and mean field convergence to the law of the associated McKean--Vlasov equation, as the number of particles $N\to\infty$, with quantitative sharp rates of order $N^{-1/2}$. Our results hold for a wide class of poss…
▽ More
We consider interacting systems particle driven by i.i.d. fractional Brownian motions, subject to irregular, possibly distributional, pairwise interactions. We show propagation of chaos and mean field convergence to the law of the associated McKean--Vlasov equation, as the number of particles $N\to\infty$, with quantitative sharp rates of order $N^{-1/2}$. Our results hold for a wide class of possibly time-dependent interactions, which are only assumed to satisfy a Besov-type regularity, related to the Hurst parameter $H\in (0,+\infty)\setminus \mathbb{N}$ of the driving noises. In particular, as $H$ decreases to $0$, interaction kernels of arbitrary singularity can be considered, a phenomenon frequently observed in regularization by noise results. Our proofs rely on a combinations of Sznitman's direct comparison argument with stochastic sewing techniques.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
Zagier-Hoffman's conjectures in positive characteristic II
Authors:
Bo-Hae Im,
Hojin Kim,
Khac Nhuan Le,
Tuan Ngo Dac,
Lan Huong Pham
Abstract:
Zagier-Hoffman's conjectures predict the dimension and a basis for the $\mathbb Q$-vector spaces spanned by $N$th cyclotomic multiple zeta values (MZV's) of fixed weight where $N$ is a natural number.
For $N=1$ (MZV's case), half of these conjectures have been solved by the work of Terasoma, Deligne-Goncharov and Brown with the help of Zagier's identity. The other half are completely open. For…
▽ More
Zagier-Hoffman's conjectures predict the dimension and a basis for the $\mathbb Q$-vector spaces spanned by $N$th cyclotomic multiple zeta values (MZV's) of fixed weight where $N$ is a natural number.
For $N=1$ (MZV's case), half of these conjectures have been solved by the work of Terasoma, Deligne-Goncharov and Brown with the help of Zagier's identity. The other half are completely open. For $N=2$ (alternating MZV's case) and $N=3,4,8$, Deligne-Goncharov and Deligne solved the same half of these conjectures for $N$th-cyclotomic MZV's. For other values of $N$, no sharp upper bound on the dimension is known.
In this paper we completely establish, for all $N$, Zagier-Hoffman's conjectures for $N$th cyclotomic multiple zeta values in positive characteristic. By working with the tower of all cyclotomic extensions, we present a proof that is uniform on $N$ and give an effective algorithm to express any cyclotomic multiple zeta value in the chosen basis. This generalizes all previous work on these conjectures for MZV's and alternating MZV's in positive characteristic.
△ Less
Submitted 18 February, 2024;
originally announced February 2024.
-
Object Detection in Thermal Images Using Deep Learning for Unmanned Aerial Vehicles
Authors:
Minh Dang Tu,
Kieu Trang Le,
Manh Duong Phung
Abstract:
This work presents a neural network model capable of recognizing small and tiny objects in thermal images collected by unmanned aerial vehicles. Our model consists of three parts, the backbone, the neck, and the prediction head. The backbone is developed based on the structure of YOLOv5 combined with the use of a transformer encoder at the end. The neck includes a BI-FPN block combined with the us…
▽ More
This work presents a neural network model capable of recognizing small and tiny objects in thermal images collected by unmanned aerial vehicles. Our model consists of three parts, the backbone, the neck, and the prediction head. The backbone is developed based on the structure of YOLOv5 combined with the use of a transformer encoder at the end. The neck includes a BI-FPN block combined with the use of a sliding window and a transformer to increase the information fed into the prediction head. The prediction head carries out the detection by evaluating feature maps with the Sigmoid function. The use of transformers with attention and sliding windows increases recognition accuracy while keeping the model at a reasonable number of parameters and computation requirements for embedded systems. Experiments conducted on public dataset VEDAI and our collected datasets show that our model has a higher accuracy than state-of-the-art methods such as ResNet, Faster RCNN, ComNet, ViT, YOLOv5, SMPNet, and DPNetV3. Experiments on the embedded computer Jetson AGX show that our model achieves a real-time computation speed with a stability rate of over 90%.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
Sliding Mode Observers for Set-valued Lur'e Systems with Uncertainties Beyond Observational Range
Authors:
Samir Adly,
Jun Huang,
Ba Khiet Le
Abstract:
In this paper, we introduce a new sliding mode observer for Lur'e set-valued dynamical systems, particularly addressing challenges posed by uncertainties not within the standard range of observation. Traditionally, most of Luenberger-like observers and sliding mode observer have been designed only for uncertainties in the range of observation. Central to our approach is the treatment of the uncert…
▽ More
In this paper, we introduce a new sliding mode observer for Lur'e set-valued dynamical systems, particularly addressing challenges posed by uncertainties not within the standard range of observation. Traditionally, most of Luenberger-like observers and sliding mode observer have been designed only for uncertainties in the range of observation. Central to our approach is the treatment of the uncertainty term which we decompose into two components: the first part in the observation subspace and the second part in its complemented subspace. We establish that when the second part converges to zero, an exact sliding mode observer for the system can be obtained. In scenarios where this convergence does not occur, our methodology allows for the estimation of errors between the actual state and the observer state. This leads to a practical interval estimation technique, valuable in situations where part of the uncertainty lies outside the observable range. Finally, we show that our observer is also a T- observer as well as a strong H-infinity observer.
△ Less
Submitted 25 April, 2024; v1 submitted 8 February, 2024;
originally announced February 2024.
-
Semi-Supervised Semantic Segmentation using Redesigned Self-Training for White Blood Cells
Authors:
Vinh Quoc Luu,
Duy Khanh Le,
Huy Thanh Nguyen,
Minh Thanh Nguyen,
Thinh Tien Nguyen,
Vinh Quang Dinh
Abstract:
Artificial Intelligence (AI) in healthcare, especially in white blood cell cancer diagnosis, is hindered by two primary challenges: the lack of large-scale labeled datasets for white blood cell (WBC) segmentation and outdated segmentation methods. These challenges inhibit the development of more accurate and modern techniques to diagnose cancer relating to white blood cells. To address the first c…
▽ More
Artificial Intelligence (AI) in healthcare, especially in white blood cell cancer diagnosis, is hindered by two primary challenges: the lack of large-scale labeled datasets for white blood cell (WBC) segmentation and outdated segmentation methods. These challenges inhibit the development of more accurate and modern techniques to diagnose cancer relating to white blood cells. To address the first challenge, a semi-supervised learning framework should be devised to efficiently capitalize on the scarcity of the dataset available. In this work, we address this issue by proposing a novel self-training pipeline with the incorporation of FixMatch. Self-training is a technique that utilizes the model trained on labeled data to generate pseudo-labels for the unlabeled data and then re-train on both of them. FixMatch is a consistency-regularization algorithm to enforce the model's robustness against variations in the input image. We discover that by incorporating FixMatch in the self-training pipeline, the performance improves in the majority of cases. Our performance achieved the best performance with the self-training scheme with consistency on DeepLab-V3 architecture and ResNet-50, reaching 90.69%, 87.37%, and 76.49% on Zheng 1, Zheng 2, and LISC datasets, respectively.
△ Less
Submitted 23 February, 2024; v1 submitted 14 January, 2024;
originally announced January 2024.
-
LAMPAT: Low-Rank Adaption for Multilingual Paraphrasing Using Adversarial Training
Authors:
Khoi M. Le,
Trinh Pham,
Tho Quan,
Anh Tuan Luu
Abstract:
Paraphrases are texts that convey the same meaning while using different words or sentence structures. It can be used as an automatic data augmentation tool for many Natural Language Processing tasks, especially when dealing with low-resource languages, where data shortage is a significant problem. To generate a paraphrase in multilingual settings, previous studies have leveraged the knowledge fro…
▽ More
Paraphrases are texts that convey the same meaning while using different words or sentence structures. It can be used as an automatic data augmentation tool for many Natural Language Processing tasks, especially when dealing with low-resource languages, where data shortage is a significant problem. To generate a paraphrase in multilingual settings, previous studies have leveraged the knowledge from the machine translation field, i.e., forming a paraphrase through zero-shot machine translation in the same language. Despite good performance on human evaluation, those methods still require parallel translation datasets, thus making them inapplicable to languages that do not have parallel corpora. To mitigate that problem, we proposed the first unsupervised multilingual paraphrasing model, LAMPAT ($\textbf{L}$ow-rank $\textbf{A}$daptation for $\textbf{M}$ultilingual $\textbf{P}$araphrasing using $\textbf{A}$dversarial $\textbf{T}$raining), by which monolingual dataset is sufficient enough to generate a human-like and diverse sentence. Throughout the experiments, we found out that our method not only works well for English but can generalize on unseen languages as well. Data and code are available at https://github.com/VinAIResearch/LAMPAT.
△ Less
Submitted 23 June, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
Toward a comprehensive simulation framework for hypergraphs: a Python-base approach
Authors:
Quoc Chuong Nguyen,
Trung Kien Le
Abstract:
Hypergraphs, or generalization of graphs such that edges can contain more than two nodes, have become increasingly prominent in understanding complex network analysis. Unlike graphs, hypergraphs have relatively few supporting platforms, and such dearth presents a barrier to more widespread adaptation of hypergraph computational toolboxes that could enable further research in several areas. Here, w…
▽ More
Hypergraphs, or generalization of graphs such that edges can contain more than two nodes, have become increasingly prominent in understanding complex network analysis. Unlike graphs, hypergraphs have relatively few supporting platforms, and such dearth presents a barrier to more widespread adaptation of hypergraph computational toolboxes that could enable further research in several areas. Here, we introduce HyperRD, a Python package for hypergraph computation, simulation, and interoperability with other powerful Python packages in graph and hypergraph research. Then, we will introduce two models on hypergraph, the general Schelling's model and the SIR model, and simulate them with HyperRD.
△ Less
Submitted 8 January, 2024;
originally announced January 2024.
-
Decays of Standard Model like Higgs boson $h \rightarrowγγ, Z γ$ in a minimal left-right symmetric model
Authors:
T. T. Hong,
V. K. Le,
L. T. T. Phuong,
N . C. Hoi,
N. T. K. Ngan,
N. H. T. Nha
Abstract:
Two decay channels $h\rightarrow γγ, Zγ$ of the Standard Model-like Higgs in a left-right symmetry model are investigated under recent experimental data. We will show there exist one-loop contributions that affect the $h\rightarrow Zγ$ amplitude, but not the $h\rightarrow γγ$ amplitude. From numerical investigations, we show that the signal strength $μ_{Z γ}$ of the decay $h\rightarrow Zγ$ is stil…
▽ More
Two decay channels $h\rightarrow γγ, Zγ$ of the Standard Model-like Higgs in a left-right symmetry model are investigated under recent experimental data. We will show there exist one-loop contributions that affect the $h\rightarrow Zγ$ amplitude, but not the $h\rightarrow γγ$ amplitude. From numerical investigations, we show that the signal strength $μ_{Z γ}$ of the decay $h\rightarrow Zγ$ is still constrained strictly by that of $h\rightarrow γγ$, namely $|Δμ_{γγ}|<38\%$ results in max $|Δμ_{Z γ}|<46\%$. On the other hand, the future experimental sensitivity $|Δμ_{γγ}|=4\%$ still allows $|Δμ_{Z γ}|$ reaches to values larger than the expected sensitivity $|Δμ_{Z γ}|=23\%$.
△ Less
Submitted 11 March, 2024; v1 submitted 18 December, 2023;
originally announced December 2023.
-
HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts
Authors:
Giang Do,
Khiem Le,
Quang Pham,
TrungTin Nguyen,
Thanh-Nam Doan,
Bint T. Nguyen,
Chenghao Liu,
Savitha Ramasamy,
Xiaoli Li,
Steven Hoi
Abstract:
By routing input tokens to only a few split experts, Sparse Mixture-of-Experts has enabled efficient training of large language models. Recent findings suggest that fixing the routers can achieve competitive performance by alleviating the collapsing problem, where all experts eventually learn similar representations. However, this strategy has two key limitations: (i) the policy derived from rando…
▽ More
By routing input tokens to only a few split experts, Sparse Mixture-of-Experts has enabled efficient training of large language models. Recent findings suggest that fixing the routers can achieve competitive performance by alleviating the collapsing problem, where all experts eventually learn similar representations. However, this strategy has two key limitations: (i) the policy derived from random routers might be sub-optimal, and (ii) it requires extensive resources during training and evaluation, leading to limited efficiency gains. This work introduces \HyperRout, which dynamically generates the router's parameters through a fixed hypernetwork and trainable embeddings to achieve a balance between training the routers and freezing them to learn an improved routing policy. Extensive experiments across a wide range of tasks demonstrate the superior performance and efficiency gains of \HyperRouter compared to existing routing methods. Our implementation is publicly available at {\url{https://github.com/giangdip2410/HyperRouter}}.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
READ: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language Modeling
Authors:
Thong Nguyen,
Xiaobao Wu,
Xinshuai Dong,
Khoi Le,
Zhiyuan Hu,
Cong-Duy Nguyen,
See-Kiong Ng,
Luu Anh Tuan
Abstract:
Fully fine-tuning pretrained large-scale transformer models has become a popular paradigm for video-language modeling tasks, such as temporal language grounding and video-language summarization. With a growing number of tasks and limited training data, such full fine-tuning approach leads to costly model storage and unstable training. To overcome these shortcomings, we introduce lightweight adapte…
▽ More
Fully fine-tuning pretrained large-scale transformer models has become a popular paradigm for video-language modeling tasks, such as temporal language grounding and video-language summarization. With a growing number of tasks and limited training data, such full fine-tuning approach leads to costly model storage and unstable training. To overcome these shortcomings, we introduce lightweight adapters to the pre-trained model and only update them at fine-tuning time. However, existing adapters fail to capture intrinsic temporal relations among video frames or textual words. Moreover, they neglect the preservation of critical task-related information that flows from the raw video-language input into the adapter's low-dimensional space. To address these issues, we first propose a novel REcurrent ADapter (READ) that employs recurrent computation to enable temporal modeling capability. Second, we propose Partial Video-Language Alignment (PVLA) objective via the use of partial optimal transport to maintain task-related information flowing into our READ modules. We validate our READ framework through extensive experiments where READ significantly outperforms all existing fine-tuning strategies on multiple low-resource temporal language grounding and video-language summarization benchmarks. The code, model, and data have been made available at https://nguyentthong.github.io/READ.
△ Less
Submitted 5 October, 2024; v1 submitted 11 December, 2023;
originally announced December 2023.
-
Harnessing graph state resources for robust quantum magnetometry under noise
Authors:
Phu Trong Nguyen,
Trung Kien Le,
Hung Q. Nguyen,
Le Bin Ho
Abstract:
Precise measurement of magnetic fields is essential for various applications, such as fundamental physics, space exploration, and biophysics. Although recent progress in quantum engineering has assisted in creating advanced quantum magnetometers, there are still ongoing challenges in improving their efficiency and noise resistance. This study focuses on using symmetric graph state resources for qu…
▽ More
Precise measurement of magnetic fields is essential for various applications, such as fundamental physics, space exploration, and biophysics. Although recent progress in quantum engineering has assisted in creating advanced quantum magnetometers, there are still ongoing challenges in improving their efficiency and noise resistance. This study focuses on using symmetric graph state resources for quantum magnetometry to enhance measurement precision by analyzing the estimation theory under time-homogeneous and time-inhomogeneous noise models. The results show a significant improvement in estimating both single and multiple Larmor frequencies. In single Larmor frequency estimation, the quantum Fisher information spans a spectrum from the standard quantum limit to the Heisenberg limit within a periodic range of the Larmor frequency, and in the case of multiple Larmor frequencies, it can exceed the standard quantum limit for both noisy cases. This study highlights the potential of graph state-based methods for improving magnetic field measurements under noisy environments.
△ Less
Submitted 3 September, 2024; v1 submitted 29 November, 2023;
originally announced November 2023.
-
R-Continuity with Applications to Convergence Analysis of Tikhonov Regularization and DC Programming
Authors:
Ba Khiet Le
Abstract:
In the paper, we study the convergence analysis of Tikhonov regularization in finding a zero of a maximal monotone operator using the notion of R-continuity. Applications to convex minimization and DC programming are provided.
In the paper, we study the convergence analysis of Tikhonov regularization in finding a zero of a maximal monotone operator using the notion of R-continuity. Applications to convex minimization and DC programming are provided.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
State-Dependent Sweeping Processes: Asymptotic Behavior and Algorithmic Approaches
Authors:
Samir Adly,
Monica G. Cojocaru,
Ba Khiet Le
Abstract:
In this paper, we investigate the asymptotic properties of a particular class of state-dependent sweeping processes. While extensive research has been conducted on the existence and uniqueness of solutions for sweeping processes, there is a scarcity of studies addressing their behavior in the limit of large time. Additionally, we introduce novel algorithms designed for the resolution of quasi-vari…
▽ More
In this paper, we investigate the asymptotic properties of a particular class of state-dependent sweeping processes. While extensive research has been conducted on the existence and uniqueness of solutions for sweeping processes, there is a scarcity of studies addressing their behavior in the limit of large time. Additionally, we introduce novel algorithms designed for the resolution of quasi-variational inequalities. As a result, we introduce a new derivative-free algorithm to find zeros of nonsmooth Lipschitz continuous mappings with a linear convergence rate. This algorithm can be effectively used in nonsmooth and nonconvex optimization problems that do not possess necessarily second-order differentiability conditions of the data.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
A new twist on modular links from an old perspective
Authors:
Khanh Le
Abstract:
We show that the complement of arithmetic modular links found in arXiv:2307.09409 is homeomorphic to the complement of augmented chainlinks. In particular, these link complements arise as n-fold cyclic covers of the Whitehead link complement.
We show that the complement of arithmetic modular links found in arXiv:2307.09409 is homeomorphic to the complement of augmented chainlinks. In particular, these link complements arise as n-fold cyclic covers of the Whitehead link complement.
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
Asymptotically accurate and locking-free finite element implementation of first order shear deformation theory for plates
Authors:
Khanh Chau Le,
Hoang Giang Bui
Abstract:
A formulation of the asymptotically exact first-order shear deformation theory for linear-elastic homogeneous plates in the rescaled coordinates and rotation angles is considered. This allows the development of its asymptotically accurate and shear-locking-free finite element implementation. As applications, numerical simulations are performed for circular and rectangular plates, showing complete…
▽ More
A formulation of the asymptotically exact first-order shear deformation theory for linear-elastic homogeneous plates in the rescaled coordinates and rotation angles is considered. This allows the development of its asymptotically accurate and shear-locking-free finite element implementation. As applications, numerical simulations are performed for circular and rectangular plates, showing complete agreement between the analytical solution and the numerical solutions based on two-dimensional theory and three-dimensional elasticity theory.
△ Less
Submitted 16 April, 2024; v1 submitted 30 October, 2023;
originally announced October 2023.
-
A central limit theorem for the Euler method for SDEs with irregular drifts
Authors:
Konstantinos Dareiotis,
Máté Gerencsér,
Khoa Lê
Abstract:
The goal of this article is to establish a central limit theorem for the Euler-Maruyama scheme approximating multidimensional SDEs with elliptic Brownian diffusion, under very mild regularity requirements on the drift coefficients. When the drift is Hölder continuous, we show that the limiting law of the rescaled fluctuations around the true solution is characterised as the unique solution of a hy…
▽ More
The goal of this article is to establish a central limit theorem for the Euler-Maruyama scheme approximating multidimensional SDEs with elliptic Brownian diffusion, under very mild regularity requirements on the drift coefficients. When the drift is Hölder continuous, we show that the limiting law of the rescaled fluctuations around the true solution is characterised as the unique solution of a hybrid Young-Itô differential equation. When the drift has positive Sobolev regularity, this limit is characterised by the solution of a transformed SDE. Our result is an extension of the results of Jacod-Kurtz-Protter (1991, 1998) in which SDEs with differentiable coefficients were considered. To compensate for the lack of regularity of the drifts, we utilize the regularisation effect from the non-degenerate noise.
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
Towards Robust Natural-Looking Mammography Lesion Synthesis on Ipsilateral Dual-Views Breast Cancer Analysis
Authors:
Thanh-Huy Nguyen,
Quang Hien Kha,
Thai Ngoc Toan Truong,
Ba Thinh Lam,
Ba Hung Ngo,
Quang Vinh Dinh,
Nguyen Quoc Khanh Le
Abstract:
In recent years, many mammographic image analysis methods have been introduced for improving cancer classification tasks. Two major issues of mammogram classification tasks are leveraging multi-view mammographic information and class-imbalance handling. In the first problem, many multi-view methods have been released for concatenating features of two or more views for the training and inference st…
▽ More
In recent years, many mammographic image analysis methods have been introduced for improving cancer classification tasks. Two major issues of mammogram classification tasks are leveraging multi-view mammographic information and class-imbalance handling. In the first problem, many multi-view methods have been released for concatenating features of two or more views for the training and inference stage. Having said that, most multi-view existing methods are not explainable in the meaning of feature fusion, and treat many views equally for diagnosing. Our work aims to propose a simple but novel method for enhancing examined view (main view) by leveraging low-level feature information from the auxiliary view (ipsilateral view) before learning the high-level feature that contains the cancerous features. For the second issue, we also propose a simple but novel malignant mammogram synthesis framework for upsampling minor class samples. Our easy-to-implement and no-training framework has eliminated the current limitation of the CutMix algorithm which is unreliable synthesized images with random pasted patches, hard-contour problems, and domain shift problems. Our results on VinDr-Mammo and CMMD datasets show the effectiveness of our two new frameworks for both multi-view training and synthesizing mammographic images, outperforming the previous conventional methods in our experimental settings.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
-
DM-VTON: Distilled Mobile Real-time Virtual Try-On
Authors:
Khoi-Nguyen Nguyen-Ngoc,
Thanh-Tung Phan-Nguyen,
Khanh-Duy Le,
Tam V. Nguyen,
Minh-Triet Tran,
Trung-Nghia Le
Abstract:
The fashion e-commerce industry has witnessed significant growth in recent years, prompting exploring image-based virtual try-on techniques to incorporate Augmented Reality (AR) experiences into online shopping platforms. However, existing research has primarily overlooked a crucial aspect - the runtime of the underlying machine-learning model. While existing methods prioritize enhancing output qu…
▽ More
The fashion e-commerce industry has witnessed significant growth in recent years, prompting exploring image-based virtual try-on techniques to incorporate Augmented Reality (AR) experiences into online shopping platforms. However, existing research has primarily overlooked a crucial aspect - the runtime of the underlying machine-learning model. While existing methods prioritize enhancing output quality, they often disregard the execution time, which restricts their applications on a limited range of devices. To address this gap, we propose Distilled Mobile Real-time Virtual Try-On (DM-VTON), a novel virtual try-on framework designed to achieve simplicity and efficiency. Our approach is based on a knowledge distillation scheme that leverages a strong Teacher network as supervision to guide a Student network without relying on human parsing. Notably, we introduce an efficient Mobile Generative Module within the Student network, significantly reducing the runtime while ensuring high-quality output. Additionally, we propose Virtual Try-on-guided Pose for Data Synthesis to address the limited pose variation observed in training images. Experimental results show that the proposed method can achieve 40 frames per second on a single Nvidia Tesla T4 GPU and only take up 37 MB of memory while producing almost the same output quality as other state-of-the-art methods. DM-VTON stands poised to facilitate the advancement of real-time AR applications, in addition to the generation of lifelike attired human figures tailored for diverse specialized training tasks. https://sites.google.com/view/ltnghia/research/DMVTON
△ Less
Submitted 26 August, 2023;
originally announced August 2023.
-
VIDES: Virtual Interior Design via Natural Language and Visual Guidance
Authors:
Minh-Hien Le,
Chi-Bien Chu,
Khanh-Duy Le,
Tam V. Nguyen,
Minh-Triet Tran,
Trung-Nghia Le
Abstract:
Interior design is crucial in creating aesthetically pleasing and functional indoor spaces. However, developing and editing interior design concepts requires significant time and expertise. We propose Virtual Interior DESign (VIDES) system in response to this challenge. Leveraging cutting-edge technology in generative AI, our system can assist users in generating and editing indoor scene concepts…
▽ More
Interior design is crucial in creating aesthetically pleasing and functional indoor spaces. However, developing and editing interior design concepts requires significant time and expertise. We propose Virtual Interior DESign (VIDES) system in response to this challenge. Leveraging cutting-edge technology in generative AI, our system can assist users in generating and editing indoor scene concepts quickly, given user text description and visual guidance. Using both visual guidance and language as the conditional inputs significantly enhances the accuracy and coherence of the generated scenes, resulting in visually appealing designs. Through extensive experimentation, we demonstrate the effectiveness of VIDES in developing new indoor concepts, changing indoor styles, and replacing and removing interior objects. The system successfully captures the essence of users' descriptions while providing flexibility for customization. Consequently, this system can potentially reduce the entry barrier for indoor design, making it more accessible to users with limited technical skills and reducing the time required to create high-quality images. Individuals who have a background in design can now easily communicate their ideas visually and effectively present their design concepts. https://sites.google.com/view/ltnghia/research/VIDES
△ Less
Submitted 26 August, 2023;
originally announced August 2023.
-
A class of space-time discretizations for the stochastic $p$-Stokes system
Authors:
Kim-Ngan Le,
Jörn Wichmann
Abstract:
The main objective of the present paper is to construct a new class of space-time discretizations for the stochastic $p$-Stokes system and analyze its stability and convergence properties.
We derive regularity results for the approximation that are similar to the natural regularity of solutions. One of the key arguments relies on discrete extrapolation that allows to relate lower moments of disc…
▽ More
The main objective of the present paper is to construct a new class of space-time discretizations for the stochastic $p$-Stokes system and analyze its stability and convergence properties.
We derive regularity results for the approximation that are similar to the natural regularity of solutions. One of the key arguments relies on discrete extrapolation that allows to relate lower moments of discrete maximal processes.
We show that, if the generic spatial discretization is constraint conforming, then the velocity approximation satisfies a best-approximation property in the natural distance.
Moreover, we present an example such that the resulting velocity approximation converges with rate $1/2$ in time and $1$ in space towards the (unknown) target velocity with respect to the natural distance. The theory is corroborated by numerical experiments.
△ Less
Submitted 5 August, 2024; v1 submitted 25 July, 2023;
originally announced July 2023.
-
Advancing Wound Filling Extraction on 3D Faces: Auto-Segmentation and Wound Face Regeneration Approach
Authors:
Duong Q. Nguyen,
Thinh D. Le,
Phuong D. Nguyen,
Nga T. K. Le,
H. Nguyen-Xuan
Abstract:
Facial wound segmentation plays a crucial role in preoperative planning and optimizing patient outcomes in various medical applications. In this paper, we propose an efficient approach for automating 3D facial wound segmentation using a two-stream graph convolutional network. Our method leverages the Cir3D-FaIR dataset and addresses the challenge of data imbalance through extensive experimentation…
▽ More
Facial wound segmentation plays a crucial role in preoperative planning and optimizing patient outcomes in various medical applications. In this paper, we propose an efficient approach for automating 3D facial wound segmentation using a two-stream graph convolutional network. Our method leverages the Cir3D-FaIR dataset and addresses the challenge of data imbalance through extensive experimentation with different loss functions. To achieve accurate segmentation, we conducted thorough experiments and selected a high-performing model from the trained models. The selected model demonstrates exceptional segmentation performance for complex 3D facial wounds. Furthermore, based on the segmentation model, we propose an improved approach for extracting 3D facial wound fillers and compare it to the results of the previous study. Our method achieved a remarkable accuracy of 0.9999986\% on the test suite, surpassing the performance of the previous method. From this result, we use 3D printing technology to illustrate the shape of the wound filling. The outcomes of this study have significant implications for physicians involved in preoperative planning and intervention design. By automating facial wound segmentation and improving the accuracy of wound-filling extraction, our approach can assist in carefully assessing and optimizing interventions, leading to enhanced patient outcomes. Additionally, it contributes to advancing facial reconstruction techniques by utilizing machine learning and 3D bioprinting for printing skin tissue implants. Our source code is available at \url{https://github.com/SIMOGroup/WoundFilling3D}.
△ Less
Submitted 12 July, 2023; v1 submitted 4 July, 2023;
originally announced July 2023.
-
On a Conjecture of Gezmis and Pellarin
Authors:
Khac Nhuan Le,
Kien Huu Nguyen
Abstract:
In 2022, Gezmis and Pellarin introduced and studied the concept of trivial multiple zeta values, along with a map from the vector space spanned by these values to the vector space spanned by Thakur's multiple zeta values. Their construction allows us to generate some linear relations among the latter values using the former. In our work, we determine the structure of the kernel of the aforemention…
▽ More
In 2022, Gezmis and Pellarin introduced and studied the concept of trivial multiple zeta values, along with a map from the vector space spanned by these values to the vector space spanned by Thakur's multiple zeta values. Their construction allows us to generate some linear relations among the latter values using the former. In our work, we determine the structure of the kernel of the aforementioned map. As a consequence, we give an answer to a conjecture proposed by Gezmis and Pellarin regarding the injectivity of this specific map.
△ Less
Submitted 22 June, 2023;
originally announced June 2023.
-
Numerical analysis of the stochastic Stefan problem
Authors:
Jerome Droniou,
Muhammad Awais Khan,
Kim Ngan Le
Abstract:
The gradient discretisation method (GDM) -- a generic framework encompassing many numerical methods -- is studied for a general stochastic Stefan problem with multiplicative noise. The convergence of the numerical solutions is proved by compactness method using discrete functional analysis tools, Skorohod theorem and the martingale representation theorem. The generic convergence results establishe…
▽ More
The gradient discretisation method (GDM) -- a generic framework encompassing many numerical methods -- is studied for a general stochastic Stefan problem with multiplicative noise. The convergence of the numerical solutions is proved by compactness method using discrete functional analysis tools, Skorohod theorem and the martingale representation theorem. The generic convergence results established in the GDM framework are applicable to a range of different numerical methods, including for example mass-lumped finite elements, but also some finite volume methods, mimetic methods, lowest-order virtual element methods, etc. Theoretical results are complemented by numerical tests based on two methods that fit in GDM framework.
△ Less
Submitted 8 August, 2024; v1 submitted 22 June, 2023;
originally announced June 2023.
-
Collapsing molecular clouds with tracer particles: Part II, Collapse Histories
Authors:
David C. Collins,
Dan K. Le,
Luz L. Jimenez Vela
Abstract:
In order to develop a complete theory of star formation, one essentially needs to know two things: what collapses, and how long it takes. This is the second paper in a series, where we query how long a parcel of gas takes to collapse and the process it undergoes. We embed pseudo-Lagrangian tracer particles in simulations of collapsing molecular clouds, identify the particles that end in dense knot…
▽ More
In order to develop a complete theory of star formation, one essentially needs to know two things: what collapses, and how long it takes. This is the second paper in a series, where we query how long a parcel of gas takes to collapse and the process it undergoes. We embed pseudo-Lagrangian tracer particles in simulations of collapsing molecular clouds, identify the particles that end in dense knots, and then examine the collapse history of the gas. We find a nearly universal behavior of cruise-then-collapse, wherein a core stays at intermediate densities for a significant fraction of its life before finally collapsing. We identify time immediately before each core collapses, $t_{\rm{sing}}$, and examine how it transitions to high density. We find that the time to collapse is uniformly distributed between $0.25 t_{\rm{ff}}$ and the end of the simulation at $\sim 1 t_{\rm{ff}}$, and that the duration of collapse is universally short, $Δt \sim 0.1 t_{\rm{ff}}$, where $t_{\rm{ff}}$ is the free-fall time at the mean density. We describe the collapse in three stages; collection, hardening, and singularity. Collection sweeps low density gas into moderate density. Hardening brings kinetic and gravitational energies into quasi-equipartition. Singularity is the free-fall collapse, forming an envelope in rough energy balance and central over density in $\sim 0.1 t_{\rm{ff}}$.
△ Less
Submitted 12 June, 2024; v1 submitted 17 June, 2023;
originally announced June 2023.
-
Variational quantum metrology for multiparameter estimation under dephasing noise
Authors:
Trung Kien Le,
Hung Q. Nguyen,
Le Bin Ho
Abstract:
We present a hybrid quantum-classical variational scheme to enhance precision in quantum metrology. In the scheme, both the initial state and the measurement basis in the quantum part are parameterized and optimized via the classical part. It enables the maximization of information gained about the measured quantity. We discuss specific applications to 3D magnetic field sensing under several depha…
▽ More
We present a hybrid quantum-classical variational scheme to enhance precision in quantum metrology. In the scheme, both the initial state and the measurement basis in the quantum part are parameterized and optimized via the classical part. It enables the maximization of information gained about the measured quantity. We discuss specific applications to 3D magnetic field sensing under several dephasing noise modes. Indeed, we demonstrate its ability to simultaneously estimate all parameters and surpass the standard quantum limit, making it a powerful tool for metrological applications.
△ Less
Submitted 12 October, 2023; v1 submitted 14 May, 2023;
originally announced May 2023.
-
An asymptotically exact first-order shear deformation theory for functionally graded plates
Authors:
Khanh Chau Le
Abstract:
An asymptotically exact first-order shear deformation theory for functionally graded elastic plates is derived using the variational-asymptotic method. As an application, an analytical solution to the problem of wave propagation in a sandwich plate is found in accordance with this refined theory. Comparison between the dispersion curves obtained by 2-D plate theory and 3-D elasticity theory reveal…
▽ More
An asymptotically exact first-order shear deformation theory for functionally graded elastic plates is derived using the variational-asymptotic method. As an application, an analytical solution to the problem of wave propagation in a sandwich plate is found in accordance with this refined theory. Comparison between the dispersion curves obtained by 2-D plate theory and 3-D elasticity theory reveals that the former is accurate up to the order of h^2/l^2, where h is the plate thickness and l the wavelength.
△ Less
Submitted 5 May, 2023; v1 submitted 17 April, 2023;
originally announced April 2023.
-
Path-by-path uniqueness for stochastic differential equations under Krylov-Röckner condition
Authors:
Lukas Anzeletti,
Khoa Lê,
Chengcheng Ling
Abstract:
We show that any stochastic differential equation (SDE) driven by Brownian motion with drift satisfying the Krylov-Röckner condition has exactly one solution in an ordinary sense for almost every trajectory of the Brownian motion. Additionally, we show that such SDE is strongly complete, i.e. for almost every trajectory of the Brownian motion, the family of solutions with different initial data fo…
▽ More
We show that any stochastic differential equation (SDE) driven by Brownian motion with drift satisfying the Krylov-Röckner condition has exactly one solution in an ordinary sense for almost every trajectory of the Brownian motion. Additionally, we show that such SDE is strongly complete, i.e. for almost every trajectory of the Brownian motion, the family of solutions with different initial data forms a continuous semiflow for all nonnegative times.
△ Less
Submitted 13 April, 2023;
originally announced April 2023.
-
TextANIMAR: Text-based 3D Animal Fine-Grained Retrieval
Authors:
Trung-Nghia Le,
Tam V. Nguyen,
Minh-Quan Le,
Trong-Thuan Nguyen,
Viet-Tham Huynh,
Trong-Le Do,
Khanh-Duy Le,
Mai-Khiem Tran,
Nhat Hoang-Xuan,
Thang-Long Nguyen-Ho,
Vinh-Tiep Nguyen,
Tuong-Nghiem Diep,
Khanh-Duy Ho,
Xuan-Hieu Nguyen,
Thien-Phuc Tran,
Tuan-Anh Yang,
Kim-Phat Tran,
Nhu-Vinh Hoang,
Minh-Quang Nguyen,
E-Ro Nguyen,
Minh-Khoi Nguyen-Nhat,
Tuan-An To,
Trung-Truc Huynh-Le,
Nham-Tan Nguyen,
Hoang-Chau Luong
, et al. (8 additional authors not shown)
Abstract:
3D object retrieval is an important yet challenging task that has drawn more and more attention in recent years. While existing approaches have made strides in addressing this issue, they are often limited to restricted settings such as image and sketch queries, which are often unfriendly interactions for common users. In order to overcome these limitations, this paper presents a novel SHREC chall…
▽ More
3D object retrieval is an important yet challenging task that has drawn more and more attention in recent years. While existing approaches have made strides in addressing this issue, they are often limited to restricted settings such as image and sketch queries, which are often unfriendly interactions for common users. In order to overcome these limitations, this paper presents a novel SHREC challenge track focusing on text-based fine-grained retrieval of 3D animal models. Unlike previous SHREC challenge tracks, the proposed task is considerably more challenging, requiring participants to develop innovative approaches to tackle the problem of text-based retrieval. Despite the increased difficulty, we believe this task can potentially drive useful applications in practice and facilitate more intuitive interactions with 3D objects. Five groups participated in our competition, submitting a total of 114 runs. While the results obtained in our competition are satisfactory, we note that the challenges presented by this task are far from fully solved. As such, we provide insights into potential areas for future research and improvements. We believe we can help push the boundaries of 3D object retrieval and facilitate more user-friendly interactions via vision-language technologies. https://aichallenge.hcmus.edu.vn/textanimar
△ Less
Submitted 9 August, 2023; v1 submitted 12 April, 2023;
originally announced April 2023.
-
SketchANIMAR: Sketch-based 3D Animal Fine-Grained Retrieval
Authors:
Trung-Nghia Le,
Tam V. Nguyen,
Minh-Quan Le,
Trong-Thuan Nguyen,
Viet-Tham Huynh,
Trong-Le Do,
Khanh-Duy Le,
Mai-Khiem Tran,
Nhat Hoang-Xuan,
Thang-Long Nguyen-Ho,
Vinh-Tiep Nguyen,
Nhat-Quynh Le-Pham,
Huu-Phuc Pham,
Trong-Vu Hoang,
Quang-Binh Nguyen,
Trong-Hieu Nguyen-Mau,
Tuan-Luc Huynh,
Thanh-Danh Le,
Ngoc-Linh Nguyen-Ha,
Tuong-Vy Truong-Thuy,
Truong Hoai Phong,
Tuong-Nghiem Diep,
Khanh-Duy Ho,
Xuan-Hieu Nguyen,
Thien-Phuc Tran
, et al. (9 additional authors not shown)
Abstract:
The retrieval of 3D objects has gained significant importance in recent years due to its broad range of applications in computer vision, computer graphics, virtual reality, and augmented reality. However, the retrieval of 3D objects presents significant challenges due to the intricate nature of 3D models, which can vary in shape, size, and texture, and have numerous polygons and vertices. To this…
▽ More
The retrieval of 3D objects has gained significant importance in recent years due to its broad range of applications in computer vision, computer graphics, virtual reality, and augmented reality. However, the retrieval of 3D objects presents significant challenges due to the intricate nature of 3D models, which can vary in shape, size, and texture, and have numerous polygons and vertices. To this end, we introduce a novel SHREC challenge track that focuses on retrieving relevant 3D animal models from a dataset using sketch queries and expedites accessing 3D models through available sketches. Furthermore, a new dataset named ANIMAR was constructed in this study, comprising a collection of 711 unique 3D animal models and 140 corresponding sketch queries. Our contest requires participants to retrieve 3D models based on complex and detailed sketches. We receive satisfactory results from eight teams and 204 runs. Although further improvement is necessary, the proposed task has the potential to incentivize additional research in the domain of 3D object retrieval, potentially yielding benefits for a wide range of applications. We also provide insights into potential areas of future research, such as improving techniques for feature extraction and matching and creating more diverse datasets to evaluate retrieval performance. https://aichallenge.hcmus.edu.vn/sketchanimar
△ Less
Submitted 9 August, 2023; v1 submitted 12 April, 2023;
originally announced April 2023.