Skip to main content

Showing 1–50 of 1,002 results for author: Nguyen, A

  1. arXiv:2410.11774  [pdf, other

    cs.CV

    Fractal Calibration for long-tailed object detection

    Authors: Konstantinos Panagiotis Alexandridis, Ismail Elezi, Jiankang Deng, Anh Nguyen, Shan Luo

    Abstract: Real-world datasets follow an imbalanced distribution, which poses significant challenges in rare-category object detection. Recent studies tackle this problem by developing re-weighting and re-sampling methods, that utilise the class frequencies of the dataset. However, these techniques focus solely on the frequency statistics and ignore the distribution of the classes in image space, missing imp… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  2. arXiv:2410.11765  [pdf, other

    cs.LG

    ECGN: A Cluster-Aware Approach to Graph Neural Networks for Imbalanced Classification

    Authors: Bishal Thapaliya, Anh Nguyen, Yao Lu, Tian Xie, Igor Grudetskyi, Fudong Lin, Antonios Valkanas, Jingyu Liu, Deepayan Chakraborty, Bilel Fehri

    Abstract: Classifying nodes in a graph is a common problem. The ideal classifier must adapt to any imbalances in the class distribution. It must also use information in the clustering structure of real-world graphs. Existing Graph Neural Networks (GNNs) have not addressed both problems together. We propose the Enhanced Cluster-aware Graph Network (ECGN), a novel method that addresses these issues by integra… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 17 pages, 3 figures

  3. arXiv:2410.06557  [pdf, other

    quant-ph cond-mat.dis-nn cond-mat.str-el hep-lat

    Observation of disorder-free localization and efficient disorder averaging on a quantum processor

    Authors: Gaurav Gyawali, Tyler Cochran, Yuri Lensky, Eliott Rosenberg, Amir H. Karamlou, Kostyantyn Kechedzhi, Julia Berndtsson, Tom Westerhout, Abraham Asfaw, Dmitry Abanin, Rajeev Acharya, Laleh Aghababaie Beni, Trond I. Andersen, Markus Ansmann, Frank Arute, Kunal Arya, Nikita Astrakhantsev, Juan Atalaya, Ryan Babbush, Brian Ballard, Joseph C. Bardin, Andreas Bengtsson, Alexander Bilmes, Gina Bortoli, Alexandre Bourassa , et al. (195 additional authors not shown)

    Abstract: One of the most challenging problems in the computational study of localization in quantum manybody systems is to capture the effects of rare events, which requires sampling over exponentially many disorder realizations. We implement an efficient procedure on a quantum processor, leveraging quantum parallelism, to efficiently sample over all disorder realizations. We observe localization without d… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  4. arXiv:2410.06173  [pdf, other

    cs.CL cs.AI cs.LG

    Manual Verbalizer Enrichment for Few-Shot Text Classification

    Authors: Quang Anh Nguyen, Nadi Tomeh, Mustapha Lebbah, Thierry Charnois, Hanene Azzag, Santiago Cordoba Muñoz

    Abstract: With the continuous development of pre-trained language models, prompt-based training becomes a well-adopted paradigm that drastically improves the exploitation of models for many natural language processing tasks. Prompting also shows great performance compared to traditional fine-tuning when adapted to zero-shot or few-shot scenarios where the number of annotated data is limited. In this framewo… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  5. arXiv:2410.05789  [pdf, other

    cs.RO

    Hybrid Gripper with Passive Pneumatic Soft Joints for Grasping Deformable Thin Objects

    Authors: Ngoc-Duy Tran, Hoang-Hiep Ly, Xuan-Thuan Nguyen, Thi-Thoa Mac, Anh Nguyen, Tung D. Ta

    Abstract: Grasping a variety of objects remains a key challenge in the development of versatile robotic systems. The human hand is remarkably dexterous, capable of grasping and manipulating objects with diverse shapes, mechanical properties, and textures. Inspired by how humans use two fingers to pick up thin and large objects such as fabric or sheets of paper, we aim to develop a gripper optimized for gras… ▽ More

    Submitted 10 October, 2024; v1 submitted 8 October, 2024; originally announced October 2024.

  6. VPI-Mlogs: A web-based machine learning solution for applications in petrophysics

    Authors: Anh Tuan Nguyen

    Abstract: Machine learning is an important part of the data science field. In petrophysics, machine learning algorithms and applications have been widely approached. In this context, Vietnam Petroleum Institute (VPI) has researched and deployed several effective prediction models, namely missing log prediction, fracture zone and fracture density forecast, etc. As one of our solutions, VPI-MLogs is a web-bas… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

  7. arXiv:2410.04700  [pdf, other

    stat.ME stat.CO

    Nonparametric tests for interaction in two-way ANOVA with balanced replications

    Authors: Bao Khue Tran, Amy S. Wagaman, Andrew Nguyen, David Jacobson, Bradley Hartlaub

    Abstract: Nonparametric procedures are more powerful for detecting interaction in two-way ANOVA when the data are non-normal. In this paper, we compute null critical values for the aligned rank-based tests (APCSSA/APCSSM) where the levels of the factors are between 2 and 6. We compare the performance of these new procedures with the ANOVA F-test for interaction, the adjusted rank transform test (ART), Conov… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

  8. arXiv:2410.04492  [pdf, other

    cs.CV cs.AI cs.LG

    Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification

    Authors: Zhaorui Tan, Xi Yang, Qiufeng Wang, Anh Nguyen, Kaizhu Huang

    Abstract: Vision models excel in image classification but struggle to generalize to unseen data, such as classifying images from unseen domains or discovering novel categories. In this paper, we explore the relationship between logical reasoning and deep learning generalization in visual classification. A logical regularization termed L-Reg is derived which bridges a logical analysis framework to image clas… ▽ More

    Submitted 16 October, 2024; v1 submitted 6 October, 2024; originally announced October 2024.

    Comments: Accepted by NeurIPS2024 as Spotlight

  9. The Ni isotopic composition of Ryugu reveals a common accretion region for carbonaceous chondrites

    Authors: Fridolin Spitzer, Thorsten Kleine, Christoph Burkhardt, Timo Hopp, Tetsuya Yokoyama, Yoshinari Abe, Jérôme Aléon, Conel M. O'D. Alexander, Sachiko Amari, Yuri Amelin, Ken-ichi Bajo, Martin Bizzarro, Audrey Bouvier, Richard W. Carlson, Marc Chaussidon, Byeon-Gak Choi, Nicolas Dauphas, Andrew M. Davis, Tommaso Di Rocco, Wataru Fujiya, Ryota Fukai, Ikshu Gautam, Makiko K. Haba, Yuki Hibiya, Hiroshi Hidaka , et al. (66 additional authors not shown)

    Abstract: The isotopic compositions of samples returned from Cb-type asteroid Ryugu and Ivuna-type (CI) chondrites are distinct from other carbonaceous chondrites, which has led to the suggestion that Ryugu and CI chondrites formed in a different region of the accretion disk, possibly around the orbits of Uranus and Neptune. We show that, like for Fe, Ryugu and CI chondrites also have indistinguishable Ni i… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

    Comments: Published open access in Science Advances

    Journal ref: Science Advances 10, 39, eadp2426 (2024)

  10. arXiv:2410.03890  [pdf, other

    eess.SY cs.RO

    Safe Reference Tracking and Collision Avoidance for Taxiing Aircraft Using an MPC-CBF Framework

    Authors: Brooks A. Butler, Zarif Cabrera, Andy Nguyen, Philip E. Paré

    Abstract: In this paper, we develop a framework for the automatic taxiing of aircraft between hangar and take-off given a graph-based model of an airport. We implement a high-level path-planning algorithm that models taxiway intersections as nodes in an undirected graph, algorithmically constructs a directed graph according to the physical limitations of the aircraft, and finds the shortest valid taxi path… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: This work is under review to be presented at the 2025 American Control Conference

  11. arXiv:2410.03507  [pdf, other

    astro-ph.CO astro-ph.IM

    LOO-PIT: A sensitive posterior test

    Authors: Alan B. H. Nguyen, Marco Bonici, Glen McGee, Will J. Percival

    Abstract: With the advent of the next generation of astrophysics experiments, the volume of data available to researchers will be greater than ever. As these projects will significantly drive down statistical uncertainties in measurements, it is crucial to develop novel tools to assess the ability of our models to fit these data within the specified errors. We introduce to astronomy the Leave One Out-Probab… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: 28 pages, 13 figures. Prepared for submission to JCAP. Comments welcomed

  12. arXiv:2410.00203  [pdf, other

    math.NA math.AP

    Multilevel Picard approximations overcome the curse of dimensionality when approximating semilinear heat equations with gradient-dependent nonlinearities in $L^p$-sense

    Authors: Tuan Anh Nguyen

    Abstract: We prove that multilevel Picard approximations are capable of approximating solutions of semilinear heat equations in $L^{p}$-sense, ${p}\in [2,\infty)$, in the case of gradient-dependent, Lipschitz-continuous nonlinearities, in the sense that the computational effort of the multilevel Picard approximations grow at most polynomially in both the dimension $d$ and the reciprocal $1/ε$ of the prescri… ▽ More

    Submitted 12 October, 2024; v1 submitted 30 September, 2024; originally announced October 2024.

    Comments: 23 pages, 2 figures

    MSC Class: 65C99; 68T05

  13. arXiv:2409.20467  [pdf, other

    cs.CL cs.AI

    A Weakly Supervised Data Labeling Framework for Machine Lexical Normalization in Vietnamese Social Media

    Authors: Dung Ha Nguyen, Anh Thi Hoang Nguyen, Kiet Van Nguyen

    Abstract: This study introduces an innovative automatic labeling framework to address the challenges of lexical normalization in social media texts for low-resource languages like Vietnamese. Social media data is rich and diverse, but the evolving and varied language used in these contexts makes manual labeling labor-intensive and expensive. To tackle these issues, we propose a framework that integrates sem… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

  14. arXiv:2409.20431  [pdf, ps, other

    math.NA cs.LG math.PR

    Multilevel Picard approximations and deep neural networks with ReLU, leaky ReLU, and softplus activation overcome the curse of dimensionality when approximating semilinear parabolic partial differential equations in $L^p$-sense

    Authors: Ariel Neufeld, Tuan Anh Nguyen

    Abstract: We prove that multilevel Picard approximations and deep neural networks with ReLU, leaky ReLU, and softplus activation are capable of approximating solutions of semilinear Kolmogorov PDEs in $L^\mathfrak{p}$-sense, $\mathfrak{p}\in [2,\infty)$, in the case of gradient-independent, Lipschitz-continuous nonlinearities, while the computational effort of the multilevel Picard approximations and the re… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

  15. arXiv:2409.19749  [pdf, other

    cs.CL

    NeuroMax: Enhancing Neural Topic Modeling via Maximizing Mutual Information and Group Topic Regularization

    Authors: Duy-Tung Pham, Thien Trang Nguyen Vu, Tung Nguyen, Linh Ngo Van, Duc Anh Nguyen, Thien Huu Nguyen

    Abstract: Recent advances in neural topic models have concentrated on two primary directions: the integration of the inference network (encoder) with a pre-trained language model (PLM) and the modeling of the relationship between words and topics in the generative model (decoder). However, the use of large PLMs significantly increases inference costs, making them less practical for situations requiring low… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

    Comments: Findings of EMNLP 2024

  16. arXiv:2409.19117  [pdf, other

    cs.LG eess.SP

    Range-aware Positional Encoding via High-order Pretraining: Theory and Practice

    Authors: Viet Anh Nguyen, Nhat Khang Ngo, Truong Son Hy

    Abstract: Unsupervised pre-training on vast amounts of graph data is critical in real-world applications wherein labeled data is limited, such as molecule properties prediction or materials science. Existing approaches pre-train models for specific graph domains, neglecting the inherent connections within networks. This limits their ability to transfer knowledge to various supervised tasks. In this work, we… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

  17. arXiv:2409.17727  [pdf, other

    cs.RO cs.CV

    Robotic-CLIP: Fine-tuning CLIP on Action Data for Robotic Applications

    Authors: Nghia Nguyen, Minh Nhat Vu, Tung D. Ta, Baoru Huang, Thieu Vo, Ngan Le, Anh Nguyen

    Abstract: Vision language models have played a key role in extracting meaningful features for various robotic applications. Among these, Contrastive Language-Image Pretraining (CLIP) is widely used in robotic tasks that require both vision and natural language understanding. However, CLIP was trained solely on static images paired with text prompts and has not yet been fully adapted for robotic tasks involv… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: 7 pages

  18. arXiv:2409.17189  [pdf, other

    math.OC cs.LG

    Decentralized Federated Learning with Gradient Tracking over Time-Varying Directed Networks

    Authors: Duong Thuy Anh Nguyen, Su Wang, Duong Tung Nguyen, Angelia Nedich, H. Vincent Poor

    Abstract: We investigate the problem of agent-to-agent interaction in decentralized (federated) learning over time-varying directed graphs, and, in doing so, propose a consensus-based algorithm called DSGTm-TV. The proposed algorithm incorporates gradient tracking and heavy-ball momentum to distributively optimize a global objective function, while preserving local data privacy. Under DSGTm-TV, agents will… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

  19. arXiv:2409.17142  [pdf, other

    quant-ph cond-mat.str-el hep-lat

    Visualizing Dynamics of Charges and Strings in (2+1)D Lattice Gauge Theories

    Authors: Tyler A. Cochran, Bernhard Jobst, Eliott Rosenberg, Yuri D. Lensky, Gaurav Gyawali, Norhan Eassa, Melissa Will, Dmitry Abanin, Rajeev Acharya, Laleh Aghababaie Beni, Trond I. Andersen, Markus Ansmann, Frank Arute, Kunal Arya, Abraham Asfaw, Juan Atalaya, Ryan Babbush, Brian Ballard, Joseph C. Bardin, Andreas Bengtsson, Alexander Bilmes, Alexandre Bourassa, Jenna Bovaird, Michael Broughton, David A. Browne , et al. (167 additional authors not shown)

    Abstract: Lattice gauge theories (LGTs) can be employed to understand a wide range of phenomena, from elementary particle scattering in high-energy physics to effective descriptions of many-body interactions in materials. Studying dynamical properties of emergent phases can be challenging as it requires solving many-body problems that are generally beyond perturbative limits. We investigate the dynamics of… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

  20. arXiv:2409.15582  [pdf, other

    stat.ML cond-mat.dis-nn cond-mat.stat-mech cs.LG

    Generalization vs. Specialization under Concept Shift

    Authors: Alex Nguyen, David J. Schwab, Vudtiwat Ngampruetikorn

    Abstract: Machine learning models are often brittle under distribution shift, i.e., when data distributions at test time differ from those during training. Understanding this failure mode is central to identifying and mitigating safety risks of mass adoption of machine learning. Here we analyze ridge regression under concept shift -- a form of distribution shift in which the input-label relationship changes… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

    Comments: 8 pages, 3 figures

  21. arXiv:2409.14403  [pdf, other

    cs.RO cs.CV

    GraspMamba: A Mamba-based Language-driven Grasp Detection Framework with Hierarchical Feature Learning

    Authors: Huy Hoang Nguyen, An Vuong, Anh Nguyen, Ian Reid, Minh Nhat Vu

    Abstract: Grasp detection is a fundamental robotic task critical to the success of many industrial applications. However, current language-driven models for this task often struggle with cluttered images, lengthy textual descriptions, or slow inference speed. We introduce GraspMamba, a new language-driven grasp detection method that employs hierarchical feature fusion with Mamba vision to tackle these chall… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

    Comments: 8 pages. Project page: https://airvlab.github.io/grasp-anything/

  22. arXiv:2409.11697  [pdf, other

    cs.LG

    Monomial Matrix Group Equivariant Neural Functional Networks

    Authors: Hoang V. Tran, Thieu N. Vo, Tho H. Tran, An T. Nguyen, Tan Minh Nguyen

    Abstract: Neural functional networks (NFNs) have recently gained significant attention due to their diverse applications, ranging from predicting network generalization and network editing to classifying implicit neural representation. Previous NFN designs often depend on permutation symmetries in neural networks' weights, which traditionally arise from the unordered arrangement of neurons in hidden layers.… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

  23. arXiv:2409.04418  [pdf, other

    physics.chem-ph cond-mat.mtrl-sci

    Charting new regions of Cobalt's chemical space with maximally large magnetic anisotropy: A computational high-throughput study

    Authors: Lorenzo A. Mariano, Vu Ha Anh Nguyen, Valerio Briganti, Alessandro Lunghi

    Abstract: Magnetic anisotropy slows down magnetic relaxation and plays a prominent role in the design of permanent magnets. Coordination compounds of Co(II) in particular exhibit large magnetic anisotropy in the presence of low-coordination environments and have been used as single-molecule magnet prototypes. However, only a limited sampling of Cobalt's vast chemical space has been performed, potentially ob… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

  24. arXiv:2409.04383  [pdf, other

    physics.bio-ph cond-mat.dis-nn cond-mat.mtrl-sci cond-mat.soft

    Origin of yield stress and mechanical plasticity in biological tissues

    Authors: Anh Q. Nguyen, Junxiang Huang, Dapeng Bi

    Abstract: During development and under normal physiological conditions, biological tissues are continuously subjected to substantial mechanical stresses. In response to large deformations cells in a tissue must undergo multicellular rearrangements in order to maintain integrity and robustness. However, how these events are connected in time and space remains unknown. Here, using computational and theoretica… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

  25. arXiv:2409.04367  [pdf, other

    cs.LG cs.AI stat.ML

    Provable Hyperparameter Tuning for Structured Pfaffian Settings

    Authors: Maria-Florina Balcan, Anh Tuan Nguyen, Dravyansh Sharma

    Abstract: Data-driven algorithm design automatically adapts algorithms to specific application domains, achieving better performance. In the context of parameterized algorithms, this approach involves tuning the algorithm parameters using problem instances drawn from the problem distribution of the target application domain. While empirical evidence supports the effectiveness of data-driven algorithm design… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

  26. arXiv:2408.17391  [pdf, other

    nucl-ex physics.ins-det

    Two-neutrino double electron capture of $^{124}$Xe in the first LUX-ZEPLIN exposure

    Authors: J. Aalbers, D. S. Akerib, A. K. Al Musalhi, F. Alder, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, A. Baker, S. Balashov, J. Bang, J. W. Bargemann, E. E. Barillier, K. Beattie, A. Bhatti, A. Biekert, T. P. Biesiadzinski, H. J. Birch, E. Bishop, G. M. Blockinger, B. Boxer, C. A. J. Brew , et al. (180 additional authors not shown)

    Abstract: The broad physics reach of the LUX-ZEPLIN (LZ) experiment covers rare phenomena beyond the direct detection of dark matter. We report precise measurements of the extremely rare decay of $^{124}$Xe through the process of two-neutrino double electron capture (2$ν$2EC), utilizing a $1.39\,\mathrm{kg} \times \mathrm{yr}$ isotopic exposure from the first LZ science run. A half-life of… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: 15 pages, 3 figures

  27. arXiv:2408.15619  [pdf

    cs.LG

    Large-Scale Demand Prediction in Urban Rail using Multi-Graph Inductive Representation Learning

    Authors: Dang Viet Anh Nguyen, J. Victor Flensburg, Fabrizio Cerreto, Bianca Pascariu, Paola Pellegrini, Carlos Lima Azevedo, Filipe Rodrigues

    Abstract: With the expansion of cities over time, URT (Urban Rail Transit) networks have also grown significantly. Demand prediction plays an important role in supporting planning, scheduling, fleet management, and other operational decisions. In this study, we propose an Origin-Destination (OD) demand prediction model called Multi-Graph Inductive Representation Learning (mGraphSAGE) for large-scale URT net… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 18 pages, 3 figures

    MSC Class: 68T07; 90B06 ACM Class: H.4.2; I.2.6; J.4; C.2.1

  28. arXiv:2408.13687  [pdf, other

    quant-ph

    Quantum error correction below the surface code threshold

    Authors: Rajeev Acharya, Laleh Aghababaie-Beni, Igor Aleiner, Trond I. Andersen, Markus Ansmann, Frank Arute, Kunal Arya, Abraham Asfaw, Nikita Astrakhantsev, Juan Atalaya, Ryan Babbush, Dave Bacon, Brian Ballard, Joseph C. Bardin, Johannes Bausch, Andreas Bengtsson, Alexander Bilmes, Sam Blackwell, Sergio Boixo, Gina Bortoli, Alexandre Bourassa, Jenna Bovaird, Leon Brill, Michael Broughton, David A. Browne , et al. (224 additional authors not shown)

    Abstract: Quantum error correction provides a path to reach practical quantum computing by combining multiple physical qubits into a logical qubit, where the logical error rate is suppressed exponentially as more qubits are added. However, this exponential suppression only occurs if the physical error rate is below a critical threshold. In this work, we present two surface code memories operating below this… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

    Comments: 10 pages, 4 figures, Supplementary Information

  29. arXiv:2408.13126  [pdf, other

    cs.CV

    CathAction: A Benchmark for Endovascular Intervention Understanding

    Authors: Baoru Huang, Tuan Vo, Chayun Kongtongvattana, Giulio Dagnino, Dennis Kundrat, Wenqiang Chi, Mohamed Abdelaziz, Trevor Kwok, Tudor Jianu, Tuong Do, Hieu Le, Minh Nguyen, Hoan Nguyen, Erman Tjiputra, Quang Tran, Jianyang Xie, Yanda Meng, Binod Bhattarai, Zhaorui Tan, Hongbin Liu, Hong Seng Gan, Wei Wang, Xi Yang, Qiufeng Wang, Jionglong Su , et al. (13 additional authors not shown)

    Abstract: Real-time visual feedback from catheterization analysis is crucial for enhancing surgical safety and efficiency during endovascular interventions. However, existing datasets are often limited to specific tasks, small scale, and lack the comprehensive annotations necessary for broader endovascular intervention understanding. To tackle these limitations, we introduce CathAction, a large-scale datase… ▽ More

    Submitted 30 August, 2024; v1 submitted 23 August, 2024; originally announced August 2024.

    Comments: 10 pages. Webpage: https://airvlab.github.io/cathaction/

  30. arXiv:2408.11753  [pdf, other

    math.ST stat.ME

    Small Sample Behavior of Wasserstein Projections, Connections to Empirical Likelihood, and Other Applications

    Authors: Sirui Lin, Jose Blanchet, Peter Glynn, Viet Anh Nguyen

    Abstract: The empirical Wasserstein projection (WP) distance quantifies the Wasserstein distance from the empirical distribution to a set of probability measures satisfying given expectation constraints. The WP is a powerful tool because it mitigates the curse of dimensionality inherent in the Wasserstein distance, making it valuable for various tasks, including constructing statistics for hypothesis testin… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  31. arXiv:2408.11747  [pdf, other

    cs.CV cs.AI

    Open-Ended 3D Point Cloud Instance Segmentation

    Authors: Phuc D. A. Nguyen, Minh Luu, Anh Tran, Cuong Pham, Khoi Nguyen

    Abstract: Open-Vocab 3D Instance Segmentation methods (OV-3DIS) have recently demonstrated their ability to generalize to unseen objects. However, these methods still depend on predefined class names during testing, restricting the autonomy of agents. To mitigate this constraint, we propose a novel problem termed Open-Ended 3D Instance Segmentation (OE-3DIS), which eliminates the necessity for predefined cl… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  32. arXiv:2408.11133  [pdf, other

    cs.IR cs.CL

    Public Health in Disaster: Emotional Health and Life Incidents Extraction during Hurricane Harvey

    Authors: Thomas Hoang, Quynh Anh Nguyen, Long Nguyen

    Abstract: Countless disasters have resulted from climate change, causing severe damage to infrastructure and the economy. These disasters have significant societal impacts, necessitating mental health services for the millions affected. To prepare for and respond effectively to such events, it is important to understand people's emotions and the life incidents they experience before and after a disaster str… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  33. arXiv:2408.06925  [pdf, other

    physics.ins-det

    Enhancing Material Screening at Boulby Underground Laboratory with XIA UltraLo-1800 Alpha Particle Detectors

    Authors: Sid El Moctar Ahmed Maouloud, XinRan Liu, Anh Nguyen, James Edward Young Dobson, Chamkaur Ghag, Léna Le Floch, Emma Meehan, Alexander St. John Murphy, Sean Paling, Ruben Saakyan, Paul Robert Scovell, Christopher Toth

    Abstract: The Boulby UnderGround Screening (BUGS) facility, located at the Boulby Underground Laboratory, has significantly advanced its material screening capabilities by installing two XIA UltraLo-1800 alpha particle detectors. This study presents a comprehensive evaluation of one of these detectors, operated 1,100 meters underground at the Boulby Underground Laboratory, which provides significant shieldi… ▽ More

    Submitted 27 September, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

  34. arXiv:2408.05822  [pdf, other

    cs.LG cs.CV

    Sampling Foundational Transformer: A Theoretical Perspective

    Authors: Viet Anh Nguyen, Minh Lenhat, Khoa Nguyen, Duong Duc Hieu, Dao Huu Hung, Truong Son Hy

    Abstract: The versatility of self-attention mechanism earned transformers great success in almost all data modalities, with limitations on the quadratic complexity and difficulty of training. To apply transformers across different data modalities, practitioners have to make specific clever data-modality-dependent constructions. In this paper, we propose Sampling Foundational Transformer (SFT) that can work… ▽ More

    Submitted 17 August, 2024; v1 submitted 11 August, 2024; originally announced August 2024.

  35. arXiv:2408.05391  [pdf, other

    cs.LG

    SAMSA: Efficient Transformer for Many Data Modalities

    Authors: Minh Lenhat, Viet Anh Nguyen, Khoa Nguyen, Duong Duc Hieu, Dao Huu Hung, Truong Son Hy

    Abstract: The versatility of self-attention mechanism earned transformers great success in almost all data modalities, with limitations on the quadratic complexity and difficulty of training. Efficient transformers, on the other hand, often rely on clever data-modality-dependent construction to get over the quadratic complexity of transformers. This greatly hinders their applications on different data modal… ▽ More

    Submitted 18 August, 2024; v1 submitted 9 August, 2024; originally announced August 2024.

  36. arXiv:2408.04660  [pdf, other

    cs.CL cs.AI

    XMainframe: A Large Language Model for Mainframe Modernization

    Authors: Anh T. V. Dau, Hieu Trung Dao, Anh Tuan Nguyen, Hieu Trung Tran, Phong X. Nguyen, Nghi D. Q. Bui

    Abstract: Mainframe operating systems, despite their inception in the 1940s, continue to support critical sectors like finance and government. However, these systems are often viewed as outdated, requiring extensive maintenance and modernization. Addressing this challenge necessitates innovative tools that can understand and interact with legacy codebases. To this end, we introduce XMainframe, a state-of-th… ▽ More

    Submitted 26 August, 2024; v1 submitted 5 August, 2024; originally announced August 2024.

  37. arXiv:2408.03500  [pdf, other

    cs.CV

    e-Health CSIRO at RRG24: Entropy-Augmented Self-Critical Sequence Training for Radiology Report Generation

    Authors: Aaron Nicolson, Jinghui Liu, Jason Dowling, Anthony Nguyen, Bevan Koopman

    Abstract: The Shared Task on Large-Scale Radiology Report Generation (RRG24) aims to expedite the development of assistive systems for interpreting and reporting on chest X-ray (CXR) images. This task challenges participants to develop models that generate the findings and impression sections of radiology reports from CXRs from a patient's study, using five different datasets. This paper outlines the e-Heal… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

  38. arXiv:2408.00239  [pdf, other

    astro-ph.GA

    Simulating intermediate black hole mass measurements for a sample of galaxies with nuclear star clusters using ELT/HARMONI high spatial resolution integral-field stellar kinematics

    Authors: Dieu D. Nguyen, Michele Cappellari, Hai N. Ngo, Tinh Q. T. Le, Khue N . H. Ho, An K. Nguyen, Huy G . Tong, Phong T. On, Tuan N. Le, Miguel Pereira-Santaella

    Abstract: The fraction of low-mass galaxies hosting an intermediate-mass black hole (IMBH, with masses $M_{\rm BH} \approx 10^2-10^5$ M$_\odot$), is sensitive to how black hole seeds formed in the early Universe but is observationally still unconstrained. In this paper, we assemble a sample of dwarf galaxies within 10 Mpc hosting bright nuclear star clusters (NSCs) that could host IMBHs. For a subset of the… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

    Comments: 33 pages, 19 figures, 9 tables, submitted to MNRAS

  39. arXiv:2407.19877  [pdf, other

    cs.RO cs.CV

    Language-driven Grasp Detection with Mask-guided Attention

    Authors: Tuan Van Vo, Minh Nhat Vu, Baoru Huang, An Vuong, Ngan Le, Thieu Vo, Anh Nguyen

    Abstract: Grasp detection is an essential task in robotics with various industrial applications. However, traditional methods often struggle with occlusions and do not utilize language for grasping. Incorporating natural language into grasp detection remains a challenging task and largely unexplored. To address this gap, we propose a new method for language-driven grasp detection with mask-guided attention… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: Accepted at IROS 2024

  40. arXiv:2407.18892  [pdf, other

    cs.RO cs.AI eess.SY

    SHANGUS: Deep Reinforcement Learning Meets Heuristic Optimization for Speedy Frontier-Based Exploration of Autonomous Vehicles in Unknown Spaces

    Authors: Seunghyeop Nam, Tuan Anh Nguyen, Eunmi Choi, Dugki Min

    Abstract: This paper introduces SHANGUS, an advanced framework combining Deep Reinforcement Learning (DRL) with heuristic optimization to improve frontier-based exploration efficiency in unknown environments, particularly for intelligent vehicles in autonomous air services, search and rescue operations, and space exploration robotics. SHANGUS harnesses DRL's adaptability and heuristic prioritization, marked… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  41. arXiv:2407.18839  [pdf, other

    cs.CV

    Scalable Group Choreography via Variational Phase Manifold Learning

    Authors: Nhat Le, Khoa Do, Xuan Bui, Tuong Do, Erman Tjiputra, Quang D. Tran, Anh Nguyen

    Abstract: Generating group dance motion from the music is a challenging task with several industrial applications. Although several methods have been proposed to tackle this problem, most of them prioritize optimizing the fidelity in dancing movement, constrained by predetermined dancer counts in datasets. This limitation impedes adaptability to real-world applications. Our study addresses the scalability p… ▽ More

    Submitted 31 July, 2024; v1 submitted 26 July, 2024; originally announced July 2024.

    Comments: Accepted at ECCV 2024

  42. arXiv:2407.17967  [pdf, other

    cs.RO cs.CV

    Lightweight Language-driven Grasp Detection using Conditional Consistency Model

    Authors: Nghia Nguyen, Minh Nhat Vu, Baoru Huang, An Vuong, Ngan Le, Thieu Vo, Anh Nguyen

    Abstract: Language-driven grasp detection is a fundamental yet challenging task in robotics with various industrial applications. In this work, we present a new approach for language-driven grasp detection that leverages the concept of lightweight diffusion models to achieve fast inference time. By integrating diffusion processes with grasping prompts in natural language, our method can effectively encode v… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: Accepted at IROS 2024

  43. arXiv:2407.17053  [pdf, other

    cs.SE cs.CR cs.LG

    Automated Code-centric Software Vulnerability Assessment: How Far Are We? An Empirical Study in C/C++

    Authors: Anh The Nguyen, Triet Huynh Minh Le, M. Ali Babar

    Abstract: Background: The C and C++ languages hold significant importance in Software Engineering research because of their widespread use in practice. Numerous studies have utilized Machine Learning (ML) and Deep Learning (DL) techniques to detect software vulnerabilities (SVs) in the source code written in these languages. However, the application of these techniques in function-level SV assessment has be… ▽ More

    Submitted 3 August, 2024; v1 submitted 24 July, 2024; originally announced July 2024.

    Comments: Accepted as a full paper in the technical track at The International Symposium on Empirical Software Engineering and Measurement (ESEM) 2024

  44. arXiv:2407.16803  [pdf, other

    cs.CV cs.AI cs.HC cs.LG eess.SP

    Fusion and Cross-Modal Transfer for Zero-Shot Human Action Recognition

    Authors: Abhi Kamboj, Anh Duy Nguyen, Minh Do

    Abstract: Despite living in a multi-sensory world, most AI models are limited to textual and visual interpretations of human motion and behavior. Inertial measurement units (IMUs) provide a salient signal to understand human motion; however, they are challenging to use due to their uninterpretability and scarcity of their data. We investigate a method to transfer knowledge between visual and inertial modali… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  45. arXiv:2407.13842  [pdf, other

    cs.RO cs.CV

    Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance

    Authors: Toan Nguyen, Minh Nhat Vu, Baoru Huang, An Vuong, Quan Vuong, Ngan Le, Thieu Vo, Anh Nguyen

    Abstract: 6-DoF grasp detection has been a fundamental and challenging problem in robotic vision. While previous works have focused on ensuring grasp stability, they often do not consider human intention conveyed through natural language, hindering effective collaboration between robots and users in complex 3D environments. In this paper, we present a new approach for language-driven 6-DoF grasp detection i… ▽ More

    Submitted 25 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: Accepted at ECCV 2024

  46. arXiv:2407.12064  [pdf, other

    eess.IV cs.CL cs.CV cs.LG cs.MM

    LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task

    Authors: Khai Le-Duc, Ryan Zhang, Ngoc Son Nguyen, Tan-Hanh Pham, Anh Dao, Ba Hung Ngo, Anh Totti Nguyen, Truong-Son Hy

    Abstract: Vision-language models have been extensively explored across a wide range of tasks, achieving satisfactory performance; however, their application in medical imaging remains underexplored. In this work, we propose a unified framework - LiteGPT - for the medical imaging. We leverage multiple pre-trained visual encoders to enrich information and enhance the performance of vision-language models. To… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Preprint, 19 pages

  47. arXiv:2407.10709  [pdf, other

    cs.CV

    Detecting Omissions in Geographic Maps through Computer Vision

    Authors: Phuc D. A. Nguyen, Anh Do, Minh Hoai

    Abstract: This paper explores the application of computer vision technologies to the analysis of maps, an area with substantial historical, cultural, and political significance. Our focus is on developing and evaluating a method for automatically identifying maps that depict specific regions and feature landmarks with designated names, a task that involves complex challenges due to the diverse styles and me… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: VinMap dataset: https://github.com/VinAIResearch/VinMap

  48. arXiv:2407.09035  [pdf, other

    eess.IV cs.CV

    GPC: Generative and General Pathology Image Classifier

    Authors: Anh Tien Nguyen, Jin Tae Kwak

    Abstract: Deep learning has been increasingly incorporated into various computational pathology applications to improve its efficiency, accuracy, and robustness. Although successful, most previous approaches for image classification have crucial drawbacks. There exist numerous tasks in pathology, but one needs to build a model per task, i.e., a task-specific model, thereby increasing the number of models, t… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: MICCAI-MedAGI 2023 (Best Paper Honorable Mention)

  49. arXiv:2407.09030  [pdf, other

    eess.IV cs.CV

    CAMP: Continuous and Adaptive Learning Model in Pathology

    Authors: Anh Tien Nguyen, Keunho Byeon, Kyungeun Kim, Boram Song, Seoung Wan Chae, Jin Tae Kwak

    Abstract: There exist numerous diagnostic tasks in pathology. Conventional computational pathology formulates and tackles them as independent and individual image classification problems, thereby resulting in computational inefficiency and high costs. To address the challenges, we propose a generic, unified, and universal framework, called a continuous and adaptive learning model in pathology (CAMP), for pa… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Under review

  50. arXiv:2407.08567  [pdf, other

    cs.CV cs.LG

    Adaptive Parametric Activation

    Authors: Konstantinos Panagiotis Alexandridis, Jiankang Deng, Anh Nguyen, Shan Luo

    Abstract: The activation function plays a crucial role in model optimisation, yet the optimal choice remains unclear. For example, the Sigmoid activation is the de-facto activation in balanced classification tasks, however, in imbalanced classification, it proves inappropriate due to bias towards frequent classes. In this work, we delve deeper in this phenomenon by performing a comprehensive statistical ana… ▽ More

    Submitted 9 October, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: ECCV2024 Oral