Skip to main content

Showing 1–17 of 17 results for author: Nguyen, D M H

  1. arXiv:2410.02615  [pdf, other

    cs.LG

    LoGra-Med: Long Context Multi-Graph Alignment for Medical Vision-Language Model

    Authors: Duy M. H. Nguyen, Nghiem T. Diep, Trung Q. Nguyen, Hoang-Bao Le, Tai Nguyen, Tien Nguyen, TrungTin Nguyen, Nhat Ho, Pengtao Xie, Roger Wattenhofer, James Zhou, Daniel Sonntag, Mathias Niepert

    Abstract: State-of-the-art medical multi-modal large language models (med-MLLM), like LLaVA-Med or BioMedGPT, leverage instruction-following data in pre-training. However, those models primarily focus on scaling the model size and data volume to boost performance while mainly relying on the autoregressive learning objectives. Surprisingly, we reveal that such learning schemes might result in a weak alignmen… ▽ More

    Submitted 6 October, 2024; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: First version, fixed typo

  2. arXiv:2407.04489  [pdf, other

    cs.CV

    Dude: Dual Distribution-Aware Context Prompt Learning For Large Vision-Language Model

    Authors: Duy M. H. Nguyen, An T. Le, Trung Q. Nguyen, Nghiem T. Diep, Tai Nguyen, Duy Duong-Tran, Jan Peters, Li Shen, Mathias Niepert, Daniel Sonntag

    Abstract: Prompt learning methods are gaining increasing attention due to their ability to customize large vision-language models to new domains using pre-trained contextual knowledge and minimal training data. However, existing works typically rely on optimizing unified prompt inputs, often struggling with fine-grained classification tasks due to insufficient discriminative attributes. To tackle this, we c… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Version 1

  3. arXiv:2406.06239  [pdf, other

    cs.CV

    I-MPN: Inductive Message Passing Network for Efficient Human-in-the-Loop Annotation of Mobile Eye Tracking Data

    Authors: Hoang H. Le, Duy M. H. Nguyen, Omair Shahzad Bhatti, Laszlo Kopacsi, Thinh P. Ngo, Binh T. Nguyen, Michael Barz, Daniel Sonntag

    Abstract: Comprehending how humans process visual information in dynamic settings is crucial for psychology and designing user-centered interactions. While mobile eye-tracking systems combining egocentric video and gaze signals can offer valuable insights, manual analysis of these recordings is time-intensive. In this work, we present a novel human-centered learning algorithm designed for automated object r… ▽ More

    Submitted 7 July, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: Updated version

  4. arXiv:2405.16148  [pdf, other

    cs.LG

    Accelerating Transformers with Spectrum-Preserving Token Merging

    Authors: Hoai-Chau Tran, Duy M. H. Nguyen, Duy M. Nguyen, Trung-Tin Nguyen, Ngan Le, Pengtao Xie, Daniel Sonntag, James Y. Zou, Binh T. Nguyen, Mathias Niepert

    Abstract: Increasing the throughput of the Transformer architecture, a foundational component used in numerous state-of-the-art models for vision and language tasks (e.g., GPT, LLaVa), is an important problem in machine learning. One recent and effective strategy is to merge token representations within Transformer models, aiming to reduce computational and memory requirements while maintaining accuracy. Pr… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: Version 1

  5. arXiv:2402.01975  [pdf, other

    cs.LG

    Structure-Aware E(3)-Invariant Molecular Conformer Aggregation Networks

    Authors: Duy M. H. Nguyen, Nina Lukashina, Tai Nguyen, An T. Le, TrungTin Nguyen, Nhat Ho, Jan Peters, Daniel Sonntag, Viktor Zaverkin, Mathias Niepert

    Abstract: A molecule's 2D representation consists of its atoms, their attributes, and the molecule's covalent bonds. A 3D (geometric) representation of a molecule is called a conformer and consists of its atom types and Cartesian coordinates. Every conformer has a potential energy, and the lower this energy, the more likely it occurs in nature. Most existing machine learning methods for molecular property p… ▽ More

    Submitted 19 August, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted at ICML 2024 (updated version)

  6. arXiv:2311.11096  [pdf, other

    eess.IV cs.CV

    On the Out of Distribution Robustness of Foundation Models in Medical Image Segmentation

    Authors: Duy Minh Ho Nguyen, Tan Ngoc Pham, Nghiem Tuong Diep, Nghi Quoc Phan, Quang Pham, Vinh Tong, Binh T. Nguyen, Ngan Hoang Le, Nhat Ho, Pengtao Xie, Daniel Sonntag, Mathias Niepert

    Abstract: Constructing a robust model that can effectively generalize to test samples under distribution shifts remains a significant challenge in the field of medical imaging. The foundational models for vision and language, pre-trained on extensive sets of natural image and text data, have emerged as a promising approach. It showcases impressive learning abilities across different tasks with the need for… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: Advances in Neural Information Processing Systems (NeurIPS) 2023, Workshop on robustness of zero/few-shot learning in foundation models

  7. arXiv:2306.11925  [pdf, other

    cs.CV

    LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical Imaging via Second-order Graph Matching

    Authors: Duy M. H. Nguyen, Hoang Nguyen, Nghiem T. Diep, Tan N. Pham, Tri Cao, Binh T. Nguyen, Paul Swoboda, Nhat Ho, Shadi Albarqouni, Pengtao Xie, Daniel Sonntag, Mathias Niepert

    Abstract: Obtaining large pre-trained models that can be fine-tuned to new tasks with limited annotated samples has remained an open challenge for medical imaging data. While pre-trained deep networks on ImageNet and vision-language foundation models trained on web-scale data are prevailing approaches, their effectiveness on medical tasks is limited due to the significant domain shift between natural and me… ▽ More

    Submitted 18 November, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Accepted at NeurIPS 2023

  8. arXiv:2302.13270  [pdf, other

    math.SG math-ph math.DS

    Integrable Systems Arising from Separation of Variables on $S^{3}$

    Authors: Diana M. H. Nguyen, Sean R. Dawson, Holger R. Dullin

    Abstract: We show that the space of orthogonally separable coordinates on the sphere $S^3$ induces a natural family of integrable systems, which after symplectic reduction leads to a family of integrable systems on $S^2 \times S^2$. The generic member of the family corresponds to ellipsoidal coordinates. We use the theory of compatible Poisson structures to study the critical points and critical values of t… ▽ More

    Submitted 26 February, 2023; originally announced February 2023.

    Comments: 37 pages, 11 figures

    MSC Class: 37J15; 37J35; 53D20; 53D22; 81Q20

  9. arXiv:2212.14615  [pdf, other

    cs.CV

    DRG-Net: Interactive Joint Learning of Multi-lesion Segmentation and Classification for Diabetic Retinopathy Grading

    Authors: Hasan Md Tusfiqur, Duy M. H. Nguyen, Mai T. N. Truong, Triet A. Nguyen, Binh T. Nguyen, Michael Barz, Hans-Juergen Profitlich, Ngoc T. T. Than, Ngan Le, Pengtao Xie, Daniel Sonntag

    Abstract: Diabetic Retinopathy (DR) is a leading cause of vision loss in the world, and early DR detection is necessary to prevent vision loss and support an appropriate treatment. In this work, we leverage interactive machine learning and introduce a joint learning framework, termed DRG-Net, to effectively learn both disease grading and multi-lesion segmentation. Our DRG-Net consists of two modules: (i) DR… ▽ More

    Submitted 30 December, 2022; originally announced December 2022.

    Comments: First version

  10. arXiv:2212.01893  [pdf, other

    cs.CV

    Joint Self-Supervised Image-Volume Representation Learning with Intra-Inter Contrastive Clustering

    Authors: Duy M. H. Nguyen, Hoang Nguyen, Mai T. N. Truong, Tri Cao, Binh T. Nguyen, Nhat Ho, Paul Swoboda, Shadi Albarqouni, Pengtao Xie, Daniel Sonntag

    Abstract: Collecting large-scale medical datasets with fully annotated samples for training of deep networks is prohibitively expensive, especially for 3D volume data. Recent breakthroughs in self-supervised learning (SSL) offer the ability to overcome the lack of labeled training samples by learning feature representations from unlabeled data. However, most current SSL techniques in the medical field have… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

    Comments: Accepted at AAAI 2023

  11. The Harmonic Lagrange Top and the Confluent Heun Equation

    Authors: Sean R. Dawson, Holger R. Dullin, Diana M. H. Nguyen

    Abstract: The harmonic Lagrange top is the Lagrange top plus a quadratic (harmonic) potential term. We describe the top in the space fixed frame using a global description with a Poisson structure on $T^*S^3$. This global description naturally leads to a rational parametrisation of the set of critical values of the energy-momentum map. We show that there are 4 different topological types for generic paramet… ▽ More

    Submitted 10 December, 2021; v1 submitted 28 November, 2021; originally announced November 2021.

    Comments: 14 pages, 3 figures. Revision 1: Corrected inconsistent notation in Section 7 and fixed Theorem 4

    MSC Class: 70E40; 70E17

  12. arXiv:2111.11892  [pdf, other

    cs.CV

    LMGP: Lifted Multicut Meets Geometry Projections for Multi-Camera Multi-Object Tracking

    Authors: Duy M. H. Nguyen, Roberto Henschel, Bodo Rosenhahn, Daniel Sonntag, Paul Swoboda

    Abstract: Multi-Camera Multi-Object Tracking is currently drawing attention in the computer vision field due to its superior performance in real-world applications such as video surveillance in crowded scenes or in wide spaces. In this work, we propose a mathematically elegant multi-camera multiple object tracking approach based on a spatial-temporal lifted multicut formulation. Our model utilizes state-of-… ▽ More

    Submitted 3 May, 2022; v1 submitted 23 November, 2021; originally announced November 2021.

    Comments: Official version for CVPR 2022

  13. arXiv:2107.09372  [pdf, other

    cs.CV

    Self-Supervised Domain Adaptation for Diabetic Retinopathy Grading using Vessel Image Reconstruction

    Authors: Duy M. H. Nguyen, Truong T. N. Mai, Ngoc T. T. Than, Alexander Prange, Daniel Sonntag

    Abstract: This paper investigates the problem of domain adaptation for diabetic retinopathy (DR) grading. We learn invariant target-domain features by defining a novel self-supervised task based on retinal vessel image reconstructions, inspired by medical domain knowledge. Then, a benchmark of current state-of-the-art unsupervised domain adaptation methods on the DR problem is provided. It can be shown that… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

  14. arXiv:2104.01641  [pdf, other

    cs.CV

    TATL: Task Agnostic Transfer Learning for Skin Attributes Detection

    Authors: Duy M. H. Nguyen, Thu T. Nguyen, Huong Vu, Quang Pham, Manh-Duy Nguyen, Binh T. Nguyen, Daniel Sonntag

    Abstract: Existing skin attributes detection methods usually initialize with a pre-trained Imagenet network and then fine-tune on a medical target task. However, we argue that such approaches are suboptimal because medical datasets are largely different from ImageNet and often contain limited training samples. In this work, we propose \emph{Task Agnostic Transfer Learning (TATL)}, a novel framework motivate… ▽ More

    Submitted 27 January, 2022; v1 submitted 4 April, 2021; originally announced April 2021.

    Comments: This version has been accepted at Medical Image Analysis

  15. arXiv:2009.11008  [pdf, other

    eess.IV cs.CV cs.HC

    An Attention Mechanism with Multiple Knowledge Sources for COVID-19 Detection from CT Images

    Authors: Duy M. H. Nguyen, Duy M. Nguyen, Huong Vu, Binh T. Nguyen, Fabrizio Nunnari, Daniel Sonntag

    Abstract: Until now, Coronavirus SARS-CoV-2 has caused more than 850,000 deaths and infected more than 27 million individuals in over 120 countries. Besides principal polymerase chain reaction (PCR) tests, automatically identifying positive samples based on computed tomography (CT) scans can present a promising option in the early diagnosis of COVID-19. Recently, there have been increasing efforts to utiliz… ▽ More

    Submitted 1 December, 2020; v1 submitted 23 September, 2020; originally announced September 2020.

    Comments: In AAAI 2021 Workshop: Trustworthy AI for Healthcare

  16. arXiv:2001.11270  [pdf, other

    math-ph math.DS

    Monodromy in Prolate Spheroidal Harmonics

    Authors: Sean R. Dawson, Holger R. Dullin, Diana M. H. Nguyen

    Abstract: We show that spheroidal wave functions viewed as the essential part of the joint eigenfunction of two commuting operators of $L_2(S^2)$ has a defect in the joint spectrum that makes a global labelling of the joint eigenfunctions by quantum numbers impossible. To our knowledge this is the first explicit demonstration that quantum monodromy exists in a class of classically known special functions. U… ▽ More

    Submitted 30 January, 2020; originally announced January 2020.

    Comments: 26 pages, 11 figures

    MSC Class: 37J15; 37J35; 53D20; 53D22; 81Q20

    Journal ref: Stud. Appl. Math.146 (2021) 953-982

  17. The Lie-Poisson Structure of the Symmetry Reduced Regularised n-Body Problem

    Authors: Suntharan Arunasalam, Holger R. Dullin, Diana M. H. Nguyen

    Abstract: This paper investigates the symmetry reduction of the regularised n-body problem. The three body problem, regularised through quaternions, is examined in detail. We show that for a suitably chosen symmetry group action the space of quadratic invariants is closed and the Hamiltonian can be written in terms of the quadratic invariants. The corresponding Lie-Poisson structure is isomorphic to the Lie… ▽ More

    Submitted 13 August, 2014; originally announced August 2014.

    Journal ref: J. Phys. A, 48(6):065202, 2015