Skip to main content

Showing 1–50 of 82 results for author: Ngo, D

  1. arXiv:2410.09913  [pdf, other

    cs.CV

    Stratified Domain Adaptation: A Progressive Self-Training Approach for Scene Text Recognition

    Authors: Kha Nhat Le, Hoang-Tuan Nguyen, Hung Tien Tran, Thanh Duc Ngo

    Abstract: Unsupervised domain adaptation (UDA) has become increasingly prevalent in scene text recognition (STR), especially where training and testing data reside in different domains. The efficacy of existing UDA approaches tends to degrade when there is a large gap between the source and target domains. To deal with this problem, gradually shifting or progressively learning to shift from domain to domain… ▽ More

    Submitted 17 October, 2024; v1 submitted 13 October, 2024; originally announced October 2024.

    Comments: 15 pages, 12 figures, 5 tables, include supplementary materials

  2. arXiv:2409.14700  [pdf, other

    cs.CR

    Adaptive and Robust Watermark for Generative Tabular Data

    Authors: Dung Daniel Ngo, Daniel Scott, Saheed Obitayo, Vamsi K. Potluru, Manuela Veloso

    Abstract: Recent developments in generative models have demonstrated its ability to create high-quality synthetic data. However, the pervasiveness of synthetic content online also brings forth growing concerns that it can be used for malicious purposes. To ensure the authenticity of the data, watermarking techniques have recently emerged as a promising solution due to their strong statistical guarantees. In… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

    Comments: 12 pages of main body, 2 figures, 5 tables

  3. arXiv:2408.16408  [pdf

    physics.app-ph

    High-yield large-scale suspended graphene membranes over closed cavities for sensor applications

    Authors: Sebastian Lukas, Ardeshir Esteki, Nico Rademacher, Vikas Jangra, Michael Gross, Zhenxing Wang, Ha Duong Ngo, Manuel Bäuscher, Piotr Mackowiak, Katrin Höppner, Dominique Wehenkel, Richard van Rijn, Max C. Lemme

    Abstract: Suspended membranes of monoatomic graphene exhibit great potential for applications in electronic and nanoelectromechanical devices. In this work, a "hot and dry" transfer process is demonstrated to address the fabrication and patterning challenges of large-area graphene membranes on top of closed, sealed cavities. Here, "hot" refers to the use of high temperature during transfer, promoting the ad… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: 30 pages of manuscript plus 17 pages of Supporting Information

  4. arXiv:2407.01963  [pdf, other

    eess.AS

    Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders

    Authors: Phat Lam, Lam Pham, Truong Nguyen, Dat Ngo, Thinh Pham, Tin Nguyen, Loi Khanh Nguyen, Alexander Schindler

    Abstract: Existing speaker diarization systems typically rely on large amounts of manually annotated data, which is labor-intensive and difficult to obtain, especially in real-world scenarios. Additionally, language-specific constraints in these systems significantly hinder their effectiveness and scalability in multilingual settings. In this paper, we propose a cluster-based speaker diarization system desi… ▽ More

    Submitted 12 September, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: Preprint, 14 pages, 6 figures

  5. arXiv:2405.19667  [pdf, other

    cs.LG cs.AI

    Reconciling Model Multiplicity for Downstream Decision Making

    Authors: Ally Yalei Du, Dung Daniel Ngo, Zhiwei Steven Wu

    Abstract: We consider the problem of model multiplicity in downstream decision-making, a setting where two predictive models of equivalent accuracy cannot agree on the best-response action for a downstream loss function. We show that even when the two predictive models approximately agree on their individual predictions almost everywhere, it is still possible for their induced best-response actions to diffe… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 16 pages main body, 6 figures

  6. arXiv:2405.08843  [pdf, other

    cs.LG cs.AI cs.NI

    FLEXIBLE: Forecasting Cellular Traffic by Leveraging Explicit Inductive Graph-Based Learning

    Authors: Duc Thinh Ngo, Kandaraj Piamrat, Ons Aouedi, Thomas Hassan, Philippe Raipin-Parvédy

    Abstract: From a telecommunication standpoint, the surge in users and services challenges next-generation networks with escalating traffic demands and limited resources. Accurate traffic prediction can offer network operators valuable insights into network conditions and suggest optimal allocation policies. Recently, spatio-temporal forecasting, employing Graph Neural Networks (GNNs), has emerged as a promi… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  7. arXiv:2403.00379  [pdf, other

    eess.AS cs.SD

    The Impact of Frequency Bands on Acoustic Anomaly Detection of Machines using Deep Learning Based Model

    Authors: Tin Nguyen, Lam Pham, Phat Lam, Dat Ngo, Hieu Tang, Alexander Schindler

    Abstract: In this paper, we propose a deep learning based model for Acoustic Anomaly Detection of Machines, the task for detecting abnormal machines by analysing the machine sound. By conducting extensive experiments, we indicate that multiple techniques of pseudo audios, audio segment, data augmentation, Mahalanobis distance, and narrow frequency bands, which mainly focus on feature engineering, are effect… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  8. arXiv:2402.12179  [pdf, other

    cs.CV cs.AI cs.CY

    Examining Monitoring System: Detecting Abnormal Behavior In Online Examinations

    Authors: Dinh An Ngo, Thanh Dat Nguyen, Thi Le Chi Dang, Huy Hoan Le, Ton Bao Ho, Vo Thanh Khang Nguyen, Truong Thanh Hung Nguyen

    Abstract: Cheating in online exams has become a prevalent issue over the past decade, especially during the COVID-19 pandemic. To address this issue of academic dishonesty, our "Exam Monitoring System: Detecting Abnormal Behavior in Online Examinations" is designed to assist proctors in identifying unusual student behavior. Our system demonstrates high accuracy and speed in detecting cheating in real-time s… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  9. arXiv:2312.16307  [pdf, other

    econ.EM cs.GT cs.LG stat.ME

    Incentive-Aware Synthetic Control: Accurate Counterfactual Estimation via Incentivized Exploration

    Authors: Daniel Ngo, Keegan Harris, Anish Agarwal, Vasilis Syrgkanis, Zhiwei Steven Wu

    Abstract: We consider the setting of synthetic control methods (SCMs), a canonical approach used to estimate the treatment effect on the treated in a panel data setting. We shed light on a frequently overlooked but ubiquitous assumption made in SCMs of "overlap": a treated unit can be written as some combination -- typically, convex or linear combination -- of the units that remain under control. We show th… ▽ More

    Submitted 13 February, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

  10. arXiv:2312.10671  [pdf, other

    cs.CV

    Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance

    Authors: Phuc D. A. Nguyen, Tuan Duc Ngo, Evangelos Kalogerakis, Chuang Gan, Anh Tran, Cuong Pham, Khoi Nguyen

    Abstract: We introduce Open3DIS, a novel solution designed to tackle the problem of Open-Vocabulary Instance Segmentation within 3D scenes. Objects within 3D environments exhibit diverse shapes, scales, and colors, making precise instance-level identification a challenging task. Recent advancements in Open-Vocabulary scene understanding have made significant strides in this area by employing class-agnostic… ▽ More

    Submitted 5 April, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

    Comments: CVPR 2024. Project page: https://open3dis.github.io/

  11. arXiv:2310.00467  [pdf, ps, other

    cs.IT cs.DC

    New results on Erasure Combinatorial Batch Codes

    Authors: Phuc-Lu Le, Son Hoang Dau, Hy Dinh Ngo, Thuc D. Nguyen

    Abstract: We investigate in this work the problem of Erasure Combinatorial Batch Codes, in which $n$ files are stored on $m$ servers so that every set of $n-r$ servers allows a client to retrieve at most $k$ distinct files by downloading at most $t$ files from each server. Previous studies have solved this problem for the special case of $t=1$ using Combinatorial Batch Codes. We tackle the general case… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: Allerton conference

  12. arXiv:2307.13251  [pdf, other

    cs.CV cs.AI

    GaPro: Box-Supervised 3D Point Cloud Instance Segmentation Using Gaussian Processes as Pseudo Labelers

    Authors: Tuan Duc Ngo, Binh-Son Hua, Khoi Nguyen

    Abstract: Instance segmentation on 3D point clouds (3DIS) is a longstanding challenge in computer vision, where state-of-the-art methods are mainly based on full supervision. As annotating ground truth dense instance masks is tedious and expensive, solving 3DIS with weak supervision has become more practical. In this paper, we propose GaPro, a new instance segmentation for 3D point clouds using axis-aligned… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted to ICCV 2023

  13. arXiv:2306.14929  [pdf, other

    cs.SD eess.AS

    A Deep Learning Architecture with Spatio-Temporal Focusing for Detecting Respiratory Anomalies

    Authors: Dat Ngo, Lam Pham, Huy Phan, Minh Tran, Delaram Jarchi

    Abstract: This paper presents a deep learning system applied for detecting anomalies from respiratory sound recordings. Our system initially performs audio feature extraction using Continuous Wavelet transformation. This transformation converts the respiratory sound input into a two-dimensional spectrogram where both spectral and temporal features are presented. Then, our proposed deep learning architecture… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: text overlap with arXiv:2303.04104

  14. arXiv:2305.09463  [pdf, other

    cs.SD cs.AI eess.AS

    Low-complexity deep learning frameworks for acoustic scene classification using teacher-student scheme and multiple spectrograms

    Authors: Lam Pham, Dat Ngo, Cam Le, Anahid Jalali, Alexander Schindler

    Abstract: In this technical report, a low-complexity deep learning system for acoustic scene classification (ASC) is presented. The proposed system comprises two main phases: (Phase I) Training a teacher network; and (Phase II) training a student network using distilled knowledge from the teacher. In the first phase, the teacher, which presents a large footprint model, is trained. After training the teacher… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: text overlap with arXiv:2206.06057

  15. arXiv:2305.06827  [pdf, other

    cs.LG

    A Generic Approach to Integrating Time into Spatial-Temporal Forecasting via Conditional Neural Fields

    Authors: Minh-Thanh Bui, Duc-Thinh Ngo, Demin Lu, Zonghua Zhang

    Abstract: Self-awareness is the key capability of autonomous systems, e.g., autonomous driving network, which relies on highly efficient time series forecasting algorithm to enable the system to reason about the future state of the environment, as well as its effect on the system behavior as time progresses. Recently, a large number of forecasting algorithms using either convolutional neural networks or gra… ▽ More

    Submitted 17 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  16. arXiv:2305.01476  [pdf, other

    cs.SD cs.MM eess.AS

    Deep Learning Based Multimodal with Two-phase Training Strategy for Daily Life Video Classification

    Authors: Lam Pham, Trang Le, Cam Le, Dat Ngo, Weissenfeld Axel, Alexander Schindler

    Abstract: In this paper, we present a deep learning based multimodal system for classifying daily life videos. To train the system, we propose a two-phase training strategy. In the first training phase (Phase I), we extract the audio and visual (image) data from the original video. We then train the audio data and the visual data with independent deep learning based models. After the training processes, we… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

  17. Instance-level Few-shot Learning with Class Hierarchy Mining

    Authors: Anh-Khoa Nguyen Vu, Thanh-Toan Do, Nhat-Duy Nguyen, Vinh-Tiep Nguyen, Thanh Duc Ngo, Tam V. Nguyen

    Abstract: Few-shot learning is proposed to tackle the problem of scarce training data in novel classes. However, prior works in instance-level few-shot learning have paid less attention to effectively utilizing the relationship between categories. In this paper, we exploit the hierarchical information to leverage discriminative and relevant features of base classes to effectively classify novel objects. The… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: accepted by IEEE Transactions on Image Processing

  18. The Art of Camouflage: Few-Shot Learning for Animal Detection and Segmentation

    Authors: Thanh-Danh Nguyen, Anh-Khoa Nguyen Vu, Nhat-Duy Nguyen, Vinh-Tiep Nguyen, Thanh Duc Ngo, Thanh-Toan Do, Minh-Triet Tran, Tam V. Nguyen

    Abstract: Camouflaged object detection and segmentation is a new and challenging research topic in computer vision. There is a serious issue of lacking data on concealed objects such as camouflaged animals in natural scenes. In this paper, we address the problem of few-shot learning for camouflaged object detection and segmentation. To this end, we first collect a new dataset, CAMO-FS, for the benchmark. As… ▽ More

    Submitted 5 August, 2024; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: IEEE Access 2024

  19. arXiv:2303.04104  [pdf, other

    cs.SD cs.LG eess.AS q-bio.QM

    An Inception-Residual-Based Architecture with Multi-Objective Loss for Detecting Respiratory Anomalies

    Authors: Dat Ngo, Lam Pham, Huy Phan, Minh Tran, Delaram Jarchi, Sefki Kolozali

    Abstract: This paper presents a deep learning system applied for detecting anomalies from respiratory sound recordings. Initially, our system begins with audio feature extraction using Gammatone and Continuous Wavelet transformation. This step aims to transform the respiratory sound input into a two-dimensional spectrogram where both spectral and temporal features are presented. Then, our proposed system in… ▽ More

    Submitted 19 June, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

  20. arXiv:2303.00246  [pdf, other

    cs.CV

    ISBNet: a 3D Point Cloud Instance Segmentation Network with Instance-aware Sampling and Box-aware Dynamic Convolution

    Authors: Tuan Duc Ngo, Binh-Son Hua, Khoi Nguyen

    Abstract: Existing 3D instance segmentation methods are predominated by the bottom-up design -- manually fine-tuned algorithm to group points into clusters followed by a refinement network. However, by relying on the quality of the clusters, these methods generate susceptible results when (1) nearby objects with the same semantic class are packed together, or (2) large objects with loosely connected regions… ▽ More

    Submitted 26 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2023

  21. arXiv:2302.13028  [pdf, other

    cs.CV cs.AI cs.LG

    A Light-weight Deep Learning Model for Remote Sensing Image Classification

    Authors: Lam Pham, Cam Le, Dat Ngo, Anh Nguyen, Jasmin Lampert, Alexander Schindler, Ian McLoughlin

    Abstract: In this paper, we present a high-performance and light-weight deep learning model for Remote Sensing Image Classification (RSIC), the task of identifying the aerial scene of a remote sensing image. To this end, we first valuate various benchmark convolutional neural network (CNN) architectures: MobileNet V1/V2, ResNet 50/151V2, InceptionV3/InceptionResNetV2, EfficientNet B0/B7, DenseNet 121/201, C… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

  22. arXiv:2302.08533  [pdf, other

    cs.LG cs.DC

    Federated Learning as a Network Effects Game

    Authors: Shengyuan Hu, Dung Daniel Ngo, Shuran Zheng, Virginia Smith, Zhiwei Steven Wu

    Abstract: Federated Learning (FL) aims to foster collaboration among a population of clients to improve the accuracy of machine learning without directly sharing local data. Although there has been rich literature on designing federated learning algorithms, most prior works implicitly assume that all clients are willing to participate in a FL scheme. In practice, clients may not benefit from joining in FL,… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: 14 pages of main text, 26 pages in total

  23. Optimal sizing of renewable energy storage: A comparative study of hydrogen and battery system considering degradation and seasonal storage

    Authors: Son Tay Le, Tuan Ngoc Nguyen, Dac-Khuong Bui, Tuan Duc Ngo

    Abstract: Renewable energy storage (RES) is essential to address the intermittence issues of renewable energy systems, thereby enhancing the system stability and reliability. This study presents an optimisation study of sizing and operational strategy parameters of a grid-connected photovoltaic (PV)-hydrogen/battery systems using a Multi-Objective Modified Firefly Algorithm (MOMFA). An operational strategy… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

  24. arXiv:2209.03672  [pdf

    cond-mat.str-el

    Observation of strange metal in hole-doped valley-spin insulator

    Authors: Tuan Dung Nguyen, Baithi Mallesh, Seon Je Kim, Houcine Bouzid, Byeongwook Cho, Xuan Phu Le, Tien Dat Ngo, Won Jong Yoo, Young-Min Kim, Dinh Loc Duong, Young Hee Lee

    Abstract: Temperature-linear resistance at low temperatures in strange metals is an exotic characteristic of strong correlation systems, as observed in high-TC superconducting cuprates, heavy fermions, Fe-based superconductors, ruthenates, and twisted bilayer graphene. Here, we introduce a hole-doped valley-spin insulator, V-doped WSe2, with hole pockets in the valence band. The strange metal characteristic… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Comments: 8 pages, 4 figures + Supplemental Material

  25. arXiv:2208.03403  [pdf, other

    cs.CV

    Slice-level Detection of Intracranial Hemorrhage on CT Using Deep Descriptors of Adjacent Slices

    Authors: Dat T. Ngo, Thao T. B. Nguyen, Hieu T. Nguyen, Dung B. Nguyen, Ha Q. Nguyen, Hieu H. Pham

    Abstract: The rapid development in representation learning techniques such as deep neural networks and the availability of large-scale, well-annotated medical imaging datasets have to a rapid increase in the use of supervised machine learning in the 3D medical image analysis and diagnosis. In particular, deep convolutional neural networks (D-CNNs) have been key players and were adopted by the medical imagin… ▽ More

    Submitted 17 April, 2023; v1 submitted 5 August, 2022; originally announced August 2022.

    Comments: Accepted for presentation at the 22nd IEEE Statistical Signal Processing (SSP) workshop

  26. arXiv:2206.13392  [pdf, ps, other

    cs.CV cs.AI cs.LG

    Remote Sensing Image Classification using Transfer Learning and Attention Based Deep Neural Network

    Authors: Lam Pham, Khoa Tran, Dat Ngo, Jasmin Lampert, Alexander Schindler

    Abstract: The task of remote sensing image scene classification (RSISC), which aims at classifying remote sensing images into groups of semantic categories based on their contents, has taken the important role in a wide range of applications such as urban planning, natural hazards detection, environment monitoring,vegetation mapping, or geospatial object detection. During the past years, research community… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  27. arXiv:2206.06057  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Low-complexity deep learning frameworks for acoustic scene classification

    Authors: Lam Pham, Dat Ngo, Anahid Jalali, Alexander Schindler

    Abstract: In this report, we presents low-complexity deep learning frameworks for acoustic scene classification (ASC). The proposed frameworks can be separated into four main steps: Front-end spectrogram extraction, online data augmentation, back-end classification, and late fusion of predicted probabilities. In particular, we initially transform audio recordings into Mel, Gammatone, and CQT spectrograms. N… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

  28. arXiv:2206.00494  [pdf, ps, other

    cs.LG

    Incentivizing Combinatorial Bandit Exploration

    Authors: Xinyan Hu, Dung Daniel Ngo, Aleksandrs Slivkins, Zhiwei Steven Wu

    Abstract: Consider a bandit algorithm that recommends actions to self-interested users in a recommendation system. The users are free to choose other actions and need to be incentivized to follow the algorithm's recommendations. While the users prefer to exploit, the algorithm can incentivize them to explore by leveraging the information collected from the previous users. All published work on this problem,… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

    Comments: 9 pages of main text, 21 pages in total

  29. arXiv:2203.12314  [pdf, other

    cs.SD cs.LG eess.AS

    Wider or Deeper Neural Network Architecture for Acoustic Scene Classification with Mismatched Recording Devices

    Authors: Lam Pham, Khoa Dinh, Dat Ngo, Hieu Tang, Alexander Schindler

    Abstract: In this paper, we present a robust and low complexity system for Acoustic Scene Classification (ASC), the task of identifying the scene of an audio recording. We first construct an ASC baseline system in which a novel inception-residual-based network architecture is proposed to deal with the mismatched recording device issue. To further improve the performance but still satisfy the low complexity… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: This paper was submitted to INTERSPEECH 2022

  30. arXiv:2203.05281  [pdf, other

    eess.SY

    Multi-Agent Task Assignment in Vehicular Edge Computing: A Regret-Matching Learning-Based Approach

    Authors: Bach Long Nguyen, Duong D. Nguyen, Hung X. Nguyen, Duy T. Ngo, Markus Wagner

    Abstract: Vehicular edge computing has recently been proposed to support computation-intensive applications in Intelligent Transportation Systems (ITS) such as self-driving cars and augmented reality. Despite progress in this area, significant challenges remain to efficiently allocate limited computation resources to a range of time-critical ITS tasks. To this end, the current paper develops a new task assi… ▽ More

    Submitted 16 December, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

    Comments: 10 pages, 12 figures, and 1 table

  31. arXiv:2202.05626  [pdf, other

    cs.SD eess.AS

    Audio-Based Deep Learning Frameworks for Detecting COVID-19

    Authors: Dat Ngo, Lam Pham, Truong Hoang, Sefki Kolozali, Delaram Jarchi

    Abstract: This paper evaluates a wide range of audio-based deep learning frameworks applied to the breathing, cough, and speech sounds for detecting COVID-19. In general, the audio recording inputs are transformed into low-level spectrogram features, then they are fed into pre-trained deep learning models to extract high-level embedding features. Next, the dimension of these high-level embedding features ar… ▽ More

    Submitted 2 March, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

  32. arXiv:2202.01292  [pdf, other

    cs.LG

    Improved Regret for Differentially Private Exploration in Linear MDP

    Authors: Dung Daniel Ngo, Giuseppe Vietri, Zhiwei Steven Wu

    Abstract: We study privacy-preserving exploration in sequential decision-making for environments that rely on sensitive data such as medical records. In particular, we focus on solving the problem of reinforcement learning (RL) subject to the constraint of (joint) differential privacy in the linear MDP setting, where both dynamics and rewards are given by linear functions. Prior work on this problem due to… ▽ More

    Submitted 22 June, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: 13 pages of main text, 30 pages in total; typo corrected, references added

  33. arXiv:2201.03054  [pdf, ps, other

    cs.SD eess.AS

    An Ensemble of Deep Learning Frameworks Applied For Predicting Respiratory Anomalies

    Authors: Lam Pham, Dat Ngo, Truong Hoang, Alexander Schindler, Ian McLoughlin

    Abstract: In this paper, we evaluate various deep learning frameworks for detecting respiratory anomalies from input audio recordings. To this end, we firstly transform audio respiratory cycles collected from patients into spectrograms where both temporal and spectral features are presented, referred to as the front-end feature extraction. We then feed the spectrograms into back-end deep learning networks f… ▽ More

    Submitted 9 January, 2022; originally announced January 2022.

  34. arXiv:2201.00118  [pdf, ps, other

    cs.CL cs.IR cs.LG

    Semantic Search for Large Scale Clinical Ontologies

    Authors: Duy-Hoa Ngo, Madonna Kemp, Donna Truran, Bevan Koopman, Alejandro Metke-Jimenez

    Abstract: Finding concepts in large clinical ontologies can be challenging when queries use different vocabularies. A search algorithm that overcomes this problem is useful in applications such as concept normalisation and ontology matching, where concepts can be referred to in different ways, using different synonyms. In this paper, we present a deep learning based approach to build a semantic search syste… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

  35. arXiv:2112.11723  [pdf, other

    cs.IT

    Energy-Efficient Massive MIMO for Federated Learning: Transmission Designs and Resource Allocations

    Authors: Tung T. Vu, Hien Q. Ngo, Minh N. Dao, Duy T. Ngo, Erik G. Larsson, Tho Le-Ngoc

    Abstract: This work proposes novel synchronous, asynchronous, and session-based designs for energy-efficient massive multiple-input multiple-output networks to support federated learning (FL). The synchronous design relies on strict synchronization among users when executing each FL communication round, while the asynchronous design allows more flexibility for users to save energy by using lower computing f… ▽ More

    Submitted 15 November, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

    Comments: accepted to appear

  36. arXiv:2112.09172  [pdf, ps, other

    cs.CV cs.LG eess.IV

    An Audio-Visual Dataset and Deep Learning Frameworks for Crowded Scene Classification

    Authors: Lam Pham, Dat Ngo, Phu X. Nguyen, Truong Hoang, Alexander Schindler

    Abstract: This paper presents a task of audio-visual scene classification (SC) where input videos are classified into one of five real-life crowded scenes: 'Riot', 'Noise-Street', 'Firework-Event', 'Music-Event', and 'Sport-Atmosphere'. To this end, we firstly collect an audio-visual dataset (videos) of these five crowded contexts from Youtube (in-the-wild scenes). Then, a wide range of deep learning framew… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

  37. A Cough-based deep learning framework for detecting COVID-19

    Authors: Truong Hoang, Lam Pham, Dat Ngo, Hoang D. Nguyen

    Abstract: This paper presents a deep learning framework for detecting COVID-19 positive subjects from their cough sounds. In particular, the proposed approach comprises two main steps. In the first step, we generate a feature representing the cough sound by combining an embedding extracted from a pre-trained model and handcrafted features extracted from draw audio recording, referred to as the front-end fea… ▽ More

    Submitted 30 September, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: COVID-19, EMBC-2022, DiCOVA, top 2nd, benchmark on Spec > 0.95%

    MSC Class: 92-05; 68Txx ACM Class: J.3; I.5.4; I.5.2; H.5.5; C.3; K.5

    Journal ref: EMBC 44 (2022) 3422-3425

  38. arXiv:2108.13512  [pdf, ps, other

    cs.IT

    Energy-Efficient Massive MIMO for Serving Multiple Federated Learning Groups

    Authors: Tung T. Vu, Hien Quoc Ngo, Duy T. Ngo, Minh N Dao, Erik G. Larsson

    Abstract: With its privacy preservation and communication efficiency, federated learning (FL) has emerged as a learning framework that suits beyond 5G and towards 6G systems. This work looks into a future scenario in which there are multiple groups with different learning purposes and participating in different FL processes. We give energy-efficient solutions to demonstrate that this scenario can be realist… ▽ More

    Submitted 17 October, 2021; v1 submitted 30 August, 2021; originally announced August 2021.

    Comments: Accepted to appear in Proc. IEEE Global Communications Conference (GLOBECOM), Madrid, Spain, Dec. 2021. (v2). arXiv admin note: text overlap with arXiv:2107.09577

  39. arXiv:2107.10093  [pdf, other

    cs.LG cs.GT

    Incentivizing Compliance with Algorithmic Instruments

    Authors: Daniel Ngo, Logan Stapleton, Vasilis Syrgkanis, Zhiwei Steven Wu

    Abstract: Randomized experiments can be susceptible to selection bias due to potential non-compliance by the participants. While much of the existing work has studied compliance as a static behavior, we propose a game-theoretic model to study compliance as dynamic behavior that may change over time. In rounds, a social planner interacts with a sequence of heterogeneous agents who arrive with their unobserve… ▽ More

    Submitted 28 July, 2021; v1 submitted 21 July, 2021; originally announced July 2021.

    Comments: In Proceedings of the Thirty-eighth International Conference on Machine Learning (ICML 2021), 17 pages of main text, 53 pages total, 3 figures

  40. arXiv:2107.09725  [pdf, other

    cs.CV

    Registration of 3D Point Sets Using Correntropy Similarity Matrix

    Authors: Ashutosh Singandhupe, Hung La, Trung Dung Ngo, Van Ho

    Abstract: This work focuses on Registration or Alignment of 3D point sets. Although the Registration problem is a well established problem and it's solved using multiple variants of Iterative Closest Point (ICP) Algorithm, most of the approaches in the current state of the art still suffers from misalignment when the \textit{Source} and the \textit{Target} point sets are separated by large rotations and tra… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

  41. arXiv:2107.05762  [pdf, other

    cs.LG

    Strategic Instrumental Variable Regression: Recovering Causal Relationships From Strategic Responses

    Authors: Keegan Harris, Daniel Ngo, Logan Stapleton, Hoda Heidari, Zhiwei Steven Wu

    Abstract: In settings where Machine Learning (ML) algorithms automate or inform consequential decisions about people, individual decision subjects are often incentivized to strategically modify their observable attributes to receive more favorable predictions. As a result, the distribution the assessment rule is trained on may differ from the one it operates on in deployment. While such distribution shifts,… ▽ More

    Submitted 8 June, 2022; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: In the 39th International Conference on Machine Learning (ICML 2022)

  42. arXiv:2104.02523  [pdf, other

    cs.LG

    An Analysis of State-of-the-art Activation Functions For Supervised Deep Neural Network

    Authors: Anh Nguyen, Khoa Pham, Dat Ngo, Thanh Ngo, Lam Pham

    Abstract: This paper provides an analysis of state-of-the-art activation functions with respect to supervised classification of deep neural network. These activation functions comprise of Rectified Linear Units (ReLU), Exponential Linear Unit (ELU), Scaled Exponential Linear Unit (SELU), Gaussian Error Linear Unit (GELU), and the Inverse Square Root Linear Unit (ISRLU). To evaluate, experiments over two dee… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    Comments: 6 pages, 5 figures

  43. arXiv:2012.15029  [pdf, other

    eess.IV

    VinDr-CXR: An open dataset of chest X-rays with radiologist's annotations

    Authors: Ha Q. Nguyen, Khanh Lam, Linh T. Le, Hieu H. Pham, Dat Q. Tran, Dung B. Nguyen, Dung D. Le, Chi M. Pham, Hang T. T. Tong, Diep H. Dinh, Cuong D. Do, Luu T. Doan, Cuong N. Nguyen, Binh T. Nguyen, Que V. Nguyen, Au D. Hoang, Hien N. Phan, Anh T. Nguyen, Phuong H. Ho, Dat T. Ngo, Nghia T. Nguyen, Nhan T. Nguyen, Minh Dao, Van Vu

    Abstract: Most of the existing chest X-ray datasets include labels from a list of findings without specifying their locations on the radiographs. This limits the development of machine learning algorithms for the detection and localization of chest abnormalities. In this work, we describe a dataset of more than 100,000 chest X-ray scans that were retrospectively collected from two major hospitals in Vietnam… ▽ More

    Submitted 20 March, 2022; v1 submitted 29 December, 2020; originally announced December 2020.

    Comments: 11 pages, under review by Nature Scientific Data

  44. arXiv:2012.13668  [pdf, other

    cs.LG cs.CV cs.SD eess.AS

    Deep Learning Framework Applied for Predicting Anomaly of Respiratory Sounds

    Authors: Dat Ngo, Lam Pham, Anh Nguyen, Ben Phan, Khoa Tran, Truong Nguyen

    Abstract: This paper proposes a robust deep learning framework used for classifying anomaly of respiratory cycles. Initially, our framework starts with front-end feature extraction step. This step aims to transform the respiratory input sound into a two-dimensional spectrogram where both spectral and temporal features are well presented. Next, an ensemble of C- DNN and Autoencoder networks is then applied t… ▽ More

    Submitted 25 December, 2020; originally announced December 2020.

    Comments: 5 pages, 2 figures, 8 tables

  45. Automated, Cost-effective, and Update-driven App Testing

    Authors: Chanh Duc Ngo, Fabrizio Pastore, Lionel Briand

    Abstract: Apps' pervasive role in our society led to the definition of test automation approaches to ensure their dependability. However, state-of-the-art approaches tend to generate large numbers of test inputs and are unlikely to achieve more than 50% method coverage. In this paper, we propose a strategy to achieve significantly higher coverage of the code affected by updates with a much smaller number of… ▽ More

    Submitted 6 December, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

  46. arXiv:2009.09619  [pdf, other

    cs.NI

    Economic Theoretic LEO Satellite Coverage Control: An Auction-based Framework

    Authors: Junghyun Kim, Thong D. Ngo, Paul S. Oh, Sean S. -C. Kwon, Changhee Han, Joongheon Kim

    Abstract: Recently, ultra-dense low earth orbit (LEO) satelliteconstellation over high-frequency bands has considered as one ofpromising solutions to supply coverage all over the world. Givensatellite constellations, efficient beam coverage schemes should beemployed at satellites to provide seamless services and full-viewcoverage. In LEO systems, hybrid wide and spot beam coverageschemes are generally used,… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

    Comments: 3 pages

    ACM Class: C.2.1

  47. arXiv:2009.02031  [pdf, ps, other

    cs.IT

    Joint Resource Allocation to Minimize Execution Time of Federated Learning in Cell-Free Massive MIMO

    Authors: Tung T. Vu, Duy T. Ngo, Hien Quoc Ngo, Minh N. Dao, Nguyen H. Tran, Richard H. Middleton

    Abstract: Due to its communication efficiency and privacy-preserving capability, federated learning (FL) has emerged as a promising framework for machine learning in 5G-and-beyond wireless networks. Of great interest is the design and optimization of new wireless network structures that support the stable and fast operation of FL. Cell-free massive multiple-input multiple-output (CFmMIMO) turns out to be a… ▽ More

    Submitted 10 June, 2022; v1 submitted 4 September, 2020; originally announced September 2020.

    Comments: accepted to appear in IEEE Internet of Things Journal, Jun. 2022

  48. arXiv:2005.12779  [pdf, ps, other

    cs.SD eess.AS

    Sound Context Classification Basing on Join Learning Model and Multi-Spectrogram Features

    Authors: Dat Ngo, Hao Hoang, Anh Nguyen, Tien Ly, Lam Pham

    Abstract: In this paper, we present a deep learning framework applied for Acoustic Scene Classification (ASC), the task of classifying scene contexts from environmental input sounds. An ASC system generally comprises of two main steps, referred to as front-end feature extraction and back-end classification. In the first step, an extractor is used to extract low-level features from raw audio signals. Next, t… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

  49. arXiv:2005.12734  [pdf, other

    cs.CV

    Interpreting Chest X-rays via CNNs that Exploit Hierarchical Disease Dependencies and Uncertainty Labels

    Authors: Hieu H. Pham, Tung T. Le, Dat T. Ngo, Dat Q. Tran, Ha Q. Nguyen

    Abstract: The chest X-rays (CXRs) is one of the views most commonly ordered by radiologists (NHS),which is critical for diagnosis of many different thoracic diseases. Accurately detecting thepresence of multiple diseases from CXRs is still a challenging task. We present a multi-labelclassification framework based on deep convolutional neural networks (CNNs) for diagnos-ing the presence of 14 common thoracic… ▽ More

    Submitted 25 May, 2020; originally announced May 2020.

    Comments: MIDL 2020 Accepted Short Paper. arXiv admin note: substantial text overlap with arXiv:1911.06475

    Report number: MIDL/2020/ExtendedAbstract/4o1GLIIHlh

  50. arXiv:2005.09707  [pdf, other

    physics.app-ph

    New Way of Generating Electromagnetic Waves

    Authors: Ali Hosseini-Fahraji, Majid Manteghi, Khai d. t. Ngo

    Abstract: This paper presents a new method for generating low-frequency electromagnetic waves for navigation and communication in challenging environments, such as underwater and underground. The main idea is to store magnetic energy in two different spaces using the interaction between a permanent magnet and a magnetic material. The magnetic reluctance of the medium around the permanent magnet is modulated… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

    Comments: 8 pages, 9 figures