Skip to main content

Showing 1–50 of 249 results for author: Mahmood, A

  1. arXiv:2410.14606  [pdf, other

    cs.LG cs.AI

    Streaming Deep Reinforcement Learning Finally Works

    Authors: Mohamed Elsayed, Gautham Vasan, A. Rupam Mahmood

    Abstract: Natural intelligence processes experience as a continuous stream, sensing, acting, and learning moment-by-moment in real time. Streaming learning, the modus operandi of classic reinforcement learning (RL) algorithms like Q-learning and TD, mimics natural learning by using the most recent sample without storing it. This approach is also ideal for resource-constrained, communication-limited, and pri… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

  2. arXiv:2410.14242  [pdf, other

    cs.CV cs.LG

    Pseudo-label Refinement for Improving Self-Supervised Learning Systems

    Authors: Zia-ur-Rehman, Arif Mahmood, Wenxiong Kang

    Abstract: Self-supervised learning systems have gained significant attention in recent years by leveraging clustering-based pseudo-labels to provide supervision without the need for human annotations. However, the noise in these pseudo-labels caused by the clustering methods poses a challenge to the learning process leading to degraded performance. In this work, we propose a pseudo-label refinement (SLR) al… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

  3. arXiv:2410.09968  [pdf

    cs.LG q-bio.CB

    Deep-Ace: LSTM-based Prokaryotic Lysine Acetylation Site Predictor

    Authors: Maham Ilyas, Abida Yasmeen, Yaser Daanial Khan, Arif Mahmood

    Abstract: Acetylation of lysine residues (K-Ace) is a post-translation modification occurring in both prokaryotes and eukaryotes. It plays a crucial role in disease pathology and cell biology hence it is important to identify these K-Ace sites. In the past, many machine learning-based models using hand-crafted features and encodings have been used to find and analyze the characteristics of K-Ace sites howev… ▽ More

    Submitted 20 October, 2024; v1 submitted 13 October, 2024; originally announced October 2024.

  4. arXiv:2410.09964  [pdf, other

    cs.LG cs.AI q-bio.GN

    Lower-dimensional projections of cellular expression improves cell type classification from single-cell RNA sequencing

    Authors: Muhammad Umar, Muhammad Asif, Arif Mahmood

    Abstract: Single-cell RNA sequencing (scRNA-seq) enables the study of cellular diversity at single cell level. It provides a global view of cell-type specification during the onset of biological mechanisms such as developmental processes and human organogenesis. Various statistical, machine and deep learning-based methods have been proposed for cell-type classification. Most of the methods utilizes unsuperv… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

  5. arXiv:2410.09399  [pdf, other

    cs.CL cs.LG

    Text Classification using Graph Convolutional Networks: A Comprehensive Survey

    Authors: Syed Mustafa Haider Rizvi, Ramsha Imran, Arif Mahmood

    Abstract: Text classification is a quintessential and practical problem in natural language processing with applications in diverse domains such as sentiment analysis, fake news detection, medical diagnosis, and document classification. A sizable body of recent works exists where researchers have studied and tackled text classification from different angles with varying degrees of success. Graph convolution… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

  6. arXiv:2410.04574  [pdf, other

    cs.CV cs.LG

    Enhancing 3D Human Pose Estimation Amidst Severe Occlusion with Dual Transformer Fusion

    Authors: Mehwish Ghafoor, Arif Mahmood, Muhammad Bilal

    Abstract: In the field of 3D Human Pose Estimation from monocular videos, the presence of diverse occlusion types presents a formidable challenge. Prior research has made progress by harnessing spatial and temporal cues to infer 3D poses from 2D joint observations. This paper introduces a Dual Transformer Fusion (DTF) algorithm, a novel approach to obtain a holistic 3D pose estimation, even in the presence… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

  7. arXiv:2410.04256  [pdf, other

    cs.CV cs.AI

    Implicit to Explicit Entropy Regularization: Benchmarking ViT Fine-tuning under Noisy Labels

    Authors: Maria Marrium, Arif Mahmood, Mohammed Bennamoun

    Abstract: Automatic annotation of large-scale datasets can introduce noisy training data labels, which adversely affect the learning process of deep neural networks (DNNs). Consequently, Noisy Labels Learning (NLL) has become a critical research field for Convolutional Neural Networks (CNNs), though it remains less explored for Vision Transformers (ViTs). In this study, we evaluate the vulnerability of ViT… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

  8. arXiv:2409.15495  [pdf, other

    cs.HC

    From Our Lab to Their Homes: Learnings from Longitudinal Field Research with Older Adults

    Authors: Amama Mahmood, Chien-Ming Huang

    Abstract: Conducting research with older adults in their home environments presents unique opportunities and challenges that differ significantly from traditional lab-based studies. In this paper, we share our experiences from year-long research activities aiming to design and evaluate conversational voice assistants for older adults through longitudinal deployment, interviews, co-design workshops, and eval… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  9. arXiv:2409.15488  [pdf, other

    cs.HC

    Voice Assistants for Health Self-Management: Designing for and with Older Adults

    Authors: Amama Mahmood, Shiye Cao, Maia Stiber, Victor Nikhil Antony, Chien-Ming Huang

    Abstract: Supporting older adults in health self-management is crucial for promoting independent aging, particularly given the growing strain on healthcare systems. While voice assistants (VAs) hold the potential to support aging in place, they often lack tailored assistance and present usability challenges. We addressed these issues through a five-stage design process with older adults to develop a persona… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  10. arXiv:2409.14345  [pdf

    q-bio.OT

    Evaluation of drought tolerance of some almond genotypes by morphological, phytochemical and molecular markers in Sulaymaniyah governorate

    Authors: Anwar Mohammed Raouf Mahmood

    Abstract: The study was carried out during 2017 to 2019 growing seasons at four locations in Sulaimani governorate and one location in Halabja governorate, in the Iraqi Kurdistan region including SH, M, Q, B and H. A huge number almond trees were observed for all locations, among them 38 trees were selected with the best morphological characteristics which were chosen 9,3,5,7 and 14 trees depending on the l… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

  11. arXiv:2409.06073  [pdf, other

    eess.SP cs.ET

    Integration of Beyond Diagonal RIS and UAVs in 6G NTNs: Enhancing Aerial Connectivity

    Authors: Wali Ullah Khan, Eva Lagunas, Asad Mahmood, Muhammad Asif, Manzoor Ahmed, Symeon Chatzinotas

    Abstract: The reconfigurable intelligent surface (RIS) technology shows great potential in sixth-generation (6G) terrestrial and non-terrestrial networks (NTNs) since it can effectively change wireless settings to improve connectivity. Extensive research has been conducted on traditional RIS systems with diagonal phase response matrices. The straightforward RIS architecture, while cost-effective, has restri… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

    Comments: 7,4

  12. arXiv:2408.15084  [pdf, other

    cs.ET eess.SP

    CR-Enabled NOMA Integrated Non-Terrestrial IoT Networks with Transmissive RIS

    Authors: Wali Ullah Khan, Zain Ali, Asad Mahmood, Eva Lagunas, Syed Tariq Shah, Symeon Chatzinotas

    Abstract: This work proposes a T-RIS-equipped LEO satellite communication in cognitive radio-enabled integrated NTNs. In the proposed system, a GEO satellite operates as a primary network, and a T-RIS-equipped LEO satellite operates as a secondary IoT network. The objective is to maximize the sum rate of T-RIS-equipped LEO satellite communication using downlink NOMA while ensuring the service quality of GEO… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: 7,5

  13. arXiv:2408.12926  [pdf, other

    cs.IT eess.SP

    Balancing AoI and Rate for Mission-Critical and eMBB Coexistence with Puncturing, NOMA,and RSMA in Cellular Uplink

    Authors: Farnaz Khodakhah, Aamir Mahmood, Čedomir Stefanović, Hossam Farag, Patrik Österberg, Mikael Gidlund

    Abstract: Through the lens of average and peak age-of-information (AoI), this paper takes a fresh look into the uplink medium access solutions for mission-critical (MC) communication coexisting with enhanced mobile broadband (eMBB) service. Considering the stochastic packet arrivals from an MC user, we study three access schemes: orthogonal multiple access (OMA) with eMBB preemption (puncturing), non-orthog… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    Comments: 14 pages, 9 figures, under review for possible publication in IEEE TVT

  14. arXiv:2407.15879  [pdf, other

    cs.CR cs.AI cs.DC cs.LG

    Decentralized Federated Anomaly Detection in Smart Grids: A P2P Gossip Approach

    Authors: Muhammad Akbar Husnoo, Adnan Anwar, Md Enamul Haque, A. N. Mahmood

    Abstract: The increasing security and privacy concerns in the Smart Grid sector have led to a significant demand for robust intrusion detection systems within critical smart grid infrastructure. To address the challenges posed by privacy preservation and decentralized power system zones with distinct data ownership, Federated Learning (FL) has emerged as a promising privacy-preserving solution which facilit… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

  15. arXiv:2407.15707  [pdf, other

    cs.CV cs.AI eess.IV

    Predicting the Best of N Visual Trackers

    Authors: Basit Alawode, Sajid Javed, Arif Mahmood, Jiri Matas

    Abstract: We observe that the performance of SOTA visual trackers surprisingly strongly varies across different video attributes and datasets. No single tracker remains the best performer across all tracking attributes and datasets. To bridge this gap, for a given video sequence, we predict the "Best of the N Trackers", called the BofN meta-tracker. At its core, a Tracking Performance Prediction Network (TP… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  16. arXiv:2407.13355  [pdf, other

    cs.CR

    EarlyMalDetect: A Novel Approach for Early Windows Malware Detection Based on Sequences of API Calls

    Authors: Pascal Maniriho, Abdun Naser Mahmood, Mohammad Jabed Morshed Chowdhury

    Abstract: In this work, we propose EarlyMalDetect, a novel approach for early Windows malware detection based on sequences of API calls. Our approach leverages generative transformer models and attention-guided deep recurrent neural networks to accurately identify and detect patterns of malicious behaviors in the early stage of malware execution. By analyzing the sequences of API calls invoked during execut… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  17. arXiv:2407.10240  [pdf

    cs.LG cs.AI

    xLSTMTime : Long-term Time Series Forecasting With xLSTM

    Authors: Musleh Alharthi, Ausif Mahmood

    Abstract: In recent years, transformer-based models have gained prominence in multivariate long-term time series forecasting (LTSF), demonstrating significant advancements despite facing challenges such as high computational demands, difficulty in capturing temporal dynamics, and managing long-term dependencies. The emergence of LTSF-Linear, with its straightforward linear architecture, has notably outperfo… ▽ More

    Submitted 11 August, 2024; v1 submitted 14 July, 2024; originally announced July 2024.

  18. arXiv:2407.05260  [pdf, other

    cs.IT

    Improved Channel Coding Performance Through Cost Variability

    Authors: Adeel Mahmood, Aaron B. Wagner

    Abstract: Channel coding for discrete memoryless channels (DMCs) with mean and variance cost constraints has been recently introduced. We show that there is an improvement in coding performance due to cost variability, both with and without feedback. We demonstrate this improvement over the traditional almost-sure cost constraint (also called the peak-power constraint) that prohibits any cost variation abov… ▽ More

    Submitted 17 September, 2024; v1 submitted 7 July, 2024; originally announced July 2024.

  19. arXiv:2407.03316  [pdf, other

    nucl-ex hep-ex

    An Upper Limit on the Photoproduction Cross Section of the Spin-Exotic $π_1(1600)$

    Authors: F. Afzal, C. S. Akondi, M. Albrecht, M. Amaryan, S. Arrigo, V. Arroyave, A. Asaturyan, A. Austregesilo, Z. Baldwin, F. Barbosa, J. Barlow, E. Barriga, R. Barsotti, D. Barton, V. Baturin, V. V. Berdnikov, T. Black, W. Boeglin, M. Boer, W. J. Briscoe, T. Britton, S. Cao, E. Chudakov, G. Chung, P. L. Cole , et al. (124 additional authors not shown)

    Abstract: The spin-exotic hybrid meson $π_{1}(1600)$ is predicted to have a large decay rate to the $ωππ$ final state. Using 76.6~pb$^{-1}$ of data collected with the GlueX detector, we measure the cross sections for the reactions $γp \to ωπ^+ π^- p$, $γp \to ωπ^0 π^0 p$, and $γp\toωπ^-π^0Δ^{++}$ in the range $E_γ=$ 8-10 GeV. Using isospin conservation, we set the first upper limits on the photoproduction c… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures plus supplemental materials

  20. arXiv:2407.01704  [pdf, other

    cs.LG cs.AI

    Weight Clipping for Deep Continual and Reinforcement Learning

    Authors: Mohamed Elsayed, Qingfeng Lan, Clare Lyle, A. Rupam Mahmood

    Abstract: Many failures in deep continual and reinforcement learning are associated with increasing magnitudes of the weights, making them hard to change and potentially causing overfitting. While many methods address these learning failures, they often change the optimizer or the architecture, a complexity that hinders widespread adoption in various systems. In this paper, we focus on learning failures tha… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Published in the First Reinforcement Learning Conference (RLC 2024). Code is available at https://github.com/mohmdelsayed/weight-clipping

  21. arXiv:2407.00324  [pdf, other

    cs.RO cs.LG

    Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning

    Authors: Gautham Vasan, Yan Wang, Fahim Shahriar, James Bergstra, Martin Jagersand, A. Rupam Mahmood

    Abstract: Many real-world robot learning problems, such as pick-and-place or arriving at a destination, can be seen as a problem of reaching a goal state as soon as possible. These problems, when formulated as episodic reinforcement learning tasks, can easily be specified to align well with our intended goal: -1 reward every time step with termination upon reaching the goal state, called minimum-time tasks.… ▽ More

    Submitted 8 July, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

    Comments: In Proceedings of Reinforcement Learning Conference 2024. For a video demo, see https://youtu.be/a6zlVUuKzBc

  22. arXiv:2407.00148  [pdf, other

    cs.CV cs.LG

    Localizing Anomalies via Multiscale Score Matching Analysis

    Authors: Ahsan Mahmood, Junier Oliva, Martin Styner

    Abstract: Anomaly detection and localization in medical imaging remain critical challenges in healthcare. This paper introduces Spatial-MSMA (Multiscale Score Matching Analysis), a novel unsupervised method for anomaly localization in volumetric brain MRIs. Building upon the MSMA framework, our approach incorporates spatial information and conditional likelihoods to enhance anomaly detection capabilities. W… ▽ More

    Submitted 18 July, 2024; v1 submitted 28 June, 2024; originally announced July 2024.

  23. arXiv:2406.12829  [pdf, other

    nucl-ex

    Measurement of Spin-Density Matrix Elements in $Δ^{++}(1232)$ photoproduction

    Authors: F. Afzal, C. S. Akondi, M. Albrecht, M. Amaryan, S. Arrigo, V. Arroyave, A. Asaturyan, A. Austregesilo, Z. Baldwin, F. Barbosa, J. Barlow, E. Barriga, R. Barsotti, D. Barton, V. Baturin, V. V. Berdnikov, T. Black, W. Boeglin, M. Boer, W. J. Briscoe, T. Britton, S. Cao, E. Chudakov, G. Chung, P. L. Cole , et al. (124 additional authors not shown)

    Abstract: We measure the spin-density matrix elements (SDMEs) of the $Δ^{++}(1232)$ in the photoproduction reaction $γp \to π^-Δ^{++}(1232)$ with the GlueX experiment in Hall D at Jefferson Lab. The measurement uses a linearly--polarized photon beam with energies from $8.2$ to $8.8$~GeV and the statistical precision of the SDMEs exceeds the previous measurement by three orders of magnitude for the momentum… ▽ More

    Submitted 26 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  24. arXiv:2406.12241  [pdf, other

    cs.LG cs.AI

    More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling

    Authors: Haque Ishfaq, Yixin Tan, Yu Yang, Qingfeng Lan, Jianfeng Lu, A. Rupam Mahmood, Doina Precup, Pan Xu

    Abstract: Thompson sampling (TS) is one of the most popular exploration techniques in reinforcement learning (RL). However, most TS algorithms with theoretical guarantees are difficult to implement and not generalizable to Deep RL. While the emerging approximate sampling-based exploration schemes are promising, most existing algorithms are specific to linear Markov Decision Processes (MDP) with suboptimal r… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: First two authors contributed equally. Accepted to the Reinforcement Learning Conference (RLC) 2024

  25. arXiv:2406.10691  [pdf, other

    cs.ET

    Beyond Diagonal RIS for 6G Non-Terrestrial Networks: Potentials and Challenges

    Authors: Wali Ullah Khan, Asad Mahmood, Muhammad Ali Jamshed, Eva Lagunas, Manzoor Ahmed, Symeon Chatzinotas

    Abstract: Reconfigurable intelligent surface (RIS) has emerged as a promising technology in both terrestrial and non-terrestrial networks (NTNs) due to its ability to manipulate wireless environments for better connectivity. Significant studies have been focused on conventional RIS with diagonal phase response matrices. This simple RIS architecture, though less expensive, has limited flexibility in engineer… ▽ More

    Submitted 22 September, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Comments: 10,4

  26. arXiv:2406.05205  [pdf, other

    cs.CV cs.CL cs.LG cs.MM eess.IV

    CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment

    Authors: Sajid Javed, Arif Mahmood, Iyyakutti Iyappan Ganapathi, Fayaz Ali Dharejo, Naoufel Werghi, Mohammed Bennamoun

    Abstract: This paper proposes Comprehensive Pathology Language Image Pre-training (CPLIP), a new unsupervised technique designed to enhance the alignment of images and text in histopathology for tasks such as classification and segmentation. This methodology enriches vision-language models by leveraging extensive data without needing ground truth annotations. CPLIP involves constructing a pathology-specific… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  27. arXiv:2406.03276  [pdf, other

    cs.LG cs.AI

    Revisiting Scalable Hessian Diagonal Approximations for Applications in Reinforcement Learning

    Authors: Mohamed Elsayed, Homayoon Farrahi, Felix Dangel, A. Rupam Mahmood

    Abstract: Second-order information is valuable for many applications but challenging to compute. Several works focus on computing or approximating Hessian diagonals, but even this simplification introduces significant additional costs compared to computing a gradient. In the absence of efficient exact computation schemes for Hessian diagonals, we revisit an early approximation scheme proposed by Becker and… ▽ More

    Submitted 3 July, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: Published in the Proceedings of the 41st International Conference on Machine Learning (ICML 2024). Code is available at https://github.com/mohmdelsayed/HesScale. arXiv admin note: substantial text overlap with arXiv:2210.11639

  28. arXiv:2405.21043  [pdf, other

    cs.LG cs.AI

    Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation

    Authors: Fengdi Che, Chenjun Xiao, Jincheng Mei, Bo Dai, Ramki Gummadi, Oscar A Ramirez, Christopher K Harris, A. Rupam Mahmood, Dale Schuurmans

    Abstract: We prove that the combination of a target network and over-parameterized linear function approximation establishes a weaker convergence condition for bootstrapped value estimation in certain cases, even with off-policy data. Our condition is naturally satisfied for expected updates over the entire state-action space or learning with a batch of complete trajectories from episodic Markov decision pr… ▽ More

    Submitted 4 October, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Journal ref: Proceedings of the 41 st International Conference on Machine Learning, 2024

  29. arXiv:2405.14881  [pdf, other

    cs.CV

    DiffuseMix: Label-Preserving Data Augmentation with Diffusion Models

    Authors: Khawar Islam, Muhammad Zaigham Zaheer, Arif Mahmood, Karthik Nandakumar

    Abstract: Recently, a number of image-mixing-based augmentation techniques have been introduced to improve the generalization of deep neural networks. In these techniques, two or more randomly selected natural images are mixed together to generate an augmented image. Such methods may not only omit important portions of the input images but also introduce label ambiguities by mixing images across labels resu… ▽ More

    Submitted 5 April, 2024; originally announced May 2024.

    Comments: Accepted at CVPR 2024

  30. arXiv:2405.11122  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Imaging Local Effects of Voltage and Boron Doping on Spin Reversal in Antiferromagnetic Magnetoelectric Cr2O3 Thin Films and Devices

    Authors: Adam Erickson, Syed Qamar Abbas Shah, Ather Mahmood, Pratyush Buragohain, Ilja Fescenko, Alexei Gruverman, Christian Binek, Abdelghani Laraoui

    Abstract: Chromia (Cr2O3) is a magnetoelectric oxide which permits voltage-control of the antiferromagnetic (AFM) order, but it suffers technological constraints due to its low Neel Temperature (TN ~307 K) and the need of a symmetry breaking applied magnetic field to achieve reversal of the Neel vector. Recently, boron (B) doping of Cr2O3 films led to an increase TN > 400 K and allowed the realization of vo… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Journal ref: Advanced Functional Materials 2408542 (2024)

  31. arXiv:2404.00781  [pdf, other

    cs.LG cs.AI

    Addressing Loss of Plasticity and Catastrophic Forgetting in Continual Learning

    Authors: Mohamed Elsayed, A. Rupam Mahmood

    Abstract: Deep representation learning methods struggle with continual learning, suffering from both catastrophic forgetting of useful units and loss of plasticity, often due to rigid and unuseful units. While many methods address these two issues separately, only a few currently deal with both simultaneously. In this paper, we introduce Utility-based Perturbed Gradient Descent (UPGD) as a novel approach fo… ▽ More

    Submitted 30 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: Published in the Proceedings of the 12th International Conference on Learning Representations (ICLR 2024). Code is available at https://github.com/mohmdelsayed/upgd

  32. arXiv:2403.17913  [pdf, ps, other

    eess.SP

    Enhancing Indoor and Outdoor THz Communications with Beyond Diagonal-IRS: Optimization and Performance Analysis

    Authors: Asad Mahmood, Thang X. Vu, Symeon Chatzinotas, Björn Ottersten

    Abstract: This work investigates the application of Beyond Diagonal Intelligent Reflective Surface (BD-IRS) to enhance THz downlink communication systems, operating in a hybrid: reflective and transmissive mode, to simultaneously provide services to indoor and outdoor users. We propose an optimization framework that jointly optimizes the beamforming vectors and phase shifts in the hybrid reflective/transmis… ▽ More

    Submitted 9 July, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  33. arXiv:2403.16194  [pdf, other

    cs.CV

    Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised Landmark Discovery

    Authors: Siddharth Tourani, Ahmed Alwheibi, Arif Mahmood, Muhammad Haris Khan

    Abstract: Unsupervised landmarks discovery (ULD) for an object category is a challenging computer vision problem. In pursuit of developing a robust ULD framework, we explore the potential of a recent paradigm of self-supervised learning algorithms, known as diffusion models. Some recent works have shown that these models implicitly contain important correspondence cues. Towards harnessing the potential of d… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted in CVPR 2024

  34. arXiv:2403.14743  [pdf, other

    cs.CV

    VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding

    Authors: Ahmad Mahmood, Ashmal Vayani, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan

    Abstract: Recent studies have demonstrated the effectiveness of Large Language Models (LLMs) as reasoning modules that can deconstruct complex tasks into more manageable sub-tasks, particularly when applied to visual reasoning tasks for images. In contrast, this paper introduces a Video Understanding and Reasoning Framework (VURF) based on the reasoning power of LLMs. Ours is a novel approach to extend the… ▽ More

    Submitted 24 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

  35. arXiv:2403.02421  [pdf, other

    cs.HC

    Situated Understanding of Errors in Older Adults' Interactions with Voice Assistants: A Month-Long, In-Home Study

    Authors: Amama Mahmood, Junxiang Wang, Chien-Ming Huang

    Abstract: Our work addresses the challenges older adults face with commercial Voice Assistants (VAs), notably in conversation breakdowns and error handling. Traditional methods of collecting user experiences-usage logs and post-hoc interviews-do not fully capture the intricacies of older adults' interactions with VAs, particularly regarding their reactions to errors. To bridge this gap, we equipped 15 older… ▽ More

    Submitted 23 September, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  36. arXiv:2401.16417  [pdf, ps, other

    cs.IT

    Channel Coding with Mean and Variance Cost Constraints

    Authors: Adeel Mahmood, Aaron B. Wagner

    Abstract: We consider channel coding for discrete memoryless channels (DMCs) with a novel cost constraint that constrains both the mean and the variance of the cost of the codewords. We show that the maximum (asymptotically) achievable rate under the new cost formulation is equal to the capacity-cost function; in particular, the strong converse holds. We further characterize the optimal second-order coding… ▽ More

    Submitted 12 May, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  37. arXiv:2401.06193  [pdf, other

    gr-qc

    Dark Energy Compact Stars in Extended Teleparallel Gravity

    Authors: Allah Ditta, Xia Tiecheng, G. Mustafa, Değer Sofuoğlu, Asif Mahmood

    Abstract: This paper presents the study of dark-energy compact stars in the context of modified Rastall teleparallel gravity. It is the first time that dark energy celestial phenomena have been explored in this modified gravitational theory. Employing the torsion-based functions, $f(T)$ and $h(T)$, we analyzed their effects in a spherically symmetric spacetime chosen as the interior geometry, while using th… ▽ More

    Submitted 26 January, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: 18 pages, 9 Figures, 2tables

  38. arXiv:2312.15339  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    MaDi: Learning to Mask Distractions for Generalization in Visual Deep Reinforcement Learning

    Authors: Bram Grooten, Tristan Tomilin, Gautham Vasan, Matthew E. Taylor, A. Rupam Mahmood, Meng Fang, Mykola Pechenizkiy, Decebal Constantin Mocanu

    Abstract: The visual world provides an abundance of information, but many input pixels received by agents often contain distracting stimuli. Autonomous agents need the ability to distinguish useful information from task-irrelevant perceptions, enabling them to generalize to unseen environments with new distractions. Existing works approach this problem using data augmentation or large auxiliary networks wit… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

    Comments: Accepted as full-paper (oral) at AAMAS 2024. Code is available at https://github.com/bramgrooten/mask-distractions and see our 40-second video at https://youtu.be/2oImF0h1k48

  39. arXiv:2312.07454  [pdf, other

    cs.HC cs.RO

    "You Might Like It": How People Respond to Small Talk During Human-Robot Collaboration

    Authors: Kaitlynn Taylor Pineda, Amama Mahmood, Juo-Tung Chen, Chien-Ming Huang

    Abstract: Social communication between people and social robots has been studied extensively and found to have various notable benefits, including the enhancement of human-robot team cohesion and the development of rapport and trust. However, the potential of social communication between people and non-social robots, such as non-anthropomorphic robot manipulators commonly used in work settings (\eg warehous… ▽ More

    Submitted 8 May, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: 25 pages, 6 figures, 7 tables,

    ACM Class: I.2.9

  40. arXiv:2311.07199  [pdf, ps, other

    eess.SP

    Joint Computation and Communication Resource Optimization for Beyond Diagonal UAV-IRS Empowered MEC Networks

    Authors: Asad Mahmood, Thang X. Vu, Wali Ullah Khan, Symeon Chatzinotas, Björn Ottersten

    Abstract: Recent advancements in 6G systems signal a leap towards universal connectivity and ultra-reliable, low-latency communications for real-time data devices. Yet, these advancements encounter obstacles such as limited device battery life and computational power, along with urban signal blockages. To counter these, Intelligent Reconfigurable Surfaces (IRS) within Mobile Edge Cloud (MEC) infrastructures… ▽ More

    Submitted 15 March, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

  41. arXiv:2310.19173  [pdf, other

    cs.CR cs.SI

    Can we Quantify Trust? Towards a Trust-based Resilient SIoT Network

    Authors: Subhash Sagar, Adnan Mahmood, Quan Z. Sheng, Munazza Zaib, Farhan Sufyan

    Abstract: The emerging yet promising paradigm of the Social Internet of Things (SIoT) integrates the notion of the Internet of Things with human social networks. In SIoT, objects, i.e., things, have the capability to socialize with the other objects in the SIoT network and can establish their social network autonomously by modeling human behaviour. The notion of trust is imperative in realizing these charac… ▽ More

    Submitted 12 May, 2023; originally announced October 2023.

    Comments: 18 Pages

  42. Gender Biases in Error Mitigation by Voice Assistants

    Authors: Amama Mahmood, Chien-Ming Huang

    Abstract: Commercial voice assistants are largely feminized and associated with stereotypically feminine traits such as warmth and submissiveness. As these assistants continue to be adopted for everyday uses, it is imperative to understand how the portrayed gender shapes the voice assistant's ability to mitigate errors, which are still common in voice interactions. We report a study (N=40) that examined the… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of the ACM on Human-Computer Interaction, Volume 8, Issue CSCW1, 2024; Article No.: 60, Pages 1 - 27

  43. arXiv:2310.05853  [pdf, other

    cs.HC

    "Mango Mango, How to Let The Lettuce Dry Without A Spinner?'': Exploring User Perceptions of Using An LLM-Based Conversational Assistant Toward Cooking Partner

    Authors: Szeyi Chan, Jiachen Li, Bingsheng Yao, Amama Mahmood, Chien-Ming Huang, Holly Jimison, Elizabeth D Mynatt, Dakuo Wang

    Abstract: The rapid advancement of the Large Language Model (LLM) has created numerous potentials for integration with conversational assistants (CAs) assisting people in their daily tasks, particularly due to their extensive flexibility. However, users' real-world experiences interacting with these assistants remain unexplored. In this research, we chose cooking, a complex daily task, as a scenario to inve… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Under submission to CHI2024

  44. arXiv:2310.01365  [pdf, other

    cs.LG cs.AI

    Elephant Neural Networks: Born to Be a Continual Learner

    Authors: Qingfeng Lan, A. Rupam Mahmood

    Abstract: Catastrophic forgetting remains a significant challenge to continual learning for decades. While recent works have proposed effective methods to mitigate this problem, they mainly focus on the algorithmic side. Meanwhile, we do not fully understand what architectural properties of neural networks lead to catastrophic forgetting. This study aims to fill this gap by studying the role of activation f… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  45. arXiv:2309.13879  [pdf, other

    cs.HC

    LLM-Powered Conversational Voice Assistants: Interaction Patterns, Opportunities, Challenges, and Design Guidelines

    Authors: Amama Mahmood, Junxiang Wang, Bingsheng Yao, Dakuo Wang, Chien-Ming Huang

    Abstract: Conventional Voice Assistants (VAs) rely on traditional language models to discern user intent and respond to their queries, leading to interactions that often lack a broader contextual understanding, an area in which Large Language Models (LLMs) excel. However, current LLMs are largely designed for text-based interactions, thus making it unclear how user interactions will evolve if their modality… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  46. arXiv:2309.12507  [pdf, other

    eess.SP

    Deep Reinforcement Learning for Backscatter Communications: Augmenting Intelligence in Future Internet of Things

    Authors: Wali Ullah Khan, Eva Lagunas, Zain Ali, Asad Mahmood, Chandan Kumar Sheemar, Manzoor Ahmed, Symeon Chatzinotas, Björn Ottersten

    Abstract: Backscatter communication (BC) technology offers sustainable solutions for next-generation Internet-of-Things (IoT) networks, where devices can transmit data by reflecting and adjusting incident radio frequency signals. In parallel to BC, deep reinforcement learning (DRL) has recently emerged as a promising tool to augment intelligence and optimize low-powered IoT devices. This article commences b… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 7, 3

  47. arXiv:2309.12493  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Post deposition interfacial Néel temperature tuning in magnetoelectric B:Cr2O3

    Authors: Ather Mahmood, Jamie Weaver, Syed Qamar Abbas Shah, Will Echtenkamp, Jeffrey W. Lynn, Peter A. Dowben, Christian Binek

    Abstract: Boron (B) alloying transforms the magnetoelectric antiferromagnet Cr2O3 into a multifunctional single-phase material which enables electric field driven π/2 rotation of the Néel vector. Nonvolatile, voltage-controlled Néel vector rotation is a much-desired material property in the context of antiferromagnetic spintronics enabling ultra-low power, ultra-fast, nonvolatile memory, and logic device ap… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  48. arXiv:2309.10518  [pdf, other

    cs.CV

    Unsupervised Landmark Discovery Using Consistency Guided Bottleneck

    Authors: Mamona Awan, Muhammad Haris Khan, Sanoojan Baliah, Muhammad Ahmad Waseem, Salman Khan, Fahad Shahbaz Khan, Arif Mahmood

    Abstract: We study a challenging problem of unsupervised discovery of object landmarks. Many recent methods rely on bottlenecks to generate 2D Gaussian heatmaps however, these are limited in generating informed heatmaps while training, presumably due to the lack of effective structural cues. Also, it is assumed that all predicted landmarks are semantically relevant despite having no ground truth supervision… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Accepted ORAL at BMVC 2023 ; Code: https://github.com/MamonaAwan/CGB_ULD

    ACM Class: I.4

  49. arXiv:2309.09727  [pdf, other

    cs.DL cs.CL

    When Large Language Models Meet Citation: A Survey

    Authors: Yang Zhang, Yufei Wang, Kai Wang, Quan Z. Sheng, Lina Yao, Adnan Mahmood, Wei Emma Zhang, Rongying Zhao

    Abstract: Citations in scholarly work serve the essential purpose of acknowledging and crediting the original sources of knowledge that have been incorporated or referenced. Depending on their surrounding textual context, these citations are used for different motivations and purposes. Large Language Models (LLMs) could be helpful in capturing these fine-grained citation information via the corresponding te… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  50. arXiv:2309.09236  [pdf, other

    cs.CV cs.AI cs.CY cs.LG

    Detection and Localization of Firearm Carriers in Complex Scenes for Improved Safety Measures

    Authors: Arif Mahmood, Abdul Basit, M. Akhtar Munir, Mohsen Ali

    Abstract: Detecting firearms and accurately localizing individuals carrying them in images or videos is of paramount importance in security, surveillance, and content customization. However, this task presents significant challenges in complex environments due to clutter and the diverse shapes of firearms. To address this problem, we propose a novel approach that leverages human-firearm interaction informat… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: This paper is accepted in IEEE Transactions on Computational Social Systems