-
Advancing Web Browser Forensics: Critical Evaluation of Emerging Tools and Techniques
Authors:
Rishal Ravikesh Chand,
Neeraj Anand Sharma,
Muhammad Ashad Kabir
Abstract:
As the use of web browsers continues to grow, the potential for cybercrime and web-related criminal activities also increases. Digital forensic investigators must understand how different browsers function and the critical areas to consider during web forensic analysis. Web forensics, a subfield of digital forensics, involves collecting and analyzing browser artifacts, such as browser history, sea…
▽ More
As the use of web browsers continues to grow, the potential for cybercrime and web-related criminal activities also increases. Digital forensic investigators must understand how different browsers function and the critical areas to consider during web forensic analysis. Web forensics, a subfield of digital forensics, involves collecting and analyzing browser artifacts, such as browser history, search keywords, and downloads, which serve as potential evidence. While existing research has provided valuable insights, many studies focus on individual browsing modes or limited forensic scenarios, leaving gaps in understanding the full scope of data retention and recovery across different modes and browsers. This paper addresses these gaps by defining four browsing scenarios and critically analyzing browser artifacts across normal, private, and portable modes using various forensic tools. We define four browsing scenarios to perform a comprehensive evaluation of popular browsers -- Google Chrome, Mozilla Firefox, Brave, Tor, and Microsoft Edge -- by monitoring changes in key data storage areas such as cache files, cookies, browsing history, and local storage across different browsing modes. Overall, this paper contributes to a deeper understanding of browser forensic analysis and identifies key areas for enhancing privacy protection and forensic methodologies.
△ Less
Submitted 16 October, 2024;
originally announced October 2024.
-
From Lab to Pocket: A Novel Continual Learning-based Mobile Application for Screening COVID-19
Authors:
Danny Falero,
Muhammad Ashad Kabir,
Nusrat Homaira
Abstract:
Artificial intelligence (AI) has emerged as a promising tool for predicting COVID-19 from medical images. In this paper, we propose a novel continual learning-based approach and present the design and implementation of a mobile application for screening COVID-19. Our approach demonstrates the ability to adapt to evolving datasets, including data collected from different locations or hospitals, var…
▽ More
Artificial intelligence (AI) has emerged as a promising tool for predicting COVID-19 from medical images. In this paper, we propose a novel continual learning-based approach and present the design and implementation of a mobile application for screening COVID-19. Our approach demonstrates the ability to adapt to evolving datasets, including data collected from different locations or hospitals, varying virus strains, and diverse clinical presentations, without retraining from scratch. We have evaluated state-of-the-art continual learning methods for detecting COVID-19 from chest X-rays and selected the best-performing model for our mobile app. We evaluated various deep learning architectures to select the best-performing one as a foundation model for continual learning. Both regularization and memory-based methods for continual learning were tested, using different memory sizes to develop the optimal continual learning model for our app. DenseNet161 emerged as the best foundation model with 96.87\% accuracy, and Learning without Forgetting (LwF) was the top continual learning method with an overall performance of 71.99\%. The mobile app design considers both patient and doctor perspectives. It incorporates the continual learning DenseNet161 LwF model on a cloud server, enabling the model to learn from new instances of chest X-rays and their classifications as they are submitted. The app is designed, implemented, and evaluated to ensure it provides an efficient tool for COVID-19 screening. The app is available to download from https://github.com/DannyFGitHub/COVID-19PneumoCheckApp.
△ Less
Submitted 16 October, 2024;
originally announced October 2024.
-
Self-DenseMobileNet: A Robust Framework for Lung Nodule Classification using Self-ONN and Stacking-based Meta-Classifier
Authors:
Md. Sohanur Rahman,
Muhammad E. H. Chowdhury,
Hasib Ryan Rahman,
Mosabber Uddin Ahmed,
Muhammad Ashad Kabir,
Sanjiban Sekhar Roy,
Rusab Sarmun
Abstract:
In this study, we propose a novel and robust framework, Self-DenseMobileNet, designed to enhance the classification of nodules and non-nodules in chest radiographs (CXRs). Our approach integrates advanced image standardization and enhancement techniques to optimize the input quality, thereby improving classification accuracy. To enhance predictive accuracy and leverage the strengths of multiple mo…
▽ More
In this study, we propose a novel and robust framework, Self-DenseMobileNet, designed to enhance the classification of nodules and non-nodules in chest radiographs (CXRs). Our approach integrates advanced image standardization and enhancement techniques to optimize the input quality, thereby improving classification accuracy. To enhance predictive accuracy and leverage the strengths of multiple models, the prediction probabilities from Self-DenseMobileNet were transformed into tabular data and used to train eight classical machine learning (ML) models; the top three performers were then combined via a stacking algorithm, creating a robust meta-classifier that integrates their collective insights for superior classification performance. To enhance the interpretability of our results, we employed class activation mapping (CAM) to visualize the decision-making process of the best-performing model. Our proposed framework demonstrated remarkable performance on internal validation data, achieving an accuracy of 99.28\% using a Meta-Random Forest Classifier. When tested on an external dataset, the framework maintained strong generalizability with an accuracy of 89.40\%. These results highlight a significant improvement in the classification of CXRs with lung nodules.
△ Less
Submitted 16 October, 2024;
originally announced October 2024.
-
The BRAM is the Limit: Shattering Myths, Shaping Standards, and Building Scalable PIM Accelerators
Authors:
MD Arafat Kabir,
Tendayi Kamucheka,
Nathaniel Fredricks,
Joel Mandebi,
Jason Bakos,
Miaoqing Huang,
David Andrews
Abstract:
Many recent FPGA-based Processor-in-Memory (PIM) architectures have appeared with promises of impressive levels of parallelism but with performance that falls short of expectations due to reduced maximum clock frequencies, an inability to scale processing elements up to the maximum BRAM capacity, and minimal hardware support for large reduction operations. In this paper, we first establish what we…
▽ More
Many recent FPGA-based Processor-in-Memory (PIM) architectures have appeared with promises of impressive levels of parallelism but with performance that falls short of expectations due to reduced maximum clock frequencies, an inability to scale processing elements up to the maximum BRAM capacity, and minimal hardware support for large reduction operations. In this paper, we first establish what we believe should be a "Gold Standard" set of design objectives for PIM-based FPGA designs. This Gold Standard was established to serve as an absolute metric for comparing PIMs developed on different technology nodes and vendor families as well as an aspirational goal for designers.
We then present IMAGine, an In-Memory Accelerated GEMV engine used as a case study to show the Gold Standard can be realized in practice. IMAGine serves as an existence proof that dispels several myths surrounding what is normally accepted as clocking and scaling FPGA performance limitations. Specifically, IMAGine clocks at the maximum frequency of the BRAM and scales to 100% of the available BRAMs. Comparative analyses are presented showing execution speeds over existing PIM-based GEMV engines on FPGAs and achieving a 2.65x - 3.2x faster clock. An AMD Alveo U55 implementation is presented that achieves a system clock speed of 737 MHz, providing 64K bit-serial multiply-accumulate (MAC) units for GEMV operation. This establishes IMAGine as the fastest PIM-based GEMV overlay, outperforming even the custom PIM-based FPGA accelerators reported to date. Additionally, it surpasses TPU v1-v2 and Alibaba Hanguang 800 in clock speed while offering an equal or greater number of MAC units.
△ Less
Submitted 9 October, 2024;
originally announced October 2024.
-
IMAGine: An In-Memory Accelerated GEMV Engine Overlay
Authors:
MD Arafat Kabir,
Tendayi Kamucheka,
Nathaniel Fredricks,
Joel Mandebi,
Jason Bakos,
Miaoqing Huang,
David Andrews
Abstract:
Processor-in-Memory (PIM) overlays and new redesigned reconfigurable tile fabrics have been proposed to eliminate the von Neumann bottleneck and enable processing performance to scale with BRAM capacity. The performance of these FPGA-based PIM architectures has been limited due to a reduction of the BRAMs maximum clock frequencies and less than ideal scaling of processing elements with increased B…
▽ More
Processor-in-Memory (PIM) overlays and new redesigned reconfigurable tile fabrics have been proposed to eliminate the von Neumann bottleneck and enable processing performance to scale with BRAM capacity. The performance of these FPGA-based PIM architectures has been limited due to a reduction of the BRAMs maximum clock frequencies and less than ideal scaling of processing elements with increased BRAM capacity. This paper presents IMAGine, an In-Memory Accelerated GEMV engine, a PIM-array accelerator that clocks at the maximum frequency of the BRAM and scales to 100% of the available BRAMs. Comparative analyses are presented showing execution speeds over existing PIM-based GEMV engines on FPGAs and achieving a 2.65x - 3.2x faster clock. An AMD Alveo U55 implementation is presented that achieves a system clock speed of 737 MHz, providing 64K bit-serial multiply-accumulate (MAC) units for GEMV operation. This establishes IMAGine as the fastest PIM-based GEMV overlay, outperforming even the custom PIM-based FPGA accelerators reported to date. Additionally, it surpasses TPU v1-v2 and Alibaba Hanguang 800 in clock speed while offering an equal or greater number of MAC units.
△ Less
Submitted 6 October, 2024;
originally announced October 2024.
-
New Measurements of the Deuteron to Proton F2 Structure Function Ratio
Authors:
Debaditya Biswas,
Fernando Araiza Gonzalez,
William Henry,
Abishek Karki,
Casey Morean,
Sooriyaarachchilage Nadeeshani,
Abel Sun,
Daniel Abrams,
Zafar Ahmed,
Bashar Aljawrneh,
Sheren Alsalmi,
George Ambrose,
Whitney Armstrong,
Arshak Asaturyan,
Kofi Assumin-Gyimah,
Carlos Ayerbe Gayoso,
Anashe Bandari,
Samip Basnet,
Vladimir Berdnikov,
Hem Bhatt,
Deepak Bhetuwal,
Werner Boeglin,
Peter Bosted,
Edward Brash,
Masroor Bukhari
, et al. (67 additional authors not shown)
Abstract:
Nucleon structure functions, as measured in lepton-nucleon scattering, have historically provided a critical observable in the study of partonic dynamics within the nucleon. However, at very large parton momenta it is both experimentally and theoretically challenging to extract parton distributions due to the probable onset of non-perturbative contributions and the unavailability of high precision…
▽ More
Nucleon structure functions, as measured in lepton-nucleon scattering, have historically provided a critical observable in the study of partonic dynamics within the nucleon. However, at very large parton momenta it is both experimentally and theoretically challenging to extract parton distributions due to the probable onset of non-perturbative contributions and the unavailability of high precision data at critical kinematics. Extraction of the neutron structure and the d-quark distribution have been further challenging due to the necessity of applying nuclear corrections when utilizing scattering data from a deuteron target to extract free neutron structure. However, a program of experiments has been carried out recently at the energy-upgraded Jefferson Lab electron accelerator aimed at significantly reducing the nuclear correction uncertainties on the d-quark distribution function at large partonic momentum. This allows leveraging the vast body of deuterium data covering a large kinematic range to be utilized for d-quark parton distribution function extraction. We present new data from experiment E12-10-002 carried out in Jefferson Lab Hall C on the deuteron to proton cross-section ratio at large BJorken-x. These results significantly improve the precision of existing data, and provide a first look at the expected impact on quark distributions extracted from global parton distribution function fits.
△ Less
Submitted 23 September, 2024;
originally announced September 2024.
-
FUSED-Net: Enhancing Few-Shot Traffic Sign Detection with Unfrozen Parameters, Pseudo-Support Sets, Embedding Normalization, and Domain Adaptation
Authors:
Md. Atiqur Rahman,
Nahian Ibn Asad,
Md. Mushfiqul Haque Omi,
Md. Bakhtiar Hasan,
Sabbir Ahmed,
Md. Hasanul Kabir
Abstract:
Automatic Traffic Sign Recognition is paramount in modern transportation systems, motivating several research endeavors to focus on performance improvement by utilizing large-scale datasets. As the appearance of traffic signs varies across countries, curating large-scale datasets is often impractical; and requires efficient models that can produce satisfactory performance using limited data. In th…
▽ More
Automatic Traffic Sign Recognition is paramount in modern transportation systems, motivating several research endeavors to focus on performance improvement by utilizing large-scale datasets. As the appearance of traffic signs varies across countries, curating large-scale datasets is often impractical; and requires efficient models that can produce satisfactory performance using limited data. In this connection, we present 'FUSED-Net', built-upon Faster RCNN for traffic sign detection, enhanced by Unfrozen Parameters, Pseudo-Support Sets, Embedding Normalization, and Domain Adaptation while reducing data requirement. Unlike traditional approaches, we keep all parameters unfrozen during training, enabling FUSED-Net to learn from limited samples. The generation of a Pseudo-Support Set through data augmentation further enhances performance by compensating for the scarcity of target domain data. Additionally, Embedding Normalization is incorporated to reduce intra-class variance, standardizing feature representation. Domain Adaptation, achieved by pre-training on a diverse traffic sign dataset distinct from the target domain, improves model generalization. Evaluating FUSED-Net on the BDTSD dataset, we achieved 2.4x, 2.2x, 1.5x, and 1.3x improvements of mAP in 1-shot, 3-shot, 5-shot, and 10-shot scenarios, respectively compared to the state-of-the-art Few-Shot Object Detection (FSOD) models. Additionally, we outperform state-of-the-art works on the cross-domain FSOD benchmark under several scenarios.
△ Less
Submitted 23 September, 2024;
originally announced September 2024.
-
FAMOUS: Flexible Accelerator for the Attention Mechanism of Transformer on UltraScale+ FPGAs
Authors:
Ehsan Kabir,
Md. Arafat Kabir,
Austin R. J. Downey,
Jason D. Bakos,
David Andrews,
Miaoqing Huang
Abstract:
Transformer neural networks (TNNs) are being applied across a widening range of application domains, including natural language processing (NLP), machine translation, and computer vision (CV). Their popularity is largely attributed to the exceptional performance of their multi-head self-attention blocks when analyzing sequential data and extracting features. To date, there are limited hardware acc…
▽ More
Transformer neural networks (TNNs) are being applied across a widening range of application domains, including natural language processing (NLP), machine translation, and computer vision (CV). Their popularity is largely attributed to the exceptional performance of their multi-head self-attention blocks when analyzing sequential data and extracting features. To date, there are limited hardware accelerators tailored for this mechanism, which is the first step before designing an accelerator for a complete model. This paper proposes \textit{FAMOUS}, a flexible hardware accelerator for dense multi-head attention (MHA) computation of TNNs on field-programmable gate arrays (FPGAs). It is optimized for high utilization of processing elements and on-chip memories to improve parallelism and reduce latency. An efficient tiling of large matrices has been employed to distribute memory and computing resources across different modules on various FPGA platforms. The design is evaluated on Xilinx Alveo U55C and U200 data center cards containing Ultrascale+ FPGAs. Experimental results are presented that show that it can attain a maximum throughput, number of parallel attention heads, embedding dimension and tile size of 328 (giga operations/second (GOPS)), 8, 768 and 64 respectively on the U55C. Furthermore, it is 3.28$\times$ and 2.6$\times$ faster than the Intel Xeon Gold 5220R CPU and NVIDIA V100 GPU respectively. It is also 1.3$\times$ faster than the fastest state-of-the-art FPGA-based accelerator.
△ Less
Submitted 21 October, 2024; v1 submitted 21 September, 2024;
originally announced September 2024.
-
Minimal Model Counting via Knowledge Compilation
Authors:
Mohimenul Kabir
Abstract:
Counting the number of models of a Boolean formula is a fundamental problem in artificial intelligence and reasoning. Minimal models of a Boolean formula are critical in various reasoning systems, making the counting of minimal models essential for detailed inference tasks. Existing research primarily focused on decision problems related to minimal models. In this work, we extend beyond decision p…
▽ More
Counting the number of models of a Boolean formula is a fundamental problem in artificial intelligence and reasoning. Minimal models of a Boolean formula are critical in various reasoning systems, making the counting of minimal models essential for detailed inference tasks. Existing research primarily focused on decision problems related to minimal models. In this work, we extend beyond decision problems to address the challenge of counting minimal models. Specifically, we propose a novel knowledge compilation form that facilitates the efficient counting of minimal models. Our approach leverages the idea of justification and incorporates theories from answer set counting.
△ Less
Submitted 16 September, 2024;
originally announced September 2024.
-
A Double-Difference Doppler Shift-Based Positioning Framework with Ephemeris Error Correction of LEO Satellites
Authors:
Md. Ali Hasan,
M. Humayun Kabir,
Md. Shafiqul Islam,
Sangmin Han,
Wonjae Shin
Abstract:
In signals of opportunity (SOPs)-based positioning utilizing low Earth orbit (LEO) satellites, ephemeris data derived from two-line element files can introduce increasing error over time. To handle the erroneous measurement, an additional base receiver with a known position is often used to compensate for the effect of ephemeris error when positioning the user terminal (UT). However, this approach…
▽ More
In signals of opportunity (SOPs)-based positioning utilizing low Earth orbit (LEO) satellites, ephemeris data derived from two-line element files can introduce increasing error over time. To handle the erroneous measurement, an additional base receiver with a known position is often used to compensate for the effect of ephemeris error when positioning the user terminal (UT). However, this approach is insufficient for the long baseline (the distance between the base receiver and UT) as it fails to adequately correct Doppler shift measurement errors caused by ephemeris inaccuracies, resulting in degraded positioning performance. Moreover, the lack of clock synchronization between the base receiver and UT exacerbates erroneous Doppler shift measurements. To address these challenges, we put forth a robust double-difference Doppler shift-based positioning framework, coined 3DPose, to handle the clock synchronization issue between the base receiver and UT, and positioning degradation due to the long baseline. The proposed 3DPose framework leverages double-difference Doppler shift measurements to eliminate the clock synchronization issue and incorporates a novel ephemeris error correction algorithm to enhance UT positioning accuracy in case of the long baseline. The algorithm specifically characterizes and corrects the Doppler shift measurement errors arising from erroneous ephemeris data, focusing on satellite position errors in the tangential direction. To validate the effectiveness of the proposed framework, we conduct comparative analyses across three different scenarios, contrasting its performance with the existing differential Doppler positioning method. The results demonstrate that the proposed 3DPose framework achieves an average reduction of 90% in 3-dimensional positioning errors compared to the existing differential Doppler approach.
△ Less
Submitted 8 September, 2024;
originally announced September 2024.
-
Flavor Dependence of Charged Pion Fragmentation Functions
Authors:
H. Bhatt,
P. Bosted,
S. Jia,
W. Armstrong,
D. Dutta,
R. Ent,
D. Gaskell,
E. Kinney,
H. Mkrtchyan,
S. Ali,
R. Ambrose,
D. Androic,
C. Ayerbe Gayoso,
A. Bandari,
V. Berdnikov,
D. Bhetuwal,
D. Biswas,
M. Boer,
E. Brash,
A. Camsonne,
J. P. Chen,
J. Chen,
M. Chen,
E. M. Christy,
S. Covrig
, et al. (45 additional authors not shown)
Abstract:
We have measured the flavor dependence of multiplicities for pi^+ and pi^- production in semi-inclusive deep-inelastic scattering (SIDIS) on proton and deuteron targets to explore a possible charge symmetry violation in fragmentation functions. The experiment used an electron beam with energies of 10.2 and 10.6 GeV at Jefferson Lab and the Hall-C spectrometers. The electron kinematics spanned the…
▽ More
We have measured the flavor dependence of multiplicities for pi^+ and pi^- production in semi-inclusive deep-inelastic scattering (SIDIS) on proton and deuteron targets to explore a possible charge symmetry violation in fragmentation functions. The experiment used an electron beam with energies of 10.2 and 10.6 GeV at Jefferson Lab and the Hall-C spectrometers. The electron kinematics spanned the range 0.3<x<0.6, 2<Q^2<5.5 GeV^2, and 4<W^2<11 GeV^2. The pion fractional momentum range was 0.3< z <0.7, and the transverse momentum range was 0<p_T<0.25 GeV/c. Assuming factorization at low p_T and allowing for isospin breaking, we find that the results can be described by two "favored" and two "un-favored" effective low $p_T$ fragmentation functions that are flavor-dependent. However, they converge to a common flavor-independent value at the lowest x or highest W of this experiment.
△ Less
Submitted 5 September, 2024; v1 submitted 29 August, 2024;
originally announced August 2024.
-
Analysis of child development facts and myths using text mining techniques and classification models
Authors:
Mehedi Tajrian,
Azizur Rahman,
Muhammad Ashad Kabir,
Md Rafiqul Islam
Abstract:
The rapid dissemination of misinformation on the internet complicates the decision-making process for individuals seeking reliable information, particularly parents researching child development topics. This misinformation can lead to adverse consequences, such as inappropriate treatment of children based on myths. While previous research has utilized text-mining techniques to predict child abuse…
▽ More
The rapid dissemination of misinformation on the internet complicates the decision-making process for individuals seeking reliable information, particularly parents researching child development topics. This misinformation can lead to adverse consequences, such as inappropriate treatment of children based on myths. While previous research has utilized text-mining techniques to predict child abuse cases, there has been a gap in the analysis of child development myths and facts. This study addresses this gap by applying text mining techniques and classification models to distinguish between myths and facts about child development, leveraging newly gathered data from publicly available websites. The research methodology involved several stages. First, text mining techniques were employed to pre-process the data, ensuring enhanced accuracy. Subsequently, the structured data was analysed using six robust Machine Learning (ML) classifiers and one Deep Learning (DL) model, with two feature extraction techniques applied to assess their performance across three different training-testing splits. To ensure the reliability of the results, cross-validation was performed using both k-fold and leave-one-out methods. Among the classification models tested, Logistic Regression (LR) demonstrated the highest accuracy, achieving a 90% accuracy with the Bag-of-Words (BoW) feature extraction technique. LR stands out for its exceptional speed and efficiency, maintaining low testing time per statement (0.97 microseconds). These findings suggest that LR, when combined with BoW, is effective in accurately classifying child development information, thus providing a valuable tool for combating misinformation and assisting parents in making informed decisions.
△ Less
Submitted 23 August, 2024;
originally announced August 2024.
-
Beyond Labels: Aligning Large Language Models with Human-like Reasoning
Authors:
Muhammad Rafsan Kabir,
Rafeed Mohammad Sultan,
Ihsanul Haque Asif,
Jawad Ibn Ahad,
Fuad Rahman,
Mohammad Ruhul Amin,
Nabeel Mohammed,
Shafin Rahman
Abstract:
Aligning large language models (LLMs) with a human reasoning approach ensures that LLMs produce morally correct and human-like decisions. Ethical concerns are raised because current models are prone to generating false positives and providing malicious responses. To contribute to this issue, we have curated an ethics dataset named Dataset for Aligning Reasons (DFAR), designed to aid in aligning la…
▽ More
Aligning large language models (LLMs) with a human reasoning approach ensures that LLMs produce morally correct and human-like decisions. Ethical concerns are raised because current models are prone to generating false positives and providing malicious responses. To contribute to this issue, we have curated an ethics dataset named Dataset for Aligning Reasons (DFAR), designed to aid in aligning language models to generate human-like reasons. The dataset comprises statements with ethical-unethical labels and their corresponding reasons. In this study, we employed a unique and novel fine-tuning approach that utilizes ethics labels and their corresponding reasons (L+R), in contrast to the existing fine-tuning approach that only uses labels (L). The original pre-trained versions, the existing fine-tuned versions, and our proposed fine-tuned versions of LLMs were then evaluated on an ethical-unethical classification task and a reason-generation task. Our proposed fine-tuning strategy notably outperforms the others in both tasks, achieving significantly higher accuracy scores in the classification task and lower misalignment rates in the reason-generation task. The increase in classification accuracies and decrease in misalignment rates indicate that the L+R fine-tuned models align more with human ethics. Hence, this study illustrates that injecting reasons has substantially improved the alignment of LLMs, resulting in more human-like responses. We have made the DFAR dataset and corresponding codes publicly available at https://github.com/apurba-nsu-rnd-lab/DFAR.
△ Less
Submitted 20 August, 2024;
originally announced August 2024.
-
Exploring the Impact of Word Prediction Assistive Features on Smartphone Keyboards for Blind Users
Authors:
Mrim M. Alnfiai,
Muhammad Ashad Kabir
Abstract:
Assistive technologies have been developed to enhance blind users' typing performance, focusing on speed, accuracy, and effort reduction. One such technology is word prediction software, designed to minimize keystrokes required for text input. This study investigates the impact of word prediction on typing performance among blind users using an on-screen QWERTY keyboard. We conducted a comparative…
▽ More
Assistive technologies have been developed to enhance blind users' typing performance, focusing on speed, accuracy, and effort reduction. One such technology is word prediction software, designed to minimize keystrokes required for text input. This study investigates the impact of word prediction on typing performance among blind users using an on-screen QWERTY keyboard. We conducted a comparative study involving eleven blind participants, evaluating both standard QWERTY input and word prediction-assisted typing. Our findings reveal that while word prediction slightly improves typing speed, it does not enhance typing accuracy and increases both physical and temporal workload compared to the default keyboard. We conclude with recommendations for improving word prediction systems, including more efficient editing methods and the integration of voice pitch variations to aid error recognition.
△ Less
Submitted 20 August, 2024;
originally announced August 2024.
-
A Methodological and Structural Review of Hand Gesture Recognition Across Diverse Data Modalities
Authors:
Jungpil Shin,
Abu Saleh Musa Miah,
Md. Humaun Kabir,
Md. Abdur Rahim,
Abdullah Al Shiam
Abstract:
Researchers have been developing Hand Gesture Recognition (HGR) systems to enhance natural, efficient, and authentic human-computer interaction, especially benefiting those who rely solely on hand gestures for communication. Despite significant progress, the automatic and precise identification of hand gestures remains a considerable challenge in computer vision. Recent studies have focused on spe…
▽ More
Researchers have been developing Hand Gesture Recognition (HGR) systems to enhance natural, efficient, and authentic human-computer interaction, especially benefiting those who rely solely on hand gestures for communication. Despite significant progress, the automatic and precise identification of hand gestures remains a considerable challenge in computer vision. Recent studies have focused on specific modalities like RGB images, skeleton data, and spatiotemporal interest points. This paper provides a comprehensive review of HGR techniques and data modalities from 2014 to 2024, exploring advancements in sensor technology and computer vision. We highlight accomplishments using various modalities, including RGB, Skeleton, Depth, Audio, EMG, EEG, and Multimodal approaches and identify areas needing further research. We reviewed over 200 articles from prominent databases, focusing on data collection, data settings, and gesture representation. Our review assesses the efficacy of HGR systems through their recognition accuracy and identifies a gap in research on continuous gesture recognition, indicating the need for improved vision-based gesture systems. The field has experienced steady research progress, including advancements in hand-crafted features and deep learning (DL) techniques. Additionally, we report on the promising developments in HGR methods and the area of multimodal approaches. We hope this survey will serve as a potential guideline for diverse data modality-based HGR research.
△ Less
Submitted 10 August, 2024;
originally announced August 2024.
-
Machine Learning Models for the Identification of Cardiovascular Diseases Using UK Biobank Data
Authors:
Sheikh Mohammed Shariful Islam,
Moloud Abrar,
Teketo Tegegne,
Liliana Loranjo,
Chandan Karmakar,
Md Abdul Awal,
Md. Shahadat Hossain,
Muhammad Ashad Kabir,
Mufti Mahmud,
Abbas Khosravi,
George Siopis,
Jeban C Moses,
Ralph Maddison
Abstract:
Machine learning models have the potential to identify cardiovascular diseases (CVDs) early and accurately in primary healthcare settings, which is crucial for delivering timely treatment and management. Although population-based CVD risk models have been used traditionally, these models often do not consider variations in lifestyles, socioeconomic conditions, or genetic predispositions. Therefore…
▽ More
Machine learning models have the potential to identify cardiovascular diseases (CVDs) early and accurately in primary healthcare settings, which is crucial for delivering timely treatment and management. Although population-based CVD risk models have been used traditionally, these models often do not consider variations in lifestyles, socioeconomic conditions, or genetic predispositions. Therefore, we aimed to develop machine learning models for CVD detection using primary healthcare data, compare the performance of different models, and identify the best models. We used data from the UK Biobank study, which included over 500,000 middle-aged participants from different primary healthcare centers in the UK. Data collected at baseline (2006--2010) and during imaging visits after 2014 were used in this study. Baseline characteristics, including sex, age, and the Townsend Deprivation Index, were included. Participants were classified as having CVD if they reported at least one of the following conditions: heart attack, angina, stroke, or high blood pressure. Cardiac imaging data such as electrocardiogram and echocardiography data, including left ventricular size and function, cardiac output, and stroke volume, were also used. We used 9 machine learning models (LSVM, RBFSVM, GP, DT, RF, NN, AdaBoost, NB, and QDA), which are explainable and easily interpretable. We reported the accuracy, precision, recall, and F-1 scores; confusion matrices; and area under the curve (AUC) curves.
△ Less
Submitted 23 July, 2024;
originally announced July 2024.
-
On Lower Bounding Minimal Model Count
Authors:
Mohimenul Kabir,
Kuldeep S Meel
Abstract:
Minimal models of a Boolean formula play a pivotal role in various reasoning tasks. While previous research has primarily focused on qualitative analysis over minimal models; our study concentrates on the quantitative aspect, specifically counting of minimal models. Exact counting of minimal models is strictly harder than #P, prompting our investigation into establishing a lower bound for their qu…
▽ More
Minimal models of a Boolean formula play a pivotal role in various reasoning tasks. While previous research has primarily focused on qualitative analysis over minimal models; our study concentrates on the quantitative aspect, specifically counting of minimal models. Exact counting of minimal models is strictly harder than #P, prompting our investigation into establishing a lower bound for their quantity, which is often useful in related applications. In this paper, we introduce two novel techniques for counting minimal models, leveraging the expressive power of answer set programming: the first technique employs methods from knowledge compilation, while the second one draws on recent advancements in hashing-based approximate model counting. Through empirical evaluations, we demonstrate that our methods significantly improve the lower bound estimates of the number of minimal models, surpassing the performance of existing minimal model reasoning systems in terms of runtime.
△ Less
Submitted 16 July, 2024; v1 submitted 12 July, 2024;
originally announced July 2024.
-
The 3He(\vec n,p)3H parity-conserving asymmetry
Authors:
M. Viviani,
S. Baeßler,
L. Barrón-Palos,
N. Birge,
J. D. Bowman,
J. Calarco,
V. Cianciolo,
C. E. Coppola,
C. B. Crawford,
G. Dodson,
N. Fomin,
I. Garishvili,
M. T. Gericke,
L. Girlanda,
G. L. Greene,
G. M. Hale,
J. Hamblen,
C. Hayes,
E. B. Iverson,
M. L. Kabir,
A. Kievsky,
L. E. Marcucci,
M. McCrea,
E. Plemons,
A. Ramírez-Morales
, et al. (6 additional authors not shown)
Abstract:
Recently, the n$^3$He collaboration reported a measurement of the parity-violating (PV) proton directional asymmetry $A_{\mathrm {PV}} = (1.55\pm 0.97~\mathrm {(st\ at)} \pm 0.24~\mathrm {(sys)})\times 10^{-8}$ in the capture reaction of ${}^3$He$(\vec {n},{\mathrm p}){}^3$H at meV incident neutron energies. The result increased the limited inventory of precisely measured and calculable PV observa…
▽ More
Recently, the n$^3$He collaboration reported a measurement of the parity-violating (PV) proton directional asymmetry $A_{\mathrm {PV}} = (1.55\pm 0.97~\mathrm {(st\ at)} \pm 0.24~\mathrm {(sys)})\times 10^{-8}$ in the capture reaction of ${}^3$He$(\vec {n},{\mathrm p}){}^3$H at meV incident neutron energies. The result increased the limited inventory of precisely measured and calculable PV observables in few-body systems required to further understand the structure of hadronic weak interaction. In this letter, we report the experimental and theoretical investigation of a parity conserving (PC) asymmetry $A_{\mathrm {PC}}$ in the same reaction (the first ever measured PC observable at meV neutron energies). As a result of S- and P-wave mixing in the reaction, the $A_{\mathrm {PC}}$ is inversely proportional to the neutron wavelength $λ$. The experimental value is $(λ\times A_{\mathrm {PC}})\equivβ= (-1.97 \pm 0.28~\mathrm{(stat)}\pm 0.12~\mathrm{(sys)}) \times 10^{-6}$ Amstrongs. We present results for a theoretical analysis of this reaction by solving the four-body scattering problem within the hyperspherical harmonic method. We find that in the ${}^3$He$(\vec {n},{\mathrm p}){}^3$H reaction, $A_{\mathrm {PC}}$ depends critically on the energy and width of the close $0^-$ resonant state of ${}^4$He, resulting in a large sensitivity to the spin-orbit components of the nucleon-nucleon force and even to the three-nucleon force. The analysis of the accurately measured $A_{\mathrm {PC}}$ and $A_{\mathrm {PV}}$ using the same few-body theoretical models gives essential information needed to interpret the PV asymmetry in the ${}^3$He$(\vec {n}, {\mathrm p}){}^3$H reaction.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Deep SIMO Auto-Encoder and Radio Frequency Hardware Impairments Modeling for Physical Layer Security
Authors:
Abdullahi Mohammad,
Mahmoud Tukur Kabir,
Mikko Valkama,
Bo Tan
Abstract:
This paper presents a novel approach to achieving secure wireless communication by leveraging the inherent characteristics of wireless channels through end-to-end learning using a single-input-multiple-output (SIMO) autoencoder (AE). To ensure a more realistic signal transmission, we derive the signal model that captures all radio frequency (RF) hardware impairments to provide reliable and secure…
▽ More
This paper presents a novel approach to achieving secure wireless communication by leveraging the inherent characteristics of wireless channels through end-to-end learning using a single-input-multiple-output (SIMO) autoencoder (AE). To ensure a more realistic signal transmission, we derive the signal model that captures all radio frequency (RF) hardware impairments to provide reliable and secure communication. Performance evaluations against traditional linear decoders, such as zero-forcing (ZR) and linear minimum mean square error (LMMSE), and the optimal nonlinear decoder, maximum likelihood (ML), demonstrate that the AE-based SIMO model exhibits superior bit error rate (BER) performance, but with a substantial gap even in the presence of RF hardware impairments. Additionally, the proposed model offers enhanced security features, preventing potential eavesdroppers from intercepting transmitted information and leveraging RF impairments for augmented physical layer security and device identification. These findings underscore the efficacy of the proposed end-to-end learning approach in achieving secure and robust wireless communication.
△ Less
Submitted 10 August, 2024; v1 submitted 30 April, 2024;
originally announced April 2024.
-
COVIDHealth: A Benchmark Twitter Dataset and Machine Learning based Web Application for Classifying COVID-19 Discussions
Authors:
Mahathir Mohammad Bishal,
Md. Rakibul Hassan Chowdory,
Anik Das,
Muhammad Ashad Kabir
Abstract:
The COVID-19 pandemic has had adverse effects on both physical and mental health. During this pandemic, numerous studies have focused on gaining insights into health-related perspectives from social media. In this study, our primary objective is to develop a machine learning-based web application for automatically classifying COVID-19-related discussions on social media. To achieve this, we label…
▽ More
The COVID-19 pandemic has had adverse effects on both physical and mental health. During this pandemic, numerous studies have focused on gaining insights into health-related perspectives from social media. In this study, our primary objective is to develop a machine learning-based web application for automatically classifying COVID-19-related discussions on social media. To achieve this, we label COVID-19-related Twitter data, provide benchmark classification results, and develop a web application. We collected data using the Twitter API and labeled a total of 6,667 tweets into five different classes: health risks, prevention, symptoms, transmission, and treatment. We extracted features using various feature extraction methods and applied them to seven different traditional machine learning algorithms, including Decision Tree, Random Forest, Stochastic Gradient Descent, Adaboost, K-Nearest Neighbour, Logistic Regression, and Linear SVC. Additionally, we used four deep learning algorithms: LSTM, CNN, RNN, and BERT, for classification. Overall, we achieved a maximum F1 score of 90.43% with the CNN algorithm in deep learning. The Linear SVC algorithm exhibited the highest F1 score at 86.13%, surpassing other traditional machine learning approaches. Our study not only contributes to the field of health-related data analysis but also provides a valuable resource in the form of a web-based tool for efficient data classification, which can aid in addressing public health challenges and increasing awareness during pandemics. We made the dataset and application publicly available, which can be downloaded from this link https://github.com/Bishal16/COVID19-Health-Related-Data-Classification-Website.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
A Computer Vision Based Approach for Stalking Detection Using a CNN-LSTM-MLP Hybrid Fusion Model
Authors:
Murad Hasan,
Shahriar Iqbal,
Md. Billal Hossain Faisal,
Md. Musnad Hossin Neloy,
Md. Tonmoy Kabir,
Md. Tanzim Reza,
Md. Golam Rabiul Alam,
Md Zia Uddin
Abstract:
Criminal and suspicious activity detection has become a popular research topic in recent years. The rapid growth of computer vision technologies has had a crucial impact on solving this issue. However, physical stalking detection is still a less explored area despite the evolution of modern technology. Nowadays, stalking in public places has become a common occurrence with women being the most aff…
▽ More
Criminal and suspicious activity detection has become a popular research topic in recent years. The rapid growth of computer vision technologies has had a crucial impact on solving this issue. However, physical stalking detection is still a less explored area despite the evolution of modern technology. Nowadays, stalking in public places has become a common occurrence with women being the most affected. Stalking is a visible action that usually occurs before any criminal activity begins as the stalker begins to follow, loiter, and stare at the victim before committing any criminal activity such as assault, kidnapping, rape, and so on. Therefore, it has become a necessity to detect stalking as all of these criminal activities can be stopped in the first place through stalking detection. In this research, we propose a novel deep learning-based hybrid fusion model to detect potential stalkers from a single video with a minimal number of frames. We extract multiple relevant features, such as facial landmarks, head pose estimation, and relative distance, as numerical values from video frames. This data is fed into a multilayer perceptron (MLP) to perform a classification task between a stalking and a non-stalking scenario. Simultaneously, the video frames are fed into a combination of convolutional and LSTM models to extract the spatio-temporal features. We use a fusion of these numerical and spatio-temporal features to build a classifier to detect stalking incidents. Additionally, we introduce a dataset consisting of stalking and non-stalking videos gathered from various feature films and television series, which is also used to train the model. The experimental results show the efficiency and dynamism of our proposed stalker detection system, achieving 89.58% testing accuracy with a significant improvement as compared to the state-of-the-art approaches.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Islamic Lifestyle Applications: Meeting the Spiritual Needs of Modern Muslims
Authors:
Mohsinul Kabir,
Mohammad Ridwan Kabir,
Riasat Siam Islam
Abstract:
We evaluated contemporary Islamic lifestyle applications supporting religious practices and motivation among Muslims. We reviewed 11 popular applications using self-determination theory and the technology-as-experience framework to assess their support for motivation and affective needs. Most applications lack features that foster autonomy, competence, and relatedness. We also interviewed ten devo…
▽ More
We evaluated contemporary Islamic lifestyle applications supporting religious practices and motivation among Muslims. We reviewed 11 popular applications using self-determination theory and the technology-as-experience framework to assess their support for motivation and affective needs. Most applications lack features that foster autonomy, competence, and relatedness. We also interviewed ten devoted Muslim application users to gain insights into their experiences and unmet needs. Our findings indicate that existing applications fall short in providing comprehensive learning, social connections, and scholar consultations. We propose design implications based on our results, including guided religious information, shareability, virtual community engagement, scholarly question-answering, and personalized reminders. We aim to inform the design of Islamic lifestyle applications that better facilitate ritual practices, benefitting application designers and Muslim communities. Our research provides valuable insights into the untapped potential for lifestyle applications to act as religious companions supporting Muslims' spiritual journey.
△ Less
Submitted 3 February, 2024;
originally announced February 2024.
-
Enhancing quantum utility: simulating large-scale quantum spin chains on superconducting quantum computers
Authors:
Talal Ahmed Chowdhury,
Kwangmin Yu,
Mahmud Ashraf Shamim,
M. L. Kabir,
Raza Sabbir Sufian
Abstract:
We present the quantum simulation of the frustrated quantum spin-$\frac{1}{2}$ antiferromagnetic Heisenberg spin chain with competing nearest-neighbor $(J_1)$ and next-nearest-neighbor $(J_2)$ exchange interactions in the real superconducting quantum computer with qubits ranging up to 100. In particular, we implement, for the first time, the Hamiltonian with the next-nearest neighbor exchange inte…
▽ More
We present the quantum simulation of the frustrated quantum spin-$\frac{1}{2}$ antiferromagnetic Heisenberg spin chain with competing nearest-neighbor $(J_1)$ and next-nearest-neighbor $(J_2)$ exchange interactions in the real superconducting quantum computer with qubits ranging up to 100. In particular, we implement, for the first time, the Hamiltonian with the next-nearest neighbor exchange interaction in conjunction with the nearest neighbor interaction on IBM's superconducting quantum computer and carry out the time evolution of the spin chain by employing first-order Trotterization. Furthermore, our novel implementation of second-order Trotterization for the isotropic Heisenberg spin chain, involving only nearest-neighbor exchange interaction, enables precise measurement of the expectation values of staggered magnetization observable across a range of up to 100 qubits. Notably, in both cases, our approach results in a constant circuit depth in each Trotter step, independent of the initial number of qubits. Our demonstration of the accurate measurement of expectation values for the large-scale quantum system using superconducting quantum computers designates the quantum utility of these devices for investigating various properties of many-body quantum systems. This will be a stepping stone to achieving the quantum advantage over classical ones in simulating quantum systems before the fault tolerance quantum era.
△ Less
Submitted 18 March, 2024; v1 submitted 19 December, 2023;
originally announced December 2023.
-
Exact ASP Counting with Compact Encodings
Authors:
Mohimenul Kabir,
Supratik Chakraborty,
Kuldeep S Meel
Abstract:
Answer Set Programming (ASP) has emerged as a promising paradigm in knowledge representation and automated reasoning owing to its ability to model hard combinatorial problems from diverse domains in a natural way. Building on advances in propositional SAT solving, the past two decades have witnessed the emergence of well-engineered systems for solving the answer set satisfiability problem, i.e., f…
▽ More
Answer Set Programming (ASP) has emerged as a promising paradigm in knowledge representation and automated reasoning owing to its ability to model hard combinatorial problems from diverse domains in a natural way. Building on advances in propositional SAT solving, the past two decades have witnessed the emergence of well-engineered systems for solving the answer set satisfiability problem, i.e., finding models or answer sets for a given answer set program. In recent years, there has been growing interest in problems beyond satisfiability, such as model counting, in the context of ASP. Akin to the early days of propositional model counting, state-of-the-art exact answer set counters do not scale well beyond small instances. Exact ASP counters struggle with handling larger input formulas. The primary contribution of this paper is a new ASP counting framework, called sharpASP, which counts answer sets avoiding larger input formulas. This relies on an alternative way of defining answer sets that allows for the lifting of key techniques developed in the context of propositional model counting. Our extensive empirical analysis over 1470 benchmarks demonstrates significant performance gain over current state-of-the-art exact answer set counters. Specifically, by using sharpASP, we were able to solve 1062 benchmarks with PAR2 score of 3082 whereas using prior state-of-the-art, we could only solve 895 benchmarks with a PAR2 score of 4205, all other experimental conditions being the same.
△ Less
Submitted 19 December, 2023;
originally announced December 2023.
-
An Interpretable Deep Learning Approach for Skin Cancer Categorization
Authors:
Faysal Mahmud,
Md. Mahin Mahfiz,
Md. Zobayer Ibna Kabir,
Yusha Abdullah
Abstract:
Skin cancer is a serious worldwide health issue, precise and early detection is essential for better patient outcomes and effective treatment. In this research, we use modern deep learning methods and explainable artificial intelligence (XAI) approaches to address the problem of skin cancer detection. To categorize skin lesions, we employ four cutting-edge pre-trained models: XceptionNet, Efficien…
▽ More
Skin cancer is a serious worldwide health issue, precise and early detection is essential for better patient outcomes and effective treatment. In this research, we use modern deep learning methods and explainable artificial intelligence (XAI) approaches to address the problem of skin cancer detection. To categorize skin lesions, we employ four cutting-edge pre-trained models: XceptionNet, EfficientNetV2S, InceptionResNetV2, and EfficientNetV2M. Image augmentation approaches are used to reduce class imbalance and improve the generalization capabilities of our models. Our models decision-making process can be clarified because of the implementation of explainable artificial intelligence (XAI). In the medical field, interpretability is essential to establish credibility and make it easier to implement AI driven diagnostic technologies into clinical workflows. We determined the XceptionNet architecture to be the best performing model, achieving an accuracy of 88.72%. Our study shows how deep learning and explainable artificial intelligence (XAI) can improve skin cancer diagnosis, laying the groundwork for future developments in medical image analysis. These technologies ability to allow for early and accurate detection could enhance patient care, lower healthcare costs, and raise the survival rates for those with skin cancer. Source Code: https://github.com/Faysal-MD/An-Interpretable-Deep-Learning?Approach-for-Skin-Cancer-Categorization-IEEE2023
△ Less
Submitted 17 December, 2023;
originally announced December 2023.
-
LEI: Livestock Event Information Schema for Enabling Data Sharing
Authors:
Mahir Habib,
Muhammad Ashad Kabir,
Lihong Zheng,
Shawn McGrath
Abstract:
Data-driven advances have resulted in significant improvements in dairy production. However, the meat industry has lagged behind in adopting data-driven approaches, underscoring the crucial need for data standardisation to facilitate seamless data transmission to maximise productivity, save costs, and increase market access. To address this gap, we propose a novel data schema, Livestock Event Info…
▽ More
Data-driven advances have resulted in significant improvements in dairy production. However, the meat industry has lagged behind in adopting data-driven approaches, underscoring the crucial need for data standardisation to facilitate seamless data transmission to maximise productivity, save costs, and increase market access. To address this gap, we propose a novel data schema, Livestock Event Information (LEI) schema, designed to accurately and uniformly record livestock events. LEI complies with the International Committee for Animal Recording (ICAR) and Integrity System Company (ISC) schemas to deliver this data standardisation and enable data sharing between producers and consumers. To validate the superiority of LEI, we conducted a structural metrics analysis and a comprehensive case study. The analysis demonstrated that LEI outperforms the ICAR and ISC schemas in terms of design, while the case study confirmed its superior ability to capture livestock event information. Our findings lay the foundation for the implementation of the LEI schema, unlocking the potential for data-driven advances in livestock management. Moreover, LEI's versatility opens avenues for future expansion into other agricultural domains, encompassing poultry, fisheries, and crops. The adoption of LEI promises substantial benefits, including improved data accuracy, reduced costs, and increased productivity, heralding a new era of sustainability in the meat industry.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
LEI2JSON: Schema-based Validation and Conversion of Livestock Event Information
Authors:
Mahir Habib,
Muhammad Ashad Kabir,
Lihong Zheng
Abstract:
Livestock producers often need help in standardising (i.e., converting and validating) their livestock event data. This article introduces a novel solution, LEI2JSON (Livestock Event Information To JSON). The tool is an add-on for Google Sheets, adhering to the livestock event information (LEI) schema. The core objective of LEI2JSON is to provide livestock producers with an efficient mechanism to…
▽ More
Livestock producers often need help in standardising (i.e., converting and validating) their livestock event data. This article introduces a novel solution, LEI2JSON (Livestock Event Information To JSON). The tool is an add-on for Google Sheets, adhering to the livestock event information (LEI) schema. The core objective of LEI2JSON is to provide livestock producers with an efficient mechanism to standardise their data, leading to substantial savings in time and resources. This is achieved by building the spreadsheet template with the appropriate column headers, notes, and validation rules, converting the spreadsheet data into JSON format, and validating the output against the schema. LEI2JSON facilitates the seamless storage of livestock event information locally or on Google Drive in JSON. Additionally, we have conducted an extensive experimental evaluation to assess the effectiveness of the tool.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Towards Automated Recipe Genre Classification using Semi-Supervised Learning
Authors:
Nazmus Sakib,
G. M. Shahariar,
Md. Mohsinul Kabir,
Md. Kamrul Hasan,
Hasan Mahmud
Abstract:
Sharing cooking recipes is a great way to exchange culinary ideas and provide instructions for food preparation. However, categorizing raw recipes found online into appropriate food genres can be challenging due to a lack of adequate labeled data. In this study, we present a dataset named the ``Assorted, Archetypal, and Annotated Two Million Extended (3A2M+) Cooking Recipe Dataset" that contains t…
▽ More
Sharing cooking recipes is a great way to exchange culinary ideas and provide instructions for food preparation. However, categorizing raw recipes found online into appropriate food genres can be challenging due to a lack of adequate labeled data. In this study, we present a dataset named the ``Assorted, Archetypal, and Annotated Two Million Extended (3A2M+) Cooking Recipe Dataset" that contains two million culinary recipes labeled in respective categories with extended named entities extracted from recipe descriptions. This collection of data includes various features such as title, NER, directions, and extended NER, as well as nine different labels representing genres including bakery, drinks, non-veg, vegetables, fast food, cereals, meals, sides, and fusions. The proposed pipeline named 3A2M+ extends the size of the Named Entity Recognition (NER) list to address missing named entities like heat, time or process from the recipe directions using two NER extraction tools. 3A2M+ dataset provides a comprehensive solution to the various challenging recipe-related tasks, including classification, named entity recognition, and recipe generation. Furthermore, we have demonstrated traditional machine learning, deep learning and pre-trained language models to classify the recipes into their corresponding genre and achieved an overall accuracy of 98.6\%. Our investigation indicates that the title feature played a more significant role in classifying the genre.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
MMTF-DES: A Fusion of Multimodal Transformer Models for Desire, Emotion, and Sentiment Analysis of Social Media Data
Authors:
Abdul Aziz,
Nihad Karim Chowdhury,
Muhammad Ashad Kabir,
Abu Nowshed Chy,
Md. Jawad Siddique
Abstract:
Desire is a set of human aspirations and wishes that comprise verbal and cognitive aspects that drive human feelings and behaviors, distinguishing humans from other animals. Understanding human desire has the potential to be one of the most fascinating and challenging research domains. It is tightly coupled with sentiment analysis and emotion recognition tasks. It is beneficial for increasing huma…
▽ More
Desire is a set of human aspirations and wishes that comprise verbal and cognitive aspects that drive human feelings and behaviors, distinguishing humans from other animals. Understanding human desire has the potential to be one of the most fascinating and challenging research domains. It is tightly coupled with sentiment analysis and emotion recognition tasks. It is beneficial for increasing human-computer interactions, recognizing human emotional intelligence, understanding interpersonal relationships, and making decisions. However, understanding human desire is challenging and under-explored because ways of eliciting desire might be different among humans. The task gets more difficult due to the diverse cultures, countries, and languages. Prior studies overlooked the use of image-text pairwise feature representation, which is crucial for the task of human desire understanding. In this research, we have proposed a unified multimodal transformer-based framework with image-text pair settings to identify human desire, sentiment, and emotion. The core of our proposed method lies in the encoder module, which is built using two state-of-the-art multimodal transformer models. These models allow us to extract diverse features. To effectively extract visual and contextualized embedding features from social media image and text pairs, we conducted joint fine-tuning of two pre-trained multimodal transformer models: Vision-and-Language Transformer (ViLT) and Vision-and-Augmented-Language Transformer (VAuLT). Subsequently, we use an early fusion strategy on these embedding features to obtain combined diverse feature representations of the image-text pair. This consolidation incorporates diverse information about this task, enabling us to robustly perceive the context and image pair from multiple perspectives.
△ Less
Submitted 21 October, 2023;
originally announced October 2023.
-
COVIDFakeExplainer: An Explainable Machine Learning based Web Application for Detecting COVID-19 Fake News
Authors:
Dylan Warman,
Muhammad Ashad Kabir
Abstract:
Fake news has emerged as a critical global issue, magnified by the COVID-19 pandemic, underscoring the need for effective preventive tools. Leveraging machine learning, including deep learning techniques, offers promise in combatting fake news. This paper goes beyond by establishing BERT as the superior model for fake news detection and demonstrates its utility as a tool to empower the general pop…
▽ More
Fake news has emerged as a critical global issue, magnified by the COVID-19 pandemic, underscoring the need for effective preventive tools. Leveraging machine learning, including deep learning techniques, offers promise in combatting fake news. This paper goes beyond by establishing BERT as the superior model for fake news detection and demonstrates its utility as a tool to empower the general populace. We have implemented a browser extension, enhanced with explainability features, enabling real-time identification of fake news and delivering easily interpretable explanations. To achieve this, we have employed two publicly available datasets and created seven distinct data configurations to evaluate three prominent machine learning architectures. Our comprehensive experiments affirm BERT's exceptional accuracy in detecting COVID-19-related fake news. Furthermore, we have integrated an explainability component into the BERT model and deployed it as a service through Amazon's cloud API hosting (AWS). We have developed a browser extension that interfaces with the API, allowing users to select and transmit data from web pages, receiving an intelligible classification in return. This paper presents a practical end-to-end solution, highlighting the feasibility of constructing a holistic system for fake news detection, which can significantly benefit society.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
An empirical study of ChatGPT-3.5 on question answering and code maintenance
Authors:
Md Mahir Asef Kabir,
Sk Adnan Hassan,
Xiaoyin Wang,
Ying Wang,
Hai Yu,
Na Meng
Abstract:
Ever since the launch of ChatGPT in 2022, a rising concern is whether ChatGPT will replace programmers and kill jobs. Motivated by this widespread concern, we conducted an empirical study to systematically compare ChatGPT against programmers in question-answering and software-maintaining. We reused a dataset introduced by prior work, which includes 130 StackOverflow (SO) discussion threads referre…
▽ More
Ever since the launch of ChatGPT in 2022, a rising concern is whether ChatGPT will replace programmers and kill jobs. Motivated by this widespread concern, we conducted an empirical study to systematically compare ChatGPT against programmers in question-answering and software-maintaining. We reused a dataset introduced by prior work, which includes 130 StackOverflow (SO) discussion threads referred to by the Java developers of 357 GitHub projects. We mainly investigated three research questions (RQs). First, how does ChatGPT compare with programmers when answering technical questions? Second, how do developers perceive the differences between ChatGPT's answers and SO answers? Third, how does ChatGPT compare with humans when revising code for maintenance requests?
For RQ1, we provided the 130 SO questions to ChatGPT, and manually compared ChatGPT answers with the accepted/most popular SO answers in terms of relevance, readability, informativeness, comprehensiveness, and reusability. For RQ2, we conducted a user study with 30 developers, asking each developer to assess and compare 10 pairs of answers, without knowing the information source (i.e., ChatGPT or SO). For RQ3, we distilled 48 software maintenance tasks from 48 GitHub projects citing the studied SO threads. We queried ChatGPT to revise a given Java file, and to incorporate the code implementation for any prescribed maintenance requirement. Our study reveals interesting phenomena: For the majority of SO questions (97/130), ChatGPT provided better answers; in 203 of 300 ratings, developers preferred ChatGPT answers to SO answers; ChatGPT revised code correctly for 22 of the 48 tasks. Our research will expand people's knowledge of ChatGPT capabilities, and shed light on future adoption of ChatGPT by the software industry.
△ Less
Submitted 3 October, 2023;
originally announced October 2023.
-
Longitudinal and transverse spin transfer to $Λ$ and $\overlineΛ$ hyperons in polarized $p$+$p$ collisions at $\sqrt{s} = 200$ GeV
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
D. M. Anderson,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
X. Z. Cai
, et al. (357 additional authors not shown)
Abstract:
The longitudinal and transverse spin transfers to $Λ$ ($\overlineΛ$) hyperons in polarized proton-proton collisions are expected to be sensitive to the helicity and transversity distributions, respectively, of (anti-)strange quarks in the proton, and to the corresponding polarized fragmentation functions. We report improved measurements of the longitudinal spin transfer coefficient, $D_{LL}$, and…
▽ More
The longitudinal and transverse spin transfers to $Λ$ ($\overlineΛ$) hyperons in polarized proton-proton collisions are expected to be sensitive to the helicity and transversity distributions, respectively, of (anti-)strange quarks in the proton, and to the corresponding polarized fragmentation functions. We report improved measurements of the longitudinal spin transfer coefficient, $D_{LL}$, and the transverse spin transfer coefficient, $D_{TT}$, to $Λ$ and $\overlineΛ$ in polarized proton-proton collisions at $\sqrt{s}$ = 200 GeV by the STAR experiment at RHIC. The data set includes longitudinally polarized proton-proton collisions with an integrated luminosity of 52 pb$^{-1}$, and transversely polarized proton-proton collisions with a similar integrated luminosity. Both data sets have about twice the statistics of previous results and cover a kinematic range of $|η_{Λ(\overlineΛ)}|$ $<$ 1.2 and transverse momentum $p_{T,{Λ(\overlineΛ)}}$ up to 8 GeV/$c$. We also report the first measurements of the hyperon spin transfer coefficients $D_{LL}$ and $D_{TT}$ as a function of the fractional jet momentum $z$ carried by the hyperon, which can provide more direct constraints on the polarized fragmentation functions.
△ Less
Submitted 7 December, 2023; v1 submitted 25 September, 2023;
originally announced September 2023.
-
Nutritional composition and bioactive compounds of mini watermelon genotypes in Bangladesh
Authors:
Hasina Sultana,
Sharmila Rani Mallick,
Jahidul Hassan,
Joydeb Gomasta,
Md. Humayun Kabir,
Md. Sakibul Alam Sakib,
Mahmuda Hossen,
Muhammad Mustakim Billah,
Emrul Kayesh
Abstract:
Given the present rising trends in changing lifestyle and consumption patterns, watermelon production has shifted from big to small-sized fruits having desirable quality attributes. Hence, analyses of fruit quality traits of mini watermelon are crucial to develop improved cultivars with enhanced nutritional compositions, consumer-preferred traits and extended storage life. In this context, fruit m…
▽ More
Given the present rising trends in changing lifestyle and consumption patterns, watermelon production has shifted from big to small-sized fruits having desirable quality attributes. Hence, analyses of fruit quality traits of mini watermelon are crucial to develop improved cultivars with enhanced nutritional compositions, consumer-preferred traits and extended storage life. In this context, fruit morphological and nutritional attributes of five mini watermelon genotypes namely BARI watermelon 1 (W1), BARI watermelon 2 (W2), L-32468 (W3), L-32236 (W4) and L-32394 (W5) were evaluated to appraise promising genotypes with better fruit quality. The evaluated genotypes expressed different levels of diversity for fruit physical qualitative traits including differences in shape, rind and flesh color and texture. The study also revealed significant variability among the genotypes regarding all observed fruit morphological and nutritional aspects as well as bioactive compounds. Among the studied genotypes, W1 stood out with the highest TSS as well as rind vitamin C and total phenolic content accompanied by higher fruit weight and thick rind. On the other hand, W3 genotype was featured with higher amount of \b{eta} carotene, total phenolic and flavonoid content in its flesh along with rind enriched with \b{eta} carotene and minerals. However, comparatively higher amount of sugar and total flavonoid content was recorded in the rind of W5 genotype. Therefore, W1 and W3 could be exploited for table purpose and using in breeding program to develop mini watermelon cultivars with more attractive fruits in terms of quality acceptance and nutritional value in Bangladesh. Furthermore, rind of BARI watermelon 1 and L-32394 could be considered as the potential cheap source of bioactive compounds to be used for dietary and industrial purpose which would decrease the solid waste in the environment.
△ Less
Submitted 24 September, 2023;
originally announced September 2023.
-
BenLLMEval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP
Authors:
Mohsinul Kabir,
Mohammed Saidul Islam,
Md Tahmid Rahman Laskar,
Mir Tafseer Nayeem,
M Saiful Bari,
Enamul Hoque
Abstract:
Large Language Models (LLMs) have emerged as one of the most important breakthroughs in NLP for their impressive skills in language generation and other language-specific tasks. Though LLMs have been evaluated in various tasks, mostly in English, they have not yet undergone thorough evaluation in under-resourced languages such as Bengali (Bangla). To this end, this paper introduces BenLLM-Eval, wh…
▽ More
Large Language Models (LLMs) have emerged as one of the most important breakthroughs in NLP for their impressive skills in language generation and other language-specific tasks. Though LLMs have been evaluated in various tasks, mostly in English, they have not yet undergone thorough evaluation in under-resourced languages such as Bengali (Bangla). To this end, this paper introduces BenLLM-Eval, which consists of a comprehensive evaluation of LLMs to benchmark their performance in the Bengali language that has modest resources. In this regard, we select various important and diverse Bengali NLP tasks, such as text summarization, question answering, paraphrasing, natural language inference, transliteration, text classification, and sentiment analysis for zero-shot evaluation of popular LLMs, namely, GPT-3.5, LLaMA-2-13b-chat, and Claude-2. Our experimental results demonstrate that while in some Bengali NLP tasks, zero-shot LLMs could achieve performance on par, or even better than current SOTA fine-tuned models; in most tasks, their performance is quite poor (with the performance of open-source LLMs like LLaMA-2-13b-chat being significantly bad) in comparison to the current SOTA results. Therefore, it calls for further efforts to develop a better understanding of LLMs in modest-resourced languages like Bengali.
△ Less
Submitted 19 March, 2024; v1 submitted 22 September, 2023;
originally announced September 2023.
-
AI-Driven Personalised Offloading Device Prescriptions: A Cutting-Edge Approach to Preventing Diabetes-Related Plantar Forefoot Ulcers and Complications
Authors:
Sayed Ahmed,
Muhammad Ashad Kabir,
Muhammad E. H. Chowdhury,
Susan Nancarrow
Abstract:
Diabetes-related foot ulcers and complications are a significant concern for individuals with diabetes, leading to severe health implications such as lower-limb amputation and reduced quality of life. This chapter discusses applying AI-driven personalised offloading device prescriptions as an advanced solution for preventing such conditions. By harnessing the capabilities of artificial intelligenc…
▽ More
Diabetes-related foot ulcers and complications are a significant concern for individuals with diabetes, leading to severe health implications such as lower-limb amputation and reduced quality of life. This chapter discusses applying AI-driven personalised offloading device prescriptions as an advanced solution for preventing such conditions. By harnessing the capabilities of artificial intelligence, this cutting-edge approach enables the prescription of offloading devices tailored to each patient's specific requirements. This includes the patient's preferences on offloading devices such as footwear and foot orthotics and their adaptations that suit the patient's intention of use and lifestyle. Through a series of studies, real-world data analysis and machine learning algorithms, high-risk areas can be identified, facilitating the recommendation of precise offloading strategies, including custom orthotic insoles, shoe adaptations, or specialised footwear. By including patient-specific factors to promote adherence, proactively addressing pressure points and promoting optimal foot mechanics, these personalised offloading devices have the potential to minimise the occurrence of foot ulcers and associated complications. This chapter proposes an AI-powered Clinical Decision Support System (CDSS) to recommend personalised prescriptions of offloading devices (footwear and insoles) for patients with diabetes who are at risk of foot complications. This innovative approach signifies a transformative leap in diabetic foot care, offering promising opportunities for preventive healthcare interventions.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
Unveiling the frontiers of deep learning: innovations shaping diverse domains
Authors:
Shams Forruque Ahmed,
Md. Sakib Bin Alam,
Maliha Kabir,
Shaila Afrin,
Sabiha Jannat Rafa,
Aanushka Mehjabin,
Amir H. Gandomi
Abstract:
Deep learning (DL) enables the development of computer models that are capable of learning, visualizing, optimizing, refining, and predicting data. In recent years, DL has been applied in a range of fields, including audio-visual data processing, agriculture, transportation prediction, natural language, biomedicine, disaster management, bioinformatics, drug design, genomics, face recognition, and…
▽ More
Deep learning (DL) enables the development of computer models that are capable of learning, visualizing, optimizing, refining, and predicting data. In recent years, DL has been applied in a range of fields, including audio-visual data processing, agriculture, transportation prediction, natural language, biomedicine, disaster management, bioinformatics, drug design, genomics, face recognition, and ecology. To explore the current state of deep learning, it is necessary to investigate the latest developments and applications of deep learning in these disciplines. However, the literature is lacking in exploring the applications of deep learning in all potential sectors. This paper thus extensively investigates the potential applications of deep learning across all major fields of study as well as the associated benefits and challenges. As evidenced in the literature, DL exhibits accuracy in prediction and analysis, makes it a powerful computational tool, and has the ability to articulate itself and optimize, making it effective in processing data with no prior training. Given its independence from training data, deep learning necessitates massive amounts of data for effective analysis and processing, much like data volume. To handle the challenge of compiling huge amounts of medical, scientific, healthcare, and environmental data for use in deep learning, gated architectures like LSTMs and GRUs can be utilized. For multimodal learning, shared neurons in the neural network for all activities and specialized neurons for particular tasks are necessary.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
CS-Mixer: A Cross-Scale Vision MLP Model with Spatial-Channel Mixing
Authors:
Jonathan Cui,
David A. Araujo,
Suman Saha,
Md. Faisal Kabir
Abstract:
Despite their simpler information fusion designs compared with Vision Transformers and Convolutional Neural Networks, Vision MLP architectures have demonstrated strong performance and high data efficiency in recent research. However, existing works such as CycleMLP and Vision Permutator typically model spatial information in equal-size spatial regions and do not consider cross-scale spatial intera…
▽ More
Despite their simpler information fusion designs compared with Vision Transformers and Convolutional Neural Networks, Vision MLP architectures have demonstrated strong performance and high data efficiency in recent research. However, existing works such as CycleMLP and Vision Permutator typically model spatial information in equal-size spatial regions and do not consider cross-scale spatial interactions. Further, their token mixers only model 1- or 2-axis correlations, avoiding 3-axis spatial-channel mixing due to its computational demands. We therefore propose CS-Mixer, a hierarchical Vision MLP that learns dynamic low-rank transformations for spatial-channel mixing through cross-scale local and global aggregation. The proposed methodology achieves competitive results on popular image recognition benchmarks without incurring substantially more compute. Our largest model, CS-Mixer-L, reaches 83.2% top-1 accuracy on ImageNet-1k with 13.7 GFLOPs and 94 M parameters.
△ Less
Submitted 14 January, 2024; v1 submitted 25 August, 2023;
originally announced August 2023.
-
FPGA Processor In Memory Architectures (PIMs): Overlay or Overhaul ?
Authors:
MD Arafat Kabir,
Ehsan Kabir,
Joshua Hollis,
Eli Levy-Mackay,
Atiyehsadat Panahi,
Jason Bakos,
Miaoqing Huang,
David Andrews
Abstract:
The dominance of machine learning and the ending of Moore's law have renewed interests in Processor in Memory (PIM) architectures. This interest has produced several recent proposals to modify an FPGA's BRAM architecture to form a next-generation PIM reconfigurable fabric. PIM architectures can also be realized within today's FPGAs as overlays without the need to modify the underlying FPGA archite…
▽ More
The dominance of machine learning and the ending of Moore's law have renewed interests in Processor in Memory (PIM) architectures. This interest has produced several recent proposals to modify an FPGA's BRAM architecture to form a next-generation PIM reconfigurable fabric. PIM architectures can also be realized within today's FPGAs as overlays without the need to modify the underlying FPGA architecture. To date, there has been no study to understand the comparative advantages of the two approaches. In this paper, we present a study that explores the comparative advantages between two proposed custom architectures and a PIM overlay running on a commodity FPGA. We created PiCaSO, a Processor in/near Memory Scalable and Fast Overlay architecture as a representative PIM overlay. The results of this study show that the PiCaSO overlay achieves up to 80% of the peak throughput of the custom designs with 2.56x shorter latency and 25% - 43% better BRAM memory utilization efficiency. We then show how several key features of the PiCaSO overlay can be integrated into the custom PIM designs to further improve their throughput by 18%, latency by 19.5%, and memory efficiency by 6.2%.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
Electron-induced non-monotonic pressure dependence of the lattice thermal conductivity of θ-TaN
Authors:
Ashis Kundu,
Yani Chen,
Xiaolong Yang,
Fanchen Meng,
Jesús Carrete,
Mukul Kabir,
Georg K. H. Madsen,
Wu Li
Abstract:
Recent theoretical and experimental research suggests that $θ$-TaN is a semimetal with high thermal conductivity ($κ$), primarily due to the contribution of phonons ($κ_\texttt{ph}$). By using first-principles calculations, we show a non-monotonic pressure dependence of the $κ$ of $θ$-TaN. $κ_\texttt{ph}$ first increases until it reaches a maximum at around 60~GPa, and then decreases. This anomalo…
▽ More
Recent theoretical and experimental research suggests that $θ$-TaN is a semimetal with high thermal conductivity ($κ$), primarily due to the contribution of phonons ($κ_\texttt{ph}$). By using first-principles calculations, we show a non-monotonic pressure dependence of the $κ$ of $θ$-TaN. $κ_\texttt{ph}$ first increases until it reaches a maximum at around 60~GPa, and then decreases. This anomalous behaviour is a consequence of the competing pressure responses of phonon-phonon and phonon-electron interactions, in contrast to the known materials BAs and BP, where the non-monotonic pressure dependence is caused by the interplay between different phonon-phonon scattering channels. Although TaN has phonon dispersion features similar to BAs at ambient pressure, its response to pressure is different and an overall stiffening of the phonon branches takes place. Consequently, the relevant phonon-phonon scattering weakens as pressure increases. However, the increased electronic density of states near the Fermi level, and specifically the emergence of additional pockets of the Fermi surface at the high-symmetry L point in the Brillouin zone, leads to a substantial increase in phonon-electron scattering at high pressures, driving a decrease in $κ_{\mathrm{ph}}$. At intermediate pressures ($\sim$~20$-$70~GPa), the $κ$ of TaN surpasses that of BAs. Our work provides deeper insight into phonon transport in semimetals and metals where phonon-electron scattering is relevant.
△ Less
Submitted 19 March, 2024; v1 submitted 28 July, 2023;
originally announced July 2023.
-
Enhanced Magnetism and Phase Transitions in Ultrathin Quantum Spin Liquid Na2IrO3 Flakes
Authors:
Deepak K Roy,
Mukul Kabir
Abstract:
The quest for quantum spin liquids has garnered significant attention due to their rich physics and disruptive prospects in quantum communication and computation. Spin-orbit coupling, electron correlation, and structural distortion play critical roles in the candidate materials that eventually order antiferromagnetically at low temperatures. We introduce quantum electron confinement to the existin…
▽ More
The quest for quantum spin liquids has garnered significant attention due to their rich physics and disruptive prospects in quantum communication and computation. Spin-orbit coupling, electron correlation, and structural distortion play critical roles in the candidate materials that eventually order antiferromagnetically at low temperatures. We introduce quantum electron confinement to the existing complexity and explore the interplay between Heisenberg and Kitaev interactions in ultrathin \ce{Na2IrO3} layers using first-principles calculations. The zigzag antiferromagnetic state in the monolayer is reinforced and pushed further away from the Kitaev spin liquid state due to the increased strength of Heisenberg and off-diagonal exchange interactions. In contrast, the carrier-doped flakes undergo a Mott insulator-to-metal transition accompanied by an antiferromagnetic to ferromagnetic transition. These findings present exciting prospects for comprehending magnetism in a novel two-dimensional framework of non-van der Waals correlated oxide flakes.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
Beam Spin Asymmetry Measurements of Deeply Virtual $π^0$ Production with CLAS12
Authors:
A. Kim,
S. Diehl,
K. Joo,
V. Kubarovsky,
P. Achenbach,
Z. Akbar,
J. S. Alvarado,
Whitney R. Armstrong,
H. Atac,
H. Avakian,
C. Ayerbe Gayoso,
L. Barion,
M. Battaglieri,
I. Bedlinskiy,
B. Benkel,
A. Bianconi,
A. S. Biselli,
M. Bondi,
F. Bossù,
S. Boiarinov,
K. T. Brinkmann,
W. J. Briscoe,
W. K. Brooks,
S. Bueltmann,
V. D. Burkert
, et al. (132 additional authors not shown)
Abstract:
The new experimental measurements of beam spin asymmetry were performed for the deeply virtual exclusive $π^0$ production in a wide kinematic region with the photon virtualities $Q^2$ up to 8 GeV$^2$ and the Bjorken scaling variable $x_B$ in the valence regime. The data were collected by the CEBAF Large Acceptance Spectrometer (CLAS12) at Jefferson Lab with longitudinally polarized 10.6 GeV electr…
▽ More
The new experimental measurements of beam spin asymmetry were performed for the deeply virtual exclusive $π^0$ production in a wide kinematic region with the photon virtualities $Q^2$ up to 8 GeV$^2$ and the Bjorken scaling variable $x_B$ in the valence regime. The data were collected by the CEBAF Large Acceptance Spectrometer (CLAS12) at Jefferson Lab with longitudinally polarized 10.6 GeV electrons scattered on an unpolarized liquid-hydrogen target. Sizable asymmetry values indicate a substantial contribution from transverse virtual photon amplitudes to the polarized structure functions.The interpretation of these measurements in terms of the Generalized Parton Distributions (GPDs) demonstrates their sensitivity to the chiral-odd GPD $\bar E_T$, which contains information on quark transverse spin densities in unpolarized and polarized nucleons and provides access to the proton's transverse anomalous magnetic moment. Additionally, the data were compared to a theoretical model based on a Regge formalism that was extended to the high photon virtualities.
△ Less
Submitted 15 July, 2023;
originally announced July 2023.
-
AttResDU-Net: Medical Image Segmentation Using Attention-based Residual Double U-Net
Authors:
Akib Mohammed Khan,
Alif Ashrafee,
Fahim Shahriar Khan,
Md. Bakhtiar Hasan,
Md. Hasanul Kabir
Abstract:
Manually inspecting polyps from a colonoscopy for colorectal cancer or performing a biopsy on skin lesions for skin cancer are time-consuming, laborious, and complex procedures. Automatic medical image segmentation aims to expedite this diagnosis process. However, numerous challenges exist due to significant variations in the appearance and sizes of objects with no distinct boundaries. This paper…
▽ More
Manually inspecting polyps from a colonoscopy for colorectal cancer or performing a biopsy on skin lesions for skin cancer are time-consuming, laborious, and complex procedures. Automatic medical image segmentation aims to expedite this diagnosis process. However, numerous challenges exist due to significant variations in the appearance and sizes of objects with no distinct boundaries. This paper proposes an attention-based residual Double U-Net architecture (AttResDU-Net) that improves on the existing medical image segmentation networks. Inspired by the Double U-Net, this architecture incorporates attention gates on the skip connections and residual connections in the convolutional blocks. The attention gates allow the model to retain more relevant spatial information by suppressing irrelevant feature representation from the down-sampling path for which the model learns to focus on target regions of varying shapes and sizes. Moreover, the residual connections help to train deeper models by ensuring better gradient flow. We conducted experiments on three datasets: CVC Clinic-DB, ISIC 2018, and the 2018 Data Science Bowl datasets and achieved Dice Coefficient scores of 94.35%, 91.68% and 92.45% respectively. Our results suggest that AttResDU-Net can be facilitated as a reliable method for automatic medical image segmentation in practice.
△ Less
Submitted 25 June, 2023;
originally announced June 2023.
-
Math Word Problem Solving by Generating Linguistic Variants of Problem Statements
Authors:
Syed Rifat Raiyan,
Md. Nafis Faiyaz,
Shah Md. Jawad Kabir,
Mohsinul Kabir,
Hasan Mahmud,
Md Kamrul Hasan
Abstract:
The art of mathematical reasoning stands as a fundamental pillar of intellectual progress and is a central catalyst in cultivating human ingenuity. Researchers have recently published a plethora of works centered around the task of solving Math Word Problems (MWP) $-$ a crucial stride towards general AI. These existing models are susceptible to dependency on shallow heuristics and spurious correla…
▽ More
The art of mathematical reasoning stands as a fundamental pillar of intellectual progress and is a central catalyst in cultivating human ingenuity. Researchers have recently published a plethora of works centered around the task of solving Math Word Problems (MWP) $-$ a crucial stride towards general AI. These existing models are susceptible to dependency on shallow heuristics and spurious correlations to derive the solution expressions. In order to ameliorate this issue, in this paper, we propose a framework for MWP solvers based on the generation of linguistic variants of the problem text. The approach involves solving each of the variant problems and electing the predicted expression with the majority of the votes. We use DeBERTa (Decoding-enhanced BERT with disentangled attention) as the encoder to leverage its rich textual representations and enhanced mask decoder to construct the solution expressions. Furthermore, we introduce a challenging dataset, $\mathrm{P\small{ARA}\normalsize{MAWPS}}$, consisting of paraphrased, adversarial, and inverse variants of selectively sampled MWPs from the benchmark $\mathrm{M\small{AWPS}}$ dataset. We extensively experiment on this dataset along with other benchmark datasets using some baseline MWP solver models. We show that training on linguistic variants of problem statements and voting on candidate predictions improve the mathematical reasoning and robustness of the model. We make our code and data publicly available.
△ Less
Submitted 24 June, 2023;
originally announced June 2023.
-
High transport spin polarization in the van der Waals ferromagnet Fe$_4$GeTe$_2$
Authors:
Deepti Rana,
Monika Bhakar,
Basavaraja G.,
Satyabrata Bera,
Neeraj Saini,
Suman Kalyan Pradhan,
Mintu Mondal,
Mukul Kabir,
Goutam Sheet
Abstract:
The challenging task of scaling-down the size of the power saving electronic devices can be accomplished by exploiting the spin degree of freedom of the conduction electrons in van der Waals (vdW) spintronic architectures built with 2D materials. One of the key components of such a device is a near-room temperature 2D ferromagnet with good metallicity that can generate a highly spin-polarized elec…
▽ More
The challenging task of scaling-down the size of the power saving electronic devices can be accomplished by exploiting the spin degree of freedom of the conduction electrons in van der Waals (vdW) spintronic architectures built with 2D materials. One of the key components of such a device is a near-room temperature 2D ferromagnet with good metallicity that can generate a highly spin-polarized electronic transport current. However, most of the known 2D ferromagnets have either a very low temperature ordering, poor conductivity, or low spin polarization. In this context, the Fe$_n$GeTe$_2$ (with $n\geq3$) family of ferromagnets stand out due to their near-room temperature ferromagnetism and good metallicity. We have performed spin-resolved Andreev reflection spectroscopy on Fe$_4$GeTe$_2$ ($T_{Curie} \sim$ 273 K) and demonstrated that the ferromagnet is capable of generating a very high transport spin polarization, exceeding 50$\%$. This makes Fe$_4$GeTe$_2$ a strong candidate for application in all-vdW power-saving spintronic devices.
△ Less
Submitted 1 June, 2023;
originally announced June 2023.
-
BanglaBook: A Large-scale Bangla Dataset for Sentiment Analysis from Book Reviews
Authors:
Mohsinul Kabir,
Obayed Bin Mahfuz,
Syed Rifat Raiyan,
Hasan Mahmud,
Md Kamrul Hasan
Abstract:
The analysis of consumer sentiment, as expressed through reviews, can provide a wealth of insight regarding the quality of a product. While the study of sentiment analysis has been widely explored in many popular languages, relatively less attention has been given to the Bangla language, mostly due to a lack of relevant data and cross-domain adaptability. To address this limitation, we present Ban…
▽ More
The analysis of consumer sentiment, as expressed through reviews, can provide a wealth of insight regarding the quality of a product. While the study of sentiment analysis has been widely explored in many popular languages, relatively less attention has been given to the Bangla language, mostly due to a lack of relevant data and cross-domain adaptability. To address this limitation, we present BanglaBook, a large-scale dataset of Bangla book reviews consisting of 158,065 samples classified into three broad categories: positive, negative, and neutral. We provide a detailed statistical analysis of the dataset and employ a range of machine learning models to establish baselines including SVM, LSTM, and Bangla-BERT. Our findings demonstrate a substantial performance advantage of pre-trained models over models that rely on manually crafted features, emphasizing the necessity for additional training resources in this domain. Additionally, we conduct an in-depth error analysis by examining sentiment unigrams, which may provide insight into common classification errors in under-resourced languages like Bangla. Our codes and data are publicly available at https://github.com/mohsinulkabir14/BanglaBook.
△ Less
Submitted 8 June, 2023; v1 submitted 11 May, 2023;
originally announced May 2023.
-
"When Words Fail, Emojis Prevail": Generating Sarcastic Utterances with Emoji Using Valence Reversal and Semantic Incongruity
Authors:
Faria Binte Kader,
Nafisa Hossain Nujat,
Tasmia Binte Sogir,
Mohsinul Kabir,
Hasan Mahmud,
Kamrul Hasan
Abstract:
Sarcasm is a form of figurative language that serves as a humorous tool for mockery and ridicule. We present a novel architecture for sarcasm generation with emoji from a non-sarcastic input sentence in English. We divide the generation task into two sub tasks: one for generating textual sarcasm and another for collecting emojis associated with those sarcastic sentences. Two key elements of sarcas…
▽ More
Sarcasm is a form of figurative language that serves as a humorous tool for mockery and ridicule. We present a novel architecture for sarcasm generation with emoji from a non-sarcastic input sentence in English. We divide the generation task into two sub tasks: one for generating textual sarcasm and another for collecting emojis associated with those sarcastic sentences. Two key elements of sarcasm are incorporated into the textual sarcasm generation task: valence reversal and semantic incongruity with context, where the context may involve shared commonsense or general knowledge between the speaker and their audience. The majority of existing sarcasm generation works have focused on this textual form. However, in the real world, when written texts fall short of effectively capturing the emotional cues of spoken and face-to-face communication, people often opt for emojis to accurately express their emotions. Due to the wide range of applications of emojis, incorporating appropriate emojis to generate textual sarcastic sentences helps advance sarcasm generation. We conclude our study by evaluating the generated sarcastic sentences using human judgement. All the codes and data used in this study has been made publicly available.
△ Less
Submitted 16 June, 2023; v1 submitted 6 May, 2023;
originally announced May 2023.
-
Reduction of Class Activation Uncertainty with Background Information
Authors:
H M Dipu Kabir
Abstract:
Multitask learning is a popular approach to training high-performing neural networks with improved generalization. In this paper, we propose a background class to achieve improved generalization at a lower computation compared to multitask learning to help researchers and organizations with limited computation power. We also present a methodology for selecting background images and discuss potenti…
▽ More
Multitask learning is a popular approach to training high-performing neural networks with improved generalization. In this paper, we propose a background class to achieve improved generalization at a lower computation compared to multitask learning to help researchers and organizations with limited computation power. We also present a methodology for selecting background images and discuss potential future improvements. We apply our approach to several datasets and achieve improved generalization with much lower computation. Through the class activation mappings (CAMs) of the trained models, we observed the tendency towards looking at a bigger picture with the proposed model training methodology. Applying the vision transformer with the proposed background class, we receive state-of-the-art (SOTA) performance on STL-10, Caltech-101, and CINIC-10 datasets. Example scripts are available in the 'CAM' folder of the following GitHub Repository: github.com/dipuk0506/UQ
△ Less
Submitted 14 July, 2024; v1 submitted 4 May, 2023;
originally announced May 2023.
-
Uncertainty Aware Neural Network from Similarity and Sensitivity
Authors:
H M Dipu Kabir,
Subrota Kumar Mondal,
Sadia Khanam,
Abbas Khosravi,
Shafin Rahman,
Mohammad Reza Chalak Qazani,
Roohallah Alizadehsani,
Houshyar Asadi,
Shady Mohamed,
Saeid Nahavandi,
U Rajendra Acharya
Abstract:
Researchers have proposed several approaches for neural network (NN) based uncertainty quantification (UQ). However, most of the approaches are developed considering strong assumptions. Uncertainty quantification algorithms often perform poorly in an input domain and the reason for poor performance remains unknown. Therefore, we present a neural network training method that considers similar sampl…
▽ More
Researchers have proposed several approaches for neural network (NN) based uncertainty quantification (UQ). However, most of the approaches are developed considering strong assumptions. Uncertainty quantification algorithms often perform poorly in an input domain and the reason for poor performance remains unknown. Therefore, we present a neural network training method that considers similar samples with sensitivity awareness in this paper. In the proposed NN training method for UQ, first, we train a shallow NN for the point prediction. Then, we compute the absolute differences between prediction and targets and train another NN for predicting those absolute differences or absolute errors. Domains with high average absolute errors represent a high uncertainty. In the next step, we select each sample in the training set one by one and compute both prediction and error sensitivities. Then we select similar samples with sensitivity consideration and save indexes of similar samples. The ranges of an input parameter become narrower when the output is highly sensitive to that parameter. After that, we construct initial uncertainty bounds (UB) by considering the distribution of sensitivity aware similar samples. Prediction intervals (PIs) from initial uncertainty bounds are larger and cover more samples than required. Therefore, we train bound correction NN. As following all the steps for finding UB for each sample requires a lot of computation and memory access, we train a UB computation NN. The UB computation NN takes an input sample and provides an uncertainty bound. The UB computation NN is the final product of the proposed approach. Scripts of the proposed method are available in the following GitHub repository: github.com/dipuk0506/UQ
△ Less
Submitted 26 April, 2023;
originally announced April 2023.
-
Event-by-event correlations between $Λ$ ($\barΛ$) hyperon global polarization and handedness with charged hadron azimuthal separation in Au+Au collisions at $\sqrt{s_{\text{NN}}} = 27 \text{ GeV}$ from STAR
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
J. R. Adams,
G. Agakishiev,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
A. Aitbaev,
I. Alekseev,
D. M. Anderson,
A. Aparin,
S. Aslam,
J. Atchison,
G. S. Averichev,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
P. Bhagat,
A. Bhasin,
S. Bhatta,
I. G. Bordyuzhin,
J. D. Brandenburg
, et al. (333 additional authors not shown)
Abstract:
Global polarizations ($P$) of $Λ$ ($\barΛ$) hyperons have been observed in non-central heavy-ion collisions. The strong magnetic field primarily created by the spectator protons in such collisions would split the $Λ$ and $\barΛ$ global polarizations ($ΔP = P_Λ - P_{\barΛ} < 0$). Additionally, quantum chromodynamics (QCD) predicts topological charge fluctuations in vacuum, resulting in a chirality…
▽ More
Global polarizations ($P$) of $Λ$ ($\barΛ$) hyperons have been observed in non-central heavy-ion collisions. The strong magnetic field primarily created by the spectator protons in such collisions would split the $Λ$ and $\barΛ$ global polarizations ($ΔP = P_Λ - P_{\barΛ} < 0$). Additionally, quantum chromodynamics (QCD) predicts topological charge fluctuations in vacuum, resulting in a chirality imbalance or parity violation in a local domain. This would give rise to an imbalance ($Δn = \frac{N_{\text{L}} - N_{\text{R}}}{\langle N_{\text{L}} + N_{\text{R}} \rangle} \neq 0$) between left- and right-handed $Λ$ ($\barΛ$) as well as a charge separation along the magnetic field, referred to as the chiral magnetic effect (CME). This charge separation can be characterized by the parity-even azimuthal correlator ($Δγ$) and parity-odd azimuthal harmonic observable ($Δa_{1}$). Measurements of $ΔP$, $Δγ$, and $Δa_{1}$ have not led to definitive conclusions concerning the CME or the magnetic field, and $Δn$ has not been measured previously. Correlations among these observables may reveal new insights. This paper reports measurements of correlation between $Δn$ and $Δa_{1}$, which is sensitive to chirality fluctuations, and correlation between $ΔP$ and $Δγ$ sensitive to magnetic field in Au+Au collisions at 27 GeV. For both measurements, no correlations have been observed beyond statistical fluctuations.
△ Less
Submitted 22 July, 2023; v1 submitted 19 April, 2023;
originally announced April 2023.
-
Interpretable Multi Labeled Bengali Toxic Comments Classification using Deep Learning
Authors:
Tanveer Ahmed Belal,
G. M. Shahariar,
Md. Hasanul Kabir
Abstract:
This paper presents a deep learning-based pipeline for categorizing Bengali toxic comments, in which at first a binary classification model is used to determine whether a comment is toxic or not, and then a multi-label classifier is employed to determine which toxicity type the comment belongs to. For this purpose, we have prepared a manually labeled dataset consisting of 16,073 instances among wh…
▽ More
This paper presents a deep learning-based pipeline for categorizing Bengali toxic comments, in which at first a binary classification model is used to determine whether a comment is toxic or not, and then a multi-label classifier is employed to determine which toxicity type the comment belongs to. For this purpose, we have prepared a manually labeled dataset consisting of 16,073 instances among which 8,488 are Toxic and any toxic comment may correspond to one or more of the six toxic categories - vulgar, hate, religious, threat, troll, and insult simultaneously. Long Short Term Memory (LSTM) with BERT Embedding achieved 89.42% accuracy for the binary classification task while as a multi-label classifier, a combination of Convolutional Neural Network and Bi-directional Long Short Term Memory (CNN-BiLSTM) with attention mechanism achieved 78.92% accuracy and 0.86 as weighted F1-score. To explain the predictions and interpret the word feature importance during classification by the proposed models, we utilized Local Interpretable Model-Agnostic Explanations (LIME) framework. We have made our dataset public and can be accessed at - https://github.com/deepu099cse/Multi-Labeled-Bengali-Toxic-Comments-Classification
△ Less
Submitted 8 April, 2023;
originally announced April 2023.