Skip to main content

Showing 1–50 of 95 results for author: Sultan, M

  1. arXiv:2410.08938  [pdf, other

    q-bio.QM cs.LG

    KinDEL: DNA-Encoded Library Dataset for Kinase Inhibitors

    Authors: Benson Chen, Tomasz Danel, Patrick J. McEnaney, Nikhil Jain, Kirill Novikov, Spurti Umesh Akki, Joshua L. Turnbull, Virja Atul Pandya, Boris P. Belotserkovskii, Jared Bryce Weaver, Ankita Biswas, Dat Nguyen, Gabriel H. S. Dreiman, Mohammad Sultan, Nathaniel Stanley, Daniel M Whalen, Divya Kanichar, Christoph Klein, Emily Fox, R. Edward Watts

    Abstract: DNA-Encoded Libraries (DEL) are combinatorial small molecule libraries that offer an efficient way to characterize diverse chemical spaces. Selection experiments using DELs are pivotal to drug discovery efforts, enabling high-throughput screens for hit finding. However, limited availability of public DEL datasets hinders the advancement of computational techniques designed to process such data. To… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  2. arXiv:2409.09595  [pdf, other

    hep-ph nucl-th

    Neutral pion to two-photons transition form factor revisited

    Authors: M. Atif Sultan, Jiayin Kang, Adnan Bashir, Lei Chang

    Abstract: Based upon a combined formalism of Schwinger-Dyson and Bethe-Salpeter equations in quantum chromodynamics (QCD), we propose a QCD kindred algebraic model for the dressed quark propagator, for the Bethe-Salpeter amplitude of the pion and the electromagnetic quark-photon interaction vertex. We then compute the $γ^{*}π^0γ$ transition form factor $G^{γ^{*}π^0γ}(Q^2)$ for a wide range of photon momentu… ▽ More

    Submitted 14 September, 2024; originally announced September 2024.

    Comments: 11 pages, 4 figures

  3. arXiv:2409.04996  [pdf, ps, other

    hep-ph

    Contact interaction treatment of $\mathcal{V}\to\mathcal{P}γ$ for light-quark mesons

    Authors: Yehan Xu, M. Atif Sultan, Khépani Raya, Lei Chang

    Abstract: The $\mathcal{V}\to\mathcal{P}γ$ and $η(η^\prime) \to γγ$ decays are evaluated within a Dyson-Schwinger and Bethe-Salpeter equations framework (here $\mathcal{V}=\{ρ^{\pm},K^{\star\pm},φ\}$ and $\mathcal{P}=\{π^{\pm},K^{\pm},η,η^{\prime}\}$). The so-called impulse approximation (IA) is employed in the computation of the decay constants involved and decay widths, and so in the estimation of the ass… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

    Comments: 9 pages

  4. arXiv:2408.11879  [pdf, other

    cs.CL cs.AI cs.LG

    Beyond Labels: Aligning Large Language Models with Human-like Reasoning

    Authors: Muhammad Rafsan Kabir, Rafeed Mohammad Sultan, Ihsanul Haque Asif, Jawad Ibn Ahad, Fuad Rahman, Mohammad Ruhul Amin, Nabeel Mohammed, Shafin Rahman

    Abstract: Aligning large language models (LLMs) with a human reasoning approach ensures that LLMs produce morally correct and human-like decisions. Ethical concerns are raised because current models are prone to generating false positives and providing malicious responses. To contribute to this issue, we have curated an ethics dataset named Dataset for Aligning Reasons (DFAR), designed to aid in aligning la… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: Accepted in ICPR 2024

  5. arXiv:2408.08388  [pdf, other

    stat.ML cs.LG stat.ME

    Classification of High-dimensional Time Series in Spectral Domain using Explainable Features

    Authors: Sarbojit Roy, Malik Shahid Sultan, Hernando Ombao

    Abstract: Interpretable classification of time series presents significant challenges in high dimensions. Traditional feature selection methods in the frequency domain often assume sparsity in spectral density matrices (SDMs) or their inverses, which can be restrictive for real-world applications. In this article, we propose a model-based approach for classifying high-dimensional stationary time series by a… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  6. arXiv:2407.10437  [pdf, other

    hep-ph

    Gravitational form factors of pseudoscalar mesons in a contact interaction

    Authors: M. Atif Sultan, Zanbin Xing, Khépani Raya, Adnan Bashir, Lei Chang

    Abstract: Given the unique role played by the gravitational form factors (GFFs) in unraveling the internal mechanics of hadrons, we examine the GFFs of ground state pseudoscalar mesons $π$, $η_c$, $η_b$ and the hypothetical {\em strangeonium} $η_s(s\bar{s})$. We adopt the coupled framework of Dyson-Schwinger and Bethe-Salpeter equations within a contact interaction, and employ a novel approach to the dresse… ▽ More

    Submitted 29 August, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

    Comments: 10 pages, 5 figures

  7. arXiv:2407.07254  [pdf, other

    eess.IV cs.CV

    HAMIL-QA: Hierarchical Approach to Multiple Instance Learning for Atrial LGE MRI Quality Assessment

    Authors: K M Arefeen Sultan, Md Hasibul Husain Hisham, Benjamin Orkild, Alan Morris, Eugene Kholmovski, Erik Bieging, Eugene Kwan, Ravi Ranjan, Ed DiBella, Shireen Elhabian

    Abstract: The accurate evaluation of left atrial fibrosis via high-quality 3D Late Gadolinium Enhancement (LGE) MRI is crucial for atrial fibrillation management but is hindered by factors like patient movement and imaging variability. The pursuit of automated LGE MRI quality assessment is critical for enhancing diagnostic accuracy, standardizing evaluations, and improving patient outcomes. The deep learnin… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Accepted to MICCAI2024, 10 pages, 2 figures

  8. arXiv:2407.06062  [pdf, ps, other

    gr-qc

    Stellar structure in $f(R,T)$ gravity: some exact solutions

    Authors: Aliya Batool, Abdul Malik Sultan, Gonzalo J. Olmo, Diego Rubiera-Garcia

    Abstract: We find some exact solutions for constant-density and quark matter equations of state in stellar structure models framed within the $f(R,T)=R+λκ^2 T$ theory of gravity, where $R$ is the curvature scalar, $T$ the trace of the stress-energy tensor, and $λ$ some constant. These solutions correspond to specific values of the constant $λ$, and represent different compactness states of the corresponding… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 6 pages, 1 figure, revtex style

  9. arXiv:2406.15657  [pdf, other

    cs.IR

    FIRST: Faster Improved Listwise Reranking with Single Token Decoding

    Authors: Revanth Gangi Reddy, JaeHyeok Doo, Yifei Xu, Md Arafat Sultan, Deevya Swain, Avirup Sil, Heng Ji

    Abstract: Large Language Models (LLMs) have significantly advanced the field of information retrieval, particularly for reranking. Listwise LLM rerankers have showcased superior performance and generalizability compared to existing supervised approaches. However, conventional listwise LLM reranking methods lack efficiency as they provide ranking output in the form of a generated ordered sequence of candidat… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Preprint

  10. arXiv:2406.11706  [pdf, other

    cs.IR cs.CL cs.LG

    Prompts as Auto-Optimized Training Hyperparameters: Training Best-in-Class IR Models from Scratch with 10 Gold Labels

    Authors: Jasper Xian, Saron Samuel, Faraz Khoubsirat, Ronak Pradeep, Md Arafat Sultan, Radu Florian, Salim Roukos, Avirup Sil, Christopher Potts, Omar Khattab

    Abstract: We develop a method for training small-scale (under 100M parameter) neural information retrieval models with as few as 10 gold relevance labels. The method depends on generating synthetic queries for documents using a language model (LM), and the key step is that we automatically optimize the LM prompt that is used to generate these queries based on training quality. In experiments with the BIRCO… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  11. arXiv:2406.06779  [pdf, other

    physics.ins-det

    Synergistic Sensing: Application of SiNWs-PANI:MO$_x$ Heterostructures for Human Respiratory Monitoring

    Authors: M. T. Sultan, A. Dumitru, E. A. Fakhri, R. E. Brophy, S. T. Ingvarsson, A. Manolescu, H. G. Svavarsson

    Abstract: In this study we investigate novel hybrid structure of silicon nanowires (SiNWs) coated with PANI:metaloxide(MO$_x$) nanoparticles i.e., WO$_3$ and TiO$_2$. The SiNWs were fabricated using MACE, whereas PANI:MO$_x$ were deposited using chemical oxidative polymerization method on SiNWs. To this date little attempts has been done to utilize such hybrid structures for respiratory sensing. The structu… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 12 figures, 16 pages, 44 references

  12. Automating Patch Set Generation from Code Review Comments Using Large Language Models

    Authors: Tajmilur Rahman, Rahul Singh, Mir Yousuf Sultan

    Abstract: The advent of Large Language Models (LLMs) has revolutionized various domains of artificial intelligence, including the realm of software engineering. In this research, we evaluate the efficacy of pre-trained LLMs in replicating the tasks traditionally performed by developers in response to code review comments. We provide code contexts to five popular LLMs and obtain the suggested code-changes (p… ▽ More

    Submitted 9 April, 2024; originally announced June 2024.

    Comments: 2 pages

  13. arXiv:2403.00827  [pdf, other

    cs.CL cs.AI cs.LG

    Self-Refinement of Language Models from External Proxy Metrics Feedback

    Authors: Keshav Ramji, Young-Suk Lee, Ramón Fernandez Astudillo, Md Arafat Sultan, Tahira Naseem, Asim Munawar, Radu Florian, Salim Roukos

    Abstract: It is often desirable for Large Language Models (LLMs) to capture multiple objectives when providing a response. In document-grounded response generation, for example, agent responses are expected to be relevant to a user's query while also being grounded in a given document. In this paper, we introduce Proxy Metric-based Self-Refinement (ProMiSe), which enables an LLM to refine its own initial re… ▽ More

    Submitted 27 February, 2024; originally announced March 2024.

  14. arXiv:2402.11770  [pdf, other

    cs.CL

    Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations

    Authors: Md Arafat Sultan, Jatin Ganhotra, Ramón Fernandez Astudillo

    Abstract: We introduce a structured chain-of-thought (SCoT) prompting approach to generating content-grounded multi-turn question-answer conversations using a pre-trained large language model (LLM). At the core of our proposal is a structured breakdown of the complex task into a number of states in a state machine, so that actions corresponding to various subtasks, e.g., content reading and utterance genera… ▽ More

    Submitted 19 February, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  15. arXiv:2401.06356  [pdf, other

    cs.LG

    An Empirical Investigation into the Effect of Parameter Choices in Knowledge Distillation

    Authors: Md Arafat Sultan, Aashka Trivedi, Parul Awasthy, Avirup Sil

    Abstract: We present a large-scale empirical study of how choices of configuration parameters affect performance in knowledge distillation (KD). An example of such a KD parameter is the measure of distance between the predictions of the teacher and the student, common choices for which include the mean squared error (MSE) and the KL-divergence. Although scattered efforts have been made to understand the dif… ▽ More

    Submitted 18 February, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  16. arXiv:2401.03169  [pdf, other

    hep-ph

    QCD anomalies in electromagnetic processes: A solution to the $γ\to3π$ puzzle

    Authors: Zanbin Xing, Hao Dang, M. Atif Sultan, Khépani Raya, Lei Chang

    Abstract: In this work, the $γ\to3π$ form factor is calculated within the Dyson-Schwinger equations framework using a contact interaction model within the so-called modified rainbow ladder truncation. The present calculation takes into account the pseudovector component in the pion Bethe-Salpeter amplitude (BSA) and $π-π$ scattering effects, producing a $γ\to3π$ anomaly which is $1+6\mathcal{R}_π^2$ larger… ▽ More

    Submitted 11 January, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: 10 pages, 3 figures, references added

  17. arXiv:2312.00953  [pdf, other

    eess.IV

    Deep Image prior with StruCtUred Sparsity (DISCUS) for dynamic MRI reconstruction

    Authors: Muhammad A. Sultan, Chong Chen, Yingmin Liu, Xuan Lei, Rizwan Ahmad

    Abstract: High-quality training data are not always available in dynamic MRI. To address this, we propose a self-supervised deep learning method called deep image prior with structured sparsity (DISCUS) for reconstructing dynamic images. DISCUS is inspired by deep image prior (DIP) and recovers a series of images through joint optimization of network parameters and input code vectors. However, DISCUS additi… ▽ More

    Submitted 24 May, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: To appear in 2024 ISBI

  18. arXiv:2312.00936  [pdf, other

    eess.IV

    Surface Coil Intensity Correction for MRI

    Authors: Xuan Lei, Philip Schniter, Chong Chen, Muhammad A. Sultan, Rizwan Ahmad

    Abstract: Modern MRI scanners utilize one or more arrays of small receive-only coils to collect k-space data. The sensitivity maps of the coils, when estimated using traditional methods, differ from the true sensitivity maps, which are generally unknown. Consequently, the reconstructed MR images exhibit undesired spatial variation in intensity. These intensity variations can be at least partially corrected… ▽ More

    Submitted 24 May, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

  19. arXiv:2311.08640  [pdf, other

    cs.CL cs.LG

    Multistage Collaborative Knowledge Distillation from a Large Language Model for Semi-Supervised Sequence Generation

    Authors: Jiachen Zhao, Wenlong Zhao, Andrew Drozdov, Benjamin Rozonoyer, Md Arafat Sultan, Jay-Yoon Lee, Mohit Iyyer, Andrew McCallum

    Abstract: We study semi-supervised sequence generation tasks, where the few labeled examples are too scarce to finetune a model, and meanwhile, few-shot prompted large language models (LLMs) exhibit room for improvement. In this paper, we present the discovery that a student model distilled from a few-shot prompted LLM can commonly generalize better than its teacher to unseen examples on such tasks. We find… ▽ More

    Submitted 3 August, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: ACL 2024

  20. arXiv:2310.13961  [pdf, other

    cs.CL cs.AI

    Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs

    Authors: Young-Suk Lee, Md Arafat Sultan, Yousef El-Kurdi, Tahira Naseem Asim Munawar, Radu Florian, Salim Roukos, Ramón Fernandez Astudillo

    Abstract: Using in-context learning (ICL) for data generation, techniques such as Self-Instruct (Wang et al., 2023) or the follow-up Alpaca (Taori et al., 2023) can train strong conversational agents with only a small amount of human supervision. One limitation of these approaches is that they resort to very large language models (around 175B parameters) that are also proprietary and non-public. Here we exp… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Journal ref: EMNLP 2023

  21. arXiv:2310.13769  [pdf, other

    q-bio.QM stat.ML

    Compositional Deep Probabilistic Models of DNA Encoded Libraries

    Authors: Benson Chen, Mohammad M. Sultan, Theofanis Karaletsos

    Abstract: DNA-Encoded Library (DEL) has proven to be a powerful tool that utilizes combinatorially constructed small molecules to facilitate highly-efficient screening assays. These selection experiments, involving multiple stages of washing, elution, and identification of potent binders via unique DNA barcodes, often generate complex data. This complexity can potentially mask the underlying signals, necess… ▽ More

    Submitted 13 February, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

  22. Two-Stage Deep Learning Framework for Quality Assessment of Left Atrial Late Gadolinium Enhanced MRI Images

    Authors: K M Arefeen Sultan, Benjamin Orkild, Alan Morris, Eugene Kholmovski, Erik Bieging, Eugene Kwan, Ravi Ranjan, Ed DiBella, Shireen Elhabian

    Abstract: Accurate assessment of left atrial fibrosis in patients with atrial fibrillation relies on high-quality 3D late gadolinium enhancement (LGE) MRI images. However, obtaining such images is challenging due to patient motion, changing breathing patterns, or sub-optimal choice of pulse sequence parameters. Automated assessment of LGE-MRI image diagnostic quality is clinically significant as it would en… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: Accepted to STACOM 2023. 11 pages, 3 figures

  23. arXiv:2307.16275  [pdf, other

    cs.CV

    Stylized Projected GAN: A Novel Architecture for Fast and Realistic Image Generation

    Authors: Md Nurul Muttakin, Malik Shahid Sultan, Robert Hoehndorf, Hernando Ombao

    Abstract: Generative Adversarial Networks are used for generating the data using a generator and a discriminator, GANs usually produce high-quality images, but training GANs in an adversarial setting is a difficult task. GANs require high computation power and hyper-parameter regularization for converging. Projected GANs tackle the training difficulty of GANs by using transfer learning to project the genera… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: We present a new architecture for generating realistic images by combining mapping network of Style GANs and Projected GANs

  24. arXiv:2306.04307  [pdf, other

    hep-ph

    The chiral anomaly and the pion transition form factor: beyond the cutoff

    Authors: Hao Dang, Zanbin Xing, M. Atif Sultan, Khépani Raya, Lei Chang

    Abstract: In the presence of a momentum cutoff, effective theories seem unable to faithfully reproduce the so called chiral anomaly in the Standard Model. A novel prospect to overcome this related issue is discussed herein via the calculation of the $γ^{*}π^0γ$ transition form factor, $G^{γ^* π^0 γ}(Q^2)$, whose normalization is intimately connected with the chiral anomaly and dynamical chiral symmetry brea… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: 9 pages, 2 figures

  25. arXiv:2305.11744  [pdf, other

    cs.IR cs.CL

    ReFIT: Relevance Feedback from a Reranker during Inference

    Authors: Revanth Gangi Reddy, Pradeep Dasigi, Md Arafat Sultan, Arman Cohan, Avirup Sil, Heng Ji, Hannaneh Hajishirzi

    Abstract: Retrieve-and-rerank is a prevalent framework in neural information retrieval, wherein a bi-encoder network initially retrieves a pre-defined number of candidates (e.g., K=100), which are then reranked by a more powerful cross-encoder model. While the reranker often yields improved candidate scores compared to the retriever, its scope is confined to only the top K retrieved candidates. As a result,… ▽ More

    Submitted 28 May, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Preprint

  26. An Intrusion Detection Mechanism for MANETs Based on Deep Learning Artificial Neural Networks (ANNs)

    Authors: Mohamad T Sultan, Hesham El Sayed, Manzoor Ahmed Khan

    Abstract: Mobile Ad-hoc Network (MANET) is a distributed, decentralized network of wireless portable nodes connecting directly without any fixed communication base station or centralized administration. Nodes in MANET move continuously in random directions and follow an arbitrary manner, which presents numerous challenges to these networks and make them more susceptible to different security threats. Due to… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

  27. arXiv:2303.00807  [pdf, other

    cs.IR cs.CL

    UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers

    Authors: Jon Saad-Falcon, Omar Khattab, Keshav Santhanam, Radu Florian, Martin Franz, Salim Roukos, Avirup Sil, Md Arafat Sultan, Christopher Potts

    Abstract: Many information retrieval tasks require large labeled datasets for fine-tuning. However, such datasets are often unavailable, and their utility for real-world applications can diminish quickly due to domain shifts. To address this challenge, we develop and motivate a method for using large language models (LLMs) to generate large numbers of synthetic queries cheaply. The method begins by generati… ▽ More

    Submitted 13 October, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: Long Paper at Empirical Methods in Natural Language Processing (EMNLP) 2023

  28. arXiv:2302.08970  [pdf, other

    physics.ins-det physics.med-ph

    Ge coated silicon nanowires as human respiratory sensing device

    Authors: E. Fakhri, M. T. Sultan, A. Manolescu, S. Ingvarsson, H. G. Svavarsson

    Abstract: We report on Ge coated silicon nanowires (SiNWs) sensors synthesized with metal assisted chemical etching and qualify their functionality as human respiratory sensor. The sensors were made from p-type single-crystalline (100) silicon wafers using a silver catalysed top-down etching, afterwards coated by 50 nm Ge thin layer using a magnetron sputtering. The Ge post-treatment were performed by rapid… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: International Semiconductor Conference - CAS 2022, Romania

  29. arXiv:2301.12609  [pdf, other

    cs.LG cs.CL

    Knowledge Distillation $\approx$ Label Smoothing: Fact or Fallacy?

    Authors: Md Arafat Sultan

    Abstract: Originally proposed as a method for knowledge transfer from one model to another, some recent studies have suggested that knowledge distillation (KD) is in fact a form of regularization. Perhaps the strongest argument of all for this new perspective comes from its apparent similarities with label smoothing (LS). Here we re-examine this stated equivalence between the two methods by comparing the pr… ▽ More

    Submitted 24 October, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: EMNLP 2023

  30. arXiv:2301.09715  [pdf, other

    cs.CL cs.IR cs.LG

    PrimeQA: The Prime Repository for State-of-the-Art Multilingual Question Answering Research and Development

    Authors: Avirup Sil, Jaydeep Sen, Bhavani Iyer, Martin Franz, Kshitij Fadnis, Mihaela Bornea, Sara Rosenthal, Scott McCarley, Rong Zhang, Vishwajeet Kumar, Yulong Li, Md Arafat Sultan, Riyaz Bhat, Radu Florian, Salim Roukos

    Abstract: The field of Question Answering (QA) has made remarkable progress in recent years, thanks to the advent of large pre-trained language models, newer realistic benchmark datasets with leaderboards, and novel algorithms for key components such as retrievers and readers. In this paper, we introduce PRIMEQA: a one-stop and open-source QA repository with an aim to democratize QA re-search and facilitate… ▽ More

    Submitted 25 January, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

  31. arXiv:2212.01340  [pdf, other

    cs.IR cs.CL

    Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking

    Authors: Keshav Santhanam, Jon Saad-Falcon, Martin Franz, Omar Khattab, Avirup Sil, Radu Florian, Md Arafat Sultan, Salim Roukos, Matei Zaharia, Christopher Potts

    Abstract: Neural information retrieval (IR) systems have progressed rapidly in recent years, in large part due to the release of publicly available benchmarking tasks. Unfortunately, some dimensions of this progress are illusory: the majority of the popular IR benchmarks today focus exclusively on downstream task accuracy and thus conceal the costs incurred by systems that trade away efficiency for quality.… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  32. arXiv:2212.00136  [pdf, other

    q-bio.QM cs.LG

    DEL-Dock: Molecular Docking-Enabled Modeling of DNA-Encoded Libraries

    Authors: Kirill Shmilovich, Benson Chen, Theofanis Karaletsos, Mohammad M. Sultan

    Abstract: DNA-Encoded Library (DEL) technology has enabled significant advances in hit identification by enabling efficient testing of combinatorially-generated molecular libraries. DEL screens measure protein binding affinity though sequencing reads of molecules tagged with unique DNA-barcodes that survive a series of selection experiments. Computational models have been deployed to learn the latent bindin… ▽ More

    Submitted 14 December, 2022; v1 submitted 30 November, 2022; originally announced December 2022.

  33. arXiv:2211.16634  [pdf, other

    cs.CL cs.AI cs.LG

    SPARTAN: Sparse Hierarchical Memory for Parameter-Efficient Transformers

    Authors: Ameet Deshpande, Md Arafat Sultan, Anthony Ferritto, Ashwin Kalyan, Karthik Narasimhan, Avirup Sil

    Abstract: Fine-tuning pre-trained language models (PLMs) achieves impressive performance on a range of downstream tasks, and their sizes have consequently been getting bigger. Since a different copy of the model is required for each task, this paradigm is infeasible for storage-constrained edge devices like mobile phones. In this paper, we propose SPARTAN, a parameter efficient (PE) and computationally fast… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  34. arXiv:2210.01792  [pdf, other

    cs.LG cs.DC

    Sampling Streaming Data with Parallel Vector Quantization -- PVQ

    Authors: Mujahid Sultan

    Abstract: Accumulation of corporate data in the cloud has attracted more enterprise applications to the cloud creating data gravity. As a consequence, network traffic has become more cloud centric. This increase in cloud centric traffic poses new challenges in designing learning systems for streaming data due to class imbalance. The number of classes plays a vital role in the accuracy of the classifiers bui… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: 9 pages

    MSC Class: F.2.2; I.2.7

  35. arXiv:2209.12991  [pdf

    physics.ins-det hep-ex

    Quality Control (QC) of FBK Preproduction 3D Si Sensors for ATLAS HL-LHC Upgrades

    Authors: D M S Sultan, Md Arif Abdulla Samy, J. X. Ye, M. Boscardin, F. Ficorella, S. Ronchin, G. -F. Dalla Betta

    Abstract: The challenging demands of the ATLAS High Luminosity (HL-LHC) Upgrade aim for a complete swap of new generation sensors that should cope with the ultimate radiation hardness. FBK has been one of the prime foundries to develop and fabricate such radiation-hard 3D silicon (Si) sensors. These sensors are chosen to be deployed into the innermost layer of the ATLAS Inner Tracker (ITk). Recently, a pre-… ▽ More

    Submitted 28 September, 2022; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: 8 pages, prepared for iWoRiD 2022 Proceeding

  36. arXiv:2209.03607  [pdf, ps, other

    physics.ins-det hep-ex

    Solid State Detectors and Tracking for Snowmass

    Authors: A. Affolder, A. Apresyan, S. Worm, M. Albrow, D. Ally, D. Ambrose, E. Anderssen, N. Apadula, P. Asenov, W. Armstrong, M. Artuso, A. Barbier, P. Barletta, L. Bauerdick, D. Berry, M. Bomben, M. Boscardin, J. Brau, W. Brooks, M. Breidenbach, J. Buckley, V. Cairo, R. Caputo, L. Carpenter, M. Centis-Vignali , et al. (110 additional authors not shown)

    Abstract: Tracking detectors are of vital importance for collider-based high energy physics (HEP) experiments. The primary purpose of tracking detectors is the precise reconstruction of charged particle trajectories and the reconstruction of secondary vertices. The performance requirements from the community posed by the future collider experiments require an evolution of tracking systems, necessitating the… ▽ More

    Submitted 19 October, 2022; v1 submitted 8 September, 2022; originally announced September 2022.

    Comments: for the Snowmass Instrumentation Frontier Solid State Detector and Tracking community

  37. arXiv:2208.03703  [pdf, other

    stat.ML cs.LG

    Granger Causality using Neural Networks

    Authors: Malik Shahid Sultan, Samuel Horvath, Hernando Ombao

    Abstract: Dependence between nodes in a network is an important concept that pervades many areas including finance, politics, sociology, genomics and the brain sciences. One way to characterize dependence between components of a multivariate time series data is via Granger Causality (GC). Standard traditional approaches to GC estimation / inference commonly assume linear dynamics, however such simplificatio… ▽ More

    Submitted 7 August, 2024; v1 submitted 7 August, 2022; originally announced August 2022.

    Comments: To be Submitted to a Journal work Presented at JSM. arXiv admin note: text overlap with arXiv:1802.05842 by other authors

  38. arXiv:2206.12615  [pdf

    cs.NI cs.PF

    Effects of MAC Parameters on the Performance of IEEE 802.11 DCF in NS-3

    Authors: Md. Abubakar Siddik, Jakia Akter Nitu, Natasha Islam, Most. Anju Ara Hasi, Jannatun Ferdous, Md. Mizanur Rahman, Md. Nahid Sultan

    Abstract: This paper presents the design procedure of the NS-3 script for WLAN that is organized according to the hierarchical manner of TCP/IP model. We configure all layers by using NS-3 model objects and set and modify the values used by objects to investigate the effects of MAC parameters (access mechanism, CWmin, CWmax and retry limit) on the performance metrics viz. packet delivery ratio, packet lost… ▽ More

    Submitted 25 June, 2022; originally announced June 2022.

    Comments: 20 pages

  39. arXiv:2206.08441  [pdf, other

    cs.CL

    GAAMA 2.0: An Integrated System that Answers Boolean and Extractive Questions

    Authors: Scott McCarley, Mihaela Bornea, Sara Rosenthal, Anthony Ferritto, Md Arafat Sultan, Avirup Sil, Radu Florian

    Abstract: Recent machine reading comprehension datasets include extractive and boolean questions but current approaches do not offer integrated support for answering both question types. We present a multilingual machine reading comprehension system and front-end demo that handles boolean questions by providing both a YES/NO answer and highlighting supporting evidence, and handles extractive questions by hi… ▽ More

    Submitted 21 June, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

  40. arXiv:2206.05006  [pdf, other

    cond-mat.mtrl-sci

    Structure and electrical behavior of silicon nanowires prepared by MACE process

    Authors: R. Plugaru, E. Fakhri, C. Romanitan, I. Mihalache, G. Craciun, N. Plugaru, H. Ö. Árnason, M. T. Sultan, G. A. Nemnes, S. Ingvarsson, H. G. Svavarsson, A. Manolescu

    Abstract: We report on the structure and electrical characteristics of silicon nanowire arrays prepared by metal assisted chemical etching (MACE) method, investigated by cross-sectional scanning electron microscopy (SEM) and high resolution X-ray diffraction (HR-XRD) methods. SEM micrographs show arrays of merged parallel nanowires, with lengths of 700 nm and 1000 nm, resulted after 1.5 min and 5 min etchin… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: 23 pages 18 figures

    Journal ref: Surfaces and Interfaces Volume 33, October 2022, 102167

  41. arXiv:2206.04991  [pdf, other

    physics.app-ph

    Piezoresistance characterization of silicon nanowires in uniaxial and isostatic pressure variation

    Authors: Elham Fakhri, Rodica Plugaru, Muhammad Taha Sultan, Thorsteinn Hanning Kristinsson, Hákon Örn Árnason, Neculai Plugaru, Andrei Manolescu, Snorri Ingvarsson, Halldor Gudfinnur Svavarsson

    Abstract: Silicon nanowires (SiNWs) are known to exhibit large piezoresistance (PZR) effect, making it suitable for various sensing applications. Here, we report the results of a PZR investigation on randomly distributed and interconnected vertical silicon nanowire arrays as a pressure sensor. The samples were produced from p-type (100) Si wafers using a silver catalysed top-down etching process. The piezor… ▽ More

    Submitted 23 June, 2022; v1 submitted 10 June, 2022; originally announced June 2022.

    Comments: 7 pages 10 figures

    Journal ref: Sensors 2022, 22(17), 6340

  42. arXiv:2205.07257  [pdf, other

    cs.CL cs.LG

    Not to Overfit or Underfit the Source Domains? An Empirical Study of Domain Generalization in Question Answering

    Authors: Md Arafat Sultan, Avirup Sil, Radu Florian

    Abstract: Machine learning models are prone to overfitting their training (source) domains, which is commonly believed to be the reason why they falter in novel target domains. Here we examine the contrasting view that multi-source domain generalization (DG) is first and foremost a problem of mitigating source domain underfitting: models not adequately learning the signal already present in their multi-doma… ▽ More

    Submitted 24 October, 2022; v1 submitted 15 May, 2022; originally announced May 2022.

    Comments: Accepted at EMNLP 2022

  43. Entity-Conditioned Question Generation for Robust Attention Distribution in Neural Information Retrieval

    Authors: Revanth Gangi Reddy, Md Arafat Sultan, Martin Franz, Avirup Sil, Heng Ji

    Abstract: We show that supervised neural information retrieval (IR) models are prone to learning sparse attention patterns over passage tokens, which can result in key phrases including named entities receiving low attention weights, eventually leading to model under-performance. Using a novel targeted synthetic data generation method that identifies poorly attended entities and conditions the generation ep… ▽ More

    Submitted 24 April, 2022; originally announced April 2022.

    Comments: Published at SIGIR 2022

  44. arXiv:2204.09248  [pdf, ps, other

    cs.CL cs.IR

    Synthetic Target Domain Supervision for Open Retrieval QA

    Authors: Revanth Gangi Reddy, Bhavani Iyer, Md Arafat Sultan, Rong Zhang, Avirup Sil, Vittorio Castelli, Radu Florian, Salim Roukos

    Abstract: Neural passage retrieval is a new and promising approach in open retrieval question answering. In this work, we stress-test the Dense Passage Retriever (DPR) -- a state-of-the-art (SOTA) open domain neural retrieval model -- on closed and specialized target domains such as COVID-19, and find that it lags behind standard BM25 in this important real-world setting. To make DPR more robust under domai… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: Published at SIGIR 2021

  45. arXiv:2202.11828  [pdf, other

    physics.ins-det hep-ex

    Novel Sensors for Particle Tracking: a Contribution to the Snowmass Community Planning Exercise of 2021

    Authors: M. R. Hoeferkamp, S. Seidel, S. Kim, J. Metcalfe, A. Sumant, H. Kagan, W. Trischuk, M. Boscardin, G. -F. Dalla Betta, D. M. S. Sultan, N. T. Fourches, C. Renard, A. Barbier, T. Mahajan, A. Minns, V. Tokranov, M. Yakimov, S. Oktyabrsky, C. Gingu, P. Murat, M. T. Hedges

    Abstract: Five contemporary technologies are discussed in the context of their potential roles in particle tracking for future high energy physics applications. These include sensors of the 3D configuration, in both diamond and silicon, submicron-dimension pixels, thin film detectors, and scintillating quantum dots in gallium arsenide. Drivers of the technologies include radiation hardness, excellent positi… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Comments: 15 pages, 6 figures

  46. arXiv:2112.08185  [pdf, other

    cs.CL cs.AI

    Learning Cross-Lingual IR from an English Retriever

    Authors: Yulong Li, Martin Franz, Md Arafat Sultan, Bhavani Iyer, Young-Suk Lee, Avirup Sil

    Abstract: We present DR.DECR (Dense Retrieval with Distillation-Enhanced Cross-Lingual Representation), a new cross-lingual information retrieval (CLIR) system trained using multi-stage knowledge distillation (KD). The teacher of DR.DECR relies on a highly effective but computationally expensive two-stage inference process consisting of query translation and monolingual IR, while the student, DR.DECR, execu… ▽ More

    Submitted 31 July, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: Presented at NAACL 2022 main conference Code can be found at: https://github.com/primeqa/primeqa

  47. arXiv:2104.07800  [pdf, other

    cs.CL cs.AI cs.IR

    Towards Robust Neural Retrieval Models with Synthetic Pre-Training

    Authors: Revanth Gangi Reddy, Vikas Yadav, Md Arafat Sultan, Martin Franz, Vittorio Castelli, Heng Ji, Avirup Sil

    Abstract: Recent work has shown that commonly available machine reading comprehension (MRC) datasets can be used to train high-performance neural information retrieval (IR) systems. However, the evaluation of neural IR has so far been limited to standard supervised learning settings, where they have outperformed traditional term matching baselines. We conduct in-domain and out-of-domain evaluations of neura… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

  48. toon2real: Translating Cartoon Images to Realistic Images

    Authors: K. M. Arefeen Sultan, Mohammad Imrul Jubair, MD. Nahidul Islam, Sayed Hossain Khan

    Abstract: In terms of Image-to-image translation, Generative Adversarial Networks (GANs) has achieved great success even when it is used in the unsupervised dataset. In this work, we aim to translate cartoon images to photo-realistic images using GAN. We apply several state-of-the-art models to perform this task; however, they fail to perform good quality translations. We observe that the shallow difference… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: Accepted as a short paper at ICTAI 2020

  49. arXiv:2012.01414  [pdf, other

    cs.CL cs.AI cs.IR

    End-to-End QA on COVID-19: Domain Adaptation with Synthetic Training

    Authors: Revanth Gangi Reddy, Bhavani Iyer, Md Arafat Sultan, Rong Zhang, Avi Sil, Vittorio Castelli, Radu Florian, Salim Roukos

    Abstract: End-to-end question answering (QA) requires both information retrieval (IR) over a large document collection and machine reading comprehension (MRC) on the retrieved passages. Recent work has successfully trained neural IR systems using only supervised question answering (QA) examples from open-domain datasets. However, despite impressive performance on Wikipedia, neural IR lags behind traditional… ▽ More

    Submitted 2 December, 2020; originally announced December 2020.

    Comments: Preprint

  50. arXiv:2011.03435  [pdf, other

    cs.CL cs.AI cs.LG

    Answer Span Correction in Machine Reading Comprehension

    Authors: Revanth Gangi Reddy, Md Arafat Sultan, Efsun Sarioglu Kayi, Rong Zhang, Vittorio Castelli, Avirup Sil

    Abstract: Answer validation in machine reading comprehension (MRC) consists of verifying an extracted answer against an input context and question pair. Previous work has looked at re-assessing the "answerability" of the question given the extracted answer. Here we address a different problem: the tendency of existing MRC systems to produce partially correct answers when presented with answerable questions.… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

    Comments: Accepted in Findings of EMNLP 2020