subscribe to arXiv mailings

arXiv:2409.14583 [pdf]

Evaluating Gender, Racial, and Age Biases in Large Language Models: A Comparative Analysis of Occupational and Crime Scenarios

Authors: Vishal Mirza, Rahul Kulkarni, Aakanksha Jadhav

Abstract: Recent advancements in Large Language Models(LLMs) have been notable, yet widespread enterprise adoption remains limited due to various constraints. This paper examines bias in LLMs-a crucial issue affecting their usability, reliability, and fairness. Researchers are developing strategies to mitigate bias, including debiasing layers, specialized reference datasets like Winogender and Winobias, and… ▽ More Recent advancements in Large Language Models(LLMs) have been notable, yet widespread enterprise adoption remains limited due to various constraints. This paper examines bias in LLMs-a crucial issue affecting their usability, reliability, and fairness. Researchers are developing strategies to mitigate bias, including debiasing layers, specialized reference datasets like Winogender and Winobias, and reinforcement learning with human feedback (RLHF). These techniques have been integrated into the latest LLMs. Our study evaluates gender bias in occupational scenarios and gender, age, and racial bias in crime scenarios across four leading LLMs released in 2024: Gemini 1.5 Pro, Llama 3 70B, Claude 3 Opus, and GPT-4o. Findings reveal that LLMs often depict female characters more frequently than male ones in various occupations, showing a 37% deviation from US BLS data. In crime scenarios, deviations from US FBI data are 54% for gender, 28% for race, and 17% for age. We observe that efforts to reduce gender and racial bias often lead to outcomes that may over-index one sub-class, potentially exacerbating the issue. These results highlight the limitations of current bias mitigation techniques and underscore the need for more effective approaches. △ Less

Submitted 18 October, 2024; v1 submitted 22 September, 2024; originally announced September 2024.

Comments: 11 pages, 17 figures

arXiv:2407.15124 [pdf, other]

Chemical Reaction Extraction from Long Patent Documents

Authors: Aishwarya Jadhav, Ritam Dutt

Abstract: The task of searching through patent documents is crucial for chemical patent recommendation and retrieval. This can be enhanced by creating a patent knowledge base (ChemPatKB) to aid in prior art searches and to provide a platform for domain experts to explore new innovations in chemical compound synthesis and use-cases. An essential foundational component of this KB is the extraction of importan… ▽ More The task of searching through patent documents is crucial for chemical patent recommendation and retrieval. This can be enhanced by creating a patent knowledge base (ChemPatKB) to aid in prior art searches and to provide a platform for domain experts to explore new innovations in chemical compound synthesis and use-cases. An essential foundational component of this KB is the extraction of important reaction snippets from long patents documents which facilitates multiple downstream tasks such as reaction co-reference resolution and chemical entity role identification. In this work, we explore the problem of extracting reactions spans from chemical patents in order to create a reactions resource database. We formulate this task as a paragraph-level sequence tagging problem, where the system is required to return a sequence of paragraphs that contain a description of a reaction. We propose several approaches and modifications of the baseline models and study how different methods generalize across different domains of chemical patents. △ Less

Submitted 23 July, 2024; v1 submitted 21 July, 2024; originally announced July 2024.

Comments: Work completed in 2022 at Carnegie Mellon University

arXiv:2406.12698 [pdf, other]

Online-Adaptive Anomaly Detection for Defect Identification in Aircraft Assembly

Authors: Siddhant Shete, Dennis Mronga, Ankita Jadhav, Frank Kirchner

Abstract: Anomaly detection deals with detecting deviations from established patterns within data. It has various applications like autonomous driving, predictive maintenance, and medical diagnosis. To improve anomaly detection accuracy, transfer learning can be applied to large, pre-trained models and adapt them to the specific application context. In this paper, we propose a novel framework for online-ada… ▽ More Anomaly detection deals with detecting deviations from established patterns within data. It has various applications like autonomous driving, predictive maintenance, and medical diagnosis. To improve anomaly detection accuracy, transfer learning can be applied to large, pre-trained models and adapt them to the specific application context. In this paper, we propose a novel framework for online-adaptive anomaly detection using transfer learning. The approach adapts to different environments by selecting visually similar training images and online fitting a normality model to EfficientNet features extracted from the training subset. Anomaly detection is then performed by computing the Mahalanobis distance between the normality model and the test image features. Different similarity measures (SIFT/FLANN, Cosine) and normality models (MVG, OCSVM) are employed and compared with each other. We evaluate the approach on different anomaly detection benchmarks and data collected in controlled laboratory settings. Experimental results showcase a detection accuracy exceeding 0.975, outperforming the state-of-the-art ET-NET approach. △ Less

Submitted 18 June, 2024; originally announced June 2024.

Comments: This is preprint for the accepted paper

arXiv:2311.12014 [pdf, other]

doi 10.1007/s10909-024-03141-x

MMC Array to Study X-ray Transitions in Muonic Atoms

Authors: Daniel Unger, Andreas Abeln, Thomas Elias Cocolios, Ofir Eizenberg, Christian Enss, Andreas Fleischmann, Loredana Gastaldo, César Godinho, Michael Heines, Daniel Hengstler, Paul Indelicato, Ashish Jadhav, Daniel Kreuzberger, Klaus Kirch, Andreas Knecht, Jorge Machado, Ben Ohayon, Nancy Paul, Randolf Pohl, Katharina von Schoeler, Stergiani Marina Vogiatzi, Frederik Wauters

Abstract: The QUARTET collaboration aims to significantly improve the precision of the absolute nuclear charge radii of light nuclei from Li to Ne by using an array of metallic magnetic calorimeters to perform high-precision X-ray spectroscopy of low-lying states in muonic atoms. A proof-of-principle measurement with lithium, beryllium and boron is planned for fall 2023 at the Paul Scherrer Institute. We di… ▽ More The QUARTET collaboration aims to significantly improve the precision of the absolute nuclear charge radii of light nuclei from Li to Ne by using an array of metallic magnetic calorimeters to perform high-precision X-ray spectroscopy of low-lying states in muonic atoms. A proof-of-principle measurement with lithium, beryllium and boron is planned for fall 2023 at the Paul Scherrer Institute. We discuss the performance achieved with the maXs-30 detector module to be used. To place the detector close to the target chamber where the muon beam will impact the material under study, we have developed a new dilution refrigerator sidearm. We further discuss the expected efficiency given the transparency of the X-ray windows and the quantum efficiency of the detector. The expected muonic X-ray rate combined with the high resolving power and detection efficiency of the detector suggest that QUARTET will be able to study the de-excitation of light muonic atoms at an unprecedented level, increasing the relative energy resolution by up to a factor of 20 compared to conventional detector techniques. △ Less

Submitted 6 April, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

arXiv:2311.02100 [pdf, other]

A Comprehensive Study on Model Initialization Techniques Ensuring Efficient Federated Learning

Authors: Ishmeet Kaur, Adwaita Janardhan Jadhav

Abstract: Advancement in the field of machine learning is unavoidable, but something of major concern is preserving the privacy of the users whose data is being used for training these machine learning algorithms. Federated learning(FL) has emerged as a promising paradigm for training machine learning models in a distributed and privacy-preserving manner which enables one to collaborate and train a global m… ▽ More Advancement in the field of machine learning is unavoidable, but something of major concern is preserving the privacy of the users whose data is being used for training these machine learning algorithms. Federated learning(FL) has emerged as a promising paradigm for training machine learning models in a distributed and privacy-preserving manner which enables one to collaborate and train a global model without sharing local data. But starting this learning process on each device in the right way, called ``model initialization" is critical. The choice of initialization methods used for models plays a crucial role in the performance, convergence speed, communication efficiency, privacy guarantees of federated learning systems, etc. In this survey, we dive deeper into a comprehensive study of various ways of model initialization techniques in FL.Unlike other studies, our research meticulously compares, categorizes, and delineates the merits and demerits of each technique, examining their applicability across diverse FL scenarios. We highlight how factors like client variability, data non-IIDness, model caliber, security considerations, and network restrictions influence FL model outcomes and propose how strategic initialization can address and potentially rectify many such challenges. The motivation behind this survey is to highlight that the right start can help overcome challenges like varying data quality, security issues, and network problems. Our insights provide a foundational base for experts looking to fully utilize FL, also while understanding the complexities of model initialization. △ Less

Submitted 31 October, 2023; originally announced November 2023.

Comments: Accepted to be presented at IEEE 2nd International Conference on Intelligent Computing and Next Generation Networks (ICNGN2023) will be held November 17-18,2023 at Hangzhou, China

arXiv:2310.05567 [pdf, other]

doi 10.1016/j.oceaneng.2023.116011

Collision Avoidance for Autonomous Surface Vessels using Novel Artificial Potential Fields

Authors: Aditya Kailas Jadhav, Anantha Raj Pandi, Abhilash Somayajula

Abstract: As the demand for transportation through waterways continues to rise, the number of vessels plying the waters has correspondingly increased. This has resulted in a greater number of accidents and collisions between ships, some of which lead to significant loss of life and financial losses. Research has shown that human error is a major factor responsible for such incidents. The maritime industry i… ▽ More As the demand for transportation through waterways continues to rise, the number of vessels plying the waters has correspondingly increased. This has resulted in a greater number of accidents and collisions between ships, some of which lead to significant loss of life and financial losses. Research has shown that human error is a major factor responsible for such incidents. The maritime industry is constantly exploring newer approaches to autonomy to mitigate this issue. This study presents the use of novel Artificial Potential Fields (APFs) to perform obstacle and collision avoidance in marine environments. This study highlights the advantage of harmonic functions over traditional functions in modeling potential fields. With a modification, the method is extended to effectively avoid dynamic obstacles while adhering to COLREGs. Improved performance is observed as compared to the traditional potential fields and also against the popular velocity obstacle approach. A comprehensive statistical analysis is also performed through Monte Carlo simulations in different congested environments that emulate real traffic conditions to demonstrate robustness of the approach. △ Less

Submitted 9 October, 2023; originally announced October 2023.

Comments: 28 pages, 30 figures

Journal ref: Ocean Engineering, 288 (2023), 116011

arXiv:2309.01824 [pdf, other]

On the fly Deep Neural Network Optimization Control for Low-Power Computer Vision

Authors: Ishmeet Kaur, Adwaita Janardhan Jadhav

Abstract: Processing visual data on mobile devices has many applications, e.g., emergency response and tracking. State-of-the-art computer vision techniques rely on large Deep Neural Networks (DNNs) that are usually too power-hungry to be deployed on resource-constrained edge devices. Many techniques improve the efficiency of DNNs by using sparsity or quantization. However, the accuracy and efficiency of th… ▽ More Processing visual data on mobile devices has many applications, e.g., emergency response and tracking. State-of-the-art computer vision techniques rely on large Deep Neural Networks (DNNs) that are usually too power-hungry to be deployed on resource-constrained edge devices. Many techniques improve the efficiency of DNNs by using sparsity or quantization. However, the accuracy and efficiency of these techniques cannot be adapted for diverse edge applications with different hardware constraints and accuracy requirements. This paper presents a novel technique to allow DNNs to adapt their accuracy and energy consumption during run-time, without the need for any re-training. Our technique called AdaptiveActivation introduces a hyper-parameter that controls the output range of the DNNs' activation function to dynamically adjust the sparsity and precision in the DNN. AdaptiveActivation can be applied to any existing pre-trained DNN to improve their deployability in diverse edge environments. We conduct experiments on popular edge devices and show that the accuracy is within 1.5% of the baseline. We also show that our approach requires 10%--38% less memory than the baseline techniques leading to more accuracy-efficiency tradeoff options △ Less

Submitted 4 September, 2023; originally announced September 2023.

arXiv:2308.13157 [pdf, other]

Federated Learning in IoT: a Survey from a Resource-Constrained Perspective

Authors: Ishmeet Kaur andAdwaita Janardhan Jadhav

Abstract: The IoT ecosystem is able to leverage vast amounts of data for intelligent decision-making. Federated Learning (FL), a decentralized machine learning technique, is widely used to collect and train machine learning models from a variety of distributed data sources. Both IoT and FL systems can be complementary and used together. However, the resource-constrained nature of IoT devices prevents the wi… ▽ More The IoT ecosystem is able to leverage vast amounts of data for intelligent decision-making. Federated Learning (FL), a decentralized machine learning technique, is widely used to collect and train machine learning models from a variety of distributed data sources. Both IoT and FL systems can be complementary and used together. However, the resource-constrained nature of IoT devices prevents the widescale deployment FL in the real world. This research paper presents a comprehensive survey of the challenges and solutions associated with implementing Federated Learning (FL) in resource-constrained Internet of Things (IoT) environments, viewed from 2 levels, client and server. We focus on solutions regarding limited client resources, presence of heterogeneous client data, server capacity, and high communication costs, and assess their effectiveness in various scenarios. Furthermore, we categorize the solutions based on the location of their application, i.e., the IoT client, and the FL server. In addition to a comprehensive review of existing research and potential future directions, this paper also presents new evaluation metrics that would allow researchers to evaluate their solutions on resource-constrained IoT devices. △ Less

Submitted 24 August, 2023; originally announced August 2023.

Comments: Presented and accepted at The IEEE 2023 International Conference on Artificial Intelligence, Robotics, Signal and Image Processing (AIRoSIP)

arXiv:2308.02553 [pdf, other]

Survey on Computer Vision Techniques for Internet-of-Things Devices

Authors: Ishmeet Kaur, Adwaita Janardhan Jadhav

Abstract: Deep neural networks (DNNs) are state-of-the-art techniques for solving most computer vision problems. DNNs require billions of parameters and operations to achieve state-of-the-art results. This requirement makes DNNs extremely compute, memory, and energy-hungry, and consequently difficult to deploy on small battery-powered Internet-of-Things (IoT) devices with limited computing resources. Deploy… ▽ More Deep neural networks (DNNs) are state-of-the-art techniques for solving most computer vision problems. DNNs require billions of parameters and operations to achieve state-of-the-art results. This requirement makes DNNs extremely compute, memory, and energy-hungry, and consequently difficult to deploy on small battery-powered Internet-of-Things (IoT) devices with limited computing resources. Deployment of DNNs on Internet-of-Things devices, such as traffic cameras, can improve public safety by enabling applications such as automatic accident detection and emergency response.Through this paper, we survey the recent advances in low-power and energy-efficient DNN implementations that improve the deployability of DNNs without significantly sacrificing accuracy. In general, these techniques either reduce the memory requirements, the number of arithmetic operations, or both. The techniques can be divided into three major categories: neural network compression, network architecture search and design, and compiler and graph optimizations. In this paper, we survey both low-power techniques for both convolutional and transformer DNNs, and summarize the advantages, disadvantages, and open research problems. △ Less

Submitted 1 August, 2023; originally announced August 2023.

Comments: Accepted and presented at THE IEEE INTERNATIONAL CONFERENCE ON INDUSTRY 4.0, ARTIFICIAL INTELLIGENCE, AND COMMUNICATIONS TECHNOLOGY

arXiv:2211.11749 [pdf]

Towards Automatic Prediction of Outcome in Treatment of Cerebral Aneurysms

Authors: Ashutosh Jadhav, Satyananda Kashyap, Hakan Bulu, Ronak Dholakia, Amon Y. Liu, Tanveer Syeda-Mahmood, William R. Patterson, Hussain Rangwala, Mehdi Moradi

Abstract: Intrasaccular flow disruptors treat cerebral aneurysms by diverting the blood flow from the aneurysm sac. Residual flow into the sac after the intervention is a failure that could be due to the use of an undersized device, or to vascular anatomy and clinical condition of the patient. We report a machine learning model based on over 100 clinical and imaging features that predict the outcome of wide… ▽ More Intrasaccular flow disruptors treat cerebral aneurysms by diverting the blood flow from the aneurysm sac. Residual flow into the sac after the intervention is a failure that could be due to the use of an undersized device, or to vascular anatomy and clinical condition of the patient. We report a machine learning model based on over 100 clinical and imaging features that predict the outcome of wide-neck bifurcation aneurysm treatment with an intravascular embolization device. We combine clinical features with a diverse set of common and novel imaging measurements within a random forest model. We also develop neural network segmentation algorithms in 2D and 3D to contour the sac in angiographic images and automatically calculate the imaging features. These deliver 90% overlap with manual contouring in 2D and 83% in 3D. Our predictive model classifies complete vs. partial occlusion outcomes with an accuracy of 75.31%, and weighted F1-score of 0.74. △ Less

Submitted 18 November, 2022; originally announced November 2022.

Comments: 10 pages

Report number: https://s4.goeshow.com/amia/annual/2022/schedule_at_a_glance.cfm?session_key=1965BCBD-A832-92DD-9D05-FB2CB132FADB&session_date=

Journal ref: AMAI 2022 Annual Symposium

arXiv:2109.12234 [pdf, other]

Low Cost Bin Picking Solution for E-Commerce Warehouse Fulfillment Centers

Authors: Avnish Gupta, Akash Jadhav, Pradyot VN Korupolu

Abstract: In recent years, the throughput requirements of e-commerce fulfillment warehouses have seen a steep increase. This has resulted in various automation solutions being developed for item picking and movement. In this paper, we address the problem of manipulators picking heterogeneous items placed randomly in a bin. Traditional solutions require that the items be picked to be placed in an orderly man… ▽ More In recent years, the throughput requirements of e-commerce fulfillment warehouses have seen a steep increase. This has resulted in various automation solutions being developed for item picking and movement. In this paper, we address the problem of manipulators picking heterogeneous items placed randomly in a bin. Traditional solutions require that the items be picked to be placed in an orderly manner in the bin and that the exact dimensions of the items be known beforehand. Such solutions do not perform well in the real world since the items in a bin are seldom placed in an orderly manner and new products are added almost every day by e-commerce suppliers. We propose a cost-effective solution that handles both the aforementioned challenges. Our solution comprises of a dual sensor system comprising of a regular RGB camera and a 3D ToF depth sensor. We propose a novel algorithm that fuses data from both these sensors to improve object segmentation while maintaining the accuracy of pose estimation, especially in occluded environments and tightly packed bins. We experimentally verify the performance of our system by picking boxes using an ABB IRB 1200 robot. We also show that our system maintains a high level of accuracy in pose estimation that is independent of the dimensions of the box, texture, occlusion or orientation. We further show that our system is computationally less expensive and maintains a consistent detection time of 1 second. We also discuss how this approach can be easily extended to objects of all shapes. △ Less

Submitted 24 September, 2021; originally announced September 2021.

Journal ref: Australasian Conference on Robotics and Automation 2019

arXiv:2011.13714 [pdf, ps, other]

Detection of Malaria Vector Breeding Habitats using Topographic Models

Authors: Aishwarya Jadhav

Abstract: Treatment of stagnant water bodies that act as a breeding site for malarial vectors is a fundamental step in most malaria elimination campaigns. However, identification of such water bodies over large areas is expensive, labour-intensive and time-consuming and hence, challenging in countries with limited resources. Practical models that can efficiently locate water bodies can target the limited re… ▽ More Treatment of stagnant water bodies that act as a breeding site for malarial vectors is a fundamental step in most malaria elimination campaigns. However, identification of such water bodies over large areas is expensive, labour-intensive and time-consuming and hence, challenging in countries with limited resources. Practical models that can efficiently locate water bodies can target the limited resources by greatly reducing the area that needs to be scanned by the field workers. To this end, we propose a practical topographic model based on easily available, global, high-resolution DEM data to predict locations of potential vector-breeding water sites. We surveyed the Obuasi region of Ghana to assess the impact of various topographic features on different types of water bodies and uncover the features that significantly influence the formation of aquatic habitats. We further evaluate the effectiveness of multiple models. Our best model significantly outperforms earlier attempts that employ topographic variables for detection of small water sites, even the ones that utilize additional satellite imagery data and demonstrates robustness across different settings. △ Less

Submitted 16 July, 2024; v1 submitted 27 November, 2020; originally announced November 2020.

Comments: Presented at ML4D Workshop, NeurIPS 2020. Proceedings available: arXiv:2101.04347. Also presented at MLPH Workshop, NeurIPS 2020. Awarded Best Paper. Recording available https://slideslive.com/38938356/detection-of-malaria-vector-breedding-habitats-using-topographic-models

arXiv:2011.09517 [pdf]

Extracting and Learning Fine-Grained Labels from Chest Radiographs

Authors: Tanveer Syeda-Mahmood, Ph. D, K. C. L Wong, Ph. D, Joy T. Wu, M. D., M. P. H, Ashutosh Jadhav, Ph. D, Orest Boyko, M. D. Ph. D

Abstract: Chest radiographs are the most common diagnostic exam in emergency rooms and intensive care units today. Recently, a number of researchers have begun working on large chest X-ray datasets to develop deep learning models for recognition of a handful of coarse finding classes such as opacities, masses and nodules. In this paper, we focus on extracting and learning fine-grained labels for chest X-ray… ▽ More Chest radiographs are the most common diagnostic exam in emergency rooms and intensive care units today. Recently, a number of researchers have begun working on large chest X-ray datasets to develop deep learning models for recognition of a handful of coarse finding classes such as opacities, masses and nodules. In this paper, we focus on extracting and learning fine-grained labels for chest X-ray images. Specifically we develop a new method of extracting fine-grained labels from radiology reports by combining vocabulary-driven concept extraction with phrasal grouping in dependency parse trees for association of modifiers with findings. A total of 457 fine-grained labels depicting the largest spectrum of findings to date were selected and sufficiently large datasets acquired to train a new deep learning model designed for fine-grained classification. We show results that indicate a highly accurate label extraction process and a reliable learning of fine-grained labels. The resulting network, to our knowledge, is the first to recognize fine-grained descriptions of findings in images covering over nine modifiers including laterality, location, severity, size and appearance. △ Less

Submitted 18 November, 2020; originally announced November 2020.

Comments: This paper won the Homer R. Warner Award at AMIA 2020 awarded to a paper that best describes approaches to improving computerized information acquisition, knowledge data acquisition and management, and experimental results documenting the value of these approaches. The paper shows a combination of textual and visual processing to automatically recognize complex findings in chest X-rays

arXiv:2009.06082 [pdf]

doi 10.5220/0008984901780186

Receptivity of an AI Cognitive Assistant by the Radiology Community: A Report on Data Collected at RSNA

Authors: Karina Kanjaria, Anup Pillai, Chaitanya Shivade, Marina Bendersky, Ashutosh Jadhav, Vandana Mukherjee, Tanveer Syeda-Mahmood

Abstract: Due to advances in machine learning and artificial intelligence (AI), a new role is emerging for machines as intelligent assistants to radiologists in their clinical workflows. But what systematic clinical thought processes are these machines using? Are they similar enough to those of radiologists to be trusted as assistants? A live demonstration of such a technology was conducted at the 2016 Scie… ▽ More Due to advances in machine learning and artificial intelligence (AI), a new role is emerging for machines as intelligent assistants to radiologists in their clinical workflows. But what systematic clinical thought processes are these machines using? Are they similar enough to those of radiologists to be trusted as assistants? A live demonstration of such a technology was conducted at the 2016 Scientific Assembly and Annual Meeting of the Radiological Society of North America (RSNA). The demonstration was presented in the form of a question-answering system that took a radiology multiple choice question and a medical image as inputs. The AI system then demonstrated a cognitive workflow, involving text analysis, image analysis, and reasoning, to process the question and generate the most probable answer. A post demonstration survey was made available to the participants who experienced the demo and tested the question answering system. Of the reported 54,037 meeting registrants, 2,927 visited the demonstration booth, 1,991 experienced the demo, and 1,025 completed a post-demonstration survey. In this paper, the methodology of the survey is shown and a summary of its results are presented. The results of the survey show a very high level of receptiveness to cognitive computing technology and artificial intelligence among radiologists. △ Less

Submitted 13 September, 2020; originally announced September 2020.

Journal ref: Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 5: HEALTHINF, ISBN 978-989-758-398-8, pages 178-186. 2020

arXiv:2007.13831 [pdf, other]

Chest X-ray Report Generation through Fine-Grained Label Learning

Authors: Tanveer Syeda-Mahmood, Ken C. L. Wong, Yaniv Gur, Joy T. Wu, Ashutosh Jadhav, Satyananda Kashyap, Alexandros Karargyris, Anup Pillai, Arjun Sharma, Ali Bin Syed, Orest Boyko, Mehdi Moradi

Abstract: Obtaining automated preliminary read reports for common exams such as chest X-rays will expedite clinical workflows and improve operational efficiencies in hospitals. However, the quality of reports generated by current automated approaches is not yet clinically acceptable as they cannot ensure the correct detection of a broad spectrum of radiographic findings nor describe them accurately in terms… ▽ More Obtaining automated preliminary read reports for common exams such as chest X-rays will expedite clinical workflows and improve operational efficiencies in hospitals. However, the quality of reports generated by current automated approaches is not yet clinically acceptable as they cannot ensure the correct detection of a broad spectrum of radiographic findings nor describe them accurately in terms of laterality, anatomical location, severity, etc. In this work, we present a domain-aware automatic chest X-ray radiology report generation algorithm that learns fine-grained description of findings from images and uses their pattern of occurrences to retrieve and customize similar reports from a large report database. We also develop an automatic labeling algorithm for assigning such descriptors to images and build a novel deep learning network that recognizes both coarse and fine-grained descriptions of findings. The resulting report generation algorithm significantly outperforms the state of the art using established score metrics. △ Less

Submitted 27 July, 2020; originally announced July 2020.

Comments: 11 pages, 5 figures, to appear in MICCAI 2020 Conference

ACM Class: I.2.1; I.4.9; J.3

arXiv:2006.00614 [pdf]

Electrospun nanodiamond-silk fibroin membranes: a multifunctional platform for biosensing and wound healing applications

Authors: Asma Khalid, Dongbi Bai, Amanda Abraham, Amit Jadhav, Denver Linklater, Alex Matusica, Duy Nguyen, Billy James Murdoch, Nadia Zakhartchouk, Chaitali Dekiwadia, Philipp Reineck, David Simpson, Achini K. Vidanapathirana, Shadi Houshyar, Christina A. Bursill, Elena Ivanova, Brant Gibson

Abstract: Next generation wound care technology capable of diagnosing wound parameters, promoting healthy cell growth and reducing pathogenic infections noninvasively will provide patients with an improved standard of care and an accelerated wound repair. Temperature is one of the indicating biomarkers specific to chronic wounds. This work reports a hybrid, multifunctional optical platform: nanodiamond-silk… ▽ More Next generation wound care technology capable of diagnosing wound parameters, promoting healthy cell growth and reducing pathogenic infections noninvasively will provide patients with an improved standard of care and an accelerated wound repair. Temperature is one of the indicating biomarkers specific to chronic wounds. This work reports a hybrid, multifunctional optical platform: nanodiamond-silk membranes as bioinspired dressings capable of temperature sensing and wound healing. The hybrid was fabricated through electrospinning and formed sub-micron fibrous membranes with high porosity. The silk fibres are capable of compensating for the lack of extracellular matrix at the wound site, supporting the wound healing. The negatively charged nitrogen vacancy (NV-) color centres in nanodiamonds (NDs) exhibit optically detected magnetic resonance (ODMR) properties and act as fluorescent nanoscale thermometers, capable of sensing temperature variations associated to the presence of infection or inflammation in a wound, without physically removing the dressing. Our results show that the presence of NDs in the hybrid ND-silk membranes improve the thermal stability of silk fibres. The NV- color centres in NDs embedded in silk fibres exhibit well-retained fluorescent and ODMR. Using the NV- centres as fluorescent nanoscale thermometers, we achieved temperature sensing at a range of temperatures, including the biologically relevant temperature window, on cell-cultured ND-silk membranes. Enhancement in the temperature sensitivity of the NV- centres was observed for the hybrids. The membranes were further tested in vivo in a murine wound healing model and demonstrated biocompatibility and equivalent wound closure rates as the control wounds. Additionally, the hybrid ND-silk membranes showed selective antifouling and biocidal propensity toward Gram-negative Pseudomonas aeruginosa and Escherichia coli. △ Less

Submitted 31 May, 2020; originally announced June 2020.

arXiv:2004.04244 [pdf]

doi 10.1109/ICCCI48352.2020.9104085

Variable Rate Video Compression using a Hybrid Recurrent Convolutional Learning Framework

Authors: Aishwarya Jadhav

Abstract: In recent years, neural network-based image compression techniques have been able to outperform traditional codecs and have opened the gates for the development of learning-based video codecs. However, to take advantage of the high temporal correlation in videos, more sophisticated architectures need to be employed. This paper presents PredEncoder, a hybrid video compression framework based on the… ▽ More In recent years, neural network-based image compression techniques have been able to outperform traditional codecs and have opened the gates for the development of learning-based video codecs. However, to take advantage of the high temporal correlation in videos, more sophisticated architectures need to be employed. This paper presents PredEncoder, a hybrid video compression framework based on the concept of predictive auto-encoding that models the temporal correlations between consecutive video frames using a prediction network which is then combined with a progressive encoder network to exploit the spatial redundancies. A variable-rate block encoding scheme has been proposed in the paper that leads to remarkably high quality to bit-rate ratios. By joint training and fine-tuning of this hybrid architecture, PredEncoder has been able to gain significant improvement over the MPEG-4 codec and has achieved bit-rate savings over the H.264 codec in the low to medium bit-rate range for HD videos and comparable results over most bit-rates for non-HD videos. This paper serves to demonstrate how neural architectures can be leveraged to perform at par with the highly optimized traditional methodologies in the video compression domain. △ Less

Submitted 21 August, 2020; v1 submitted 8 April, 2020; originally announced April 2020.

Journal ref: 2020 International Conference on Computer Communication and Informatics (ICCCI)

arXiv:2003.03212 [pdf, other]

Diverse and Admissible Trajectory Forecasting through Multimodal Context Understanding

Authors: Seong Hyeon Park, Gyubok Lee, Manoj Bhat, Jimin Seo, Minseok Kang, Jonathan Francis, Ashwin R. Jadhav, Paul Pu Liang, Louis-Philippe Morency

Abstract: Multi-agent trajectory forecasting in autonomous driving requires an agent to accurately anticipate the behaviors of the surrounding vehicles and pedestrians, for safe and reliable decision-making. Due to partial observability in these dynamical scenes, directly obtaining the posterior distribution over future agent trajectories remains a challenging problem. In realistic embodied environments, ea… ▽ More Multi-agent trajectory forecasting in autonomous driving requires an agent to accurately anticipate the behaviors of the surrounding vehicles and pedestrians, for safe and reliable decision-making. Due to partial observability in these dynamical scenes, directly obtaining the posterior distribution over future agent trajectories remains a challenging problem. In realistic embodied environments, each agent's future trajectories should be both diverse since multiple plausible sequences of actions can be used to reach its intended goals, and admissible since they must obey physical constraints and stay in drivable areas. In this paper, we propose a model that synthesizes multiple input signals from the multimodal world|the environment's scene context and interactions between multiple surrounding agents|to best model all diverse and admissible trajectories. We compare our model with strong baselines and ablations across two public datasets and show a significant performance improvement over previous state-of-the-art methods. Lastly, we offer new metrics incorporating admissibility criteria to further study and evaluate the diversity of predictions. Codes are at: https://github.com/kami93/CMU-DATF. △ Less

Submitted 31 August, 2020; v1 submitted 6 March, 2020; originally announced March 2020.

Comments: ECCV 2020

arXiv:2002.11643 [pdf]

Marathi To English Neural Machine Translation With Near Perfect Corpus And Transformers

Authors: Swapnil Ashok Jadhav

Abstract: There have been very few attempts to benchmark performances of state-of-the-art algorithms for Neural Machine Translation task on Indian Languages. Google, Bing, Facebook and Yandex are some of the very few companies which have built translation systems for few of the Indian Languages. Among them, translation results from Google are supposed to be better, based on general inspection. Bing-Translat… ▽ More There have been very few attempts to benchmark performances of state-of-the-art algorithms for Neural Machine Translation task on Indian Languages. Google, Bing, Facebook and Yandex are some of the very few companies which have built translation systems for few of the Indian Languages. Among them, translation results from Google are supposed to be better, based on general inspection. Bing-Translator do not even support Marathi language which has around 95 million speakers and ranks 15th in the world in terms of combined primary and secondary speakers. In this exercise, we trained and compared variety of Neural Machine Marathi to English Translators trained with BERT-tokenizer by huggingface and various Transformer based architectures using Facebook's Fairseq platform with limited but almost correct parallel corpus to achieve better BLEU scores than Google on Tatoeba and Wikimedia open datasets. △ Less

Submitted 26 February, 2020; originally announced February 2020.

Comments: 5 pages, 5 tables. This report is based on applied research work done at Dailyhunt

arXiv:2002.11402 [pdf, other]

Detecting Potential Topics In News Using BERT, CRF and Wikipedia

Authors: Swapnil Ashok Jadhav

Abstract: For a news content distribution platform like Dailyhunt, Named Entity Recognition is a pivotal task for building better user recommendation and notification algorithms. Apart from identifying names, locations, organisations from the news for 13+ Indian languages and use them in algorithms, we also need to identify n-grams which do not necessarily fit in the definition of Named-Entity, yet they are… ▽ More For a news content distribution platform like Dailyhunt, Named Entity Recognition is a pivotal task for building better user recommendation and notification algorithms. Apart from identifying names, locations, organisations from the news for 13+ Indian languages and use them in algorithms, we also need to identify n-grams which do not necessarily fit in the definition of Named-Entity, yet they are important. For example, "me too movement", "beef ban", "alwar mob lynching". In this exercise, given an English language text, we are trying to detect case-less n-grams which convey important information and can be used as topics and/or hashtags for a news. Model is built using Wikipedia titles data, private English news corpus and BERT-Multilingual pre-trained model, Bi-GRU and CRF architecture. It shows promising results when compared with industry best Flair, Spacy and Stanford-caseless-NER in terms of F1 and especially Recall. △ Less

Submitted 28 February, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

Comments: 6 pages, 5 tables, 1 figure, 2 examples. This is a report based on applied research work conducted at Dailyhunt

arXiv:2002.00584 [pdf, other]

A Novel PHY Layer Approach for Enhanced Data Rate in LoRa using Adaptive Symbol Periods

Authors: Akshay Ramesh Jadhav, P. Rajalakshmi

Abstract: LoRaWAN has emerged as one of the promising low-power wide-area network technologies to enable long-range sensing and monitoring applications in Internet of Things. The LoRa physical layer used in LoRaWAN suffers from low data rates and thus increases packet duration. In a dense LoRaWAN network scenario with simple media access protocol like ALOHA, the packet collision probability increases with i… ▽ More LoRaWAN has emerged as one of the promising low-power wide-area network technologies to enable long-range sensing and monitoring applications in Internet of Things. The LoRa physical layer used in LoRaWAN suffers from low data rates and thus increases packet duration. In a dense LoRaWAN network scenario with simple media access protocol like ALOHA, the packet collision probability increases with increase in packet duration. This degrades over-all network throughput because of increased re-transmissions of collided packets. Any increase in data rate directly reduces the packet duration. Thus, in this paper, we have proposed a novel approach to enhance the data rate in LoRa communication system by using adaptive symbol periods in physical layer. To the best of our knowledge, this is the first attempt at using adaptive symbol periods to enhance data rate of the LoRa system. The trade-off of the proposed approach in terms of required symbol overhead and degradation in bit error rate performance due to symbol period reduction has also been analysed. We have shown that for reduction factor \(β\), the data rate directly increases \(1/β\) times. △ Less

Submitted 3 February, 2020; originally announced February 2020.

Comments: Accepted and to be published in Proceedings of IEEE International Conference on Advanced Networks and Telecommunications Systems (IEEE ANTS 2019)

arXiv:1909.01547 [pdf, other]

Aerial multi-object tracking by detection using deep association networks

Authors: Ajit Jadhav, Prerana Mukherjee, Vinay Kaushik, Brejesh Lall

Abstract: A lot a research is focused on object detection and it has achieved significant advances with deep learning techniques in recent years. Inspite of the existing research, these algorithms are not usually optimal for dealing with sequences or images captured by drone-based platforms, due to various challenges such as view point change, scales, density of object distribution and occlusion. In this pa… ▽ More A lot a research is focused on object detection and it has achieved significant advances with deep learning techniques in recent years. Inspite of the existing research, these algorithms are not usually optimal for dealing with sequences or images captured by drone-based platforms, due to various challenges such as view point change, scales, density of object distribution and occlusion. In this paper, we develop a model for detection of objects in drone images using the VisDrone2019 DET dataset. Using the RetinaNet model as our base, we modify the anchor scales to better handle the detection of dense distribution and small size of the objects. We explicitly model the channel interdependencies by using "Squeeze-and-Excitation" (SE) blocks that adaptively recalibrates channel-wise feature responses. This helps to bring significant improvements in performance at a slight additional computational cost. Using this architecture for object detection, we build a custom DeepSORT network for object detection on the VisDrone2019 MOT dataset by training a custom Deep Association network for the algorithm. △ Less

Submitted 3 September, 2019; originally announced September 2019.

arXiv:1711.09389 [pdf]

Whale Optimization Based Energy-Efficient Cluster Head Selection Algorithm for Wireless Sensor Networks

Authors: Ashwin R Jadhav, T. Shankar

Abstract: Wireless Sensor Network (WSN) consists of many individual sensors that are deployed in the area of interest. These sensor nodes have major energy constraints as they are small and their battery can't be replaced. They collaborate together in order to gather, transmit and forward the sensed data to the base station. Consequently, data transmission is one of the biggest reasons for energy depletion… ▽ More Wireless Sensor Network (WSN) consists of many individual sensors that are deployed in the area of interest. These sensor nodes have major energy constraints as they are small and their battery can't be replaced. They collaborate together in order to gather, transmit and forward the sensed data to the base station. Consequently, data transmission is one of the biggest reasons for energy depletion in WSN. Clustering is one of the most effective techniques for energy efficient data transmission in WSN. In this paper, an energy efficient cluster head selection algorithm which is based on Whale Optimization Algorithm (WOA) called WOA-Clustering (WOA-C) is proposed. Accordingly, the proposed algorithm helps in selection of energy aware cluster heads based on a fitness function which considers the residual energy of the node and the sum of energy of adjacent nodes. The proposed algorithm is evaluated for network lifetime, energy efficiency, throughput and overall stability. Furthermore, the performance of WOA-C is evaluated against other standard contemporary routing protocols such as LEACH. Extensive simulations show the superior performance of the proposed algorithm in terms of residual energy, network lifetime and longer stability period. △ Less

Submitted 26 November, 2017; originally announced November 2017.

Comments: 22 pages, 12 figures, 5 tables

arXiv:0912.2319 [pdf]

Steganography An Art of Hiding Data

Authors: Shashikala Channalli, Ajay Jadhav

Abstract: In today's world the art of sending & displaying the hidden information especially in public places, has received more attention and faced many challenges. Therefore, different methods have been proposed so far for hiding information in different cover media. In this paper a method for hiding of information on the billboard display is presented. It is well known that encryption provides secure c… ▽ More In today's world the art of sending & displaying the hidden information especially in public places, has received more attention and faced many challenges. Therefore, different methods have been proposed so far for hiding information in different cover media. In this paper a method for hiding of information on the billboard display is presented. It is well known that encryption provides secure channels for communicating entities. However, due to lack of covertness on these channels, an eavesdropper can identify encrypted streams through statistical tests and capture them for further cryptanalysis. In this paper we propose a new form of steganography, on-line hiding of information on the output screens of the instrument. This method can be used for announcing a secret message in public place. It can be extended to other means such as electronic advertising board around sports stadium, railway station or airport. This method of steganography is very similar to image steganography and video steganography. Private marking system using symmetric key steganography technique and LSB technique is used here for hiding the secret information. △ Less

Submitted 11 December, 2009; originally announced December 2009.

Journal ref: IJCSE Volume 1 Issue 3 2009 137-141

arXiv:cond-mat/0207406 [pdf]

Electrochemical synthesis of superconducting MgB2 thin films: a novel potential technique

Authors: S. H. Pawar, A. B. Jadhav, P. M. Shirage, D. D. Shivagan

Abstract: A complexing molecule electrodeposition technique has been developed for the deposition of low cost, lightweight MgB2 films, for the first time. Different deposition parameters such as bath composition, deposition potential, current density, deposition time were studied and optimized to give uniform, homogeneous and sticky films. The MgB2 films were deposited at a constant potential of -1.3 V wi… ▽ More A complexing molecule electrodeposition technique has been developed for the deposition of low cost, lightweight MgB2 films, for the first time. Different deposition parameters such as bath composition, deposition potential, current density, deposition time were studied and optimized to give uniform, homogeneous and sticky films. The MgB2 films were deposited at a constant potential of -1.3 V with respect to Saturated Calomel Electrode (SCE) onto silver substrate from aqueous bath and at -3.2 V with respect to SCE electrode onto silver substrate from non-aqueous bath. XRD, SEM and EDAX techniques are used to characterize these films. The films formed from non-aqueous electrochemical bath show the superconducting transition temperature at Tc = 36.4 K. The method developed is of less energy inputs than any other existing methods and of versatile nature having potential for large scale applications of MgB2 superconductors. PACS numbers: 74.70.Ad, 81.15.Pq, 74.76.-w *Author for correspondence △ Less

Submitted 17 July, 2002; originally announced July 2002.

Comments: 12 pages, 5 figures. Submitted to Journal of Physics: Condensed Matter

Showing 1–25 of 25 results for author: Jadhav, A