subscribe to arXiv mailings

Hedging Is Not All You Need: A Simple Baseline for Online Learning Under Haphazard Inputs

Authors: Himanshu Buckchash, Momojit Biswas, Rohit Agarwal, Dilip K. Prasad

Abstract: Handling haphazard streaming data, such as data from edge devices, presents a challenging problem. Over time, the incoming data becomes inconsistent, with missing, faulty, or new inputs reappearing. Therefore, it requires models that are reliable. Recent methods to solve this problem depend on a hedging-based solution and require specialized elements like auxiliary dropouts, forked architectures,… ▽ More Handling haphazard streaming data, such as data from edge devices, presents a challenging problem. Over time, the incoming data becomes inconsistent, with missing, faulty, or new inputs reappearing. Therefore, it requires models that are reliable. Recent methods to solve this problem depend on a hedging-based solution and require specialized elements like auxiliary dropouts, forked architectures, and intricate network design. We observed that hedging can be reduced to a special case of weighted residual connection; this motivated us to approximate it with plain self-attention. In this work, we propose HapNet, a simple baseline that is scalable, does not require online backpropagation, and is adaptable to varying input types. All present methods are restricted to scaling with a fixed window; however, we introduce a more complex problem of scaling with a variable window where the data becomes positionally uncorrelated, and cannot be addressed by present methods. We demonstrate that a variant of the proposed approach can work even for this complex scenario. We extensively evaluated the proposed approach on five benchmarks and found competitive performance. △ Less

Submitted 16 September, 2024; originally announced September 2024.

arXiv:2409.00150 [pdf]

Characterisation of Front-End Electronics of ChaSTE experiment onboard Chandayaan-3 lander

Authors: K. Durga Prasad, Chandan Kumar, Sanjeev K. Mishra, P. Kalyana S. Reddy, Janmejay Kumar, Tinkal Ladiya, Arpit Patel, Anil Bhardwaj

Abstract: Chandra Surface Thermophysical Experiment (ChaSTE) is one of the payloads flown onboard the Chandrayaan-3 lander. The objective of the experiment is in-situ investigation of thermal behaviour of outermost 100 mm layer of the lunar surface by deploying a thermal probe. The probe consists of 10 temperature sensors (Platinum RTDs) mounted at different locations along the length of the probe to measur… ▽ More Chandra Surface Thermophysical Experiment (ChaSTE) is one of the payloads flown onboard the Chandrayaan-3 lander. The objective of the experiment is in-situ investigation of thermal behaviour of outermost 100 mm layer of the lunar surface by deploying a thermal probe. The probe consists of 10 temperature sensors (Platinum RTDs) mounted at different locations along the length of the probe to measure lunar soil temperatures as a function of depth. A heater is also mounted on the probe for thermal conductivity measurements. The onboard electronics of ChaSTE has two parts, Front-End Electronics (FEE) and processing electronics (PE). The front-end electronics (FEE) card is responsible for carrying out necessary sensor signal conditioning,which includes exciting the RTD sensors,acquiring analog voltages and then converting the acquired analog signals to digital signals using an Analog to Digital Converter(ADC). The front-end card is further interfaced with the processing electronics card for digital processing and spacecraft interface.The calibration, characterisation and functional test activities of Front-End Electronics of ChaSTE were carried out with the objective of testing and ensuring proper functionality and performance.A two phase calibration process involving electronic offset correction and temperature calibration were carried out. All these activities were successfully completed and the results from them provided us with a really good understanding of the behaviour of the FEE under different thermal and electrical conditions as well as when subjected to the simulated conditions of the actual ChaSTE experiment. The performance of the ChaSTE front-end electronics was very much within the design margins and its behaviour in simulated lunar environment was as desired. The data from these activities is useful in the interpretation of the actual science data of ChaSTE. △ Less

Submitted 30 August, 2024; originally announced September 2024.

Comments: 15 pages, 14 figures

Journal ref: Journal of Spacecraft Technology, 34(2), July-Dec. 2023, Publisher: U.R.Rao.Satellite Centre, ISRO, Bangalore, ISSN: 0971-1600

arXiv:2406.09208 [pdf, other]

Python-based DSL for generating Verilog model of Synchronous Digital Circuits

Authors: Mandar Datar, Dhruva S. Hegde, Vendra Durga Prasad, Manish Prajapati, Neralla Manikanta, Devansh Gupta, Janampalli Pavanija, Pratyush Pare, Akash, Shivam Gupta, Sachin B. Patkar

Abstract: We have designed a Python-based Domain Specific Language (DSL) for modeling synchronous digital circuits. In this DSL, hardware is modeled as a collection of transactions -- running in series, parallel, and loops. When the model is executed by a Python interpreter, synthesizable and behavioural Verilog is generated as output, which can be integrated with other RTL designs or directly used for FPGA… ▽ More We have designed a Python-based Domain Specific Language (DSL) for modeling synchronous digital circuits. In this DSL, hardware is modeled as a collection of transactions -- running in series, parallel, and loops. When the model is executed by a Python interpreter, synthesizable and behavioural Verilog is generated as output, which can be integrated with other RTL designs or directly used for FPGA and ASIC flows. In this paper, we describe - 1) the language (DSL), which allows users to express computation in series/parallel/loop constructs, with explicit cycle boundaries, 2) the internals of a simple Python implementation to produce synthesizable Verilog, and 3) several design examples and case studies for applications in post-quantum cryptography, stereo-vision, digital signal processing and optimization techniques. In the end, we list ideas to extend this framework. △ Less

Submitted 13 June, 2024; originally announced June 2024.

Comments: 9 pages, 13 figures

arXiv:2405.05777 [pdf, other]

Towards a More Inclusive AI: Progress and Perspectives in Large Language Model Training for the Sámi Language

Authors: Ronny Paul, Himanshu Buckchash, Shantipriya Parida, Dilip K. Prasad

Abstract: Sámi, an indigenous language group comprising multiple languages, faces digital marginalization due to the limited availability of data and sophisticated language models designed for its linguistic intricacies. This work focuses on increasing technological participation for the Sámi language. We draw the attention of the ML community towards the language modeling problem of Ultra Low Resource (ULR… ▽ More Sámi, an indigenous language group comprising multiple languages, faces digital marginalization due to the limited availability of data and sophisticated language models designed for its linguistic intricacies. This work focuses on increasing technological participation for the Sámi language. We draw the attention of the ML community towards the language modeling problem of Ultra Low Resource (ULR) languages. ULR languages are those for which the amount of available textual resources is very low, and the speaker count for them is also very low. ULRLs are also not supported by mainstream Large Language Models (LLMs) like ChatGPT, due to which gathering artificial training data for them becomes even more challenging. Mainstream AI foundational model development has given less attention to this category of languages. Generally, these languages have very few speakers, making it hard to find them. However, it is important to develop foundational models for these ULR languages to promote inclusion and the tangible abilities and impact of LLMs. To this end, we have compiled the available Sámi language resources from the web to create a clean dataset for training language models. In order to study the behavior of modern LLM models with ULR languages (Sámi), we have experimented with different kinds of LLMs, mainly at the order of $\sim$ seven billion parameters. We have also explored the effect of multilingual LLM training for ULRLs. We found that the decoder-only models under a sequential multilingual training scenario perform better than joint multilingual training, whereas multilingual training with high semantic overlap, in general, performs better than training from scratch.This is the first study on the Sámi language for adapting non-statistical language models that use the latest developments in the field of natural language processing (NLP). △ Less

Submitted 9 May, 2024; originally announced May 2024.

arXiv:2404.04903 [pdf, other]

Online Learning under Haphazard Input Conditions: A Comprehensive Review and Analysis

Authors: Rohit Agarwal, Arijit Das, Alexander Horsch, Krishna Agarwal, Dilip K. Prasad

Abstract: The domain of online learning has experienced multifaceted expansion owing to its prevalence in real-life applications. Nonetheless, this progression operates under the assumption that the input feature space of the streaming data remains constant. In this survey paper, we address the topic of online learning in the context of haphazard inputs, explicitly foregoing such an assumption. We discuss,… ▽ More The domain of online learning has experienced multifaceted expansion owing to its prevalence in real-life applications. Nonetheless, this progression operates under the assumption that the input feature space of the streaming data remains constant. In this survey paper, we address the topic of online learning in the context of haphazard inputs, explicitly foregoing such an assumption. We discuss, classify, evaluate, and compare the methodologies that are adept at modeling haphazard inputs, additionally providing the corresponding code implementations and their carbon footprint. Moreover, we classify the datasets related to the field of haphazard inputs and introduce evaluation metrics specifically designed for datasets exhibiting imbalance. The code of each methodology can be found at https://github.com/Rohit102497/HaphazardInputsReview △ Less

Submitted 7 April, 2024; originally announced April 2024.

arXiv:2402.11323 [pdf, other]

Towards Development of Automated Knowledge Maps and Databases for Materials Engineering using Large Language Models

Authors: Deepak Prasad, Mayur Pimpude, Alankar Alankar

Abstract: In this work a Large Language Model (LLM) based workflow is presented that utilizes OpenAI ChatGPT model GPT-3.5-turbo-1106 and Google Gemini Pro model to create summary of text, data and images from research articles. It is demonstrated that by using a series of processing, the key information can be arranged in tabular form and knowledge graphs to capture underlying concepts. Our method offers e… ▽ More In this work a Large Language Model (LLM) based workflow is presented that utilizes OpenAI ChatGPT model GPT-3.5-turbo-1106 and Google Gemini Pro model to create summary of text, data and images from research articles. It is demonstrated that by using a series of processing, the key information can be arranged in tabular form and knowledge graphs to capture underlying concepts. Our method offers efficiency and comprehension, enabling researchers to extract insights more effectively. Evaluation based on a diverse Scientific Paper Collection demonstrates our approach in facilitating discovery of knowledge. This work contributes to accelerated material design by smart literature review. The method has been tested based on various qualitative and quantitative measures of gathered information. The ChatGPT model achieved an F1 score of 0.40 for an exact match (ROUGE-1, ROUGE-2) but an impressive 0.479 for a relaxed match (ROUGE-L, ROUGE-Lsum) structural data format in performance evaluation. The Google Gemini Pro outperforms ChatGPT with an F1 score of 0.50 for an exact match and 0.63 for a relaxed match. This method facilitates high-throughput development of a database relevant to materials informatics. For demonstration, an example of data extraction and knowledge graph formation based on a manuscript about a titanium alloy is discussed. △ Less

Submitted 17 February, 2024; originally announced February 2024.

arXiv:2402.08884 [pdf, other]

Machine Learning, Density Functional Theory, and Experiments to Understand the Photocatalytic Reduction of CO$_2$ by CuPt/TiO$_2$

Authors: Vaidish Sumaria, Takat B. Rawal, Young Feng Li, David Sommer, Jake Vikoren, Robert J. Bondi, Matthias Rupp, Amrit Prasad, Deeptanshu Prasad

Abstract: The photoconversion of CO$_2$ to hydrocarbons is a sustainable route to its transformation into value-added compounds and, thereby, crucial to mitigating the energy and climate crises. CuPt nanoparticles on TiO$_2$ surfaces have been reported to show promising photoconversion efficiency. For further progress, a mechanistic understanding of the catalytic properties of these CuPt/TiO$_2$ systems is… ▽ More The photoconversion of CO$_2$ to hydrocarbons is a sustainable route to its transformation into value-added compounds and, thereby, crucial to mitigating the energy and climate crises. CuPt nanoparticles on TiO$_2$ surfaces have been reported to show promising photoconversion efficiency. For further progress, a mechanistic understanding of the catalytic properties of these CuPt/TiO$_2$ systems is vital. Here, we employ $\textit{ab-initio}$ calculations, machine learning, and photocatalysis experiments to explore their configurational space and examine their reactivity and find that the interface plays a key role in stabilizing *CO$_2$, *CO, and other CH-containing intermediates, facilitating higher activity and selectivity for methane. A bias-corrected machine-learning interatomic potential trained on density functional theory data enables efficient exploration of the potential energy surfaces of numerous CO$_2$@CuPt/TiO$_2$ configurations via basin-hopping Monte Carlo simulations, greatly accelerating the study of these photocatalyst systems. Our simulations show that CO$_2$ preferentially adsorbs at the interface, with C atom bonded to a Pt site and one O atom occupying an O-vacancy site. The interface also promotes the formation of *CH and *CH$_2$ intermediates. For confirmation, we synthesize CuPt/TiO$_2$ samples with a variety of compositions and analyze their morphologies and compositions using scanning electron microscopy and energy-dispersive X-ray spectroscopy, and measure their photocatalytic activity. Our computational and experimental findings qualitatively agree and highlight the importance of interface design for selective conversion of CO$_2$ to hydrocarbons. △ Less

Submitted 16 February, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

Comments: Main text: 16 pages and 7 figures; Supporting information: 10 pages and 9 figures; Page 1, affiliation re-ordering; Page 4, typos corrected and abbreviation defined; Page 5, Table 1 caption revised and typos corrected; Page 16 typos corrected

arXiv:2401.16434 [pdf]

doi 10.1016/j.egyr.2023.01.039

A novel ANROA based control approach for grid-tied multi-functional solar energy conversion system

Authors: Dinanath Prasad, Narendra Kumar, Rakhi Sharma, Hasmat Malik, Fausto Pedro García Márquez, Jesús María Pinar Pérez

Abstract: An adaptive control approach for a three-phase grid-interfaced solar photovoltaic system based on the new Neuro-Fuzzy Inference System with Rain Optimization Algorithm (ANROA) methodology is proposed and discussed in this manuscript. This method incorporates an Adaptive Neuro-fuzzy Inference System (ANFIS) with a Rain Optimization Algorithm (ROA). The ANFIS controller has excellent maximum trackin… ▽ More An adaptive control approach for a three-phase grid-interfaced solar photovoltaic system based on the new Neuro-Fuzzy Inference System with Rain Optimization Algorithm (ANROA) methodology is proposed and discussed in this manuscript. This method incorporates an Adaptive Neuro-fuzzy Inference System (ANFIS) with a Rain Optimization Algorithm (ROA). The ANFIS controller has excellent maximum tracking capability because it includes features of both neural and fuzzy techniques. The ROA technique is in charge of controlling the voltage source converter switching. Avoiding power quality problems including voltage fluctuations, harmonics, and flickers as well as unbalanced loads and reactive power usage is the major goal. Besides, the proposed method performs at zero voltage regulation and unity power factor modes. The suggested control approach has been modeled and simulated, and its performance has been assessed using existing alternative methods. A statistical analysis of proposed and existing techniques has been also presented and discussed. The results of the simulations demonstrate that, when compared to alternative approaches, the suggested strategy may properly and effectively identify the best global solutions. Furthermore, the system's robustness has been studied by using MATLAB/SIMULINK environment and experimentally by Field Programmable Gate Arrays Controller (FPGA)-based Hardware-in-Loop (HLL). △ Less

Submitted 26 January, 2024; originally announced January 2024.

Comments: The paper was published in Energy Reports journal (ELSEVIER). Cite as: Prasad, D., Kumar, N., Sharma, R., Malik, H., Márquez, F. P. G., & Pinar-Pérez, J. M. (2023). A novel ANROA based control approach for grid-tied multi-functional solar energy conversion system. Energy Reports, 9, 2044-2057

Journal ref: Energy Reports (2023) Elsevier

arXiv:2311.05704 [pdf, other]

The Case for Hot-Mode Accretion in Abell 2029

Authors: Deovrat Prasad, G. Mark Voit, Brian W. O'Shea

Abstract: Radiative cooling and AGN heating are thought to form a feedback loop that regulates the evolution of low redshift cool-core galaxy clusters. Numerical simulations suggest that formation of multiphase gas in the cluster core imposes a floor on the ratio of cooling time ($t_{\rm cool}$) to free-fall time ($t_{\rm ff}$) at $\min ( t_{\rm cool} / t_{\rm ff} ) \approx 10$. Observations of galaxy clust… ▽ More Radiative cooling and AGN heating are thought to form a feedback loop that regulates the evolution of low redshift cool-core galaxy clusters. Numerical simulations suggest that formation of multiphase gas in the cluster core imposes a floor on the ratio of cooling time ($t_{\rm cool}$) to free-fall time ($t_{\rm ff}$) at $\min ( t_{\rm cool} / t_{\rm ff} ) \approx 10$. Observations of galaxy clusters show evidence for such a floor, and usually the cluster cores with $\min ( t_{\rm cool} / t_{\rm ff} ) \lesssim 30$ contain abundant multiphase gas. However, there are important outliers. One of them is Abell 2029, a massive galaxy cluster ($M_{200} \gtrsim 10^{15}$ M$_\odot$) with $\min( t_{\rm cool}/t_{\rm ff}) \sim 20$, but little apparent multiphase gas. In this paper, we present high resolution 3D hydrodynamic AMR simulations of a cluster similar to A2029 and study how it evolves over a period of 1-2 Gyr. Those simulations suggest that Abell 2029 self-regulates without producing multiphase gas because the mass of its central black hole ($\sim 5\times 10^{10} \, M_\odot$) is great enough for Bondi accretion of hot ambient gas to produce enough feedback energy to compensate for radiative cooling. △ Less

Submitted 9 November, 2023; originally announced November 2023.

Comments: 8 pages, 5 figures, submitted to MNRAS

arXiv:2311.02538 [pdf, other]

Dense Video Captioning: A Survey of Techniques, Datasets and Evaluation Protocols

Authors: Iqra Qasim, Alexander Horsch, Dilip K. Prasad

Abstract: Untrimmed videos have interrelated events, dependencies, context, overlapping events, object-object interactions, domain specificity, and other semantics that are worth highlighting while describing a video in natural language. Owing to such a vast diversity, a single sentence can only correctly describe a portion of the video. Dense Video Captioning (DVC) aims at detecting and describing differen… ▽ More Untrimmed videos have interrelated events, dependencies, context, overlapping events, object-object interactions, domain specificity, and other semantics that are worth highlighting while describing a video in natural language. Owing to such a vast diversity, a single sentence can only correctly describe a portion of the video. Dense Video Captioning (DVC) aims at detecting and describing different events in a given video. The term DVC originated in the 2017 ActivityNet challenge, after which considerable effort has been made to address the challenge. Dense Video Captioning is divided into three sub-tasks: (1) Video Feature Extraction (VFE), (2) Temporal Event Localization (TEL), and (3) Dense Caption Generation (DCG). This review aims to discuss all the studies that claim to perform DVC along with its sub-tasks and summarize their results. We also discuss all the datasets that have been used for DVC. Lastly, we highlight some emerging challenges and future trends in the field. △ Less

Submitted 4 November, 2023; originally announced November 2023.

Comments: 35 pages, 10 figures

arXiv:2310.07578 [pdf, ps, other]

Nanoparticle Stressor-Induced Single-photon Sources in Monolayer WS$_2$ Emitting into a Narrowband Visible Spectral Range

Authors: J. Thoppil S, Y. Waheed, S. Shit, I. D. Prasad, K. Watanabe, T. Taniguchi, S. Kumar

Abstract: A van der Waals heterostructure containing an atomically thin monolayer transition-metal dichalcogenide as a single-photon emitting layer is emerging as an intriguing solid-state quantum-photonic platform. Here, we report the utilization of spin-coating of silica nanoparticles for deterministically creating the spectrally isolated, energetically stable, and narrow-linewidth single-photon emitters… ▽ More A van der Waals heterostructure containing an atomically thin monolayer transition-metal dichalcogenide as a single-photon emitting layer is emerging as an intriguing solid-state quantum-photonic platform. Here, we report the utilization of spin-coating of silica nanoparticles for deterministically creating the spectrally isolated, energetically stable, and narrow-linewidth single-photon emitters in ML-WS$_2$. We also demonstrate that long-duration low-temperature annealing of the photonic heterostructure in the vacuum removes the energetically unstable emitters that are present due to fabrication-associated residue and lead to the emission of single-photons in a <25 nm narrowband visible spectral range centered at $\sim$620 nm. This work may pave the way toward realizing a hybrid-quantum-photonic platform containing a van der Waals heterostructure/device and an atomic-vapor system emitting/absorbing in the same visible spectral range. △ Less

Submitted 11 October, 2023; originally announced October 2023.

Comments: 5 figures

arXiv:2309.08698 [pdf, other]

No Imputation Needed: A Switch Approach to Irregularly Sampled Time Series

Authors: Rohit Agarwal, Aman Sinha, Ayan Vishwakarma, Xavier Coubez, Marianne Clausel, Mathieu Constant, Alexander Horsch, Dilip K. Prasad

Abstract: Modeling irregularly-sampled time series (ISTS) is challenging because of missing values. Most existing methods focus on handling ISTS by converting irregularly sampled data into regularly sampled data via imputation. These models assume an underlying missing mechanism, which may lead to unwanted bias and sub-optimal performance. We present SLAN (Switch LSTM Aggregate Network), which utilizes a gr… ▽ More Modeling irregularly-sampled time series (ISTS) is challenging because of missing values. Most existing methods focus on handling ISTS by converting irregularly sampled data into regularly sampled data via imputation. These models assume an underlying missing mechanism, which may lead to unwanted bias and sub-optimal performance. We present SLAN (Switch LSTM Aggregate Network), which utilizes a group of LSTMs to model ISTS without imputation, eliminating the assumption of any underlying process. It dynamically adapts its architecture on the fly based on the measured sensors using switches. SLAN exploits the irregularity information to explicitly capture each sensor's local summary and maintains a global summary state throughout the observational period. We demonstrate the efficacy of SLAN on two public datasets, namely, MIMIC-III, and Physionet 2012. △ Less

Submitted 19 August, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

arXiv:2308.10712 [pdf]

doi 10.18520/cs/v126/i7/774-780

Chandrayaan-3 Alternate Landing Site: Pre-Landing Characterisation

Authors: K. Durga Prasad, Dibyendu Misra, Amitabh, Megha Bhatt, G. Ambily, Sachana Sathyan, Neeraj Srivastava, Anil Bhardwaj

Abstract: India's third Moon mission Chandrayaan 3 will deploy a lander and a rover at a high latitude location of the Moon enabling us to carry out first ever in-situ science investigations of such a pristine location that will potentially improve our understanding on primary crust formation and subsequent modification processes. The primary landing site (PLS), is situated at 69.367621 degS, 32.348126 degE… ▽ More India's third Moon mission Chandrayaan 3 will deploy a lander and a rover at a high latitude location of the Moon enabling us to carry out first ever in-situ science investigations of such a pristine location that will potentially improve our understanding on primary crust formation and subsequent modification processes. The primary landing site (PLS), is situated at 69.367621 degS, 32.348126 degE. As a contingency, an alternate landing site (ALS) was also selected at nearly the same latitude but nearly 450 km west to PLS. In this work, a detailed study of the geomorphology, composition, and temperature characteristics of ALS has been carried out using the best-ever high resolution Chandrayaan 2 OHRC DEMs and Ortho images, datasets obtained from Chandrayaan 1 and on-going Lunar Reconnaissance Orbiter. For understanding the thermophysical behaviour, we used a well-established thermophysical model. We found that the Chandrayaan 3 ALS is characterised by a smooth topography with an elevated central part. The ALS is a scientifically interesting site with a high possibility of sampling ejecta materials from Tycho and Moretus. Based on the spectral and elemental analysis of the site, Fe is found to be near approx. 4.8 wt.%, with Mg approx. 5 wt.%, and Ca approx. 11 wt.%. Compositionally, ALS is similar to PLS with a highland soil composition. Spatial and diurnal variability of around 40 K and 175 K has been observed in the surface temperatures at ALS. Although belonging to similar location like PLS, ALS showed reduced daytime temperatures and enhanced night-time temperatures compared to PLS, indicating a terrain of distinctive thermophysical characteristics. Like PLS, ALS is also seems to be an interesting site for science investigations and Chandrayaan 3 is expected to provide new insights into the understanding of lunar science even if it happens to land in the alternate landing site. △ Less

Submitted 21 August, 2023; originally announced August 2023.

Comments: 13 pages, 7 figures

Journal ref: Current Science, 126(7), 774-780, 2023

arXiv:2308.06983 [pdf, other]

pNNCLR: Stochastic Pseudo Neighborhoods for Contrastive Learning based Unsupervised Representation Learning Problems

Authors: Momojit Biswas, Himanshu Buckchash, Dilip K. Prasad

Abstract: Nearest neighbor (NN) sampling provides more semantic variations than pre-defined transformations for self-supervised learning (SSL) based image recognition problems. However, its performance is restricted by the quality of the support set, which holds positive samples for the contrastive loss. In this work, we show that the quality of the support set plays a crucial role in any nearest neighbor b… ▽ More Nearest neighbor (NN) sampling provides more semantic variations than pre-defined transformations for self-supervised learning (SSL) based image recognition problems. However, its performance is restricted by the quality of the support set, which holds positive samples for the contrastive loss. In this work, we show that the quality of the support set plays a crucial role in any nearest neighbor based method for SSL. We then provide a refined baseline (pNNCLR) to the nearest neighbor based SSL approach (NNCLR). To this end, we introduce pseudo nearest neighbors (pNN) to control the quality of the support set, wherein, rather than sampling the nearest neighbors, we sample in the vicinity of hard nearest neighbors by varying the magnitude of the resultant vector and employing a stochastic sampling strategy to improve the performance. Additionally, to stabilize the effects of uncertainty in NN-based learning, we employ a smooth-weight-update approach for training the proposed network. Evaluation of the proposed method on multiple public image recognition and medical image recognition datasets shows that it performs up to 8 percent better than the baseline nearest neighbor method, and is comparable to other previously proposed SSL methods. △ Less

Submitted 14 August, 2023; originally announced August 2023.

Comments: 15 pages, 5 figures

arXiv:2307.04149 [pdf, other]

Latent Graph Attention for Enhanced Spatial Context

Authors: Ayush Singh, Yash Bhambhu, Himanshu Buckchash, Deepak K. Gupta, Dilip K. Prasad

Abstract: Global contexts in images are quite valuable in image-to-image translation problems. Conventional attention-based and graph-based models capture the global context to a large extent, however, these are computationally expensive. Moreover, the existing approaches are limited to only learning the pairwise semantic relation between any two points on the image. In this paper, we present Latent Graph A… ▽ More Global contexts in images are quite valuable in image-to-image translation problems. Conventional attention-based and graph-based models capture the global context to a large extent, however, these are computationally expensive. Moreover, the existing approaches are limited to only learning the pairwise semantic relation between any two points on the image. In this paper, we present Latent Graph Attention (LGA) a computationally inexpensive (linear to the number of nodes) and stable, modular framework for incorporating the global context in the existing architectures, especially empowering small-scale architectures to give performance closer to large size architectures, thus making the light-weight architectures more useful for edge devices with lower compute power and lower energy needs. LGA propagates information spatially using a network of locally connected graphs, thereby facilitating to construct a semantically coherent relation between any two spatially distant points that also takes into account the influence of the intermediate pixels. Moreover, the depth of the graph network can be used to adapt the extent of contextual spread to the target dataset, thereby being able to explicitly control the added computational cost. To enhance the learning mechanism of LGA, we also introduce a novel contrastive loss term that helps our LGA module to couple well with the original architecture at the expense of minimal additional computational load. We show that incorporating LGA improves the performance on three challenging applications, namely transparent object segmentation, image restoration for dehazing and optical flow estimation. △ Less

Submitted 12 July, 2023; v1 submitted 9 July, 2023; originally announced July 2023.

Comments: 20 pages, 7 figures

arXiv:2306.08263 [pdf, ps, other]

Semi-Invariant Rings: UFD and Codimension One Orbits

Authors: Charles Paquette, Deepanshu Prasad, David Wehlau

Abstract: Let $A$ be a finite dimensional associative $\mathbb{K}$-algebra over an algebraically closed field $\mathbb{K}$ of characteristic zero. To $A$, we can associate its basic form that is given by a quiver $Q = (Q_0, Q_1)$ with an admissible ideal $R$. For a dimension vector $β$, we consider an irreducible component $\mathcal{C}$ of the module variety of $β$-dimensional representations of $A$. The re… ▽ More Let $A$ be a finite dimensional associative $\mathbb{K}$-algebra over an algebraically closed field $\mathbb{K}$ of characteristic zero. To $A$, we can associate its basic form that is given by a quiver $Q = (Q_0, Q_1)$ with an admissible ideal $R$. For a dimension vector $β$, we consider an irreducible component $\mathcal{C}$ of the module variety of $β$-dimensional representations of $A$. The reductive group ${\rm GL}_β(\mathbb{K}):= \prod_{i \in Q_0}{\rm GL}_{β_i}(\mathbb{K})$ acts on $\mathcal{C}$ by change of basis, and has a unique closed orbit. We consider the corresponding ring of semi-invariants ${\rm SI}(Q, \mathcal{C})$. We prove that if $\mathcal{C}$ is factorial and has maximal orbits of codimension one, then ${\rm SI}(Q, \mathcal{C})$ is a complete intersection and is not multiplicity free. If $\mathcal{C}$ is not factorial, then this conclusion does not necessarily hold. We present examples showing that the codimension of the complete intersection can be arbitrarily large. Finally, we interpret our results in the case of hereditary algebras. △ Less

Submitted 14 June, 2023; originally announced June 2023.

arXiv:2306.05974 [pdf, other]

Taxonomy of hybridly polarized Stokes vortex beams

Authors: Gauri Arora, Ankit Butola, Ruchi Rajput, Rohit Agarwal, Krishna Agarwal, Alexander Horsch, Dilip K Prasad, Paramasivam Senthilkumaran

Abstract: Structured beams carrying topological defects, namely phase and Stokes singularities, have gained extensive interest in numerous areas of optics. The non-separable spin and orbital angular momentum states of hybridly polarized Stokes singular beams provide additional freedom for manipulating optical fields. However, the characterization of hybridly polarized Stokes vortex beams remains challenging… ▽ More Structured beams carrying topological defects, namely phase and Stokes singularities, have gained extensive interest in numerous areas of optics. The non-separable spin and orbital angular momentum states of hybridly polarized Stokes singular beams provide additional freedom for manipulating optical fields. However, the characterization of hybridly polarized Stokes vortex beams remains challenging owing to the degeneracy associated with the complex polarization structures of these beams. In addition, experimental noise factors such as relative phase, amplitude, and polarization difference together with beam fluctuations add to the perplexity in the identification process. Here, we present a generalized diffraction-based Stokes polarimetry approach assisted with deep learning for efficient identification of Stokes singular beams. A total of 15 classes of beams are considered based on the type of Stokes singularity and their associated mode indices. The resultant total and polarization component intensities of Stokes singular beams after diffraction through a triangular aperture are exploited by the deep neural network to recognize these beams. Our approach presents a classification accuracy of 98.67% for 15 types of Stokes singular beams that comprise several degenerate cases. The present study illustrates the potential of diffraction of the Stokes singular beam with polarization transformation, modeling of experimental noise factors, and a deep learning framework for characterizing hybridly polarized beams △ Less

Submitted 9 June, 2023; originally announced June 2023.

arXiv:2304.04147 [pdf]

FedPNN: One-shot Federated Classification via Evolving Clustering Method and Probabilistic Neural Network hybrid

Authors: Polaki Durga Prasad, Yelleti Vivek, Vadlamani Ravi

Abstract: Protecting data privacy is paramount in the fields such as finance, banking, and healthcare. Federated Learning (FL) has attracted widespread attention due to its decentralized, distributed training and the ability to protect the privacy while obtaining a global shared model. However, FL presents challenges such as communication overhead, and limited resource capability. This motivated us to propo… ▽ More Protecting data privacy is paramount in the fields such as finance, banking, and healthcare. Federated Learning (FL) has attracted widespread attention due to its decentralized, distributed training and the ability to protect the privacy while obtaining a global shared model. However, FL presents challenges such as communication overhead, and limited resource capability. This motivated us to propose a two-stage federated learning approach toward the objective of privacy protection, which is a first-of-its-kind study as follows: (i) During the first stage, the synthetic dataset is generated by employing two different distributions as noise to the vanilla conditional tabular generative adversarial neural network (CTGAN) resulting in modified CTGAN, and (ii) In the second stage, the Federated Probabilistic Neural Network (FedPNN) is developed and employed for building globally shared classification model. We also employed synthetic dataset metrics to check the quality of the generated synthetic dataset. Further, we proposed a meta-clustering algorithm whereby the cluster centers obtained from the clients are clustered at the server for training the global model. Despite PNN being a one-pass learning classifier, its complexity depends on the training data size. Therefore, we employed a modified evolving clustering method (ECM), another one-pass algorithm to cluster the training data thereby increasing the speed further. Moreover, we conducted sensitivity analysis by varying Dthr, a hyperparameter of ECM at the server and client, one at a time. The effectiveness of our approach is validated on four finance and medical datasets. △ Less

Submitted 8 April, 2023; originally announced April 2023.

Comments: 27 pages, 13 figures, 7 tables

MSC Class: 68T05; 68T07 ACM Class: I.2.11

arXiv:2303.05155 [pdf, other]

Aux-Drop: Handling Haphazard Inputs in Online Learning Using Auxiliary Dropouts

Authors: Rohit Agarwal, Deepak Gupta, Alexander Horsch, Dilip K. Prasad

Abstract: Many real-world applications based on online learning produce streaming data that is haphazard in nature, i.e., contains missing features, features becoming obsolete in time, the appearance of new features at later points in time and a lack of clarity on the total number of input features. These challenges make it hard to build a learnable system for such applications, and almost no work exists in… ▽ More Many real-world applications based on online learning produce streaming data that is haphazard in nature, i.e., contains missing features, features becoming obsolete in time, the appearance of new features at later points in time and a lack of clarity on the total number of input features. These challenges make it hard to build a learnable system for such applications, and almost no work exists in deep learning that addresses this issue. In this paper, we present Aux-Drop, an auxiliary dropout regularization strategy for online learning that handles the haphazard input features in an effective manner. Aux-Drop adapts the conventional dropout regularization scheme for the haphazard input feature space ensuring that the final output is minimally impacted by the chaotic appearance of such features. It helps to prevent the co-adaptation of especially the auxiliary and base features, as well as reduces the strong dependence of the output on any of the auxiliary inputs of the model. This helps in better learning for scenarios where certain features disappear in time or when new features are to be modelled. The efficacy of Aux-Drop has been demonstrated through extensive numerical experiments on SOTA benchmarking datasets that include Italy Power Demand, HIGGS, SUSY and multiple UCI datasets. The code is available at https://github.com/Rohit102497/Aux-Drop. △ Less

Submitted 31 May, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

Comments: Accepted at Transactions on Machine Learning Research (TMLR). Link: https://openreview.net/pdf?id=R9CgBkeZ6Z

Journal ref: Transactions on Machine Learning Research, 2023

arXiv:2303.03050 [pdf, other]

MABNet: Master Assistant Buddy Network with Hybrid Learning for Image Retrieval

Authors: Rohit Agarwal, Gyanendra Das, Saksham Aggarwal, Alexander Horsch, Dilip K. Prasad

Abstract: Image retrieval has garnered growing interest in recent times. The current approaches are either supervised or self-supervised. These methods do not exploit the benefits of hybrid learning using both supervision and self-supervision. We present a novel Master Assistant Buddy Network (MABNet) for image retrieval which incorporates both learning mechanisms. MABNet consists of master and assistant bl… ▽ More Image retrieval has garnered growing interest in recent times. The current approaches are either supervised or self-supervised. These methods do not exploit the benefits of hybrid learning using both supervision and self-supervision. We present a novel Master Assistant Buddy Network (MABNet) for image retrieval which incorporates both learning mechanisms. MABNet consists of master and assistant blocks, both learning independently through supervision and collectively via self-supervision. The master guides the assistant by providing its knowledge base as a reference for self-supervision and the assistant reports its knowledge back to the master by weight transfer. We perform extensive experiments on public datasets with and without post-processing. △ Less

Submitted 6 March, 2023; originally announced March 2023.

Comments: Accepted at International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2023

arXiv:2303.02095 [pdf, other]

Data-Efficient Training of CNNs and Transformers with Coresets: A Stability Perspective

Authors: Animesh Gupta, Irtiza Hasan, Dilip K. Prasad, Deepak K. Gupta

Abstract: Coreset selection is among the most effective ways to reduce the training time of CNNs, however, only limited is known on how the resultant models will behave under variations of the coreset size, and choice of datasets and models. Moreover, given the recent paradigm shift towards transformer-based models, it is still an open question how coreset selection would impact their performance. There are… ▽ More Coreset selection is among the most effective ways to reduce the training time of CNNs, however, only limited is known on how the resultant models will behave under variations of the coreset size, and choice of datasets and models. Moreover, given the recent paradigm shift towards transformer-based models, it is still an open question how coreset selection would impact their performance. There are several similar intriguing questions that need to be answered for a wide acceptance of coreset selection methods, and this paper attempts to answer some of these. We present a systematic benchmarking setup and perform a rigorous comparison of different coreset selection methods on CNNs and transformers. Our investigation reveals that under certain circumstances, random selection of subsets is more robust and stable when compared with the SOTA selection methods. We demonstrate that the conventional concept of uniform subset sampling across the various classes of the data is not the appropriate choice. Rather samples should be adaptively chosen based on the complexity of the data distribution for each class. Transformers are generally pretrained on large datasets, and we show that for certain target datasets, it helps to keep their performance stable at even very small coreset sizes. We further show that when no pretraining is done or when the pretrained transformer models are used with non-natural images (e.g. medical data), CNNs tend to generalize better than transformers at even very small coreset sizes. Lastly, we demonstrate that in the absence of the right pretraining, CNNs are better at learning the semantic coherence between spatially distant objects within an image, and these tend to outperform transformers at almost all choices of the coreset size. △ Less

Submitted 10 March, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

arXiv:2303.01546 [pdf, other]

MiShape: 3D Shape Modelling of Mitochondria in Microscopy

Authors: Abhinanda R. Punnakkal, Suyog S Jadhav, Alexander Horsch, Krishna Agarwal, Dilip K. Prasad

Abstract: Fluorescence microscopy is a quintessential tool for observing cells and understanding the underlying mechanisms of life-sustaining processes of all living organisms. The problem of extracting 3D shape of mitochondria from fluorescence microscopy images remains unsolved due to the complex and varied shapes expressed by mitochondria and the poor resolving capacity of these microscopes. We propose a… ▽ More Fluorescence microscopy is a quintessential tool for observing cells and understanding the underlying mechanisms of life-sustaining processes of all living organisms. The problem of extracting 3D shape of mitochondria from fluorescence microscopy images remains unsolved due to the complex and varied shapes expressed by mitochondria and the poor resolving capacity of these microscopes. We propose an approach to bridge this gap by learning a shape prior for mitochondria termed as MiShape, by leveraging high-resolution electron microscopy data. MiShape is a generative model learned using implicit representations of mitochondrial shapes. It provides a shape distribution that can be used to generate infinite realistic mitochondrial shapes. We demonstrate the representation power of MiShape and its utility for 3D shape reconstruction given a single 2D fluorescence image or a small 3D stack of 2D slices. We also showcase applications of our method by deriving simulated fluorescence microscope datasets that have realistic 3D ground truths for the problem of 2D segmentation and microscope-to-microscope transformation. △ Less

Submitted 2 March, 2023; originally announced March 2023.

arXiv:2302.03492 [pdf, ps, other]

Homological aspects of branching laws

Authors: Dipendra Prasad

Abstract: In this mostly expository article, we consider certain homological aspects of branching laws for representations of a group restricted to its subgroups in the context of $p$-adic groups. We follow our earlier paper, ICM 2018 proceedings, updating it with some more recent works. In particular, following Chan and Chan-Savin, see many of their papers listed in the bibliography, we have emphasized in… ▽ More In this mostly expository article, we consider certain homological aspects of branching laws for representations of a group restricted to its subgroups in the context of $p$-adic groups. We follow our earlier paper, ICM 2018 proceedings, updating it with some more recent works. In particular, following Chan and Chan-Savin, see many of their papers listed in the bibliography, we have emphasized in this work that the restriction of a (generic) representation $π$ of a group $G$ to a closed subgroup $H$ (most of the paper is written in the context of GGP) turns out to be a projective representation on most Bernstein blocks of the category of smooth representations of $H$. Further, once $π|_H$ is a projective module in a particular Bernstein block, it has a simple structure. △ Less

Submitted 5 February, 2024; v1 submitted 7 February, 2023; originally announced February 2023.

Comments: Revised version

MSC Class: 11F70; 22E55

arXiv:2301.13817 [pdf, other]

Patch Gradient Descent: Training Neural Networks on Very Large Images

Authors: Deepak K. Gupta, Gowreesh Mago, Arnav Chavan, Dilip K. Prasad

Abstract: Traditional CNN models are trained and tested on relatively low resolution images (<300 px), and cannot be directly operated on large-scale images due to compute and memory constraints. We propose Patch Gradient Descent (PatchGD), an effective learning strategy that allows to train the existing CNN architectures on large-scale images in an end-to-end manner. PatchGD is based on the hypothesis that… ▽ More Traditional CNN models are trained and tested on relatively low resolution images (<300 px), and cannot be directly operated on large-scale images due to compute and memory constraints. We propose Patch Gradient Descent (PatchGD), an effective learning strategy that allows to train the existing CNN architectures on large-scale images in an end-to-end manner. PatchGD is based on the hypothesis that instead of performing gradient-based updates on an entire image at once, it should be possible to achieve a good solution by performing model updates on only small parts of the image at a time, ensuring that the majority of it is covered over the course of iterations. PatchGD thus extensively enjoys better memory and compute efficiency when training models on large scale images. PatchGD is thoroughly evaluated on two datasets - PANDA and UltraMNIST with ResNet50 and MobileNetV2 models under different memory constraints. Our evaluation clearly shows that PatchGD is much more stable and efficient than the standard gradient-descent method in handling large images, and especially when the compute memory is limited. △ Less

Submitted 31 January, 2023; originally announced January 2023.

arXiv:2301.05691 [pdf]

Structural phase transitions in perovskite BaCeO3 with data mining and first-principles theoretical calculations

Authors: Farha Naaz, Manendra S. Chauhan, Kedar Yadav, Surender Singh, Ashok Kumar, Dasari L. V. K. Prasad

Abstract: Several experiments conducted over decades have revealed that the perovskite-structured BaCeO3 goes through a series of temperature-induced structural phase transitions. However, it has been frequently observed that the number of phases and the sequence in which they appear as a function of temperature differ between experiments. Insofar as neutron diffraction and Raman spectroscopy experiments ar… ▽ More Several experiments conducted over decades have revealed that the perovskite-structured BaCeO3 goes through a series of temperature-induced structural phase transitions. However, it has been frequently observed that the number of phases and the sequence in which they appear as a function of temperature differ between experiments. Insofar as neutron diffraction and Raman spectroscopy experiments are concern, four structures are well characterized with three transitions: Pnma to Imma [563 K] to R-3c [673 K] to Pm-3m [1173 K]. In contrast, thermoanalytical methods showed multiple singularities corresponding to at-least three more structural transitions at around 830 K, 900 K, and 1030 K. In account of these conflicting experimental findings, we computed free energy phase diagram for BaCeO3 employing crystal structure data mining in conjunction with first principles electronic structure and phonon lattice dynamics. A total of 34 polymorphs have been predicted, the most stable of which follows the Glazer classification of the perovskite tilt system. It has been predicted that the Cmcm and P4/mbm phases surpass Pnma at 666 K and 1210 K, respectively. At any temperature, two alternate tetragonal phases (P42/nmc and I4/mcm) are also found to be 20 to 30 meV less favored than the Pnma. While the calculated stability order of the predicted polymorphs is in acceptable agreement with the results of neutron diffraction, the transitions observed in thermoanalytical studies could be ascribed to the development of four novel phases (Cmcm, P4/mbm, P42/nmc, and I4/mcm) at intermediate temperatures. However, we analyze that the R-3c phase predominantly stabilized over a broad temperature field, masking all subsequent phases up until the cubic Pm-3m. Consequently, the novel phases predicted to occur in thermoanalytical studies are only fleetingly metastable. △ Less

Submitted 30 November, 2023; v1 submitted 13 January, 2023; originally announced January 2023.

Comments: 20 pages, 5 figures, 1 table

arXiv:2211.13769 [pdf, other]

On Designing Light-Weight Object Trackers through Network Pruning: Use CNNs or Transformers?

Authors: Saksham Aggarwal, Taneesh Gupta, Pawan Kumar Sahu, Arnav Chavan, Rishabh Tiwari, Dilip K. Prasad, Deepak K. Gupta

Abstract: Object trackers deployed on low-power devices need to be light-weight, however, most of the current state-of-the-art (SOTA) methods rely on using compute-heavy backbones built using CNNs or transformers. Large sizes of such models do not allow their deployment in low-power conditions and designing compressed variants of large tracking models is of great importance. This paper demonstrates how high… ▽ More Object trackers deployed on low-power devices need to be light-weight, however, most of the current state-of-the-art (SOTA) methods rely on using compute-heavy backbones built using CNNs or transformers. Large sizes of such models do not allow their deployment in low-power conditions and designing compressed variants of large tracking models is of great importance. This paper demonstrates how highly compressed light-weight object trackers can be designed using neural architectural pruning of large CNN and transformer based trackers. Further, a comparative study on architectural choices best suited to design light-weight trackers is provided. A comparison between SOTA trackers using CNNs, transformers as well as the combination of the two is presented to study their stability at various compression ratios. Finally results for extreme pruning scenarios going as low as 1% in some cases are shown to study the limits of network pruning in object tracking. This work provides deeper insights into designing highly efficient trackers from existing SOTA methods. △ Less

Submitted 26 March, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

Comments: Accepted at IEEE ICASSP 2023

arXiv:2211.06739 [pdf, other]

Partial Binarization of Neural Networks for Budget-Aware Efficient Learning

Authors: Udbhav Bamba, Neeraj Anand, Saksham Aggarwal, Dilip K. Prasad, Deepak K. Gupta

Abstract: Binarization is a powerful compression technique for neural networks, significantly reducing FLOPs, but often results in a significant drop in model performance. To address this issue, partial binarization techniques have been developed, but a systematic approach to mixing binary and full-precision parameters in a single network is still lacking. In this paper, we propose a controlled approach to… ▽ More Binarization is a powerful compression technique for neural networks, significantly reducing FLOPs, but often results in a significant drop in model performance. To address this issue, partial binarization techniques have been developed, but a systematic approach to mixing binary and full-precision parameters in a single network is still lacking. In this paper, we propose a controlled approach to partial binarization, creating a budgeted binary neural network (B2NN) with our MixBin strategy. This method optimizes the mixing of binary and full-precision components, allowing for explicit selection of the fraction of the network to remain binary. Our experiments show that B2NNs created using MixBin outperform those from random or iterative searches and state-of-the-art layer selection methods by up to 3% on the ImageNet-1K dataset. We also show that B2NNs outperform the structured pruning baseline by approximately 23% at the extreme FLOP budget of 15%, and perform well in object tracking, with up to a 12.4% relative improvement over other baselines. Additionally, we demonstrate that B2NNs developed by MixBin can be transferred across datasets, with some cases showing improved performance over directly applying MixBin on the downstream data. △ Less

Submitted 8 November, 2023; v1 submitted 12 November, 2022; originally announced November 2022.

Comments: Accepted at WACV 2023 Conference

arXiv:2206.12681 [pdf, other]

UltraMNIST Classification: A Benchmark to Train CNNs for Very Large Images

Authors: Deepak K. Gupta, Udbhav Bamba, Abhishek Thakur, Akash Gupta, Suraj Sharan, Ertugrul Demir, Dilip K. Prasad

Abstract: Convolutional neural network (CNN) approaches available in the current literature are designed to work primarily with low-resolution images. When applied on very large images, challenges related to GPU memory, smaller receptive field than needed for semantic correspondence and the need to incorporate multi-scale features arise. The resolution of input images can be reduced, however, with significa… ▽ More Convolutional neural network (CNN) approaches available in the current literature are designed to work primarily with low-resolution images. When applied on very large images, challenges related to GPU memory, smaller receptive field than needed for semantic correspondence and the need to incorporate multi-scale features arise. The resolution of input images can be reduced, however, with significant loss of critical information. Based on the outlined issues, we introduce a novel research problem of training CNN models for very large images, and present 'UltraMNIST dataset', a simple yet representative benchmark dataset for this task. UltraMNIST has been designed using the popular MNIST digits with additional levels of complexity added to replicate well the challenges of real-world problems. We present two variants of the problem: 'UltraMNIST classification' and 'Budget-aware UltraMNIST classification'. The standard UltraMNIST classification benchmark is intended to facilitate the development of novel CNN training methods that make the effective use of the best available GPU resources. The budget-aware variant is intended to promote development of methods that work under constrained GPU memory. For the development of competitive solutions, we present several baseline models for the standard benchmark and its budget-aware variant. We study the effect of reducing resolution on the performance and present results for baseline models involving pretrained backbones from among the popular state-of-the-art models. Finally, with the presented benchmark dataset and the baselines, we hope to pave the ground for a new generation of CNN methods suitable for handling large images in an efficient and resource-light manner. △ Less

Submitted 25 June, 2022; originally announced June 2022.

arXiv:2204.10108 [pdf, ps, other]

Twisted GGP Problems and Conjectures

Authors: Wee Teck Gan, Benedict H. Gross, Dipendra Prasad

Abstract: In an earlier work, we considered a family of restriction problems for classical groups (over local and global fields) and proposed precise answers to these problems using the local and global Langlands correspondence. These restriction problems were formulated in terms of a pair $W \subset V$ of orthogonal, Hermitian, symplectic, or skew-Hermitian spaces. In this paper, we consider a twisted vari… ▽ More In an earlier work, we considered a family of restriction problems for classical groups (over local and global fields) and proposed precise answers to these problems using the local and global Langlands correspondence. These restriction problems were formulated in terms of a pair $W \subset V$ of orthogonal, Hermitian, symplectic, or skew-Hermitian spaces. In this paper, we consider a twisted variant of these conjectures in one particular case -- that of a pair of skew-Hermitian spaces $W = V$. △ Less

Submitted 24 April, 2023; v1 submitted 21 April, 2022; originally announced April 2022.

Comments: Revised and final version; to appear in Compositio Mathematica

MSC Class: Primary 11F70; Secondary 22E55

arXiv:2203.16973 [pdf, other]

Analyzing the factors affecting usefulness of Self-Supervised Pre-trained Representations for Speech Recognition

Authors: Ashish Seth, Lodagala V S V Durga Prasad, Sreyan Ghosh, S. Umesh

Abstract: Self-supervised learning (SSL) to learn high-level speech representations has been a popular approach to building Automatic Speech Recognition (ASR) systems in low-resource settings. However, the common assumption made in literature is that a considerable amount of unlabeled data is available for the same domain or language that can be leveraged for SSL pre-training, which we acknowledge is not fe… ▽ More Self-supervised learning (SSL) to learn high-level speech representations has been a popular approach to building Automatic Speech Recognition (ASR) systems in low-resource settings. However, the common assumption made in literature is that a considerable amount of unlabeled data is available for the same domain or language that can be leveraged for SSL pre-training, which we acknowledge is not feasible in a real-world setting. In this paper, as part of the Interspeech Gram Vaani ASR challenge, we try to study the effect of domain, language, dataset size, and other aspects of our upstream pre-training SSL data on the final performance low-resource downstream ASR task. We also build on the continued pre-training paradigm to study the effect of prior knowledge possessed by models trained using SSL. Extensive experiments and studies reveal that the performance of ASR systems is susceptible to the data used for SSL pre-training. Their performance improves with an increase in similarity and volume of pre-training data. We believe our work will be helpful to the speech community in building better ASR systems in low-resource settings and steer research towards improving generalization in SSL-based pre-training for speech systems. △ Less

Submitted 17 May, 2023; v1 submitted 31 March, 2022; originally announced March 2022.

arXiv:2203.16965 [pdf, other]

PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech Representations

Authors: Lodagala V S V Durga Prasad, Sreyan Ghosh, S. Umesh

Abstract: While self-supervised speech representation learning (SSL) models serve a variety of downstream tasks, these models have been observed to overfit to the domain from which the unlabelled data originates. To alleviate this issue, we propose PADA (Pruning Assisted Domain Adaptation) and zero out redundant weights from models pre-trained on large amounts of out-of-domain (OOD) data. Intuitively, this… ▽ More While self-supervised speech representation learning (SSL) models serve a variety of downstream tasks, these models have been observed to overfit to the domain from which the unlabelled data originates. To alleviate this issue, we propose PADA (Pruning Assisted Domain Adaptation) and zero out redundant weights from models pre-trained on large amounts of out-of-domain (OOD) data. Intuitively, this helps to make space for the target-domain ASR finetuning. The redundant weights can be identified through various pruning strategies which have been discussed in detail as a part of this work. Specifically, we investigate the effect of the recently discovered Task-Agnostic and Task-Aware pruning on PADA and propose a new pruning paradigm based on the latter, which we call Cross-Domain Task-Aware Pruning (CD-TAW). CD-TAW obtains the initial pruning mask from a well fine-tuned OOD model, which makes it starkly different from the rest of the pruning strategies discussed in the paper. Our proposed CD-TAW methodology achieves up to 20.6% relative WER improvement over our baseline when fine-tuned on a 2-hour subset of Switchboard data without language model (LM) decoding. Furthermore, we conduct a detailed analysis to highlight the key design choices of our proposed method. △ Less

Submitted 13 May, 2023; v1 submitted 31 March, 2022; originally announced March 2022.

Comments: Accepted to IEEE SLT 2022

arXiv:2202.01062 [pdf, ps, other]

Primes dividing values of a given Polynomial

Authors: Devendra Prasad

Abstract: Let $P(x) \in \mathbb{Z}[x]$ be a polynomial. We give an easy and new proof of the fact that the set of primes $p$ such that $p \mid P(n)$, for some $n \in \mathbb{Z}$, is infinite. We also get analog of this result for some special domains. Let $P(x) \in \mathbb{Z}[x]$ be a polynomial. We give an easy and new proof of the fact that the set of primes $p$ such that $p \mid P(n)$, for some $n \in \mathbb{Z}$, is infinite. We also get analog of this result for some special domains. △ Less

Submitted 1 January, 2022; originally announced February 2022.

Comments: accepted for publication in The Mathematics Student

arXiv:2111.09109 [pdf, other]

Physics-guided Loss Functions Improve Deep Learning Performance in Inverse Scattering

Authors: Zicheng Liu, Mayank Roy, Dilip K. Prasad, Krishna Agarwal

Abstract: Solving electromagnetic inverse scattering problems (ISPs) is challenging due to the intrinsic nonlinearity, ill-posedness, and expensive computational cost. Recently, deep neural network (DNN) techniques have been successfully applied on ISPs and shown potential of superior imaging over conventional methods. In this paper, we analyse the analogy between DNN solvers and traditional iterative algor… ▽ More Solving electromagnetic inverse scattering problems (ISPs) is challenging due to the intrinsic nonlinearity, ill-posedness, and expensive computational cost. Recently, deep neural network (DNN) techniques have been successfully applied on ISPs and shown potential of superior imaging over conventional methods. In this paper, we analyse the analogy between DNN solvers and traditional iterative algorithms and discuss how important physical phenomena cannot be effectively incorporated in the training process. We show the importance of including near-field priors in the learning process of DNNs. To this end, we propose new designs of loss functions which incorporate multiple-scattering based near-field quantities (such as scattered fields or induced currents within domain of interest). Effects of physics-guided loss functions are studied using a variety of numerical experiments. Pros and cons of the investigated ISP solvers with different loss functions are summarized. △ Less

Submitted 13 November, 2021; originally announced November 2021.

arXiv:2110.15382 [pdf, other]

doi 10.3847/1538-4357/ac69ee

Atmospheric Circulation in Simulations of the AGN-CGM Connection at Halo Masses $\sim 10^{13.5}, M_\odot$

Authors: Deovrat Prasad, G. Mark Voit, Brian W. O'Shea

Abstract: Coupling between active galactic nuclei (AGN) and the circumgalactic medium (CGM) is critical to the interplay between radiative cooling and feedback heating in the atmospheres of the universe's most massive galaxies. This paper presents a detailed analysis of numerical simulations showing how kinetic AGN feedback with a strong momentum flux interacts with the CGM. Our analysis shows that large sc… ▽ More Coupling between active galactic nuclei (AGN) and the circumgalactic medium (CGM) is critical to the interplay between radiative cooling and feedback heating in the atmospheres of the universe's most massive galaxies. This paper presents a detailed analysis of numerical simulations showing how kinetic AGN feedback with a strong momentum flux interacts with the CGM. Our analysis shows that large scale CGM circulation plays an important role in reconfiguring the galactic atmosphere and regulating the atmosphere's central entropy level. We find that most of the AGN energy output goes into lifting of circumgalactic gas rather than heating of atmospheric gas within the galaxy, consequently reconfiguring the circumgalactic medium (CGM) in our simulations. Large scale (10s of kpc) circulation of the CGM on ~ 10-100 kpc scales therefore plays a critical role in preventing over-cooling of gas in these simulated galaxies. The simulations also show that our choices of accretion efficiency and jet opening angle significantly affect the AGN-CGM coupling. Reducing the jet opening angle to quarter of the fiducial opening angle increases the jet momentum flux, enabling it to drill through to larger radii without effectively coupling with the CGM at the center ( $r < 5$ kpc). Outflows with a lower momentum flux decelerate and thermalize the bulk of their energy at smaller radii ($r \lesssim 10$ ). △ Less

Submitted 2 May, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

Comments: 26 pages, 17 figures, submitted to ApJ

arXiv:2110.01532 [pdf, other]

Differentiable Spline Approximations

Authors: Minsu Cho, Aditya Balu, Ameya Joshi, Anjana Deva Prasad, Biswajit Khara, Soumik Sarkar, Baskar Ganapathysubramanian, Adarsh Krishnamurthy, Chinmay Hegde

Abstract: The paradigm of differentiable programming has significantly enhanced the scope of machine learning via the judicious use of gradient-based optimization. However, standard differentiable programming methods (such as autodiff) typically require that the machine learning models be differentiable, limiting their applicability. Our goal in this paper is to use a new, principled approach to extend grad… ▽ More The paradigm of differentiable programming has significantly enhanced the scope of machine learning via the judicious use of gradient-based optimization. However, standard differentiable programming methods (such as autodiff) typically require that the machine learning models be differentiable, limiting their applicability. Our goal in this paper is to use a new, principled approach to extend gradient-based optimization to functions well modeled by splines, which encompass a large family of piecewise polynomial models. We derive the form of the (weak) Jacobian of such functions and show that it exhibits a block-sparse structure that can be computed implicitly and efficiently. Overall, we show that leveraging this redesigned Jacobian in the form of a differentiable "layer" in predictive models leads to improved performance in diverse applications such as image segmentation, 3D point cloud reconstruction, and finite element analysis. △ Less

Submitted 4 October, 2021; originally announced October 2021.

Comments: 9 pages, accepted in Neurips 2021

arXiv:2109.12867 [pdf, ps, other]

Bhargava factorials and irreducibility of integer-valued polynomials

Authors: Devendra Prasad

Abstract: The ring of integer-valued polynomials over a given subset $S$ of $\Z$ (or $ \mathrm{Int}(S,\Z ))$ is defined as the set of polynomials in $\Q[x]$ which maps $S$ to $\Z$. In factorization theory, it is crucial to check the irreducibility of a polynomial. In this article, we make Bhargava factorials our main tool to check the irreducibility of a given polynomial $f \in \mathrm{Int}(S,\Z ))$. We a… ▽ More The ring of integer-valued polynomials over a given subset $S$ of $\Z$ (or $ \mathrm{Int}(S,\Z ))$ is defined as the set of polynomials in $\Q[x]$ which maps $S$ to $\Z$. In factorization theory, it is crucial to check the irreducibility of a polynomial. In this article, we make Bhargava factorials our main tool to check the irreducibility of a given polynomial $f \in \mathrm{Int}(S,\Z ))$. We also generalize our results to arbitrary subsets of a Dedekind domain. △ Less

Submitted 27 September, 2021; originally announced September 2021.

Comments: Accepted for publication in Rocky Mountain Journal of Mathematics, 2021

arXiv:2108.07458 [pdf, ps, other]

Irreducibility of integer-valued polynomials in several variables

Authors: Devendra Prasad

Abstract: Let $§$ be an arbitrary subset of $R^n$ where $R$ is a domain with the field of fractions $\K$. Denote the ring of polynomials in $n$ variables over $\K$ by $\K[\x].$ The ring of integer-valued polynomials over $§,$ denoted by Int$(§,R)$, is defined as the set of the polynomials of $\K[\x],$ which maps $§$ to $R$. In this article, we study the irreducibility of the polynomials of Int$(§,R)$ for th… ▽ More Let $§$ be an arbitrary subset of $R^n$ where $R$ is a domain with the field of fractions $\K$. Denote the ring of polynomials in $n$ variables over $\K$ by $\K[\x].$ The ring of integer-valued polynomials over $§,$ denoted by Int$(§,R)$, is defined as the set of the polynomials of $\K[\x],$ which maps $§$ to $R$. In this article, we study the irreducibility of the polynomials of Int$(§,R)$ for the first time in the case when $R$ is a Unique Factorization Domain. We also show that our results remain valid when $R$ is a Dedekind domain or sometimes any domain. △ Less

Submitted 17 August, 2021; originally announced August 2021.

Comments: Periodica Mathemtica Hungarica, 2021

arXiv:2106.01400 [pdf, other]

Dual Script E2E framework for Multilingual and Code-Switching ASR

Authors: Mari Ganesh Kumar, Jom Kuriakose, Anand Thyagachandran, Arun Kumar A, Ashish Seth, Lodagala Durga Prasad, Saish Jaiswal, Anusha Prakash, Hema Murthy

Abstract: India is home to multiple languages, and training automatic speech recognition (ASR) systems for languages is challenging. Over time, each language has adopted words from other languages, such as English, leading to code-mixing. Most Indian languages also have their own unique scripts, which poses a major limitation in training multilingual and code-switching ASR systems. Inspired by results in… ▽ More India is home to multiple languages, and training automatic speech recognition (ASR) systems for languages is challenging. Over time, each language has adopted words from other languages, such as English, leading to code-mixing. Most Indian languages also have their own unique scripts, which poses a major limitation in training multilingual and code-switching ASR systems. Inspired by results in text-to-speech synthesis, in this work, we use an in-house rule-based phoneme-level common label set (CLS) representation to train multilingual and code-switching ASR for Indian languages. We propose two end-to-end (E2E) ASR systems. In the first system, the E2E model is trained on the CLS representation, and we use a novel data-driven back-end to recover the native language script. In the second system, we propose a modification to the E2E model, wherein the CLS representation and the native language characters are used simultaneously for training. We show our results on the multilingual and code-switching tasks of the Indic ASR Challenge 2021. Our best results achieve 6% and 5% improvement (approx) in word error rate over the baseline system for the multilingual and code-switching tasks, respectively, on the challenge development data. △ Less

Submitted 2 June, 2021; originally announced June 2021.

Comments: Accepted for publication at Interspeech 2021

arXiv:2106.00437 [pdf, ps, other]

Homological duality for covering groups of reductive $p$-adic groups

Authors: Dragos Fratila, Dipendra Prasad

Abstract: In this largely expository paper we extend properties of the homological duality functor $RHom_{\mathcal H}(-,{\mathcal H})$ where ${\mathcal H}$ is the Hecke algebra of a reductive $p$-adic group, to the case where it is the Hecke algebra of a finite central extension of a reductive $p$-adic group. The most important properties being that $RHom_{\mathcal H}(-,{\mathcal H})$ is concentrated in a s… ▽ More In this largely expository paper we extend properties of the homological duality functor $RHom_{\mathcal H}(-,{\mathcal H})$ where ${\mathcal H}$ is the Hecke algebra of a reductive $p$-adic group, to the case where it is the Hecke algebra of a finite central extension of a reductive $p$-adic group. The most important properties being that $RHom_{\mathcal H}(-,{\mathcal H})$ is concentrated in a single degree for irreducible representations and that it gives rise to Schneider--Stuhler duality for Ext groups (a Serre functor like property). Along the way we also study Grothendieck--Serre duality with respect to the Bernstein center and provide a proof of the folklore result that on admissible modules this functor is nothing but the contragredient duality. We single out a necessary and sufficient condition for when these three dualities agree on finite length modules in a given block. In particular, we show this is the case for all cuspidal blocks as well as, due to a result of Roche, on all blocks with trivial stabilizer in the relative Weyl group. △ Less

Submitted 4 August, 2022; v1 submitted 1 June, 2021; originally announced June 2021.

Comments: To appear in Pure Appl. Math. Q. in a volume in honor of Benedict Gross

MSC Class: 22E50; 22E45

arXiv:2104.14547 [pdf, other]

NURBS-Diff: A Differentiable Programming Module for NURBS

Authors: Anjana Deva Prasad, Aditya Balu, Harshil Shah, Soumik Sarkar, Chinmay Hegde, Adarsh Krishnamurthy

Abstract: Boundary representations (B-reps) using Non-Uniform Rational B-splines (NURBS) are the de facto standard used in CAD, but their utility in deep learning-based approaches is not well researched. We propose a differentiable NURBS module to integrate NURBS representations of CAD models with deep learning methods. We mathematically define the derivatives of the NURBS curves or surfaces with respect to… ▽ More Boundary representations (B-reps) using Non-Uniform Rational B-splines (NURBS) are the de facto standard used in CAD, but their utility in deep learning-based approaches is not well researched. We propose a differentiable NURBS module to integrate NURBS representations of CAD models with deep learning methods. We mathematically define the derivatives of the NURBS curves or surfaces with respect to the input parameters (control points, weights, and the knot vector). These derivatives are used to define an approximate Jacobian used for performing the "backward" evaluation to train the deep learning models. We have implemented our NURBS module using GPU-accelerated algorithms and integrated it with PyTorch, a popular deep learning framework. We demonstrate the efficacy of our NURBS module in performing CAD operations such as curve or surface fitting and surface offsetting. Further, we show its utility in deep learning for unsupervised point cloud reconstruction and enforce analysis constraints. These examples show that our module performs better for certain deep learning frameworks and can be directly integrated with any deep-learning framework requiring NURBS. △ Less

Submitted 13 January, 2022; v1 submitted 29 April, 2021; originally announced April 2021.

arXiv:2009.02617 [pdf, other]

doi 10.1364/BOE.410617

Artefact removal in ground truth and noise model deficient sub-cellular nanoscopy images using auto-encoder deep learning

Authors: Suyog Jadhav, Sebastian Acuña, Krishna Agarwal, Dilip K. prasad

Abstract: Image denoising or artefact removal using deep learning is possible in the availability of supervised training dataset acquired in real experiments or synthesized using known noise models. Neither of the conditions can be fulfilled for nanoscopy (super-resolution optical microscopy) images that are generated from microscopy videos through statistical analysis techniques. Due to several physical co… ▽ More Image denoising or artefact removal using deep learning is possible in the availability of supervised training dataset acquired in real experiments or synthesized using known noise models. Neither of the conditions can be fulfilled for nanoscopy (super-resolution optical microscopy) images that are generated from microscopy videos through statistical analysis techniques. Due to several physical constraints, supervised dataset cannot be measured. Due to non-linear spatio-temporal mixing of data and valuable statistics of fluctuations from fluorescent molecules which compete with noise statistics, noise or artefact models in nanoscopy images cannot be explicitly learnt. Therefore, such problem poses unprecedented challenges to deep learning. Here, we propose a robust and versatile simulation-supervised training approach of deep learning auto-encoder architectures for the highly challenging nanoscopy images of sub-cellular structures inside biological samples. We show the proof of concept for one nanoscopy method and investigate the scope of generalizability across structures, noise models, and nanoscopy algorithms not included during simulation-supervised training. We also investigate a variety of loss functions and learning models and discuss the limitation of existing performance metrics for nanoscopy images. We generate valuable insights for this highly challenging and unsolved problem in nanoscopy, and set the foundation for application of deep learning problems in nanoscopy for life sciences. △ Less

Submitted 5 September, 2020; originally announced September 2020.

Comments: 22 pages, 13 figures

arXiv:2009.00344 [pdf, ps, other]

doi 10.1080/00927872.2020.1823990

Irreducibility of integer-valued polynomials I

Authors: Devendra Prasad

Abstract: Let $S \subset R$ be an arbitrary subset of a unique factorization domain $R$ and $\K$ be the field of fractions of $R$. The ring of integer-valued polynomials over $S$ is the set $\mathrm{Int}(S,R)= \{ f \in \mathbb{K}[x]: f(a) \in R\ \forall\ a \in S \}.$ This article is an effort to study the irreducibility of integer-valued polynomials over arbitrary subsets of a unique factorization domain. W… ▽ More Let $S \subset R$ be an arbitrary subset of a unique factorization domain $R$ and $\K$ be the field of fractions of $R$. The ring of integer-valued polynomials over $S$ is the set $\mathrm{Int}(S,R)= \{ f \in \mathbb{K}[x]: f(a) \in R\ \forall\ a \in S \}.$ This article is an effort to study the irreducibility of integer-valued polynomials over arbitrary subsets of a unique factorization domain. We give a method to construct special kinds of sequences, which we call $d$-sequences. We then use these sequences to obtain a criteria for the irreducibility of the polynomials in $\mathrm{Int}(S,R).$ In some special cases, we explicitly construct these sequences and use these sequences to check the irreducibility of some polynomials in $\mathrm{Int}(S,R).$ At the end, we suggest a generalization of our results to an arbitrary subset of a Dedekind domain. △ Less

Submitted 1 September, 2020; originally announced September 2020.

Comments: Accepted for publication in Communications in Algebra

Journal ref: Communications in Algebra 2021

arXiv:2008.12617 [pdf, other]

Simulation-supervised deep learning for analysing organelles states and behaviour in living cells

Authors: Arif Ahmed Sekh, Ida S. Opstad, Rohit Agarwal, Asa Birna Birgisdottir, Truls Myrmel, Balpreet Singh Ahluwalia, Krishna Agarwal, Dilip K. Prasad

Abstract: In many real-world scientific problems, generating ground truth (GT) for supervised learning is almost impossible. The causes include limitations imposed by scientific instrument, physical phenomenon itself, or the complexity of modeling. Performing artificial intelligence (AI) tasks such as segmentation, tracking, and analytics of small sub-cellular structures such as mitochondria in microscopy v… ▽ More In many real-world scientific problems, generating ground truth (GT) for supervised learning is almost impossible. The causes include limitations imposed by scientific instrument, physical phenomenon itself, or the complexity of modeling. Performing artificial intelligence (AI) tasks such as segmentation, tracking, and analytics of small sub-cellular structures such as mitochondria in microscopy videos of living cells is a prime example. The 3D blurring function of microscope, digital resolution from pixel size, optical resolution due to the character of light, noise characteristics, and complex 3D deformable shapes of mitochondria, all contribute to making this problem GT hard. Manual segmentation of 100s of mitochondria across 1000s of frames and then across many such videos is not only herculean but also physically inaccurate because of the instrument and phenomena imposed limitations. Unsupervised learning produces less than optimal results and accuracy is important if inferences relevant to therapy are to be derived. In order to solve this unsurmountable problem, we bring modeling and deep learning to a nexus. We show that accurate physics based modeling of microscopy data including all its limitations can be the solution for generating simulated training datasets for supervised learning. We show here that our simulation-supervised segmentation approach is a great enabler for studying mitochondrial states and behaviour in heart muscle cells, where mitochondria have a significant role to play in the health of the cells. We report unprecedented mean IoU score of 91% for binary segmentation (19% better than the best performing unsupervised approach) of mitochondria in actual microscopy videos of living cells. We further demonstrate the possibility of performing multi-class classification, tracking, and morphology associated analytics at the scale of individual mitochondrion. △ Less

Submitted 26 August, 2020; originally announced August 2020.

Comments: under review at NIPS 2020

arXiv:2008.11828 [pdf, other]

Auxiliary Network: Scalable and agile online learning for dynamic system with inconsistently available inputs

Authors: Rohit Agarwal, Arif Ahmed Sekh, Krishna Agarwal, Dilip K. Prasad

Abstract: Streaming classification methods assume the number of input features is fixed and always received. But in many real-world scenarios demand is some input features are reliable while others are unreliable or inconsistent. In this paper, we propose a novel deep learning-based model called Auxiliary Network (Aux-Net), which is scalable and agile. It employs a weighted ensemble of classifiers to give a… ▽ More Streaming classification methods assume the number of input features is fixed and always received. But in many real-world scenarios demand is some input features are reliable while others are unreliable or inconsistent. In this paper, we propose a novel deep learning-based model called Auxiliary Network (Aux-Net), which is scalable and agile. It employs a weighted ensemble of classifiers to give a final outcome. The Aux-Net model is based on the hedging algorithm and online gradient descent. It employs a model of varying depth in an online setting using single pass learning. Aux-Net is a foundational work towards scalable neural network model for a dynamic complex environment requiring ad hoc or inconsistent input data. The efficacy of Aux-Net is shown on public dataset. △ Less

Submitted 26 August, 2020; originally announced August 2020.

Comments: under review at NIPS 2020

arXiv:2008.06713 [pdf, other]

Single image dehazing for a variety of haze scenarios using back projected pyramid network

Authors: Ayush Singh, Ajay Bhave, Dilip K. Prasad

Abstract: Learning to dehaze single hazy images, especially using a small training dataset is quite challenging. We propose a novel generative adversarial network architecture for this problem, namely back projected pyramid network (BPPNet), that gives good performance for a variety of challenging haze conditions, including dense haze and inhomogeneous haze. Our architecture incorporates learning of multipl… ▽ More Learning to dehaze single hazy images, especially using a small training dataset is quite challenging. We propose a novel generative adversarial network architecture for this problem, namely back projected pyramid network (BPPNet), that gives good performance for a variety of challenging haze conditions, including dense haze and inhomogeneous haze. Our architecture incorporates learning of multiple levels of complexities while retaining spatial context through iterative blocks of UNets and structural information of multiple scales through a novel pyramidal convolution block. These blocks together for the generator and are amenable to learning through back projection. We have shown that our network can be trained without over-fitting using as few as 20 image pairs of hazy and non-hazy images. We report the state of the art performances on NTIRE 2018 homogeneous haze datasets for indoor and outdoor images, NTIRE 2019 denseHaze dataset, and NTIRE 2020 non-homogeneous haze dataset. △ Less

Submitted 15 August, 2020; originally announced August 2020.

Comments: 16 pages, 8 figures, to be published in Computer Vision ECCV 2020 Workshops

arXiv:2007.14639 [pdf, ps, other]

Relations between cusp forms sharing Hecke eigenvalues

Authors: Dipendra Prasad, Ravi Raghunathan

Abstract: In this paper we consider the question of when the set of Hecke eigenvalues of a cusp form on $GL_n(A_F)$ is contained in the set of Hecke eigenvalues of a cusp form on $GL_m(A_F)$ for $n \leq m$.This question is closely related to a question about finite dimensional representations of an abstract group, which also we consider in this work. In this paper we consider the question of when the set of Hecke eigenvalues of a cusp form on $GL_n(A_F)$ is contained in the set of Hecke eigenvalues of a cusp form on $GL_m(A_F)$ for $n \leq m$.This question is closely related to a question about finite dimensional representations of an abstract group, which also we consider in this work. △ Less

Submitted 4 August, 2022; v1 submitted 29 July, 2020; originally announced July 2020.

Comments: To appear in the Electronic J. of the AMS, Representation Theory

MSC Class: 11F70; 22E55

arXiv:2007.02397 [pdf]

doi 10.1364/OE.402666

High space-bandwidth in quantitative phase imaging using partially spatially coherent optical coherence microscopy and deep neural network

Authors: Ankit Butola, Sheetal Raosaheb Kanade, Sunil Bhatt, Vishesh Kumar Dubey, Anand Kumar, Azeem Ahmad, Dilip K Prasad, Paramasivam Senthilkumaran, Balpreet Singh Ahluwalia, Dalip Singh Mehta

Abstract: Quantitative phase microscopy (QPM) is a label-free technique that enables to monitor morphological changes at subcellular level. The performance of the QPM system in terms of spatial sensitivity and resolution depends on the coherence properties of the light source and the numerical aperture (NA) of objective lenses. Here, we propose high space-bandwidth QPM using partially spatially coherent opt… ▽ More Quantitative phase microscopy (QPM) is a label-free technique that enables to monitor morphological changes at subcellular level. The performance of the QPM system in terms of spatial sensitivity and resolution depends on the coherence properties of the light source and the numerical aperture (NA) of objective lenses. Here, we propose high space-bandwidth QPM using partially spatially coherent optical coherence microscopy (PSC-OCM) assisted with deep neural network. The PSC source synthesized to improve the spatial sensitivity of the reconstructed phase map from the interferometric images. Further, compatible generative adversarial network (GAN) is used and trained with paired low-resolution (LR) and high-resolution (HR) datasets acquired from PSC-OCM system. The training of the network is performed on two different types of samples i.e. mostly homogenous human red blood cells (RBC) and on highly heterogenous macrophages. The performance is evaluated by predicting the HR images from the datasets captured with low NA lens and compared with the actual HR phase images. An improvement of 9 times in space-bandwidth product is demonstrated for both RBC and macrophages datasets. We believe that the PSC-OCM+GAN approach would be applicable in single-shot label free tissue imaging, disease classification and other high-resolution tomography applications by utilizing the longitudinal spatial coherence properties of the light source. △ Less

Submitted 5 July, 2020; originally announced July 2020.

arXiv:2006.10809 [pdf, other]

doi 10.3847/1538-4357/abc33c

Environmental Dependence of Self-Regulating Black-hole Feedback in Massive Galaxies

Authors: Deovrat Prasad, G. Mark Voit, Brian W. O'shea, Forrest Glines

Abstract: In the universe's most massive galaxies, kinetic feedback from a central supermassive black hole appears to limit star formation. Abundant circumstantial evidence suggests that accumulation of cold gas near the central black hole strongly boosts the feedback output, keeping the ambient medium in a state marginally unstable to condensation and formation of cold gas clouds. However, the ability of t… ▽ More In the universe's most massive galaxies, kinetic feedback from a central supermassive black hole appears to limit star formation. Abundant circumstantial evidence suggests that accumulation of cold gas near the central black hole strongly boosts the feedback output, keeping the ambient medium in a state marginally unstable to condensation and formation of cold gas clouds. However, the ability of that mechanism to self-regulate may depend on numerous environmental factors, including the depth of the potential well and the pressure of the surrounding circumgalactic medium (CGM). Here we present a suite of numerical simulations that explores the dependence of cold-fuelled bipolar kinetic feedback on those environmental factors. Halo mass in this simulation suite ranges from $2 \times 10^{12} \, M_\odot$ to $8 \times 10^{14} \, M_\odot$. We include the spatially extended mass and energy input from the massive galaxy's old stellar population, which is capable of sweeping gas out of the galaxy and away from the central black hole if the confining CGM pressure is sufficiently low. Our simulations show that this feedback mechanism is tightly self-regulating in a massive galaxy with a deep central potential and low CGM pressure, permitting only small amounts of multiphase gas to accumulate and allowing almost no star formation. In a massive galaxy of similar mass but a shallower central potential and greater CGM pressure the same feedback mechanism is more episodic, producing extended multiphase gas and occasionally allowing small rates of star formation ($\sim 0.1 \, M_\odot \, {\rm yr}^{-1}$). At the low-mass end of the explored range the mechanism becomes implausibly explosive, perhaps because the ambient gas initially has no angular momentum, which would have reduced the amount of condensed gas capable of fueling feedback. △ Less

Submitted 23 November, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

Comments: 17 pages, 9 figures, Accepted for publication in ApJ

arXiv:2006.09381 [pdf, other]

doi 10.3847/1538-4357/aba42e

A Black-Hole Feedback Valve in Massive Galaxies

Authors: G. M. Voit, G. L. Bryan, D. Prasad, R. Frisbie, Y. Li, M. Donahue, B. W. O'Shea, M. Sun, N. Werner

Abstract: Star formation in the universe's most massive galaxies proceeds furiously early in time but then nearly ceases. Plenty of hot gas remains available but does not cool and condense into star-forming clouds. Active galactic nuclei (AGN) release enough energy to inhibit cooling of the hot gas, but energetic arguments alone do not explain why quenching of star formation is most effective in high-mass g… ▽ More Star formation in the universe's most massive galaxies proceeds furiously early in time but then nearly ceases. Plenty of hot gas remains available but does not cool and condense into star-forming clouds. Active galactic nuclei (AGN) release enough energy to inhibit cooling of the hot gas, but energetic arguments alone do not explain why quenching of star formation is most effective in high-mass galaxies. In fact, optical observations show that quenching is more closely related to a galaxy's central stellar velocity dispersion ($σ_v$) than to any other characteristic. Here, we show that high $σ_v$ is critical to quenching because a deep central potential well maximizes the efficacy of AGN feedback. In order to remain quenched, a galaxy must continually sweep out the gas ejected from its aging stars. Supernova heating can accomplish this task as long as the AGN sufficiently reduces the gas pressure of the surrounding circumgalactic medium (CGM). We find that CGM pressure acts as the control knob on a valve that regulates AGN feedback and suggest that feedback power self-adjusts so that it suffices to lift the CGM out of the galaxy's potential well. Supernova heating then drives a galactic outflow that remains homogeneous if $σ_v \gtrsim 240 \, {\rm km \, s^{-1}}$. AGN feedback can effectively quench galaxies with a comparable velocity dispersion, but feedback in galaxies with a much lower velocity dispersion tends to result in convective circulation and accumulation of multiphase gas within the galaxy. △ Less

Submitted 23 October, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

Comments: 29 pages, 8 figures, published in ApJ

arXiv:2005.01770 [pdf]

HOG, LBP and SVM based Traffic Density Estimation at Intersection

Authors: Devashish Prasad, Kshitij Kapadni, Ayan Gadpal, Manish Visave, Kavita Sultanpure

Abstract: Increased amount of vehicular traffic on roads is a significant issue. High amount of vehicular traffic creates traffic congestion, unwanted delays, pollution, money loss, health issues, accidents, emergency vehicle passage and traffic violations that ends up in the decline in productivity. In peak hours, the issues become even worse. Traditional traffic management and control systems fail to tack… ▽ More Increased amount of vehicular traffic on roads is a significant issue. High amount of vehicular traffic creates traffic congestion, unwanted delays, pollution, money loss, health issues, accidents, emergency vehicle passage and traffic violations that ends up in the decline in productivity. In peak hours, the issues become even worse. Traditional traffic management and control systems fail to tackle this problem. Currently, the traffic lights at intersections aren't adaptive and have fixed time delays. There's a necessity of an optimized and sensible control system which would enhance the efficiency of traffic flow. Smart traffic systems perform estimation of traffic density and create the traffic lights modification consistent with the quantity of traffic. We tend to propose an efficient way to estimate the traffic density on intersection using image processing and machine learning techniques in real time. The proposed methodology takes pictures of traffic at junction to estimate the traffic density. We use Histogram of Oriented Gradients (HOG), Local Binary Patterns (LBP) and Support Vector Machine (SVM) based approach for traffic density estimation. The strategy is computationally inexpensive and can run efficiently on raspberry pi board. Code is released at https://github.com/DevashishPrasad/Smart-Traffic-Junction. △ Less

Submitted 4 May, 2020; originally announced May 2020.

Comments: paper accepted at IEEE PuneCon 2019

Showing 1–50 of 127 results for author: Prasad, D