subscribe to arXiv mailings

Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks

Authors: Orchid Chetia Phukan, Devyani Koshal, Swarup Ranjan Behera, Arun Balaji Buduru, Rajesh Sharma

Abstract: Speech forensic tasks (SFTs), such as automatic speaker recognition (ASR), speech emotion recognition (SER), gender recognition (GR), and age estimation (AE), find use in different security and biometric applications. Previous works have applied various techniques, with recent studies focusing on applying speech foundation models (SFMs) for improved performance. However, most prior efforts have ce… ▽ More Speech forensic tasks (SFTs), such as automatic speaker recognition (ASR), speech emotion recognition (SER), gender recognition (GR), and age estimation (AE), find use in different security and biometric applications. Previous works have applied various techniques, with recent studies focusing on applying speech foundation models (SFMs) for improved performance. However, most prior efforts have centered on building individual models for each task separately, despite the inherent similarities among these tasks. This isolated approach results in higher computational resource requirements, increased costs, time consumption, and maintenance challenges. In this study, we address these challenges by employing a multi-task learning strategy. Firstly, we explore the various state-of-the-art (SOTA) SFMs by extracting their representations for learning these SFTs and investigating their effectiveness at each task specifically. Secondly, we analyze the performance of the extracted representations on the SFTs in a multi-task learning framework. We observe a decline in performance when SFTs are modeled together compared to individual task-specific models, and as a remedy, we propose multi-view learning (MVL). Views are representations from different SFMs transformed into distinct abstract spaces by characteristics unique to each SFM. By leveraging MVL, we integrate these diverse representations to capture complementary information across tasks, enhancing the shared learning process. We introduce a new framework called TANGO (Task Alignment with iNter-view Gated Optimal transport) to implement this approach. With TANGO, we achieve the topmost performance in comparison to individual SFM representations as well as baseline fusion techniques across benchmark datasets such as CREMA-D, emo-DB, and BAVED. △ Less

Submitted 16 October, 2024; originally announced October 2024.

MSC Class: 68T45 ACM Class: I.2.7

arXiv:2410.12645 [pdf, other]

Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals

Authors: Orchid Chetia Phukan, Swarup Ranjan Behera, Girish, Mohd Mujtaba Akhtar, Arun Balaji Buduru, Rajesh Sharma

Abstract: Despite being trained exclusively on speech data, speech foundation models (SFMs) like Whisper have shown impressive performance in non-speech tasks such as audio classification. This is partly because speech shares some common traits with audio, enabling SFMs to transfer effectively. In this study, we push the boundaries by evaluating SFMs on a more challenging out-of-domain (OOD) task: classifyi… ▽ More Despite being trained exclusively on speech data, speech foundation models (SFMs) like Whisper have shown impressive performance in non-speech tasks such as audio classification. This is partly because speech shares some common traits with audio, enabling SFMs to transfer effectively. In this study, we push the boundaries by evaluating SFMs on a more challenging out-of-domain (OOD) task: classifying physiological time-series signals. We test two key hypotheses: first, that SFMs can generalize to physiological signals by capturing shared temporal patterns; second, that multilingual SFMs will outperform others due to their exposure to greater variability during pre-training, leading to more robust, generalized representations. Our experiments, conducted for stress recognition using ECG (Electrocardiogram), EMG (Electromyography), and EDA (Electrodermal Activity) signals, reveal that models trained on SFM-derived representations outperform those trained on raw physiological signals. Among all models, multilingual SFMs achieve the highest accuracy, supporting our hypothesis and demonstrating their OOD capabilities. This work positions SFMs as promising tools for new uncharted domains beyond speech. △ Less

Submitted 16 October, 2024; originally announced October 2024.

MSC Class: 68T45 ACM Class: I.2.7

arXiv:2410.12567 [pdf, other]

SeQuiFi: Mitigating Catastrophic Forgetting in Speech Emotion Recognition with Sequential Class-Finetuning

Authors: Sarthak Jain, Orchid Chetia Phukan, Swarup Ranjan Behera, Arun Balaji Buduru, Rajesh Sharma

Abstract: In this work, we introduce SeQuiFi, a novel approach for mitigating catastrophic forgetting (CF) in speech emotion recognition (SER). SeQuiFi adopts a sequential class-finetuning strategy, where the model is fine-tuned incrementally on one emotion class at a time, preserving and enhancing retention for each class. While various state-of-the-art (SOTA) methods, such as regularization-based, memory-… ▽ More In this work, we introduce SeQuiFi, a novel approach for mitigating catastrophic forgetting (CF) in speech emotion recognition (SER). SeQuiFi adopts a sequential class-finetuning strategy, where the model is fine-tuned incrementally on one emotion class at a time, preserving and enhancing retention for each class. While various state-of-the-art (SOTA) methods, such as regularization-based, memory-based, and weight-averaging techniques, have been proposed to address CF, it still remains a challenge, particularly with diverse and multilingual datasets. Through extensive experiments, we demonstrate that SeQuiFi significantly outperforms both vanilla fine-tuning and SOTA continual learning techniques in terms of accuracy and F1 scores on multiple benchmark SER datasets, including CREMA-D, RAVDESS, Emo-DB, MESD, and SHEMO, covering different languages. △ Less

Submitted 16 October, 2024; originally announced October 2024.

MSC Class: 68T45 ACM Class: I.2.7

arXiv:2409.15767 [pdf, other]

Representation Loss Minimization with Randomized Selection Strategy for Efficient Environmental Fake Audio Detection

Authors: Orchid Chetia Phukan, Girish, Mohd Mujtaba Akhtar, Swarup Ranjan Behera, Nitin Choudhury, Arun Balaji Buduru, Rajesh Sharma, S. R Mahadeva Prasanna

Abstract: The adaptation of foundation models has significantly advanced environmental audio deepfake detection (EADD), a rapidly growing area of research. These models are typically fine-tuned or utilized in their frozen states for downstream tasks. However, the dimensionality of their representations can substantially lead to a high parameter count of downstream models, leading to higher computational dem… ▽ More The adaptation of foundation models has significantly advanced environmental audio deepfake detection (EADD), a rapidly growing area of research. These models are typically fine-tuned or utilized in their frozen states for downstream tasks. However, the dimensionality of their representations can substantially lead to a high parameter count of downstream models, leading to higher computational demands. So, a general way is to compress these representations by leveraging state-of-the-art (SOTA) unsupervised dimensionality reduction techniques (PCA, SVD, KPCA, GRP) for efficient EADD. However, with the application of such techniques, we observe a drop in performance. So in this paper, we show that representation vectors contain redundant information, and randomly selecting 40-50% of representation values and building downstream models on it preserves or sometimes even improves performance. We show that such random selection preserves more performance than the SOTA dimensionality reduction techniques while reducing model parameters and inference time by almost over half. △ Less

Submitted 24 September, 2024; originally announced September 2024.

Comments: Submitted to ICASSP 2025

MSC Class: 68T45 ACM Class: I.2.7

arXiv:2409.14312 [pdf, other]

Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection

Authors: Orchid Chetia Phukan, Swarup Ranjan Behera, Shubham Singh, Muskaan Singh, Vandana Rajan, Arun Balaji Buduru, Rajesh Sharma, S. R. Mahadeva Prasanna

Abstract: In this study, we address the challenge of depression detection from speech, focusing on the potential of non-semantic features (NSFs) to capture subtle markers of depression. While prior research has leveraged various features for this task, NSFs-extracted from pre-trained models (PTMs) designed for non-semantic tasks such as paralinguistic speech processing (TRILLsson), speaker recognition (x-ve… ▽ More In this study, we address the challenge of depression detection from speech, focusing on the potential of non-semantic features (NSFs) to capture subtle markers of depression. While prior research has leveraged various features for this task, NSFs-extracted from pre-trained models (PTMs) designed for non-semantic tasks such as paralinguistic speech processing (TRILLsson), speaker recognition (x-vector), and emotion recognition (emoHuBERT)-have shown significant promise. However, the potential of combining these diverse features has not been fully explored. In this work, we demonstrate that the amalgamation of NSFs results in complementary behavior, leading to enhanced depression detection performance. Furthermore, to our end, we introduce a simple novel framework, FuSeR, designed to effectively combine these features. Our results show that FuSeR outperforms models utilizing individual NSFs as well as baseline fusion techniques and obtains state-of-the-art (SOTA) performance in E-DAIC benchmark with RMSE of 5.51 and MAE of 4.48, establishing it as a robust approach for depression detection. △ Less

Submitted 22 September, 2024; originally announced September 2024.

Comments: Submitted to ICASSP 2025

MSC Class: 68T45 ACM Class: I.2.7

arXiv:2409.14221 [pdf, other]

Strong Alone, Stronger Together: Synergizing Modality-Binding Foundation Models with Optimal Transport for Non-Verbal Emotion Recognition

Authors: Orchid Chetia Phukan, Mohd Mujtaba Akhtar, Girish, Swarup Ranjan Behera, Sishir Kalita, Arun Balaji Buduru, Rajesh Sharma, S. R Mahadeva Prasanna

Abstract: In this study, we investigate multimodal foundation models (MFMs) for emotion recognition from non-verbal sounds. We hypothesize that MFMs, with their joint pre-training across multiple modalities, will be more effective in non-verbal sounds emotion recognition (NVER) by better interpreting and differentiating subtle emotional cues that may be ambiguous in audio-only foundation models (AFMs). To v… ▽ More In this study, we investigate multimodal foundation models (MFMs) for emotion recognition from non-verbal sounds. We hypothesize that MFMs, with their joint pre-training across multiple modalities, will be more effective in non-verbal sounds emotion recognition (NVER) by better interpreting and differentiating subtle emotional cues that may be ambiguous in audio-only foundation models (AFMs). To validate our hypothesis, we extract representations from state-of-the-art (SOTA) MFMs and AFMs and evaluated them on benchmark NVER datasets. We also investigate the potential of combining selected foundation model representations to enhance NVER further inspired by research in speech recognition and audio deepfake detection. To achieve this, we propose a framework called MATA (Intra-Modality Alignment through Transport Attention). Through MATA coupled with the combination of MFMs: LanguageBind and ImageBind, we report the topmost performance with accuracies of 76.47%, 77.40%, 75.12% and F1-scores of 70.35%, 76.19%, 74.63% for ASVP-ESD, JNV, and VIVAE datasets against individual FMs and baseline fusion techniques and report SOTA on the benchmark datasets. △ Less

Submitted 21 September, 2024; originally announced September 2024.

Comments: Submitted to ICASSP 2025

MSC Class: 68T45 ACM Class: I.2.7

arXiv:2409.14131 [pdf, other]

Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models

Authors: Orchid Chetia Phukan, Sarthak Jain, Swarup Ranjan Behera, Arun Balaji Buduru, Rajesh Sharma, S. R Mahadeva Prasanna

Abstract: In this study, for the first time, we extensively investigate whether music foundation models (MFMs) or speech foundation models (SFMs) work better for singing voice deepfake detection (SVDD), which has recently attracted attention in the research community. For this, we perform a comprehensive comparative study of state-of-the-art (SOTA) MFMs (MERT variants and music2vec) and SFMs (pre-trained fo… ▽ More In this study, for the first time, we extensively investigate whether music foundation models (MFMs) or speech foundation models (SFMs) work better for singing voice deepfake detection (SVDD), which has recently attracted attention in the research community. For this, we perform a comprehensive comparative study of state-of-the-art (SOTA) MFMs (MERT variants and music2vec) and SFMs (pre-trained for general speech representation learning as well as speaker recognition). We show that speaker recognition SFM representations perform the best amongst all the foundation models (FMs), and this performance can be attributed to its higher efficacy in capturing the pitch, tone, intensity, etc, characteristics present in singing voices. To our end, we also explore the fusion of FMs for exploiting their complementary behavior for improved SVDD, and we propose a novel framework, FIONA for the same. With FIONA, through the synchronization of x-vector (speaker recognition SFM) and MERT-v1-330M (MFM), we report the best performance with the lowest Equal Error Rate (EER) of 13.74 %, beating all the individual FMs as well as baseline FM fusions and achieving SOTA results. △ Less

Submitted 21 September, 2024; originally announced September 2024.

Comments: Submitted to ICASSP 2025

MSC Class: 68T45 ACM Class: I.2.7

arXiv:2408.13530 [pdf, ps, other]

Homogeneous Dirichlet problem for degenerate parabolic-hyperbolic PDE driven by Levy noise

Authors: Soumya Ranjan Behera, Ananta K Majee

Abstract: In this article, we study the homogeneous Dirichlet problem for a degenerate parabolic-hyperbolic PDE perturbed by Levy noise. In particular, we develop the well-posedness theory of entropy solution based on the Kružkov's semi-entropy formulation. In comparison to the pioneered work by Bauzet et al. (J. Funct. Anal. 266, (2014), 2503-2545), concerning the existence and uniqueness of entropy soluti… ▽ More In this article, we study the homogeneous Dirichlet problem for a degenerate parabolic-hyperbolic PDE perturbed by Levy noise. In particular, we develop the well-posedness theory of entropy solution based on the Kružkov's semi-entropy formulation. In comparison to the pioneered work by Bauzet et al. (J. Funct. Anal. 266, (2014), 2503-2545), concerning the existence and uniqueness of entropy solution for the Dirichlet problem for conservation laws driven by Brownian noise, our present analysis involves a simpler approach to obtain the global Kato's inequality. △ Less

Submitted 24 August, 2024; originally announced August 2024.

arXiv:2408.13528 [pdf, ps, other]

Renormalized stochastic entropy solution for degenerate parabolic-hyperbolic equations with Levy noise

Authors: Soumya Ranjan Behera, Ananta K Majee

Abstract: In this article, we establish the well-posedness theory for renormalized entropy solutions of a degenerate parabolic-hyperbolic PDE perturbed by a multiplicative Levy noise with general L1-data on the unbounded domain. By using a suitable approximation procedure based on the vanishing viscosity technique and bounded data, we prove the existence of a renormalized entropy solution to the underlying… ▽ More In this article, we establish the well-posedness theory for renormalized entropy solutions of a degenerate parabolic-hyperbolic PDE perturbed by a multiplicative Levy noise with general L1-data on the unbounded domain. By using a suitable approximation procedure based on the vanishing viscosity technique and bounded data, we prove the existence of a renormalized entropy solution to the underlying problem. The uniqueness of the solution is settled by adapting Kružkov's doubling the variables technique in the presence of noise. △ Less

Submitted 24 August, 2024; originally announced August 2024.

arXiv:2406.09156 [pdf, other]

Towards Multilingual Audio-Visual Question Answering

Authors: Orchid Chetia Phukan, Priyabrata Mallick, Swarup Ranjan Behera, Aalekhya Satya Narayani, Arun Balaji Buduru, Rajesh Sharma

Abstract: In this paper, we work towards extending Audio-Visual Question Answering (AVQA) to multilingual settings. Existing AVQA research has predominantly revolved around English and replicating it for addressing AVQA in other languages requires a substantial allocation of resources. As a scalable solution, we leverage machine translation and present two multilingual AVQA datasets for eight languages crea… ▽ More In this paper, we work towards extending Audio-Visual Question Answering (AVQA) to multilingual settings. Existing AVQA research has predominantly revolved around English and replicating it for addressing AVQA in other languages requires a substantial allocation of resources. As a scalable solution, we leverage machine translation and present two multilingual AVQA datasets for eight languages created from existing benchmark AVQA datasets. This prevents extra human annotation efforts of collecting questions and answers manually. To this end, we propose, MERA framework, by leveraging state-of-the-art (SOTA) video, audio, and textual foundation models for AVQA in multiple languages. We introduce a suite of models namely MERA-L, MERA-C, MERA-T with varied model architectures to benchmark the proposed datasets. We believe our work will open new research directions and act as a reference benchmark for future works in multilingual AVQA. △ Less

Submitted 13 June, 2024; originally announced June 2024.

Comments: Accepted to Interspeech 2024

MSC Class: 68T45

arXiv:2406.07676 [pdf, other]

FastAST: Accelerating Audio Spectrogram Transformer via Token Merging and Cross-Model Knowledge Distillation

Authors: Swarup Ranjan Behera, Abhishek Dhiman, Karthik Gowda, Aalekhya Satya Narayani

Abstract: Audio classification models, particularly the Audio Spectrogram Transformer (AST), play a crucial role in efficient audio analysis. However, optimizing their efficiency without compromising accuracy remains a challenge. In this paper, we introduce FastAST, a framework that integrates Token Merging (ToMe) into the AST framework. FastAST enhances inference speed without requiring extensive retrainin… ▽ More Audio classification models, particularly the Audio Spectrogram Transformer (AST), play a crucial role in efficient audio analysis. However, optimizing their efficiency without compromising accuracy remains a challenge. In this paper, we introduce FastAST, a framework that integrates Token Merging (ToMe) into the AST framework. FastAST enhances inference speed without requiring extensive retraining by merging similar tokens in audio spectrograms. Furthermore, during training, FastAST brings about significant speed improvements. The experiments indicate that FastAST can increase audio classification throughput with minimal impact on accuracy. To mitigate the accuracy impact, we integrate Cross-Model Knowledge Distillation (CMKD) into the FastAST framework. Integrating ToMe and CMKD into AST results in improved accuracy compared to AST while maintaining faster inference speeds. FastAST represents a step towards real-time, resource-efficient audio analysis. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: Accepted to Interspeech 2024

MSC Class: 68T10

arXiv:2404.03012 [pdf, other]

Spectral Clustering in Convex and Constrained Settings

Authors: Swarup Ranjan Behera, Vijaya V. Saradhi

Abstract: Spectral clustering methods have gained widespread recognition for their effectiveness in clustering high-dimensional data. Among these techniques, constrained spectral clustering has emerged as a prominent approach, demonstrating enhanced performance by integrating pairwise constraints. However, the application of such constraints to semidefinite spectral clustering, a variant that leverages semi… ▽ More Spectral clustering methods have gained widespread recognition for their effectiveness in clustering high-dimensional data. Among these techniques, constrained spectral clustering has emerged as a prominent approach, demonstrating enhanced performance by integrating pairwise constraints. However, the application of such constraints to semidefinite spectral clustering, a variant that leverages semidefinite programming to optimize clustering objectives, remains largely unexplored. In this paper, we introduce a novel framework for seamlessly integrating pairwise constraints into semidefinite spectral clustering. Our methodology systematically extends the capabilities of semidefinite spectral clustering to capture complex data structures, thereby addressing real-world clustering challenges more effectively. Additionally, we extend this framework to encompass both active and self-taught learning scenarios, further enhancing its versatility and applicability. Empirical studies conducted on well-known datasets demonstrate the superiority of our proposed framework over existing spectral clustering methods, showcasing its robustness and scalability across diverse datasets and learning settings. By bridging the gap between constrained learning and semidefinite spectral clustering, our work contributes to the advancement of spectral clustering techniques, offering researchers and practitioners a versatile tool for addressing complex clustering challenges in various real-world applications. Access to the data, code, and experimental results is provided for further exploration (https://github.com/swarupbehera/SCCCS). △ Less

Submitted 3 April, 2024; originally announced April 2024.

ACM Class: I.2.7

arXiv:2404.00030 [pdf, other]

Visualization of Unstructured Sports Data -- An Example of Cricket Short Text Commentary

Authors: Swarup Ranjan Behera, Vijaya V Saradhi

Abstract: Sports visualization focuses on the use of structured data, such as box-score data and tracking data. Unstructured data sources pertaining to sports are available in various places such as blogs, social media posts, and online news articles. Sports visualization methods either not fully exploited the information present in these sources or the proposed visualizations through the use of these sourc… ▽ More Sports visualization focuses on the use of structured data, such as box-score data and tracking data. Unstructured data sources pertaining to sports are available in various places such as blogs, social media posts, and online news articles. Sports visualization methods either not fully exploited the information present in these sources or the proposed visualizations through the use of these sources did not augment to the body of sports visualization methods. We propose the use of unstructured data, namely cricket short text commentary for visualization. The short text commentary data is used for constructing individual player's strength rules and weakness rules. A computationally feasible definition for player's strength rule and weakness rule is proposed. A visualization method for the constructed rules is presented. In addition, players having similar strength rules or weakness rules is computed and visualized. We demonstrate the usefulness of short text commentary in visualization by analyzing the strengths and weaknesses of cricket players using more than one million text commentaries. We validate the constructed rules through two validation methods. The collected data, source code, and obtained results on more than 500 players are made publicly available. △ Less

Submitted 22 March, 2024; originally announced April 2024.

ACM Class: I.2.7

arXiv:2401.02303 [pdf, other]

doi 10.1140/epjqt/s40507-024-00279-1

Estimating the link budget of satellite-based Quantum Key Distribution (QKD) for uplink transmission through the atmosphere

Authors: Satya Ranjan Behera, Urbasi Sinha

Abstract: Satellite-based quantum communications including quantum key distribution (QKD) represent one of the most promising approaches toward global-scale quantum communications. To determine the viability of transmitting quantum signals through the atmosphere, it is essential to conduct atmospheric simulations for both uplink and downlink quantum communications. In the case of the uplink scenario, the in… ▽ More Satellite-based quantum communications including quantum key distribution (QKD) represent one of the most promising approaches toward global-scale quantum communications. To determine the viability of transmitting quantum signals through the atmosphere, it is essential to conduct atmospheric simulations for both uplink and downlink quantum communications. In the case of the uplink scenario, the initial phase of the beam's propagation involves interaction with the atmosphere, making simulation particularly critical. To analyze the atmosphere over the Indian subcontinent, we begin by validating our approach by utilizing atmospheric data obtained from the experiments carried out in the Canary Islands within the framework of Quantum Communication (QC). We also verify our simulation methodology by reproducing simulation outcomes from diverse Canadian locations, taking into account both uplink and downlink scenarios in Low Earth Orbit (LEO). In this manuscript, we explore the practicality of utilizing three different ground station locations in India for uplink-based QC, while also considering beacon signals for both uplink and downlink scenarios. The atmospheric conditions of various geographical regions in India are simulated, and a dedicated link budget analysis is performed for each location, specifically focusing on three renowned observatories: IAO Hanle, Aries Nainital, and Mount Abu. The analysis involves computing the overall losses of the signal and beacon beams. The findings indicate that the IAO Hanle site is a more suitable choice for uplink-based QC when compared to the other two sites. △ Less

Submitted 4 January, 2024; originally announced January 2024.

Comments: 15 pages main text, 11 pages of appendices

Journal ref: EPJ Quantum Technology Volume 11, article number 66, (2024)

arXiv:2312.17343 [pdf, other]

AQUALLM: Audio Question Answering Data Generation Using Large Language Models

Authors: Swarup Ranjan Behera, Krishna Mohan Injeti, Jaya Sai Kiran Patibandla, Praveen Kumar Pokala, Balakrishna Reddy Pailla

Abstract: Audio Question Answering (AQA) constitutes a pivotal task in which machines analyze both audio signals and natural language questions to produce precise natural language answers. The significance of possessing high-quality, diverse, and extensive AQA datasets cannot be overstated when aiming for the precision of an AQA system. While there has been notable focus on developing accurate and efficient… ▽ More Audio Question Answering (AQA) constitutes a pivotal task in which machines analyze both audio signals and natural language questions to produce precise natural language answers. The significance of possessing high-quality, diverse, and extensive AQA datasets cannot be overstated when aiming for the precision of an AQA system. While there has been notable focus on developing accurate and efficient AQA models, the creation of high-quality, diverse, and extensive datasets for the specific task at hand has not garnered considerable attention. To address this challenge, this work makes several contributions. We introduce a scalable AQA data generation pipeline, denoted as the AQUALLM framework, which relies on Large Language Models (LLMs). This framework utilizes existing audio-caption annotations and incorporates state-of-the-art LLMs to generate expansive, high-quality AQA datasets. Additionally, we present three extensive and high-quality benchmark datasets for AQA, contributing significantly to the progression of AQA research. AQA models trained on the proposed datasets set superior benchmarks compared to the existing state-of-the-art. Moreover, models trained on our datasets demonstrate enhanced generalizability when compared to models trained using human-annotated AQA data. Code and datasets will be accessible on GitHub~\footnote{\url{https://github.com/swarupbehera/AQUALLM}}. △ Less

Submitted 28 December, 2023; originally announced December 2023.

ACM Class: I.2.7

arXiv:2311.06818 [pdf, other]

Cricket Player Profiling: Unraveling Strengths and Weaknesses Using Text Commentary Data

Authors: Swarup Ranjan Behera, Vijaya V. Saradhi

Abstract: Devising player-specific strategies in cricket necessitates a meticulous understanding of each player's unique strengths and weaknesses. Nevertheless, the absence of a definitive computational approach to extract such insights from cricket players poses a significant challenge. This paper seeks to address this gap by establishing computational models designed to extract the rules governing player… ▽ More Devising player-specific strategies in cricket necessitates a meticulous understanding of each player's unique strengths and weaknesses. Nevertheless, the absence of a definitive computational approach to extract such insights from cricket players poses a significant challenge. This paper seeks to address this gap by establishing computational models designed to extract the rules governing player strengths and weaknesses, thereby facilitating the development of tailored strategies for individual players. The complexity of this endeavor lies in several key areas: the selection of a suitable dataset, the precise definition of strength and weakness rules, the identification of an appropriate learning algorithm, and the validation of the derived rules. To tackle these challenges, we propose the utilization of unstructured data, specifically cricket text commentary, as a valuable resource for constructing comprehensive strength and weakness rules for cricket players. We also introduce computationally feasible definitions for the construction of these rules, and present a dimensionality reduction technique for the rule-building process. In order to showcase the practicality of this approach, we conduct an in-depth analysis of cricket player strengths and weaknesses using a vast corpus of more than one million text commentaries. Furthermore, we validate the constructed rules through two distinct methodologies: intrinsic and extrinsic. The outcomes of this research are made openly accessible, including the collected data, source code, and results for over 250 cricket players, which can be accessed at https://bit.ly/2PKuzx8. △ Less

Submitted 12 November, 2023; originally announced November 2023.

Comments: The initial work was published in the ICMLA 2019 conference

ACM Class: I.2.7

arXiv:2310.02115 [pdf, other]

Daytime and Nighttime QKD over an atmospheric free space channel with passive polarisation bases compensation

Authors: Saumya Ranjan Behera, Melvee George, Urbasi Sinha

Abstract: Quantum Communication (QC) represents a promising futuristic technology, revolutionizing secure communication. Photon-based Quantum Key Distribution (QKD) is the most widely explored area in QC research, utilizing the polarisation degree of freedom of photons for both fibre and free-space communication. In this work, we investigate and mitigate the challenges posed by fibre birefringence and atmos… ▽ More Quantum Communication (QC) represents a promising futuristic technology, revolutionizing secure communication. Photon-based Quantum Key Distribution (QKD) is the most widely explored area in QC research, utilizing the polarisation degree of freedom of photons for both fibre and free-space communication. In this work, we investigate and mitigate the challenges posed by fibre birefringence and atmospheric effects on QKD, using a $50$-meter free-space optical link and entanglement-based BBM92 QKD protocol. We implement a passive polarisation correction scheme to address the critical issue of polarisation scrambling induced by fibre birefringence and the difference in the frame of reference between Alice and Bob. This scheme effectively mitigates these adverse effects, ensuring reliable polarisation control over the quantum channel. Furthermore, we conduct QKD experiments in both day and night conditions, encountering challenges such as high background noise levels and dynamic environmental changes. To overcome these issues, we employ various filtering techniques to enhance signal quality and security. Our results demonstrate the successful implementation of QKD over a free-space optical link by producing information-theoretic secure QBER of $<11\%$ on an average and high keyrate, even under varying lighting and weather conditions. Over one 24 hour cycle of data acquisition, we measured an average daylight keyrate and QBER of ($3.9118\pm0.7339 KHz$ and $10.5518\pm1.3428\%$) respectively and night time keyrate and QBER of ($4.6118\pm0.8088 KHz$ and $10.3545\pm1.2501\%$) respectively. △ Less

Submitted 3 October, 2023; originally announced October 2023.

Comments: 11 pages, 8 figures

arXiv:2306.04294 [pdf, ps, other]

Stochastic Fractional Conservation Laws: Large deviation principle, Central limit theorem and Moderate deviation principle

Authors: Soumya Ranjan Behera, Ananta K. Majee

Abstract: In this article, we establish the Freidlin-Wentzell type large deviation principle and central limit theorem for stochastic fractional conservation laws with small multiplicative noise in kinetic formulation framework. The weak convergence method and doubling variables method play a crucial role. As a consequence, we also establish moderate deviation principle for the underlying problem. In this article, we establish the Freidlin-Wentzell type large deviation principle and central limit theorem for stochastic fractional conservation laws with small multiplicative noise in kinetic formulation framework. The weak convergence method and doubling variables method play a crucial role. As a consequence, we also establish moderate deviation principle for the underlying problem. △ Less

Submitted 7 June, 2023; originally announced June 2023.

arXiv:2212.12846

On rate of convergence of finite difference scheme for degenerate parabolic-hyperbolic PDE with Levy noise

Authors: Soumya Ranjan Behera, Ananta K. Majee

Abstract: In this article, we consider a semi discrete finite difference scheme for a degenerate parabolic-hyperbolic PDE driven by Lévy noise in one space dimension. Using bounded variation estimations and a variant of classical Kružkov's doubling of variable approach, we prove that expected value of the $L^1$-difference between the unique entropy solution and approximate solution converges at a rate of… ▽ More In this article, we consider a semi discrete finite difference scheme for a degenerate parabolic-hyperbolic PDE driven by Lévy noise in one space dimension. Using bounded variation estimations and a variant of classical Kružkov's doubling of variable approach, we prove that expected value of the $L^1$-difference between the unique entropy solution and approximate solution converges at a rate of $(Δx)^\frac{1}{7}$, where $Δx$ is the spatial mesh size. △ Less

Submitted 20 December, 2023; v1 submitted 24 December, 2022; originally announced December 2022.

Comments: We found an error in Lemma 3.5.--which is used in the subsequent analysis to establish the rate of convergence. Since the error is not fixable, we would like to withdraw the article

arXiv:2212.02041 [pdf, other]

Convergence of an operator splitting scheme for fractional conservation laws with Levy noise

Authors: Soumya Ranjan Behera, Ananta K. Majee

Abstract: In this paper, we are concerned with a operator splitting scheme for linear fractional and fractional degenerate stochastic conservation laws driven by multiplicative Levy noise. More specifically, using a variant of classical Kruzkov's doubling of variable approach, we show that the approximate solutions generated by the splitting scheme converges to the unique stochastic entropy solution of the… ▽ More In this paper, we are concerned with a operator splitting scheme for linear fractional and fractional degenerate stochastic conservation laws driven by multiplicative Levy noise. More specifically, using a variant of classical Kruzkov's doubling of variable approach, we show that the approximate solutions generated by the splitting scheme converges to the unique stochastic entropy solution of the underlying problems.Finally, the convergence analysis is illustrated by several numerical examples. △ Less

Submitted 11 March, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

Showing 1–20 of 20 results for author: Behera, S R