-
Domain penalisation for improved Out-of-Distribution Generalisation
Authors:
Shuvam Jena,
Sushmetha Sumathi Rajendran,
Karthik Seemakurthy,
Sasithradevi A,
Vijayalakshmi M,
Prakash Poornachari
Abstract:
In the field of object detection, domain generalisation (DG) aims to ensure robust performance across diverse and unseen target domains by learning the robust domain-invariant features corresponding to the objects of interest across multiple source domains. While there are many approaches established for performing DG for the task of classification, there has been a very little focus on object det…
▽ More
In the field of object detection, domain generalisation (DG) aims to ensure robust performance across diverse and unseen target domains by learning the robust domain-invariant features corresponding to the objects of interest across multiple source domains. While there are many approaches established for performing DG for the task of classification, there has been a very little focus on object detection. In this paper, we propose a domain penalisation (DP) framework for the task of object detection, where the data is assumed to be sampled from multiple source domains and tested on completely unseen test domains. We assign penalisation weights to each domain, with the values updated based on the detection networks performance on the respective source domains. By prioritising the domains that needs more attention, our approach effectively balances the training process. We evaluate our solution on the GWHD 2021 dataset, a component of the WiLDS benchmark and we compare against ERM and GroupDRO as these are primarily loss function based. Our extensive experimental results reveals that the proposed approach improves the accuracy by 0.3 percent and 0.5 percent on validation and test out-of-distribution (OOD) sets, respectively for FasterRCNN. We also compare the performance of our approach on FCOS detector and show that our approach improves the baseline OOD performance over the existing approaches by 1.3 percent and 1.4 percent on validation and test sets, respectively. This study underscores the potential of performance based domain penalisation in enhancing the generalisation ability of object detection models across diverse environments.
△ Less
Submitted 3 August, 2024;
originally announced August 2024.
-
Advancing Melanoma Diagnosis with Self-Supervised Neural Networks: Evaluating the Effectiveness of Different Techniques
Authors:
Srivishnu Vusirikala,
Suraj Rajendran
Abstract:
We investigate the potential of self-supervision in improving the accuracy of deep learning models trained to classify melanoma patches. Various self-supervision techniques such as rotation prediction, missing patch prediction, and corruption removal were implemented and assessed for their impact on the convolutional neural network's performance. Preliminary results suggest a positive influence of…
▽ More
We investigate the potential of self-supervision in improving the accuracy of deep learning models trained to classify melanoma patches. Various self-supervision techniques such as rotation prediction, missing patch prediction, and corruption removal were implemented and assessed for their impact on the convolutional neural network's performance. Preliminary results suggest a positive influence of self-supervision methods on the model's accuracy. The study notably demonstrates the efficacy of the corruption removal method in enhancing model performance. Despite observable improvements, we conclude that the self-supervised models have considerable potential for further enhancement, achievable through training over more epochs or expanding the dataset. We suggest exploring other self-supervision methods like Bootstrap Your Own Latent (BYOL) and contrastive learning in future research, emphasizing the cost-benefit trade-off due to their resource-intensive nature. The findings underline the promise of self-supervision in augmenting melanoma detection capabilities of deep learning models.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
An Efficient Learning Control Framework With Sim-to-Real for String-Type Artificial Muscle-Driven Robotic Systems
Authors:
Jiyue Tao,
Yunsong Zhang,
Sunil Kumar Rajendran,
Feitian Zhang,
Dexin Zhao,
Tongsheng Shen
Abstract:
Robotic systems driven by artificial muscles present unique challenges due to the nonlinear dynamics of actuators and the complex designs of mechanical structures. Traditional model-based controllers often struggle to achieve desired control performance in such systems. Deep reinforcement learning (DRL), a trending machine learning technique widely adopted in robot control, offers a promising alte…
▽ More
Robotic systems driven by artificial muscles present unique challenges due to the nonlinear dynamics of actuators and the complex designs of mechanical structures. Traditional model-based controllers often struggle to achieve desired control performance in such systems. Deep reinforcement learning (DRL), a trending machine learning technique widely adopted in robot control, offers a promising alternative. However, integrating DRL into these robotic systems faces significant challenges, including the requirement for large amounts of training data and the inevitable sim-to-real gap when deployed to real-world robots. This paper proposes an efficient reinforcement learning control framework with sim-to-real transfer to address these challenges. Bootstrap and augmentation enhancements are designed to improve the data efficiency of baseline DRL algorithms, while a sim-to-real transfer technique, namely randomization of muscle dynamics, is adopted to bridge the gap between simulation and real-world deployment. Extensive experiments and ablation studies are conducted utilizing two string-type artificial muscle-driven robotic systems including a two degree-of-freedom robotic eye and a parallel robotic wrist, the results of which demonstrate the effectiveness of the proposed learning control strategy.
△ Less
Submitted 7 June, 2024; v1 submitted 17 May, 2024;
originally announced May 2024.
-
On the Three Demons in Causality in Finance: Time Resolution, Nonstationarity, and Latent Factors
Authors:
Xinshuai Dong,
Haoyue Dai,
Yewen Fan,
Songyao Jin,
Sathyamoorthy Rajendran,
Kun Zhang
Abstract:
Financial data is generally time series in essence and thus suffers from three fundamental issues: the mismatch in time resolution, the time-varying property of the distribution - nonstationarity, and causal factors that are important but unknown/unobserved. In this paper, we follow a causal perspective to systematically look into these three demons in finance. Specifically, we reexamine these iss…
▽ More
Financial data is generally time series in essence and thus suffers from three fundamental issues: the mismatch in time resolution, the time-varying property of the distribution - nonstationarity, and causal factors that are important but unknown/unobserved. In this paper, we follow a causal perspective to systematically look into these three demons in finance. Specifically, we reexamine these issues in the context of causality, which gives rise to a novel and inspiring understanding of how the issues can be addressed. Following this perspective, we provide systematic solutions to these problems, which hopefully would serve as a foundation for future research in the area.
△ Less
Submitted 12 January, 2024; v1 submitted 28 December, 2023;
originally announced January 2024.
-
Nanorobotics in Medicine: A Systematic Review of Advances, Challenges, and Future Prospects
Authors:
Shishir Rajendran,
Prathic Sundararajan,
Ashi Awasthi,
Suraj Rajendran
Abstract:
Nanorobotics offers an emerging frontier in biomedicine, holding the potential to revolutionize diagnostic and therapeutic applications through its unique capabilities in manipulating biological systems at the nanoscale. Following PRISMA guidelines, a comprehensive literature search was conducted using IEEE Xplore and PubMed databases, resulting in the identification and analysis of a total of 414…
▽ More
Nanorobotics offers an emerging frontier in biomedicine, holding the potential to revolutionize diagnostic and therapeutic applications through its unique capabilities in manipulating biological systems at the nanoscale. Following PRISMA guidelines, a comprehensive literature search was conducted using IEEE Xplore and PubMed databases, resulting in the identification and analysis of a total of 414 papers. The studies were filtered to include only those that addressed both nanorobotics and direct medical applications. Our analysis traces the technology's evolution, highlighting its growing prominence in medicine as evidenced by the increasing number of publications over time. Applications ranged from targeted drug delivery and single-cell manipulation to minimally invasive surgery and biosensing. Despite the promise, limitations such as biocompatibility, precise control, and ethical concerns were also identified. This review aims to offer a thorough overview of the state of nanorobotics in medicine, drawing attention to current challenges and opportunities, and providing directions for future research in this rapidly advancing field.
△ Less
Submitted 19 September, 2023;
originally announced September 2023.
-
Transparency in Sleep Staging: Deep Learning Method for EEG Sleep Stage Classification with Model Interpretability
Authors:
Shivam Sharma,
Suvadeep Maiti,
S. Mythirayee,
Srijithesh Rajendran,
Raju Surampudi Bapi
Abstract:
Automated Sleep stage classification using raw single channel EEG is a critical tool for sleep quality assessment and disorder diagnosis. However, modelling the complexity and variability inherent in this signal is a challenging task, limiting their practicality and effectiveness in clinical settings. To mitigate these challenges, this study presents an end-to-end deep learning (DL) model which in…
▽ More
Automated Sleep stage classification using raw single channel EEG is a critical tool for sleep quality assessment and disorder diagnosis. However, modelling the complexity and variability inherent in this signal is a challenging task, limiting their practicality and effectiveness in clinical settings. To mitigate these challenges, this study presents an end-to-end deep learning (DL) model which integrates squeeze and excitation blocks within the residual network to extract features and stacked Bi-LSTM to understand complex temporal dependencies. A distinctive aspect of this study is the adaptation of GradCam for sleep staging, marking the first instance of an explainable DL model in this domain with alignment of its decision-making with sleep expert's insights. We evaluated our model on the publically available datasets (SleepEDF-20, SleepEDF-78, and SHHS), achieving Macro-F1 scores of 82.5, 78.9, and 81.9, respectively. Additionally, a novel training efficiency enhancement strategy was implemented by increasing stride size, leading to 8x faster training times with minimal impact on performance. Comparative analyses underscore our model outperforms all existing baselines, indicating its potential for clinical usage.
△ Less
Submitted 14 January, 2024; v1 submitted 10 September, 2023;
originally announced September 2023.
-
Applications of machine Learning to improve the efficiency and range of microbial biosynthesis: a review of state-of-art techniques
Authors:
Akshay Bhalla,
Suraj Rajendran
Abstract:
In the modern world, technology is at its peak. Different avenues in programming and technology have been explored for data analysis, automation, and robotics. Machine learning is key to optimize data analysis, make accurate predictions, and hasten/improve existing functions. Thus, presently, the field of machine learning in artificial intelligence is being developed and its uses in varying fields…
▽ More
In the modern world, technology is at its peak. Different avenues in programming and technology have been explored for data analysis, automation, and robotics. Machine learning is key to optimize data analysis, make accurate predictions, and hasten/improve existing functions. Thus, presently, the field of machine learning in artificial intelligence is being developed and its uses in varying fields are being explored. One field in which its uses stand out is that of microbial biosynthesis. In this paper, a comprehensive overview of the differing machine learning programs used in biosynthesis is provided, alongside brief descriptions of the fields of machine learning and microbial biosynthesis separately. This information includes past trends, modern developments, future improvements, explanations of processes, and current problems they face. Thus, this paper's main contribution is to distill developments in, and provide a holistic explanation of, 2 key fields and their applicability to improve industry/research. It also highlights challenges and research directions, acting to instigate more research and development in the growing fields. Finally, the paper aims to act as a reference for academics performing research, industry professionals improving their processes, and students looking to understand the concept of machine learning in biosynthesis.
△ Less
Submitted 14 October, 2023; v1 submitted 26 August, 2023;
originally announced August 2023.
-
Patchwork Learning: A Paradigm Towards Integrative Analysis across Diverse Biomedical Data Sources
Authors:
Suraj Rajendran,
Weishen Pan,
Mert R. Sabuncu,
Yong Chen,
Jiayu Zhou,
Fei Wang
Abstract:
Machine learning (ML) in healthcare presents numerous opportunities for enhancing patient care, population health, and healthcare providers' workflows. However, the real-world clinical and cost benefits remain limited due to challenges in data privacy, heterogeneous data sources, and the inability to fully leverage multiple data modalities. In this perspective paper, we introduce "patchwork learni…
▽ More
Machine learning (ML) in healthcare presents numerous opportunities for enhancing patient care, population health, and healthcare providers' workflows. However, the real-world clinical and cost benefits remain limited due to challenges in data privacy, heterogeneous data sources, and the inability to fully leverage multiple data modalities. In this perspective paper, we introduce "patchwork learning" (PL), a novel paradigm that addresses these limitations by integrating information from disparate datasets composed of different data modalities (e.g., clinical free-text, medical images, omics) and distributed across separate and secure sites. PL allows the simultaneous utilization of complementary data sources while preserving data privacy, enabling the development of more holistic and generalizable ML models. We present the concept of patchwork learning and its current implementations in healthcare, exploring the potential opportunities and applicable data sources for addressing various healthcare challenges. PL leverages bridging modalities or overlapping feature spaces across sites to facilitate information sharing and impute missing data, thereby addressing related prediction tasks. We discuss the challenges associated with PL, many of which are shared by federated and multimodal learning, and provide recommendations for future research in this field. By offering a more comprehensive approach to healthcare data integration, patchwork learning has the potential to revolutionize the clinical applicability of ML models. This paradigm promises to strike a balance between personalization and generalizability, ultimately enhancing patient experiences, improving population health, and optimizing healthcare providers' workflows.
△ Less
Submitted 13 May, 2023; v1 submitted 10 May, 2023;
originally announced May 2023.
-
mulEEG: A Multi-View Representation Learning on EEG Signals
Authors:
Vamsi Kumar,
Likith Reddy,
Shivam Kumar Sharma,
Kamalakar Dadi,
Chiranjeevi Yarra,
Bapi S. Raju,
Srijithesh Rajendran
Abstract:
Modeling effective representations using multiple views that positively influence each other is challenging, and the existing methods perform poorly on Electroencephalogram (EEG) signals for sleep-staging tasks. In this paper, we propose a novel multi-view self-supervised method (mulEEG) for unsupervised EEG representation learning. Our method attempts to effectively utilize the complementary info…
▽ More
Modeling effective representations using multiple views that positively influence each other is challenging, and the existing methods perform poorly on Electroencephalogram (EEG) signals for sleep-staging tasks. In this paper, we propose a novel multi-view self-supervised method (mulEEG) for unsupervised EEG representation learning. Our method attempts to effectively utilize the complementary information available in multiple views to learn better representations. We introduce diverse loss that further encourages complementary information across multiple views. Our method with no access to labels beats the supervised training while outperforming multi-view baseline methods on transfer learning experiments carried out on sleep-staging tasks. We posit that our method was able to learn better representations by using complementary multi-views.
△ Less
Submitted 7 April, 2022;
originally announced April 2022.
-
Simulation-based Algorithm for Determining Best Package Delivery Alternatives under Three Criteria: Time, Cost and Sustainability
Authors:
Suchithra Rajendran,
Aidan Harper
Abstract:
With the significant rise in demand for same-day instant deliveries, several courier services are exploring alternatives to transport packages in a cost- and time-effective, as well as, sustainable manner. Motivated by a real-life case study, this paper focuses on developing a simulation algorithm that assists same-day package delivery companies to serve customers instantly. The proposed recommend…
▽ More
With the significant rise in demand for same-day instant deliveries, several courier services are exploring alternatives to transport packages in a cost- and time-effective, as well as, sustainable manner. Motivated by a real-life case study, this paper focuses on developing a simulation algorithm that assists same-day package delivery companies to serve customers instantly. The proposed recommender system provides the best solution with respect to three criteria: cost, time, and sustainability, considering the variation in travel time and cost parameters. The decision support tool provides recommendations on the best alternative for transporting products based on factors, such as source and destination locations, time of the day, package weight, and volume. Besides considering existing new technologies like electric-assisted cargo bikes, we also analyze the impact of emerging methods of deliveries, such as robots and air taxis. Finally, this paper also considers the best delivery alternative during the presence of a pandemic, such as COVID-19. For the purpose of illustrating our approach, we consider the delivery options in New York City. We believe that the proposed tool is the first to provide solutions to courier companies considering evolving modes of transportation and under logistics disruptions due to pandemic.
Keywords: Instant package delivery; Courier services; Simulation algorithm; Recommender system; Emerging technologies; COVID-19 pandemic.
△ Less
Submitted 5 June, 2021;
originally announced June 2021.
-
Predicting Demand for Air Taxi Urban Aviation Services using Machine Learning Algorithms
Authors:
Suchithra Rajendran,
Sharan Srinivas,
Trenton Grimshaw
Abstract:
This research focuses on predicting the demand for air taxi urban air mobility (UAM) services during different times of the day in various geographic regions of New York City using machine learning algorithms (MLAs). Several ride-related factors (such as month of the year, day of the week and time of the day) and weather-related variables (such as temperature, weather conditions and visibility) ar…
▽ More
This research focuses on predicting the demand for air taxi urban air mobility (UAM) services during different times of the day in various geographic regions of New York City using machine learning algorithms (MLAs). Several ride-related factors (such as month of the year, day of the week and time of the day) and weather-related variables (such as temperature, weather conditions and visibility) are used as predictors for four popular MLAs, namely, logistic regression, artificial neural networks, random forests, and gradient boosting. Experimental results suggest gradient boosting to consistently provide higher prediction performance. Specific locations, certain time periods and weekdays consistently emerged as critical predictors.
△ Less
Submitted 26 March, 2021;
originally announced March 2021.
-
Air taxi service for urban mobility: A critical review of recent developments, future challenges, and opportunities
Authors:
Suchithra Rajendran,
Sharan Srinivas
Abstract:
Expected to operate in the imminent future, air taxi service (ATS) is an aerial on-demand transport for a single passenger or a small group of riders, which seeks to transform the method of everyday commute. This uncharted territory in the emerging transportation world is anticipated to enable consumers bypass traffic congestion in urban road networks. By adopting an electric vertical takeoff and…
▽ More
Expected to operate in the imminent future, air taxi service (ATS) is an aerial on-demand transport for a single passenger or a small group of riders, which seeks to transform the method of everyday commute. This uncharted territory in the emerging transportation world is anticipated to enable consumers bypass traffic congestion in urban road networks. By adopting an electric vertical takeoff and landing concept (eVTOL), air taxis could be operational from skyports retrofitted on building rooftops, thus gaining advantage from an implementation standpoint. Motivated by the potential impact of ATS, this study provides a review of air taxi systems and associated operations. We first discuss the current developments in the ATS (demand prediction, air taxi network design, and vehicle configuration). Next, we anticipate potential future challenges of ATS from an operations management perspective, and review the existing literature that could be leveraged to tackle these problems (ride-matching, pricing strategies, vehicle maintenance scheduling, and pilot training and recruitment). Finally, we detail future research opportunities in the air taxi domain.
△ Less
Submitted 24 February, 2021;
originally announced March 2021.
-
BioALBERT: A Simple and Effective Pre-trained Language Model for Biomedical Named Entity Recognition
Authors:
Usman Naseem,
Matloob Khushi,
Vinay Reddy,
Sakthivel Rajendran,
Imran Razzak,
Jinman Kim
Abstract:
In recent years, with the growing amount of biomedical documents, coupled with advancement in natural language processing algorithms, the research on biomedical named entity recognition (BioNER) has increased exponentially. However, BioNER research is challenging as NER in the biomedical domain are: (i) often restricted due to limited amount of training data, (ii) an entity can refer to multiple t…
▽ More
In recent years, with the growing amount of biomedical documents, coupled with advancement in natural language processing algorithms, the research on biomedical named entity recognition (BioNER) has increased exponentially. However, BioNER research is challenging as NER in the biomedical domain are: (i) often restricted due to limited amount of training data, (ii) an entity can refer to multiple types and concepts depending on its context and, (iii) heavy reliance on acronyms that are sub-domain specific. Existing BioNER approaches often neglect these issues and directly adopt the state-of-the-art (SOTA) models trained in general corpora which often yields unsatisfactory results. We propose biomedical ALBERT (A Lite Bidirectional Encoder Representations from Transformers for Biomedical Text Mining) bioALBERT, an effective domain-specific language model trained on large-scale biomedical corpora designed to capture biomedical context-dependent NER. We adopted a self-supervised loss used in ALBERT that focuses on modelling inter-sentence coherence to better learn context-dependent representations and incorporated parameter reduction techniques to lower memory consumption and increase the training speed in BioNER. In our experiments, BioALBERT outperformed comparative SOTA BioNER models on eight biomedical NER benchmark datasets with four different entity types. We trained four different variants of BioALBERT models which are available for the research community to be used in future research.
△ Less
Submitted 19 September, 2020;
originally announced September 2020.
-
Improving Services Offered by Internet Providers by Analyzing Online Reviews using Text Analytics
Authors:
Suchithra Rajendran,
John Fennewald
Abstract:
With the proliferation of digital infrastructure, there is a plethora of demand for internet services, which makes the wireless communications industry highly competitive. Thus internet service providers (ISPs) must ensure that their efforts are targeted towards attracting and retaining customers to ensure continued growth. As Web 2.0 has gained traction and more tools have become available, custo…
▽ More
With the proliferation of digital infrastructure, there is a plethora of demand for internet services, which makes the wireless communications industry highly competitive. Thus internet service providers (ISPs) must ensure that their efforts are targeted towards attracting and retaining customers to ensure continued growth. As Web 2.0 has gained traction and more tools have become available, customers in recent times are equipped to make well-informed decisions, specifically due to the colossal information available in online reviews. ISPs can use this information to better understand the views of the customers about their products and services. The goal of this paper is to identify the current strengths, weaknesses, opportunities, and threats (SWOT) of each ISP by exploring consumer reviews using text analytics. The proposed approach consists of four different stages: bigram and trigram analyses, topic identification, SWOT analysis and Root Cause Analysis (RCA). For each ISP, we first categorize online reviews into positive and negative based on customer ratings and then leverage text analytic tools to determine the most frequently used and co-occurring words in each categorization of reviews. Subsequently, looking at the positive and negative topics in each ISP, we conduct the SWOT analysis as well as the RCA to help companies identify the internal and external factors impacting customer satisfaction. We use a case study to illustrate the proposed approach. The proposed managerial insights that are derived from the results can act as a decision support tool for ISPs to offer better products and services for their customers.
△ Less
Submitted 16 August, 2020;
originally announced August 2020.
-
Recommendations for Emerging Air Taxi Network Operations based on Online Review Analysis of Helicopter Services
Authors:
Suchithra Rajendran,
Emily Pagel
Abstract:
The effects of traffic congestion are adverse, primarily including air pollution, commuter stress, and an increase in vehicle operating costs and accidents on the road. In efforts to alleviate these problems in metropolitan cities, logistics companies plan to introduce a new method of everyday commute called air taxis, an Urban Air Mobility (UAM) service. These are electric-powered vehicles that a…
▽ More
The effects of traffic congestion are adverse, primarily including air pollution, commuter stress, and an increase in vehicle operating costs and accidents on the road. In efforts to alleviate these problems in metropolitan cities, logistics companies plan to introduce a new method of everyday commute called air taxis, an Urban Air Mobility (UAM) service. These are electric-powered vehicles that are expected to operate in the forthcoming years by international transportation companies like Airbus, Uber, and Kitty Hawk. Since these flying taxis are emerging mode of transportation, it is necessary to provide recommendations for the initial design, implementation, and operation. This study proposes managerial insights for these upcoming services by analyzing online customer reviews and conducting an internal assessment of helicopter operations. Helicopters are similar to air taxis in regards to their operations, and therefore, customer reviews pertaining to the former can enable us to obtain insights into the strengths and weaknesses of the short-distance aviation service, in general. A four-stage sequential approach is used in this research, wherein the online reviews are mined in Stage 1, analyzed using the bigram and trigram models in Stage 2, 7S internal assessment is conducted for helicopter services in Stage 3, and managerial recommendations for air taxis are proposed in Stage 4. The insights obtained in this paper could assist any air taxi companies in providing better customer service when they venture into the market.
Keywords: Air taxi; Emerging technology; Urban Air Mobility (UAM); Helicopter services; Online customer reviews; Text analytics;
△ Less
Submitted 18 June, 2020;
originally announced June 2020.
-
Injecting Reliable Radio Frequency Fingerprints Using Metasurface for The Internet of Things
Authors:
Sekhar Rajendran,
Zhi Sun,
Feng Lin,
Kui Ren
Abstract:
In Internet of Things, where billions of devices with limited resources are communicating with each other, security has become a major stumbling block affecting the progress of this technology. Existing authentication schemes-based on digital signatures have overhead costs associated with them in terms of computation time, battery power, bandwidth, memory, and related hardware costs. Radio frequen…
▽ More
In Internet of Things, where billions of devices with limited resources are communicating with each other, security has become a major stumbling block affecting the progress of this technology. Existing authentication schemes-based on digital signatures have overhead costs associated with them in terms of computation time, battery power, bandwidth, memory, and related hardware costs. Radio frequency fingerprint (RFF), utilizing the unique device-based information, can be a promising solution for IoT. However, traditional RFFs have become obsolete because of low reliability and reduced user capability. Our proposed solution, Metasurface RF-Fingerprinting Injection (MeRFFI), is to inject a carefully-designed radio frequency fingerprint into the wireless physical layer that can increase the security of a stationary IoT device with minimal overhead. The injection of fingerprint is implemented using a low cost metasurface developed and fabricated in our lab, which is designed to make small but detectable perturbations in the specific frequency band in which the IoT devices are communicating. We have conducted comprehensive system evaluations including distance, orientation, multiple channels where the feasibility, effectiveness, and reliability of these fingerprints are validated. The proposed MeRFFI system can be easily integrated into the existing authentication schemes. The security vulnerabilities are analyzed for some of the most threatening wireless physical layer-based attacks.
△ Less
Submitted 5 December, 2020; v1 submitted 11 June, 2020;
originally announced June 2020.
-
MagicEyes: A Large Scale Eye Gaze Estimation Dataset for Mixed Reality
Authors:
Zhengyang Wu,
Srivignesh Rajendran,
Tarrence van As,
Joelle Zimmermann,
Vijay Badrinarayanan,
Andrew Rabinovich
Abstract:
With the emergence of Virtual and Mixed Reality (XR) devices, eye tracking has received significant attention in the computer vision community. Eye gaze estimation is a crucial component in XR -- enabling energy efficient rendering, multi-focal displays, and effective interaction with content. In head-mounted XR devices, the eyes are imaged off-axis to avoid blocking the field of view. This leads…
▽ More
With the emergence of Virtual and Mixed Reality (XR) devices, eye tracking has received significant attention in the computer vision community. Eye gaze estimation is a crucial component in XR -- enabling energy efficient rendering, multi-focal displays, and effective interaction with content. In head-mounted XR devices, the eyes are imaged off-axis to avoid blocking the field of view. This leads to increased challenges in inferring eye related quantities and simultaneously provides an opportunity to develop accurate and robust learning based approaches. To this end, we present MagicEyes, the first large scale eye dataset collected using real MR devices with comprehensive ground truth labeling. MagicEyes includes $587$ subjects with $80,000$ images of human-labeled ground truth and over $800,000$ images with gaze target labels. We evaluate several state-of-the-art methods on MagicEyes and also propose a new multi-task EyeNet model designed for detecting the cornea, glints and pupil along with eye segmentation in a single forward pass.
△ Less
Submitted 18 March, 2020;
originally announced March 2020.
-
EyeNet: A Multi-Task Network for Off-Axis Eye Gaze Estimation and User Understanding
Authors:
Zhengyang Wu,
Srivignesh Rajendran,
Tarrence van As,
Joelle Zimmermann,
Vijay Badrinarayanan,
Andrew Rabinovich
Abstract:
Eye gaze estimation and simultaneous semantic understanding of a user through eye images is a crucial component in Virtual and Mixed Reality; enabling energy efficient rendering, multi-focal displays and effective interaction with 3D content. In head-mounted VR/MR devices the eyes are imaged off-axis to avoid blocking the user's gaze, this view-point makes drawing eye related inferences very chall…
▽ More
Eye gaze estimation and simultaneous semantic understanding of a user through eye images is a crucial component in Virtual and Mixed Reality; enabling energy efficient rendering, multi-focal displays and effective interaction with 3D content. In head-mounted VR/MR devices the eyes are imaged off-axis to avoid blocking the user's gaze, this view-point makes drawing eye related inferences very challenging. In this work, we present EyeNet, the first single deep neural network which solves multiple heterogeneous tasks related to eye gaze estimation and semantic user understanding for an off-axis camera setting. The tasks include eye segmentation, blink detection, emotive expression classification, IR LED glints detection, pupil and cornea center estimation. To train EyeNet end-to-end we employ both hand labelled supervision and model based supervision. We benchmark all tasks on MagicEyes, a large and new dataset of 587 subjects with varying morphology, gender, skin-color, make-up and imaging conditions.
△ Less
Submitted 23 August, 2019;
originally announced August 2019.
-
Localization in Ultra Narrow Band IoT Networks: Design Guidelines and Trade-Offs
Authors:
Hazem Sallouha,
Alessandro Chiumento,
Sreeraj Rajendran,
Sofie Pollin
Abstract:
Localization in long-range Internet of Things networks is a challenging task, mainly due to the long distances and low bandwidth used. Moreover, the cost, power, and size limitations restrict the integration of a GPS receiver in each device. In this work, we introduce a novel received signal strength indicator (RSSI) based localization solution for ultra narrow band (UNB) long-range IoT networks s…
▽ More
Localization in long-range Internet of Things networks is a challenging task, mainly due to the long distances and low bandwidth used. Moreover, the cost, power, and size limitations restrict the integration of a GPS receiver in each device. In this work, we introduce a novel received signal strength indicator (RSSI) based localization solution for ultra narrow band (UNB) long-range IoT networks such as Sigfox. The essence of our approach is to leverage the existence of a few GPS-enabled sensors (GSNs) in the network to split the wide coverage into classes, enabling RSSI based fingerprinting of other sensors (SNs). By using machine learning algorithms at the network backed-end, the proposed approach does not impose extra power, payload, or hardware requirements. To comprehensively validate the performance of the proposed method, a measurement-based dataset that has been collected in the city of Antwerp is used. We show that a location classification accuracy of 80% is achieved by virtually splitting a city with a radius of 2.5 km into seven classes. Moreover, separating classes, by increasing the spacing between them, brings the classification accuracy up-to 92% based on our measurements. Furthermore, when the density of GSN nodes is high enough to enable device-to-device communication, using multilateration, we improve the probability of localizing SNs with an error lower than 20 m by 40% in our measurement scenario.
△ Less
Submitted 25 July, 2019;
originally announced July 2019.
-
Crowdsourced wireless spectrum anomaly detection
Authors:
Sreeraj Rajendran,
Vincent Lenders,
Wannes Meert,
Sofie Pollin
Abstract:
Automated wireless spectrum monitoring across frequency, time and space will be essential for many future applications. Manual and fine-grained spectrum analysis is becoming impossible because of the large number of measurement locations and complexity of the spectrum use landscape. Detecting unexpected behaviors in the wireless spectrum from the collected data is a crucial part of this automated…
▽ More
Automated wireless spectrum monitoring across frequency, time and space will be essential for many future applications. Manual and fine-grained spectrum analysis is becoming impossible because of the large number of measurement locations and complexity of the spectrum use landscape. Detecting unexpected behaviors in the wireless spectrum from the collected data is a crucial part of this automated monitoring, and the control of detected anomalies is a key functionality to enable interaction between the automated system and the end user. In this paper we look into the wireless spectrum anomaly detection problem for crowdsourced sensors. We first analyze in detail the nature of these anomalies and design effective algorithms to bring the higher dimensional input data to a common feature space across sensors. Anomalies can then be detected as outliers in this feature space. In addition, we investigate the importance of user feedback in the anomaly detection process to improve the performance of unsupervised anomaly detection. Furthermore, schemes for generalizing user feedback across sensors are also developed to close the anomaly detection loop.
△ Less
Submitted 13 March, 2019;
originally announced March 2019.
-
A multi-layered energy consumption model for smart wireless acoustic sensor networks
Authors:
Gert Dekkers,
Fernando Rosas,
Steven Lauwereins,
Sreeraj Rajendran,
Sofie Pollin,
Bart Vanrumste,
Toon van Waterschoot,
Marian Verhelst,
Peter Karsmakers
Abstract:
Smart sensing is expected to become a pervasive technology in smart cities and environments of the near future. These services are improving their capabilities due to integrated devices shrinking in size while maintaining their computational power, which can run diverse Machine Learning algorithms and achieve high performance in various data-processing tasks. One attractive sensor modality to be u…
▽ More
Smart sensing is expected to become a pervasive technology in smart cities and environments of the near future. These services are improving their capabilities due to integrated devices shrinking in size while maintaining their computational power, which can run diverse Machine Learning algorithms and achieve high performance in various data-processing tasks. One attractive sensor modality to be used for smart sensing are acoustic sensors, which can convey highly informative data while keeping a moderate energy consumption. Unfortunately, the energy budget of current wireless sensor networks is usually not enough to support the requirements of standard microphones. Therefore, energy efficiency needs to be increased at all layers --- sensing, signal processing and communication --- in order to bring wireless smart acoustic sensors into the market. To help to attain this goal, this paper introduces WASN-EM: an energy consumption model for wireless acoustic sensors networks (WASN), whose aim is to aid in the development of novel techniques to increase the energy-efficient of smart wireless acoustic sensors. This model provides a first step of exploration prior to custom design of a smart wireless acoustic sensor, and also can be used to compare the energy consumption of different protocols.
△ Less
Submitted 17 December, 2018;
originally announced December 2018.
-
Electrosense+: Crowdsourcing Radio Spectrum Decoding using IoT Receivers
Authors:
Roberto Calvo-Palomino,
Héctor Cordobés,
Markus Engel,
Markus Fuchs,
Pratiksha Jain,
Marc Liechti,
Sreeraj Rajendran,
Matthias Schäfer,
Bertold Van den Bergh,
Sofie Pollin,
Domenico Giustiniano,
Vincent Lenders
Abstract:
Web spectrum monitoring systems based on crowdsourcing have recently gained popularity. These systems are however limited to applications of interest for governamental organizationsor telecom providers, and only provide aggregated information about spectrum statistics. Theresult is that there is a lack of interest for layman users to participate, which limits its widespreaddeployment. We present E…
▽ More
Web spectrum monitoring systems based on crowdsourcing have recently gained popularity. These systems are however limited to applications of interest for governamental organizationsor telecom providers, and only provide aggregated information about spectrum statistics. Theresult is that there is a lack of interest for layman users to participate, which limits its widespreaddeployment. We present Electrosense+ which addresses this challenge and creates a general-purpose and open platform for spectrum monitoring using low-cost, embedded, and software-defined spectrum IoT sensors. Electrosense+ allows users to remotely decode specific parts ofthe radio spectrum. It builds on the centralized architecture of its predecessor, Electrosense, forcontrolling and monitoring the spectrum IoT sensors, but implements a real-time and peer-to-peercommunication system for scalable spectrum data decoding. We propose different mechanismsto incentivize the participation of users for deploying new sensors and keep them operational inthe Electrosense network. As a reward for the user, we propose an incentive accounting systembased on virtual tokens to encourage the participants to host IoT sensors. We present the newElectrosense+ system architecture and evaluate its performance at decoding various wireless sig-nals, including FM radio, AM radio, ADS-B, AIS, LTE, and ACARS.
△ Less
Submitted 11 May, 2020; v1 submitted 29 November, 2018;
originally announced November 2018.
-
Key Technologies and System Trade-Offs for Detection and Localization of Amateur Drones
Authors:
Mohammad Mahdi Azari,
Hazem Sallouha,
Alessandro Chiumento,
Sreeraj Rajendran,
Evgenii Vinogradov,
Sofie Pollin
Abstract:
The use of amateur drones (ADrs) is expected to significantly increase over the upcoming years. However, regulations do not allow such drones to fly over all areas, in addition to typical altitude limitations. As a result, there is an urgent need for ADrs surveillance solutions. These solutions should include means of accurate detection, classification, and localization of the unwanted drones in a…
▽ More
The use of amateur drones (ADrs) is expected to significantly increase over the upcoming years. However, regulations do not allow such drones to fly over all areas, in addition to typical altitude limitations. As a result, there is an urgent need for ADrs surveillance solutions. These solutions should include means of accurate detection, classification, and localization of the unwanted drones in a no-fly zone. In this paper, we give an overview of promising techniques for modulation classification and signal strength based localization of ADrs by using surveillance drones (SDrs). By introducing a generic altitude dependent propagation model, we show how detection and localization performance depend on the altitude of SDrs. Particularly, our simulation results show a 25 dB reduction in the minimum detectable power or 10 times coverage enhancement of an SDr by flying at the optimum altitude. Moreover, for a target no-fly zone, the location estimation error of an ADr can be remarkably reduced by optimizing the positions of the SDrs. Finally, we conclude the paper with a general discussion about the future work and possible challenges of the aerial surveillance systems.
△ Less
Submitted 14 October, 2017;
originally announced October 2017.
-
Distributed Deep Learning Models for Wireless Signal Classification with Low-Cost Spectrum Sensors
Authors:
Sreeraj Rajendran,
Wannes Meert,
Domenico Giustiniano,
Vincent Lenders,
Sofie Pollin
Abstract:
This paper looks into the technology classification problem for a distributed wireless spectrum sensing network. First, a new data-driven model for Automatic Modulation Classification (AMC) based on long short term memory (LSTM) is proposed. The model learns from the time domain amplitude and phase information of the modulation schemes present in the training data without requiring expert features…
▽ More
This paper looks into the technology classification problem for a distributed wireless spectrum sensing network. First, a new data-driven model for Automatic Modulation Classification (AMC) based on long short term memory (LSTM) is proposed. The model learns from the time domain amplitude and phase information of the modulation schemes present in the training data without requiring expert features like higher order cyclic moments. Analyses show that the proposed model yields an average classification accuracy of close to 90% at varying SNR conditions ranging from 0dB to 20dB. Further, we explore the utility of this LSTM model for a variable symbol rate scenario. We show that a LSTM based model can learn good representations of variable length time domain sequences, which is useful in classifying modulation signals with different symbol rates. The achieved accuracy of 75% on an input sample length of 64 for which it was not trained, substantiates the representation power of the model. To reduce the data communication overhead from distributed sensors, the feasibility of classification using averaged magnitude spectrum data, or online classification on the low cost sensors is studied. Furthermore, quantized realizations of the proposed models are analyzed for deployment on sensors with low processing power.
△ Less
Submitted 11 July, 2018; v1 submitted 27 July, 2017;
originally announced July 2017.
-
Dependency resolution and semantic mining using Tree Adjoining Grammars for Tamil Language
Authors:
Vijay Krishna Menon,
S Rajendran,
M Anandkumar,
K P Soman
Abstract:
Tree adjoining grammars (TAGs) provide an ample tool to capture syntax of many Indian languages. Tamil represents a special challenge to computational formalisms as it has extensive agglutinative morphology and a comparatively difficult argument structure. Modelling Tamil syntax and morphology using TAG is an interesting problem which has not been in focus even though TAGs are over 4 decades old,…
▽ More
Tree adjoining grammars (TAGs) provide an ample tool to capture syntax of many Indian languages. Tamil represents a special challenge to computational formalisms as it has extensive agglutinative morphology and a comparatively difficult argument structure. Modelling Tamil syntax and morphology using TAG is an interesting problem which has not been in focus even though TAGs are over 4 decades old, since its inception. Our research with Tamil TAGs have shown us that we can not only represent syntax of the language, but to an extent mine out semantics through dependency resolution of the sentence. But in order to demonstrate this phenomenal property, we need to parse Tamil language sentences using TAGs we have built and through parsing obtain a derivation we could use to resolve dependencies, thus proving the semantic property. We use an in-house developed pseudo lexical TAG chart parser; algorithm given by Schabes and Joshi (1988), for generating derivations of sentences. We do not use any statistics to rank out ambiguous derivations but rather use all of them to understand the mentioned semantic relation with in TAGs for Tamil. We shall also present a brief parser analysis for the completeness of our discussions.
△ Less
Submitted 19 April, 2017;
originally announced April 2017.
-
Electrosense: Open and Big Spectrum Data
Authors:
Sreeraj Rajendran,
Roberto Calvo-Palomino,
Markus Fuchs,
Bertold Van den Bergh,
Héctor Cordobés,
Domenico Giustiniano,
Sofie Pollin,
Vincent Lenders
Abstract:
While the radio spectrum allocation is well regulated, there is little knowledge about its actual utilization over time and space. This limitation hinders taking effective actions in various applications including cognitive radios, electrosmog monitoring, and law enforcement. We introduce Electrosense, an initiative that seeks a more efficient, safe and reliable monitoring of the electromagnetic s…
▽ More
While the radio spectrum allocation is well regulated, there is little knowledge about its actual utilization over time and space. This limitation hinders taking effective actions in various applications including cognitive radios, electrosmog monitoring, and law enforcement. We introduce Electrosense, an initiative that seeks a more efficient, safe and reliable monitoring of the electromagnetic space by improving the accessibility of spectrum data for the general public. A collaborative spectrum monitoring network is designed that monitors the spectrum at large scale with low-cost spectrum sensing nodes. The large set of data is stored and processed in a big data architecture and provided back to the community with an open spectrum data as a service model, that allows users to build diverse and novel applications with different requirements. We illustrate useful usage scenarios of the Electrosense data.
△ Less
Submitted 31 May, 2018; v1 submitted 29 March, 2017;
originally announced March 2017.
-
A new TAG Formalism for Tamil and Parser Analytics
Authors:
Vijay Krishna Menon,
S. Rajendran,
M. Anand Kumar,
K. P. Soman
Abstract:
Tree adjoining grammar (TAG) is specifically suited for morph rich and agglutinated languages like Tamil due to its psycho linguistic features and parse time dependency and morph resolution. Though TAG and LTAG formalisms have been known for about 3 decades, efforts on designing TAG Syntax for Tamil have not been entirely successful due to the complexity of its specification and the rich morpholog…
▽ More
Tree adjoining grammar (TAG) is specifically suited for morph rich and agglutinated languages like Tamil due to its psycho linguistic features and parse time dependency and morph resolution. Though TAG and LTAG formalisms have been known for about 3 decades, efforts on designing TAG Syntax for Tamil have not been entirely successful due to the complexity of its specification and the rich morphology of Tamil language. In this paper we present a minimalistic TAG for Tamil without much morphological considerations and also introduce a parser implementation with some obvious variations from the XTAG system
△ Less
Submitted 5 April, 2016;
originally announced April 2016.