-
Cyclicity Analysis of the Ornstein-Uhlenbeck Process
Authors:
Vivek Kaushik
Abstract:
In this thesis, we consider an $N$-dimensional Ornstein-Uhlenbeck (OU) process satisfying the linear stochastic differential equation $d\mathbf x(t) = - \mathbf B\mathbf x(t) dt + \boldsymbol Σd \mathbf w(t).$ Here, $\mathbf B$ is a fixed $N \times N$ circulant friction matrix whose eigenvalues have positive real parts, $\boldsymbol Σ$ is a fixed $N \times M$ matrix. We consider a signal propagati…
▽ More
In this thesis, we consider an $N$-dimensional Ornstein-Uhlenbeck (OU) process satisfying the linear stochastic differential equation $d\mathbf x(t) = - \mathbf B\mathbf x(t) dt + \boldsymbol Σd \mathbf w(t).$ Here, $\mathbf B$ is a fixed $N \times N$ circulant friction matrix whose eigenvalues have positive real parts, $\boldsymbol Σ$ is a fixed $N \times M$ matrix. We consider a signal propagation model governed by this OU process. In this model, an underlying signal propagates throughout a network consisting of $N$ linked sensors located in space. We interpret the $n$-th component of the OU process as the measurement of the propagating effect made by the $n$-th sensor. The matrix $\mathbf B$ represents the sensor network structure: if $\mathbf B$ has first row $(b_1 \ , \ \dots \ , \ b_N),$ where $b_1>0$ and $b_2 \ , \ \dots \ ,\ b_N \le 0,$ then the magnitude of $b_p$ quantifies how receptive the $n$-th sensor is to activity within the $(n+p-1)$-th sensor. Finally, the $(m,n)$-th entry of the matrix $\mathbf D = \frac{\boldsymbol Σ\boldsymbol Σ^\text T}{2}$ is the covariance of the component noises injected into the $m$-th and $n$-th sensors. For different choices of $\mathbf B$ and $\boldsymbol Σ,$ we investigate whether Cyclicity Analysis enables us to recover the structure of network. Roughly speaking, Cyclicity Analysis studies the lead-lag dynamics pertaining to the components of a multivariate signal. We specifically consider an $N \times N$ skew-symmetric matrix $\mathbf Q,$ known as the lead matrix, in which the sign of its $(m,n)$-th entry captures the lead-lag relationship between the $m$-th and $n$-th component OU processes. We investigate whether the structure of the leading eigenvector of $\mathbf Q,$ the eigenvector corresponding to the largest eigenvalue of $\mathbf Q$ in modulus, reflects the network structure induced by $\mathbf B.$
△ Less
Submitted 18 September, 2024;
originally announced September 2024.
-
Analyzing Customer-Facing Vendor Experiences with Time Series Forecasting and Monte Carlo Techniques
Authors:
Vivek Kaushik,
Jason Tang
Abstract:
eBay partners with external vendors, which allows customers to freely select a vendor to complete their eBay experiences. However, vendor outages can hinder customer experiences. Consequently, eBay can disable a problematic vendor to prevent customer loss. Disabling the vendor too late risks losing customers willing to switch to other vendors, while disabling it too early risks losing those unwill…
▽ More
eBay partners with external vendors, which allows customers to freely select a vendor to complete their eBay experiences. However, vendor outages can hinder customer experiences. Consequently, eBay can disable a problematic vendor to prevent customer loss. Disabling the vendor too late risks losing customers willing to switch to other vendors, while disabling it too early risks losing those unwilling to switch. In this paper, we propose a data-driven solution to answer whether eBay should disable a problematic vendor and when to disable it. Our solution involves forecasting customer behavior. First, we use a multiplicative seasonality model to represent behavior if all vendors are fully functioning. Next, we use a Monte Carlo simulation to represent behavior if the problematic vendor remains enabled. Finally, we use a linear model to represent behavior if the vendor is disabled. By comparing these forecasts, we determine the optimal time for eBay to disable the problematic vendor.
△ Less
Submitted 30 July, 2024;
originally announced July 2024.
-
GraVITON: Graph based garment warping with attention guided inversion for Virtual-tryon
Authors:
Sanhita Pathak,
Vinay Kaushik,
Brejesh Lall
Abstract:
Virtual try-on, a rapidly evolving field in computer vision, is transforming e-commerce by improving customer experiences through precise garment warping and seamless integration onto the human body. While existing methods such as TPS and flow address the garment warping but overlook the finer contextual details. In this paper, we introduce a novel graph based warping technique which emphasizes th…
▽ More
Virtual try-on, a rapidly evolving field in computer vision, is transforming e-commerce by improving customer experiences through precise garment warping and seamless integration onto the human body. While existing methods such as TPS and flow address the garment warping but overlook the finer contextual details. In this paper, we introduce a novel graph based warping technique which emphasizes the value of context in garment flow. Our graph based warping module generates warped garment as well as a coarse person image, which is utilised by a simple refinement network to give a coarse virtual tryon image. The proposed work exploits latent diffusion model to generate the final tryon, treating garment transfer as an inpainting task. The diffusion model is conditioned with decoupled cross attention based inversion of visual and textual information. We introduce an occlusion aware warping constraint that generates dense warped garment, without any holes and occlusion. Our method, validated on VITON-HD and Dresscode datasets, showcases substantial state-of-the-art qualitative and quantitative results showing considerable improvement in garment warping, texture preservation, and overall realism.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Feedback Pulses
Authors:
Vishesh Kaushik,
Navin Khaneja
Abstract:
We have a new paradigm to design NMR pulses. Pulses, we call feedback pulses. We want broadband inversion and excitation. We have many offsets, start evolving them all starting from the north pole. Monitor them on the Bloch sphere, see which offset is worst (most away from south pole). Change the rf-phase to the offset ($π/2$ ahead of offset transverse magnetization phase) and irradiate at that of…
▽ More
We have a new paradigm to design NMR pulses. Pulses, we call feedback pulses. We want broadband inversion and excitation. We have many offsets, start evolving them all starting from the north pole. Monitor them on the Bloch sphere, see which offset is worst (most away from south pole). Change the rf-phase to the offset ($π/2$ ahead of offset transverse magnetization phase) and irradiate at that offset frequency and evolve for some time and monitor and repeat, looking for worst offset. When we are on resonance to a offset, we are doing well, inverting it and when we are off resonant, we don't hurt much (even if hurt little, we will come back to the offset in good time). By the process of monitoring, and setting phase we eventually push everything to the south pole and bingo, we have an inversion pulse. Feedback is done in simulation, but what results in end is a broadband inversion pulse. For broadband excitation, start with all offsets (symmetric around origin) on y axis. By feedback push them to the south pole. When we run the resulting sequence backward with phases, $π$ incremented, we will get an excitation pulse. For band-selective excitation pulse put offsets in pass band on the $y$ axis and in the stop band on the south pole. Use feedback to push everything to the south pole. Again, run backwards with $π$ incremented phases, to get band selective excitation. Suddenly, we have it all, simple and easy. The paper, introduces the feedback pulse algorithm, simulations and experiments.
△ Less
Submitted 28 January, 2024;
originally announced February 2024.
-
Single Stage Warped Cloth Learning and Semantic-Contextual Attention Feature Fusion for Virtual TryOn
Authors:
Sanhita Pathak,
Vinay Kaushik,
Brejesh Lall
Abstract:
Image-based virtual try-on aims to fit an in-shop garment onto a clothed person image. Garment warping, which aligns the target garment with the corresponding body parts in the person image, is a crucial step in achieving this goal. Existing methods often use multi-stage frameworks to handle clothes warping, person body synthesis and tryon generation separately or rely on noisy intermediate parser…
▽ More
Image-based virtual try-on aims to fit an in-shop garment onto a clothed person image. Garment warping, which aligns the target garment with the corresponding body parts in the person image, is a crucial step in achieving this goal. Existing methods often use multi-stage frameworks to handle clothes warping, person body synthesis and tryon generation separately or rely on noisy intermediate parser-based labels. We propose a novel single-stage framework that implicitly learns the same without explicit multi-stage learning. Our approach utilizes a novel semantic-contextual fusion attention module for garment-person feature fusion, enabling efficient and realistic cloth warping and body synthesis from target pose keypoints. By introducing a lightweight linear attention framework that attends to garment regions and fuses multiple sampled flow fields, we also address misalignment and artifacts present in previous methods. To achieve simultaneous learning of warped garment and try-on results, we introduce a Warped Cloth Learning Module. Our proposed approach significantly improves the quality and efficiency of virtual try-on methods, providing users with a more reliable and realistic virtual try-on experience.
△ Less
Submitted 25 May, 2024; v1 submitted 8 October, 2023;
originally announced October 2023.
-
Engineering of Niobium Surfaces Through Accelerated Neutral Atom Beam Technology For Quantum Applications
Authors:
Soumen Kar,
Conan Weiland,
Chenyu Zhou,
Ekta Bhatia,
Brian Martinick,
Jakub Nalaskowski,
John Mucci,
Stephen Olson,
Pui Yee Hung,
Ilyssa Wells,
Hunter Frost,
Corbet S. Johnson,
Thomas Murray,
Vidya Kaushik,
Sean Kirkpatrick,
Kiet Chau,
Michael J. Walsh,
Mingzhao Liu,
Satyavolu S. Papa Rao
Abstract:
A major roadblock to scalable quantum computing is phase decoherence and energy relaxation caused by qubits interacting with defect-related two-level systems (TLS). Native oxides present on the surfaces of superconducting metals used in quantum devices are acknowledged to be a source of TLS that decrease qubit coherence times. Reducing microwave loss by surface engineering (i.e., replacing uncontr…
▽ More
A major roadblock to scalable quantum computing is phase decoherence and energy relaxation caused by qubits interacting with defect-related two-level systems (TLS). Native oxides present on the surfaces of superconducting metals used in quantum devices are acknowledged to be a source of TLS that decrease qubit coherence times. Reducing microwave loss by surface engineering (i.e., replacing uncontrolled native oxide of superconducting metals with a thin, stable surface with predictable characteristics) can be a key enabler for pushing performance forward with devices of higher quality factor. In this work, we present a novel approach to replace the native oxide of niobium (typically formed in an uncontrolled fashion when its pristine surface is exposed to air) with an engineered oxide, using a room-temperature process that leverages Accelerated Neutral Atom Beam (ANAB) technology at 300 mm wafer scale. This ANAB beam is composed of a mixture of argon and oxygen, with tunable energy per atom, which is rastered across the wafer surface. The ANAB-engineered Nb-oxide thickness was found to vary from 2 nm to 6 nm depending on ANAB process parameters. Modeling of variable-energy XPS data confirm thickness and compositional control of the Nb surface oxide by the ANAB process. These results correlate well with those from transmission electron microscopy and X-ray reflectometry. Since ANAB is broadly applicable to material surfaces, the present study indicates its promise for modification of the surfaces of superconducting quantum circuits to achieve longer coherence times.
△ Less
Submitted 27 February, 2023;
originally announced February 2023.
-
RADIUS: Risk-Aware, Real-Time, Reachability-Based Motion Planning
Authors:
Jinsun Liu,
Challen Enninful Adu,
Lucas Lymburner,
Vishrut Kaushik,
Lena Trang,
Ram Vasudevan
Abstract:
Deterministic methods for motion planning guarantee safety amidst uncertainty in obstacle locations by trying to restrict the robot from operating in any possible location that an obstacle could be in. Unfortunately, this can result in overly conservative behavior. Chance-constrained optimization can be applied to improve the performance of motion planning algorithms by allowing for a user-specifi…
▽ More
Deterministic methods for motion planning guarantee safety amidst uncertainty in obstacle locations by trying to restrict the robot from operating in any possible location that an obstacle could be in. Unfortunately, this can result in overly conservative behavior. Chance-constrained optimization can be applied to improve the performance of motion planning algorithms by allowing for a user-specified amount of bounded constraint violation. However, state-of-the-art methods rely either on moment-based inequalities, which can be overly conservative, or make it difficult to satisfy assumptions about the class of probability distributions used to model uncertainty. To address these challenges, this work proposes a real-time, risk-aware reachability-based motion planning framework called RADIUS. The method first generates a reachable set of parameterized trajectories for the robot offline. At run time, RADIUS computes a closed-form over-approximation of the risk of a collision with an obstacle. This is done without restricting the probability distribution used to model uncertainty to a simple class (e.g., Gaussian). Then, RADIUS performs real-time optimization to construct a trajectory that can be followed by the robot in a manner that is certified to have a risk of collision that is less than or equal to a user-specified threshold. The proposed algorithm is compared to several state-of-the-art chance-constrained and deterministic methods in simulation, and is shown to consistently outperform them in a variety of driving scenarios. A demonstration of the proposed framework on hardware is also provided.
△ Less
Submitted 19 June, 2023; v1 submitted 15 February, 2023;
originally announced February 2023.
-
Probing electron-electron interaction along with superconducting fluctuations in disordered TiN thin films
Authors:
Sachin Yadav,
Vinay Kaushik,
M. P. Saravanan,
Sangeeta Sahoo
Abstract:
Here, we demonstrate an interplay between superconducting fluctuations and electron-electron interaction (EEI) by low temperature magnetotransport measurements for a set of 2D disordered TiN thin films. While cooling down the sample, a characteristic temperature T* is obtained from the R(T) at which superconducting fluctuations start to appear. The upturn in R(T) above T* corresponds to weak local…
▽ More
Here, we demonstrate an interplay between superconducting fluctuations and electron-electron interaction (EEI) by low temperature magnetotransport measurements for a set of 2D disordered TiN thin films. While cooling down the sample, a characteristic temperature T* is obtained from the R(T) at which superconducting fluctuations start to appear. The upturn in R(T) above T* corresponds to weak localization (WL) and/or EEI. By the temperature and field dependences of the observed resistance, we show that the upturn in R(T) originates mainly from EEI with a negligible contribution from WL. Further, we have used the modified Larkins electron-electron attraction strength beta(T/Tc), containing a field induced pair breaking parameter, in the Maki-Thompson (MT) superconducting fluctuation term. Here, the temperature dependence of the beta(T/Tc) obtained from the magnetoresistance analysis shows a diverging behavior close to Tc and it remains almost constant at higher temperature within the limit of ln(T/Tc) < 1. Interestingly, the variation of beta(T/Tc) on the reduced temperature (T/Tc) offers a common trend which has been closely followed by all the concerned samples presented in this study. Finally, the temperature dependence of inverse phase scattering time , as obtained from the magnetoresistance analysis, clearly shows two different regimes; the first one close to Tc follows the Ginzburg-Landau relaxation rate , whereas, the second one at high temperature varies almost linearly with temperature indicating the dominance of inelastic electron-electron scattering for the dephasing mechanism. These two regimes are followed in a generic way by all the samples in spite of being grown under different growth conditions.
△ Less
Submitted 18 January, 2023;
originally announced January 2023.
-
REFINE: Reachability-based Trajectory Design using Robust Feedback Linearization and Zonotopes
Authors:
Jinsun Liu,
Yifei Shao,
Lucas Lymburner,
Hansen Qin,
Vishrut Kaushik,
Lena Trang,
Ruiyang Wang,
Vladimir Ivanovic,
H. Eric Tseng,
Ram Vasudevan
Abstract:
Performing real-time receding horizon motion planning for autonomous vehicles while providing safety guarantees remains difficult. This is because existing methods to accurately predict ego vehicle behavior under a chosen controller use online numerical integration that requires a fine time discretization and thereby adversely affects real-time performance. To address this limitation, several rece…
▽ More
Performing real-time receding horizon motion planning for autonomous vehicles while providing safety guarantees remains difficult. This is because existing methods to accurately predict ego vehicle behavior under a chosen controller use online numerical integration that requires a fine time discretization and thereby adversely affects real-time performance. To address this limitation, several recent papers have proposed to apply offline reachability analysis to conservatively predict the behavior of the ego vehicle. This reachable set can be constructed by utilizing a simplified model whose behavior is assumed a priori to conservatively bound the dynamics of a full-order model. However, guaranteeing that one satisfies this assumption is challenging. This paper proposes a framework named REFINE to overcome the limitations of these existing approaches. REFINE utilizes a parameterized robust controller that partially linearizes the vehicle dynamics even in the presence of modeling error. Zonotope-based reachability analysis is then performed on the closed-loop, full-order vehicle dynamics to compute the corresponding control-parameterized, over-approximate Forward Reachable Sets (FRS). Because reachability analysis is applied to the full-order model, the potential conservativeness introduced by using a simplified model is avoided. The pre-computed, control-parameterized FRS is then used online in an optimization framework to ensure safety. The proposed method is compared to several state of the art methods during a simulation-based evaluation on a full-size vehicle model and is evaluated on a 1/10th race car robot in real hardware testing. In contrast to existing methods, REFINE is shown to enable the vehicle to safely navigate itself through complex environments.
△ Less
Submitted 21 November, 2022;
originally announced November 2022.
-
MaskMTL: Attribute prediction in masked facial images with deep multitask learning
Authors:
Prerana Mukherjee,
Vinay Kaushik,
Ronak Gupta,
Ritika Jha,
Daneshwari Kankanwadi,
Brejesh Lall
Abstract:
Predicting attributes in the landmark free facial images is itself a challenging task which gets further complicated when the face gets occluded due to the usage of masks. Smart access control gates which utilize identity verification or the secure login to personal electronic gadgets may utilize face as a biometric trait. Particularly, the Covid-19 pandemic increasingly validates the essentiality…
▽ More
Predicting attributes in the landmark free facial images is itself a challenging task which gets further complicated when the face gets occluded due to the usage of masks. Smart access control gates which utilize identity verification or the secure login to personal electronic gadgets may utilize face as a biometric trait. Particularly, the Covid-19 pandemic increasingly validates the essentiality of hygienic and contactless identity verification. In such cases, the usage of masks become more inevitable and performing attribute prediction helps in segregating the target vulnerable groups from community spread or ensuring social distancing for them in a collaborative environment. We create a masked face dataset by efficiently overlaying masks of different shape, size and textures to effectively model variability generated by wearing mask. This paper presents a deep Multi-Task Learning (MTL) approach to jointly estimate various heterogeneous attributes from a single masked facial image. Experimental results on benchmark face attribute UTKFace dataset demonstrate that the proposed approach supersedes in performance to other competing techniques. The source code is available at https://github.com/ritikajha/Attribute-prediction-in-masked-facial-images-with-deep-multitask-learning
△ Less
Submitted 11 January, 2022; v1 submitted 9 January, 2022;
originally announced January 2022.
-
On-chip Nanophotonic Broadband Wavelength Detector with 2D-Electron Gas
Authors:
Vishal Kaushik,
Swati Rajput,
Sulabh Srivastav,
Lalit Singh,
Prem Babu,
Elham Heidari,
Moustafa Ahmed,
Yas Al-Hadeethi,
Hamed Dalir,
Volker J. Sorger,
Mukesh Kumar
Abstract:
Miniaturized, low-cost wavelength detectors are gaining enormous interest as we step into the new age of photonics. Incompatibility with integrated circuits or complex fabrication requirement in most of the conventionally used filters necessitates the development of a simple, on-chip platform for easy-to-use wavelength detection system. Also, intensity fluctuations hinder precise, noise free detec…
▽ More
Miniaturized, low-cost wavelength detectors are gaining enormous interest as we step into the new age of photonics. Incompatibility with integrated circuits or complex fabrication requirement in most of the conventionally used filters necessitates the development of a simple, on-chip platform for easy-to-use wavelength detection system. Also, intensity fluctuations hinder precise, noise free detection of spectral information. Here we propose a novel approach of utilizing wavelength sensitive photocurrent across semiconductor heterojunctions to experimentally validate broadband wavelength detection on an on-chip platform with simple fabrication process. The proposed device utilizes linear frequency response of internal photoemission via 2-D electron gas in a ZnO based heterojunction along with a reference junction for coherent common mode rejection. We report sensitivity of 0.96 uA/nm for a broad wavelength-range of 280 nm from 660-940 nm. Simple fabrication process, efficient intensity noise cancelation along with heat resistance and radiation hardness of ZnO makes the proposed platform simple, low-cost and efficient alternative for several applications such as optical spectrometers, sensing and IOTs.
△ Less
Submitted 4 November, 2021;
originally announced November 2021.
-
BreakingBERT@IITK at SemEval-2021 Task 9 : Statement Verification and Evidence Finding with Tables
Authors:
Aditya Jindal,
Ankur Gupta,
Jaya Srivastava,
Preeti Menghwani,
Vijit Malik,
Vishesh Kaushik,
Ashutosh Modi
Abstract:
Recently, there has been an interest in factual verification and prediction over structured data like tables and graphs. To circumvent any false news incident, it is necessary to not only model and predict over structured data efficiently but also to explain those predictions. In this paper, as part of the SemEval-2021 Task 9, we tackle the problem of fact verification and evidence finding over ta…
▽ More
Recently, there has been an interest in factual verification and prediction over structured data like tables and graphs. To circumvent any false news incident, it is necessary to not only model and predict over structured data efficiently but also to explain those predictions. In this paper, as part of the SemEval-2021 Task 9, we tackle the problem of fact verification and evidence finding over tabular data. There are two subtasks. Given a table and a statement/fact, subtask A determines whether the statement is inferred from the tabular data, and subtask B determines which cells in the table provide evidence for the former subtask. We make a comparison of the baselines and state-of-the-art approaches over the given SemTabFact dataset. We also propose a novel approach CellBERT to solve evidence finding as a form of the Natural Language Inference task. We obtain a 3-way F1 score of 0.69 on subtask A and an F1 score of 0.65 on subtask B.
△ Less
Submitted 10 April, 2021; v1 submitted 7 April, 2021;
originally announced April 2021.
-
A Robust nitridation technique for fabrication of disordered superconducting TiN thin films featuring phase slip events
Authors:
Sachin Yadav,
Vinay Kaushik,
M. P. Saravanan,
R. P. Aloysius,
V. Ganesan,
Sangeeta Sahoo
Abstract:
Disorder induced phase slip (PS) events appearing in the current voltage characteristics (IVCs) are reported for two-dimensional TiN thin films produced by a robust substrate mediated nitridation technique. Here, high temperature annealing of Ti/Si3N4 based metal/substrate assembly is the key to produce majority phase TiN accompanied by TiSi2 and elemental Si as minority phases. The method itself…
▽ More
Disorder induced phase slip (PS) events appearing in the current voltage characteristics (IVCs) are reported for two-dimensional TiN thin films produced by a robust substrate mediated nitridation technique. Here, high temperature annealing of Ti/Si3N4 based metal/substrate assembly is the key to produce majority phase TiN accompanied by TiSi2 and elemental Si as minority phases. The method itself introduces different level of disorder intrinsically by tuning the amount of the non-superconducting minority phases that are controlled by annealing temperature (Ta) and the film thickness. The superconducting critical temperature (Tc) strongly depends on Ta and the maximum Tc obtained from the demonstrated technique is about 4.8 K for the thickness range of about 12 nm and above. Besides, the dynamics of IVCs get modulated by the appearance of intermediated resistive steps for decreased Ta and the steps get more prominent for reduced thickness. Further, the deviation in the temperature dependent critical current (Ic) from the Ginzburg-Landau theoretical limit varies strongly with the thickness. Finally, the Tc, intermediate resistive steps in the IVCs and the depairing current are observed to alter in a similar fashion with Ta and the thickness indicating the robustness of the synthesis process to fabricate disordered nitride-based superconductor.
△ Less
Submitted 22 March, 2021; v1 submitted 19 March, 2021;
originally announced March 2021.
-
ADAADepth: Adapting Data Augmentation and Attention for Self-Supervised Monocular Depth Estimation
Authors:
Vinay Kaushik,
Kartik Jindgar,
Brejesh Lall
Abstract:
Self-supervised learning of depth has been a highly studied topic of research as it alleviates the requirement of having ground truth annotations for predicting depth. Depth is learnt as an intermediate solution to the task of view synthesis, utilising warped photometric consistency. Although it gives good results when trained using stereo data, the predicted depth is still sensitive to noise, ill…
▽ More
Self-supervised learning of depth has been a highly studied topic of research as it alleviates the requirement of having ground truth annotations for predicting depth. Depth is learnt as an intermediate solution to the task of view synthesis, utilising warped photometric consistency. Although it gives good results when trained using stereo data, the predicted depth is still sensitive to noise, illumination changes and specular reflections. Also, occlusion can be tackled better by learning depth from a single camera. We propose ADAA, utilising depth augmentation as depth supervision for learning accurate and robust depth. We propose a relational self-attention module that learns rich contextual features and further enhances depth results. We also optimize the auto-masking strategy across all losses by enforcing L1 regularisation over mask. Our novel progressive training strategy first learns depth at a lower resolution and then progresses to the original resolution with slight training. We utilise a ResNet18 encoder, learning features for prediction of both depth and pose. We evaluate our predicted depth on the standard KITTI driving dataset and achieve state-of-the-art results for monocular depth estimation whilst having significantly lower number of trainable parameters in our deep learning framework. We also evaluate our model on Make3D dataset showing better generalization than other methods.
△ Less
Submitted 1 March, 2021;
originally announced March 2021.
-
Deep feature fusion for self-supervised monocular depth prediction
Authors:
Vinay Kaushik,
Brejesh Lall
Abstract:
Recent advances in end-to-end unsupervised learning has significantly improved the performance of monocular depth prediction and alleviated the requirement of ground truth depth. Although a plethora of work has been done in enforcing various structural constraints by incorporating multiple losses utilising smoothness, left-right consistency, regularisation and matching surface normals, a few of th…
▽ More
Recent advances in end-to-end unsupervised learning has significantly improved the performance of monocular depth prediction and alleviated the requirement of ground truth depth. Although a plethora of work has been done in enforcing various structural constraints by incorporating multiple losses utilising smoothness, left-right consistency, regularisation and matching surface normals, a few of them take into consideration multi-scale structures present in real world images. Most works utilise a VGG16 or ResNet50 model pre-trained on ImageNet weights for predicting depth. We propose a deep feature fusion method utilising features at multiple scales for learning self-supervised depth from scratch. Our fusion network selects features from both upper and lower levels at every level in the encoder network, thereby creating multiple feature pyramid sub-networks that are fed to the decoder after applying the CoordConv solution. We also propose a refinement module learning higher scale residual depth from a combination of higher level deep features and lower level residual depth using a pixel shuffling framework that super-resolves lower level residual depth. We select the KITTI dataset for evaluation and show that our proposed architecture can produce better or comparable results in depth prediction.
△ Less
Submitted 16 May, 2020;
originally announced May 2020.
-
A Simple Multiple Integral Solution to the Broken Stick Problem
Authors:
Vivek Kaushik
Abstract:
Regard the closed interval $[0,1]$ as a stick. Partition $[0,1]$ into $n+1$ different intervals $I_1, \ \dots \ , I_{n+1},$ where $n \geq 2,$ which represent smaller sticks. The classical Broken Stick problem asks to find the probability that the lengths of these smaller sticks can be the side lengths of a polygon with $n+1$ sides. We will show that this probability is $1-\frac{n+1}{2^{n}}$ by usi…
▽ More
Regard the closed interval $[0,1]$ as a stick. Partition $[0,1]$ into $n+1$ different intervals $I_1, \ \dots \ , I_{n+1},$ where $n \geq 2,$ which represent smaller sticks. The classical Broken Stick problem asks to find the probability that the lengths of these smaller sticks can be the side lengths of a polygon with $n+1$ sides. We will show that this probability is $1-\frac{n+1}{2^{n}}$ by using multiple integration.
△ Less
Submitted 11 December, 2021; v1 submitted 10 January, 2020;
originally announced January 2020.
-
Aerial multi-object tracking by detection using deep association networks
Authors:
Ajit Jadhav,
Prerana Mukherjee,
Vinay Kaushik,
Brejesh Lall
Abstract:
A lot a research is focused on object detection and it has achieved significant advances with deep learning techniques in recent years. Inspite of the existing research, these algorithms are not usually optimal for dealing with sequences or images captured by drone-based platforms, due to various challenges such as view point change, scales, density of object distribution and occlusion. In this pa…
▽ More
A lot a research is focused on object detection and it has achieved significant advances with deep learning techniques in recent years. Inspite of the existing research, these algorithms are not usually optimal for dealing with sequences or images captured by drone-based platforms, due to various challenges such as view point change, scales, density of object distribution and occlusion. In this paper, we develop a model for detection of objects in drone images using the VisDrone2019 DET dataset. Using the RetinaNet model as our base, we modify the anchor scales to better handle the detection of dense distribution and small size of the objects. We explicitly model the channel interdependencies by using "Squeeze-and-Excitation" (SE) blocks that adaptively recalibrates channel-wise feature responses. This helps to bring significant improvements in performance at a slight additional computational cost. Using this architecture for object detection, we build a custom DeepSORT network for object detection on the VisDrone2019 MOT dataset by training a custom Deep Association network for the algorithm.
△ Less
Submitted 3 September, 2019;
originally announced September 2019.
-
A Comment on the Sums $\sum_{n \in \mathbb{Z}} \frac{(-1)^{nk}}{(an+1)^k}$
Authors:
Vivek Kaushik
Abstract:
We recall a proof of Euler's identity $\sum_{n=1}^{\infty} \frac{1}{n^2}=\frac{π^2}{6}$ involving the evaluation of a double integral. We extend the method to find Hurwitz Zeta series of the form $S(k,a)=\sum_{n \in \mathbb{Z}} \frac{(-1)^{nk}}{(an+1)^k},$ where $a \in \mathbb{N} \setminus \lbrace 1 \rbrace$ and $k \in \mathbb{N}.$ In particular, we consider a general $k$-dimensional integral over…
▽ More
We recall a proof of Euler's identity $\sum_{n=1}^{\infty} \frac{1}{n^2}=\frac{π^2}{6}$ involving the evaluation of a double integral. We extend the method to find Hurwitz Zeta series of the form $S(k,a)=\sum_{n \in \mathbb{Z}} \frac{(-1)^{nk}}{(an+1)^k},$ where $a \in \mathbb{N} \setminus \lbrace 1 \rbrace$ and $k \in \mathbb{N}.$ In particular, we consider a general $k$-dimensional integral over $(0,1)^k$ that equals the series representation $S(k,a).$ Then we use an algebraic change of variables that diffeomorphically maps $(0,1)^k$ to a $k$-dimensional hyperbolic polytope. We interpret the integral as a sum of two probabilities, and find explicit representations of such probabilities with combinatorial techniques.
△ Less
Submitted 5 March, 2019;
originally announced March 2019.
-
Fast Hierarchical Depth Map Computation from Stereo
Authors:
Vinay Kaushik,
Brejesh Lall
Abstract:
Disparity by Block Matching stereo is usually used in applications with limited computational power in order to get depth estimates. However, the research on simple stereo methods has been lesser than the energy based counterparts which promise a better quality depth map with more potential for future improvements. Semi-global-matching (SGM) methods offer good performance and easy implementation b…
▽ More
Disparity by Block Matching stereo is usually used in applications with limited computational power in order to get depth estimates. However, the research on simple stereo methods has been lesser than the energy based counterparts which promise a better quality depth map with more potential for future improvements. Semi-global-matching (SGM) methods offer good performance and easy implementation but suffer from the problem of very high memory footprint because it's working on the full disparity space image. On the other hand, Block matching stereo needs much less memory. In this paper, we introduce a novel multi-scale-hierarchical block-matching approach using a pyramidal variant of depth and cost functions which drastically improves the results of standard block matching stereo techniques while preserving the low memory footprint and further reducing the complexity of standard block matching. We tested our new multi block matching scheme on the Middlebury stereo benchmark. For the Middlebury benchmark we get results that are only slightly worse than state of the art SGM implementations.
△ Less
Submitted 28 January, 2019;
originally announced January 2019.
-
Nrityantar: Pose oblivious Indian classical dance sequence classification system
Authors:
Vinay Kaushik,
Prerana Mukherjee,
Brejesh Lall
Abstract:
In this paper, we attempt to advance the research work done in human action recognition to a rather specialized application namely Indian Classical Dance (ICD) classification. The variation in such dance forms in terms of hand and body postures, facial expressions or emotions and head orientation makes pose estimation an extremely challenging task. To circumvent this problem, we construct a pose-o…
▽ More
In this paper, we attempt to advance the research work done in human action recognition to a rather specialized application namely Indian Classical Dance (ICD) classification. The variation in such dance forms in terms of hand and body postures, facial expressions or emotions and head orientation makes pose estimation an extremely challenging task. To circumvent this problem, we construct a pose-oblivious shape signature which is fed to a sequence learning framework. The pose signature representation is done in two-fold process. First, we represent person-pose in first frame of a dance video using symmetric Spatial Transformer Networks (STN) to extract good person object proposals and CNN-based parallel single person pose estimator (SPPE). Next, the pose basis are converted to pose flows by assigning a similarity score between successive poses followed by non-maximal suppression. Instead of feeding a simple chain of joints in the sequence learner which generally hinders the network performance we constitute a feature vector of the normalized distance vectors, flow, angles between anchor joints which captures the adjacency configuration in the skeletal pattern. Thus, the kinematic relationship amongst the body joints across the frames using pose estimation helps in better establishing the spatio-temporal dependencies. We present an exhaustive empirical evaluation of state-of-the-art deep network based methods for dance classification on ICD dataset.
△ Less
Submitted 12 December, 2018;
originally announced December 2018.
-
On Central Binomial Series Related to Zeta(4)
Authors:
Vivek Kaushik
Abstract:
In this paper, we prove two related central binomial series identities: $B(4)=\sum_{n \geq 0} \frac{\binom{2n}n}{2^{4n}(2n+1)^3}=\frac{7 π^3}{216}$ and $C(4)=\sum_{n \in \mathbb{N}} \frac{1}{n^4 \binom{2n}n}=\frac{17 π^4}{3240}.$ Both series resist all the standard approaches used to evaluate other well-known series. To prove the first series identity, we will evaluate a log-sine integral that is…
▽ More
In this paper, we prove two related central binomial series identities: $B(4)=\sum_{n \geq 0} \frac{\binom{2n}n}{2^{4n}(2n+1)^3}=\frac{7 π^3}{216}$ and $C(4)=\sum_{n \in \mathbb{N}} \frac{1}{n^4 \binom{2n}n}=\frac{17 π^4}{3240}.$ Both series resist all the standard approaches used to evaluate other well-known series. To prove the first series identity, we will evaluate a log-sine integral that is equal to $B(4).$ Evaluating this log-sine integral will lead us to computing closed forms of polylogarithms evaluated at certain complex exponentials. To prove the second identity, we will evaluate a double integral that is equal to $C(4).$ Evaluating this double integral will lead us to computing several polylogarithmic integrals, one of which has a closed form that is a linear combination of $B(4)$ and $C(4).$ After proving these series identities, we evaluate several challenging logarithmic and polylogarithmic integrals, whose evaluations involve surprising appearances of integral representations of $B(4)$ and $C(4).$ We also provide an insight into the generalization of a modern double integral proof of Euler's celebrated identity $\sum_{n \in \mathbb{N}} \frac{1}{n^2}=\frac{π^2}{6},$ in which we encounter an integral representation of $C(4).$
△ Less
Submitted 27 December, 2019; v1 submitted 14 November, 2018;
originally announced November 2018.
-
Evaluation of Harmonic Sums with Integrals
Authors:
Vivek Kaushik,
Daniele Ritelli
Abstract:
We consider the sums $S(k)=\sum_{n=0}^{\infty}\frac{(-1)^{nk}}{(2n+1)^k}$ and $ζ(2k)=\sum_{n=1}^{\infty}\frac{1}{n^{2k}}$ with $k$ being a positive integer. We evaluate these sums with multiple integration, a modern technique. First, we start with three different double integrals that have been previously used in the literature to show $S(2)=π^2/8,$ which implies Euler's identity $ζ(2)=π^2/6.$ The…
▽ More
We consider the sums $S(k)=\sum_{n=0}^{\infty}\frac{(-1)^{nk}}{(2n+1)^k}$ and $ζ(2k)=\sum_{n=1}^{\infty}\frac{1}{n^{2k}}$ with $k$ being a positive integer. We evaluate these sums with multiple integration, a modern technique. First, we start with three different double integrals that have been previously used in the literature to show $S(2)=π^2/8,$ which implies Euler's identity $ζ(2)=π^2/6.$ Then, we generalize each integral in order to find the considered sums. The $k$ dimensional analogue of the first integral is the density function of the quotient of $k$ independent, nonnegative Cauchy random variables. In seeking this function, we encounter a special logarithmic integral that we can directly relate to $S(k).$ The $k$ dimensional analogue of the second integral, upon a change of variables, is the volume of a convex polytope, which can be expressed as a probability involving certain pairwise sums of $k$ independent uniform random variables. We use combinatorial arguments to find the volume, which in turn gives new closed formulas for $S(k)$ and $ζ(2k).$ The $k$ dimensional analogue of the last integral, upon another change of variables, is an integral of the joint density function of $k$ Cauchy random variables over a hyperbolic polytope. This integral can be expressed as a probability involving certain pairwise products of these random variables, and it is equal to the probability from the second generalization. Thus, we specifically highlight the similarities in the combinatorial arguments between the second and third generalizations.
△ Less
Submitted 1 September, 2018; v1 submitted 10 October, 2017;
originally announced October 2017.
-
In-Line-Test of Variability and Bit-Error-Rate of HfOx-Based Resistive Memory
Authors:
B. L. Ji,
H. Li,
Q. Ye,
S. Gausepohl,
S. Deora,
D. Veksler,
S. Vivekanand,
H. Chong,
H. Stamper,
T. Burroughs,
C. Johnson,
M. Smalley,
S. Bennett,
V. Kaushik,
J. Piccirillo,
M. Rodgers,
M. Passaro,
M. Liehr
Abstract:
Spatial and temporal variability of HfOx-based resistive random access memory (RRAM) are investigated for manufacturing and product designs. Manufacturing variability is characterized at different levels including lots, wafers, and chips. Bit-error-rate (BER) is proposed as a holistic parameter for the write cycle resistance statistics. Using the electrical in-line-test cycle data, a method is dev…
▽ More
Spatial and temporal variability of HfOx-based resistive random access memory (RRAM) are investigated for manufacturing and product designs. Manufacturing variability is characterized at different levels including lots, wafers, and chips. Bit-error-rate (BER) is proposed as a holistic parameter for the write cycle resistance statistics. Using the electrical in-line-test cycle data, a method is developed to derive BERs as functions of the design margin, to provide guidance for technology evaluation and product design. The proposed BER calculation can also be used in the off-line bench test and build-in-self-test (BIST) for adaptive error correction and for the other types of random access memories.
△ Less
Submitted 31 August, 2015;
originally announced September 2015.
-
Micro-Raman and field emission studies of silicon nanowires prepared by metal assisted chemical etching
Authors:
Vivek Kumar,
Shailendra K. Saxena,
Vishakha Kaushik,
Kapil Saxena,
Rajesh Kumar,
A. K. Shukla
Abstract:
Micro-Raman scattering and electron field emission characteristics of silicon nanowires (SiNWs) synthesized by metal assisted chemical etching (MACE) are investigated. Scanning electron microscopy images reveal the growth of well aligned vertical SiNWs. Raman shift and size relation from bond-polarizability model has been used to calculate exact confinement sizes in SiNWs. The Si optical phonon pe…
▽ More
Micro-Raman scattering and electron field emission characteristics of silicon nanowires (SiNWs) synthesized by metal assisted chemical etching (MACE) are investigated. Scanning electron microscopy images reveal the growth of well aligned vertical SiNWs. Raman shift and size relation from bond-polarizability model has been used to calculate exact confinement sizes in SiNWs. The Si optical phonon peak for SiNWs showed a downshift and an asymmetric broadening with decreasing diameter of the SiNWs due to quantum confinement of optical phonons. The field emission characteristics of these SiNWs are studied based by carrying out current-voltage measurements followed by a theoretical analysis using Fowler-Nordheim equation. The electron field emission increased with decreasing diameter of SiNWs. Field emission from these SiNWs exhibits significant enhancement in turn-on field and total emission current with decreasing nanowire size. The reported results in the current study indicate that MACE is a simple technique to prepare well-aligned SiNWs with potentials for applications in field emission devices.
△ Less
Submitted 28 May, 2014;
originally announced May 2014.
-
Architecture for Automated Tagging and Clustering of Song Files According to Mood
Authors:
Puneet Singh,
Ashutosh Kapoor,
Vishal Kaushik,
Hima Bindu Maringanti
Abstract:
Music is one of the basic human needs for recreation and entertainment. As song files are digitalized now a days, and digital libraries are expanding continuously, which makes it difficult to recall a song. Thus need of a new classification system other than genre is very obvious and mood based classification system serves the purpose very well. In this paper we will present a well-defined archite…
▽ More
Music is one of the basic human needs for recreation and entertainment. As song files are digitalized now a days, and digital libraries are expanding continuously, which makes it difficult to recall a song. Thus need of a new classification system other than genre is very obvious and mood based classification system serves the purpose very well. In this paper we will present a well-defined architecture to classify songs into different mood-based categories, using audio content analysis, affective value of song lyrics to map a song onto a psychological-based emotion space and information from online sources. In audio content analysis we will use music features such as intensity, timbre and rhythm including their subfeatures to map music in a 2-Dimensional emotional space. In lyric based classification 1-Dimensional emotional space is used. Both the results are merged onto a 2-Dimensional emotional space, which will classify song into a particular mood category. Finally clusters of mood based song files are formed and arranged according to data acquired from various Internet sources.
△ Less
Submitted 12 June, 2012;
originally announced June 2012.
-
A New Approach of Improving CFA Image for Digital Camera's
Authors:
Manoj Kumar,
Vikas Kaushik,
Pradeep Singla
Abstract:
This paper work directly towards the improving the quality of the image for the digital cameras and other visual capturing products. In this Paper, the authors clearly defines the problems occurs in the CFA image. A different methodology for removing the noise is discuses in the paper for color correction and color balancing of the image. At the same time, the authors also proposed a new methodolo…
▽ More
This paper work directly towards the improving the quality of the image for the digital cameras and other visual capturing products. In this Paper, the authors clearly defines the problems occurs in the CFA image. A different methodology for removing the noise is discuses in the paper for color correction and color balancing of the image. At the same time, the authors also proposed a new methodology of providing denoisiing process before the demosaickingfor the improving the image quality of CFA which is much efficient then the other previous defined. The demosaicking process for producing the colors in the image in a best way is also discuss.
△ Less
Submitted 24 April, 2012;
originally announced April 2012.