-
Safe Control of Quadruped in Varying Dynamics via Safety Index Adaptation
Authors:
Kai S. Yun,
Rui Chen,
Chase Dunaway,
John M. Dolan,
Changliu Liu
Abstract:
Varying dynamics pose a fundamental difficulty when deploying safe control laws in the real world. Safety Index Synthesis (SIS) deeply relies on the system dynamics and once the dynamics change, the previously synthesized safety index becomes invalid. In this work, we show the real-time efficacy of Safety Index Adaptation (SIA) in varying dynamics. SIA enables real-time adaptation to the changing…
▽ More
Varying dynamics pose a fundamental difficulty when deploying safe control laws in the real world. Safety Index Synthesis (SIS) deeply relies on the system dynamics and once the dynamics change, the previously synthesized safety index becomes invalid. In this work, we show the real-time efficacy of Safety Index Adaptation (SIA) in varying dynamics. SIA enables real-time adaptation to the changing dynamics so that the adapted safe control law can still guarantee 1) forward invariance within a safe region and 2) finite time convergence to that safe region. This work employs SIA on a package-carrying quadruped robot, where the payload weight changes in real-time. SIA updates the safety index when the dynamics change, e.g., a change in payload weight, so that the quadruped can avoid obstacles while achieving its performance objectives. Numerical study provides theoretical guarantees for SIA and a series of hardware experiments demonstrate the effectiveness of SIA in real-world deployment in avoiding obstacles under varying dynamics.
△ Less
Submitted 15 September, 2024;
originally announced September 2024.
-
ModelVerification.jl: a Comprehensive Toolbox for Formally Verifying Deep Neural Networks
Authors:
Tianhao Wei,
Luca Marzari,
Kai S. Yun,
Hanjiang Hu,
Peizhi Niu,
Xusheng Luo,
Changliu Liu
Abstract:
Deep Neural Networks (DNN) are crucial in approximating nonlinear functions across diverse applications, ranging from image classification to control. Verifying specific input-output properties can be a highly challenging task due to the lack of a single, self-contained framework that allows a complete range of verification types. To this end, we present \texttt{ModelVerification.jl (MV)}, the fir…
▽ More
Deep Neural Networks (DNN) are crucial in approximating nonlinear functions across diverse applications, ranging from image classification to control. Verifying specific input-output properties can be a highly challenging task due to the lack of a single, self-contained framework that allows a complete range of verification types. To this end, we present \texttt{ModelVerification.jl (MV)}, the first comprehensive, cutting-edge toolbox that contains a suite of state-of-the-art methods for verifying different types of DNNs and safety specifications. This versatile toolbox is designed to empower developers and machine learning practitioners with robust tools for verifying and ensuring the trustworthiness of their DNN models.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Holographic reconstruction of black hole spacetime: machine learning and entanglement entropy
Authors:
Byoungjoon Ahn,
Hyun-Sik Jeong,
Keun-Young Kim,
Kwan Yun
Abstract:
We investigate the bulk reconstruction of AdS black hole spacetime emergent from quantum entanglement within a machine learning framework. Utilizing neural ordinary differential equations alongside Monte-Carlo integration, we develop a method tailored for continuous training functions to extract the general isotropic bulk metric from entanglement entropy data. To validate our approach, we first ap…
▽ More
We investigate the bulk reconstruction of AdS black hole spacetime emergent from quantum entanglement within a machine learning framework. Utilizing neural ordinary differential equations alongside Monte-Carlo integration, we develop a method tailored for continuous training functions to extract the general isotropic bulk metric from entanglement entropy data. To validate our approach, we first apply our machine learning algorithm to holographic entanglement entropy data derived from the Gubser-Rocha and superconductor models, which serve as representative models of strongly coupled matters in holography. Our algorithm successfully extracts the corresponding bulk metrics from these data. Additionally, we extend our methodology to many-body systems by employing entanglement entropy data from a fermionic tight-binding chain at half filling, exemplifying critical one-dimensional systems, and derive the associated bulk metric. We find that the metrics for a tight-binding chain and the Gubser-Rocha model are similar. We speculate this similarity is due to the metallic property of these models.
△ Less
Submitted 8 September, 2024; v1 submitted 11 June, 2024;
originally announced June 2024.
-
Transferable and Efficient Non-Factual Content Detection via Probe Training with Offline Consistency Checking
Authors:
Xiaokang Zhang,
Zijun Yao,
Jing Zhang,
Kaifeng Yun,
Jifan Yu,
Juanzi Li,
Jie Tang
Abstract:
Detecting non-factual content is a longstanding goal to increase the trustworthiness of large language models (LLMs) generations. Current factuality probes, trained using humanannotated labels, exhibit limited transferability to out-of-distribution content, while online selfconsistency checking imposes extensive computation burden due to the necessity of generating multiple outputs. This paper pro…
▽ More
Detecting non-factual content is a longstanding goal to increase the trustworthiness of large language models (LLMs) generations. Current factuality probes, trained using humanannotated labels, exhibit limited transferability to out-of-distribution content, while online selfconsistency checking imposes extensive computation burden due to the necessity of generating multiple outputs. This paper proposes PINOSE, which trains a probing model on offline self-consistency checking results, thereby circumventing the need for human-annotated data and achieving transferability across diverse data distributions. As the consistency check process is offline, PINOSE reduces the computational burden of generating multiple responses by online consistency verification. Additionally, it examines various aspects of internal states prior to response decoding, contributing to more effective detection of factual inaccuracies. Experiment results on both factuality detection and question answering benchmarks show that PINOSE achieves surpassing results than existing factuality detection methods. Our code and datasets are publicly available on this anonymized repository.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
LeGO: Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example
Authors:
Soyeon Yoon,
Kwan Yun,
Kwanggyoon Seo,
Sihun Cha,
Jung Eun Yoo,
Junyong Noh
Abstract:
Recent advances in 3D face stylization have made significant strides in few to zero-shot settings. However, the degree of stylization achieved by existing methods is often not sufficient for practical applications because they are mostly based on statistical 3D Morphable Models (3DMM) with limited variations. To this end, we propose a method that can produce a highly stylized 3D face model with de…
▽ More
Recent advances in 3D face stylization have made significant strides in few to zero-shot settings. However, the degree of stylization achieved by existing methods is often not sufficient for practical applications because they are mostly based on statistical 3D Morphable Models (3DMM) with limited variations. To this end, we propose a method that can produce a highly stylized 3D face model with desired topology. Our methods train a surface deformation network with 3DMM and translate its domain to the target style using a paired exemplar. The network achieves stylization of the 3D face mesh by mimicking the style of the target using a differentiable renderer and directional CLIP losses. Additionally, during the inference process, we utilize a Mesh Agnostic Encoder (MAGE) that takes deformation target, a mesh of diverse topologies as input to the stylization process and encodes its shape into our latent space. The resulting stylized face model can be animated by commonly used 3DMM blend shapes. A set of quantitative and qualitative evaluations demonstrate that our method can produce highly stylized face meshes according to a given style and output them in a desired topology. We also demonstrate example applications of our method including image-based stylized avatar generation, linear interpolation of geometric styles, and facial animation of stylized avatars.
△ Less
Submitted 22 March, 2024;
originally announced March 2024.
-
Stylized Face Sketch Extraction via Generative Prior with Limited Data
Authors:
Kwan Yun,
Kwanggyoon Seo,
Chang Wook Seo,
Soyeon Yoon,
Seongcheol Kim,
Soohyun Ji,
Amirsaman Ashtari,
Junyong Noh
Abstract:
Facial sketches are both a concise way of showing the identity of a person and a means to express artistic intention. While a few techniques have recently emerged that allow sketches to be extracted in different styles, they typically rely on a large amount of data that is difficult to obtain. Here, we propose StyleSketch, a method for extracting high-resolution stylized sketches from a face image…
▽ More
Facial sketches are both a concise way of showing the identity of a person and a means to express artistic intention. While a few techniques have recently emerged that allow sketches to be extracted in different styles, they typically rely on a large amount of data that is difficult to obtain. Here, we propose StyleSketch, a method for extracting high-resolution stylized sketches from a face image. Using the rich semantics of the deep features from a pretrained StyleGAN, we are able to train a sketch generator with 16 pairs of face and the corresponding sketch images. The sketch generator utilizes part-based losses with two-stage learning for fast convergence during training for high-quality sketch extraction. Through a set of comparisons, we show that StyleSketch outperforms existing state-of-the-art sketch extraction methods and few-shot image adaptation methods for the task of extracting high-resolution abstract face sketches. We further demonstrate the versatility of StyleSketch by extending its use to other domains and explore the possibility of semantic editing. The project page can be found in https://kwanyun.github.io/stylesketch_project.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Task-Specific Adaptation of Segmentation Foundation Model via Prompt Learning
Authors:
Hyung-Il Kim,
Kimin Yun,
Jun-Seok Yun,
Yuseok Bae
Abstract:
Recently, foundation models trained on massive datasets to adapt to a wide range of tasks have attracted considerable attention and are actively being explored within the computer vision community. Among these, the Segment Anything Model (SAM) stands out for its remarkable progress in generalizability and flexibility for image segmentation tasks, achieved through prompt-based object mask generatio…
▽ More
Recently, foundation models trained on massive datasets to adapt to a wide range of tasks have attracted considerable attention and are actively being explored within the computer vision community. Among these, the Segment Anything Model (SAM) stands out for its remarkable progress in generalizability and flexibility for image segmentation tasks, achieved through prompt-based object mask generation. However, despite its strength, SAM faces two key limitations when applied to instance segmentation that segments specific objects or those in unique environments (e.g., task-specific adaptation for out-of-distribution objects) not typically present in the training data: 1) the ambiguity inherent in input prompts and 2) the necessity for extensive additional training to achieve optimal segmentation. To address these challenges, we propose a task-specific adaptation (i.e., customization) of the segmentation foundation model via prompt learning tailored to SAM. Our method involves a prompt learning module (PLM), which adjusts input prompts into the embedding space to better align with peculiarities of the target task, thereby enabling more efficient training. Furthermore, we introduce a point matching module (PMM) to enhance the feature representation for finer segmentation by ensuring detailed alignment with ground truth boundaries. Experimental results on various customized segmentation scenarios demonstrate the effectiveness of the proposed method.
△ Less
Submitted 11 October, 2024; v1 submitted 14 March, 2024;
originally announced March 2024.
-
Maximum plasmon thermal conductivity of a thin metal film
Authors:
Kuk Hyun Yun,
Dong-min Kim,
Bong Jae Lee
Abstract:
Due to their extremely long propagation lengths compared to the wavelengths, surface plasmon polaritons (SPPs) have been considered as a key in enhancing thermal conductivity in thin metal films. This study explores the conditions at which the plasmon thermal conductivity is maximized, considering the thickness-dependent metal permittivity. We derived the analytical solutions for the plasmon therm…
▽ More
Due to their extremely long propagation lengths compared to the wavelengths, surface plasmon polaritons (SPPs) have been considered as a key in enhancing thermal conductivity in thin metal films. This study explores the conditions at which the plasmon thermal conductivity is maximized, considering the thickness-dependent metal permittivity. We derived the analytical solutions for the plasmon thermal conductivity in both the thin-film and thick-film limits to analyze the effect of the permittivities of metals and substrates. From the analytical solutions of plasmon thermal conductivity, we deduced that the plasmon thermal conductivity is proportional to the electron thermal conductivity based on the Wiedemann-Franz law. Additionally, we analyzed the conditions where the enhancement ratio of the thermal conductivity via SPPs is maximized. Metals with high plasma frequency and low damping coefficient are desirable for achieving the maximum plasmon thermal conductivity as well as the maximum enhancement ratio of thermal conductivity among metals. Significantly, 10-cm-long and 14-nm-thick Al film demonstrates most superior in-plane heat transfer via SPPs, showing a 53.5\% enhancement in thermal conductivity compared to its electron thermal counterpart on a lossless glass substrate.
△ Less
Submitted 26 January, 2024;
originally announced January 2024.
-
Representative Feature Extraction During Diffusion Process for Sketch Extraction with One Example
Authors:
Kwan Yun,
Youngseo Kim,
Kwanggyoon Seo,
Chang Wook Seo,
Junyong Noh
Abstract:
We introduce DiffSketch, a method for generating a variety of stylized sketches from images. Our approach focuses on selecting representative features from the rich semantics of deep features within a pretrained diffusion model. This novel sketch generation method can be trained with one manual drawing. Furthermore, efficient sketch extraction is ensured by distilling a trained generator into a st…
▽ More
We introduce DiffSketch, a method for generating a variety of stylized sketches from images. Our approach focuses on selecting representative features from the rich semantics of deep features within a pretrained diffusion model. This novel sketch generation method can be trained with one manual drawing. Furthermore, efficient sketch extraction is ensured by distilling a trained generator into a streamlined extractor. We select denoising diffusion features through analysis and integrate these selected features with VAE features to produce sketches. Additionally, we propose a sampling scheme for training models using a conditional generative approach. Through a series of comparisons, we verify that distilled DiffSketch not only outperforms existing state-of-the-art sketch extraction methods but also surpasses diffusion-based stylization methods in the task of extracting sketches.
△ Less
Submitted 9 January, 2024;
originally announced January 2024.
-
Deep learning bulk spacetime from boundary optical conductivity
Authors:
Byoungjoon Ahn,
Hyun-Sik Jeong,
Keun-Young Kim,
Kwan Yun
Abstract:
We employ a deep learning method to deduce the \textit{bulk} spacetime from \textit{boundary} optical conductivity. We apply the neural ordinary differential equation technique, tailored for continuous functions such as the metric, to the typical class of holographic condensed matter models featuring broken translations: linear-axion models. We successfully extract the bulk metric from the boundar…
▽ More
We employ a deep learning method to deduce the \textit{bulk} spacetime from \textit{boundary} optical conductivity. We apply the neural ordinary differential equation technique, tailored for continuous functions such as the metric, to the typical class of holographic condensed matter models featuring broken translations: linear-axion models. We successfully extract the bulk metric from the boundary holographic optical conductivity. Furthermore, as an example for real material, we use experimental optical conductivity of $\text{UPd}_2\text{Al}_3$, a representative of heavy fermion metals in strongly correlated electron systems, and construct the corresponding bulk metric. To our knowledge, our work is the first illustration of deep learning bulk spacetime from \textit{boundary} holographic or experimental conductivity data.
△ Less
Submitted 1 January, 2024;
originally announced January 2024.
-
Constraint-Informed Learning for Warm Starting Trajectory Optimization
Authors:
Julia Briden,
Changrak Choi,
Kyongsik Yun,
Richard Linares,
Abhishek Cauligi
Abstract:
Future spacecraft and surface robotic missions require increasingly capable autonomy stacks for exploring challenging and unstructured domains, and trajectory optimization will be a cornerstone of such autonomy stacks. However, the nonlinear optimization solvers required remain too slow for use on relatively resource-constrained flight-grade computers. In this work, we turn towards amortized optim…
▽ More
Future spacecraft and surface robotic missions require increasingly capable autonomy stacks for exploring challenging and unstructured domains, and trajectory optimization will be a cornerstone of such autonomy stacks. However, the nonlinear optimization solvers required remain too slow for use on relatively resource-constrained flight-grade computers. In this work, we turn towards amortized optimization, a learning-based technique for accelerating optimization run times, and present TOAST: Trajectory Optimization with Merit Function Warm Starts. Offline, using data collected from a simulation, we train a neural network to learn a mapping to the full primal and dual solutions given the problem parameters. Crucially, we build upon recent results from decision-focused learning and present a set of decision-focused loss functions using the notion of merit functions for optimization problems. We show that training networks with such constraint-informed losses can better encode the structure of the trajectory optimization problem and jointly learn to reconstruct the primal-dual solution while yielding improved constraint satisfaction. Through numerical experiments on a Lunar rover problem and a 3-degrees-of-freedom Mars powered descent guidance problem, we demonstrate that TOAST outperforms benchmark approaches in terms of both computation times and network prediction constraint satisfaction.
△ Less
Submitted 16 September, 2024; v1 submitted 21 December, 2023;
originally announced December 2023.
-
Synthesis and verification of robust-adaptive safe controllers
Authors:
Simin Liu,
Kai S. Yun,
John M. Dolan,
Changliu Liu
Abstract:
Safe control with guarantees generally requires the system model to be known. It is far more challenging to handle systems with uncertain parameters. In this paper, we propose a generic algorithm that can synthesize and verify safe controllers for systems with constant, unknown parameters. In particular, we use robust-adaptive control barrier functions (raCBFs) to achieve safety. We develop new th…
▽ More
Safe control with guarantees generally requires the system model to be known. It is far more challenging to handle systems with uncertain parameters. In this paper, we propose a generic algorithm that can synthesize and verify safe controllers for systems with constant, unknown parameters. In particular, we use robust-adaptive control barrier functions (raCBFs) to achieve safety. We develop new theories and techniques using sum-of-squares that enable us to pose synthesis and verification as a series of convex optimization problems. In our experiments, we show that our algorithms are general and scalable, applying them to three different polynomial systems of up to moderate size (7D). Our raCBFs are currently the most effective way to guarantee safety for uncertain systems, achieving 100% safety and up to 55% performance improvement over a robust baseline.
△ Less
Submitted 2 April, 2024; v1 submitted 1 November, 2023;
originally announced November 2023.
-
You Only Train Once: A Unified Framework for Both Full-Reference and No-Reference Image Quality Assessment
Authors:
Yi Ke Yun,
Weisi Lin
Abstract:
Although recent efforts in image quality assessment (IQA) have achieved promising performance, there still exists a considerable gap compared to the human visual system (HVS). One significant disparity lies in humans' seamless transition between full reference (FR) and no reference (NR) tasks, whereas existing models are constrained to either FR or NR tasks. This disparity implies the necessity of…
▽ More
Although recent efforts in image quality assessment (IQA) have achieved promising performance, there still exists a considerable gap compared to the human visual system (HVS). One significant disparity lies in humans' seamless transition between full reference (FR) and no reference (NR) tasks, whereas existing models are constrained to either FR or NR tasks. This disparity implies the necessity of designing two distinct systems, thereby greatly diminishing the model's versatility. Therefore, our focus lies in unifying FR and NR IQA under a single framework. Specifically, we first employ an encoder to extract multi-level features from input images. Then a Hierarchical Attention (HA) module is proposed as a universal adapter for both FR and NR inputs to model the spatial distortion at each encoder stage. Furthermore, considering that different distortions contaminate encoder stages and damage image semantic meaning differently, a Semantic Distortion Aware (SDA) module is proposed to examine feature correlations between shallow and deep layers of the encoder. By adopting HA and SDA, the proposed network can effectively perform both FR and NR IQA. When our proposed model is independently trained on NR or FR IQA tasks, it outperforms existing models and achieves state-of-the-art performance. Moreover, when trained jointly on NR and FR IQA tasks, it further enhances the performance of NR IQA while achieving on-par performance in the state-of-the-art FR IQA. You only train once to perform both IQA tasks. Code will be released at: https://github.com/BarCodeReader/YOTO.
△ Less
Submitted 5 April, 2024; v1 submitted 14 October, 2023;
originally announced October 2023.
-
KoLA: Carefully Benchmarking World Knowledge of Large Language Models
Authors:
Jifan Yu,
Xiaozhi Wang,
Shangqing Tu,
Shulin Cao,
Daniel Zhang-Li,
Xin Lv,
Hao Peng,
Zijun Yao,
Xiaohan Zhang,
Hanming Li,
Chunyang Li,
Zheyuan Zhang,
Yushi Bai,
Yantao Liu,
Amy Xin,
Nianyi Lin,
Kaifeng Yun,
Linlu Gong,
Jianhui Chen,
Zhili Wu,
Yunjia Qi,
Weikai Li,
Yong Guan,
Kaisheng Zeng,
Ji Qi
, et al. (10 additional authors not shown)
Abstract:
The unprecedented performance of large language models (LLMs) necessitates improvements in evaluations. Rather than merely exploring the breadth of LLM abilities, we believe meticulous and thoughtful designs are essential to thorough, unbiased, and applicable evaluations. Given the importance of world knowledge to LLMs, we construct a Knowledge-oriented LLM Assessment benchmark (KoLA), in which we…
▽ More
The unprecedented performance of large language models (LLMs) necessitates improvements in evaluations. Rather than merely exploring the breadth of LLM abilities, we believe meticulous and thoughtful designs are essential to thorough, unbiased, and applicable evaluations. Given the importance of world knowledge to LLMs, we construct a Knowledge-oriented LLM Assessment benchmark (KoLA), in which we carefully design three crucial factors: (1) For \textbf{ability modeling}, we mimic human cognition to form a four-level taxonomy of knowledge-related abilities, covering $19$ tasks. (2) For \textbf{data}, to ensure fair comparisons, we use both Wikipedia, a corpus prevalently pre-trained by LLMs, along with continuously collected emerging corpora, aiming to evaluate the capacity to handle unseen data and evolving knowledge. (3) For \textbf{evaluation criteria}, we adopt a contrastive system, including overall standard scores for better numerical comparability across tasks and models and a unique self-contrast metric for automatically evaluating knowledge-creating ability. We evaluate $28$ open-source and commercial LLMs and obtain some intriguing findings. The KoLA dataset and open-participation leaderboard are publicly released at https://kola.xlore.cn and will be continuously updated to provide references for developing LLMs and knowledge-related systems.
△ Less
Submitted 30 June, 2024; v1 submitted 15 June, 2023;
originally announced June 2023.
-
Remote estimation of geologic composition using interferometric synthetic-aperture radar in California's Central Valley
Authors:
Kyongsik Yun,
Kyra Adams,
John Reager,
Zhen Liu,
Caitlyn Chavez,
Michael Turmon,
Thomas Lu
Abstract:
California's Central Valley is the national agricultural center, producing 1/4 of the nation's food. However, land in the Central Valley is sinking at a rapid rate (as much as 20 cm per year) due to continued groundwater pumping. Land subsidence has a significant impact on infrastructure resilience and groundwater sustainability. In this study, we aim to identify specific regions with different te…
▽ More
California's Central Valley is the national agricultural center, producing 1/4 of the nation's food. However, land in the Central Valley is sinking at a rapid rate (as much as 20 cm per year) due to continued groundwater pumping. Land subsidence has a significant impact on infrastructure resilience and groundwater sustainability. In this study, we aim to identify specific regions with different temporal dynamics of land displacement and find relationships with underlying geological composition. Then, we aim to remotely estimate geologic composition using interferometric synthetic aperture radar (InSAR)-based land deformation temporal changes using machine learning techniques. We identified regions with different temporal characteristics of land displacement in that some areas (e.g., Helm) with coarser grain geologic compositions exhibited potentially reversible land deformation (elastic land compaction). We found a significant correlation between InSAR-based land deformation and geologic composition using random forest and deep neural network regression models. We also achieved significant accuracy with 1/4 sparse sampling to reduce any spatial correlations among data, suggesting that the model has the potential to be generalized to other regions for indirect estimation of geologic composition. Our results indicate that geologic composition can be estimated using InSAR-based land deformation data. In-situ measurements of geologic composition can be expensive and time consuming and may be impractical in some areas. The generalizability of the model sheds light on high spatial resolution geologic composition estimation utilizing existing measurements.
△ Less
Submitted 4 December, 2022;
originally announced December 2022.
-
Face Shape-Guided Deep Feature Alignment for Face Recognition Robust to Face Misalignment
Authors:
Hyung-Il Kim,
Kimin Yun,
Yong Man Ro
Abstract:
For the past decades, face recognition (FR) has been actively studied in computer vision and pattern recognition society. Recently, due to the advances in deep learning, the FR technology shows high performance for most of the benchmark datasets. However, when the FR algorithm is applied to a real-world scenario, the performance has been known to be still unsatisfactory. This is mainly attributed…
▽ More
For the past decades, face recognition (FR) has been actively studied in computer vision and pattern recognition society. Recently, due to the advances in deep learning, the FR technology shows high performance for most of the benchmark datasets. However, when the FR algorithm is applied to a real-world scenario, the performance has been known to be still unsatisfactory. This is mainly attributed to the mismatch between training and testing sets. Among such mismatches, face misalignment between training and testing faces is one of the factors that hinder successful FR. To address this limitation, we propose a face shape-guided deep feature alignment framework for FR robust to the face misalignment. Based on a face shape prior (e.g., face keypoints), we train the proposed deep network by introducing alignment processes, i.e., pixel and feature alignments, between well-aligned and misaligned face images. Through the pixel alignment process that decodes the aggregated feature extracted from a face image and face shape prior, we add the auxiliary task to reconstruct the well-aligned face image. Since the aggregated features are linked to the face feature extraction network as a guide via the feature alignment process, we train the robust face feature to the face misalignment. Even if the face shape estimation is required in the training stage, the additional face alignment process, which is usually incorporated in the conventional FR pipeline, is not necessarily needed in the testing phase. Through the comparative experiments, we validate the effectiveness of the proposed method for the face misalignment with the FR datasets.
△ Less
Submitted 15 September, 2022;
originally announced September 2022.
-
Warped Disk Galaxies. I. Linking U type Warps in Groups/Clusters to Jellyfish Galaxies
Authors:
Woong-Bae G. Zee,
Suk-Jin Yoon,
Jun-Sung Moon,
Sung-Ho An,
Sanjaya Paudel,
Kiyun Yun
Abstract:
arped disk galaxies are classified into two morphologies: S- and U-types. Conventional theories routinely attribute both types to galactic tidal interaction and/or gas accretion, but reproducing of U-types in simulations is extremely challenging. Here we investigate whether both types are governed by the same mechanisms using the most extensive sample of $\sim$8000 nearby (0.02\,$<$\,z\,$<$\,0.06)…
▽ More
arped disk galaxies are classified into two morphologies: S- and U-types. Conventional theories routinely attribute both types to galactic tidal interaction and/or gas accretion, but reproducing of U-types in simulations is extremely challenging. Here we investigate whether both types are governed by the same mechanisms using the most extensive sample of $\sim$8000 nearby (0.02\,$<$\,z\,$<$\,0.06) massive ($M_{*}/M_{\odot}$\,$>$\,$10^9$) edge-on disks from SDSS. We find that U-types show on average bluer optical colors and higher specific star formation rate (sSFR) than S-types, with more strongly warped U-types having higher sSFR. We also find that while the S-type warp properties correlate with the tidal force by the nearest neighbor regardless of the environment, there is no such correlation for U-types in groups/clusters, suggesting a non-tidal environmental could be at play for U-types, such as ram pressure stripping (RPS). Indeed, U-types are more common in groups/clusters than in fields and they have stellar mass, gas fraction, sSFR enhancement and phase-space distribution closely analogous to RPS-induced jellyfish galaxies in clusters. We furthermore show that the stellar disks of most RPS galaxies in the IllustirsTNG simulation are warped in U-shape and bent in opposite direction of stripped gas tails, satisfying theoretical expectations for stellar warps embeded in jellyfishes. We therefore suggest that despite the majority of U-types that live in fields being still less explained, RPS can be an alternative origin for those in groups/clusters.
△ Less
Submitted 10 August, 2022;
originally announced August 2022.
-
SelfReformer: Self-Refined Network with Transformer for Salient Object Detection
Authors:
Yi Ke Yun,
Weisi Lin
Abstract:
The global and local contexts significantly contribute to the integrity of predictions in Salient Object Detection (SOD). Unfortunately, existing methods still struggle to generate complete predictions with fine details. There are two major problems in conventional approaches: first, for global context, high-level CNN-based encoder features cannot effectively catch long-range dependencies, resulti…
▽ More
The global and local contexts significantly contribute to the integrity of predictions in Salient Object Detection (SOD). Unfortunately, existing methods still struggle to generate complete predictions with fine details. There are two major problems in conventional approaches: first, for global context, high-level CNN-based encoder features cannot effectively catch long-range dependencies, resulting in incomplete predictions. Second, downsampling the ground truth to fit the size of predictions will introduce inaccuracy as the ground truth details are lost during interpolation or pooling. Thus, in this work, we developed a Transformer-based network and framed a supervised task for a branch to learn the global context information explicitly. Besides, we adopt Pixel Shuffle from Super-Resolution (SR) to reshape the predictions back to the size of ground truth instead of the reverse. Thus details in the ground truth are untouched. In addition, we developed a two-stage Context Refinement Module (CRM) to fuse global context and automatically locate and refine the local details in the predictions. The proposed network can guide and correct itself based on the global and local context generated, thus is named, Self-Refined Transformer (SelfReformer). Extensive experiments and evaluation results on five benchmark datasets demonstrate the outstanding performance of the network, and we achieved the state-of-the-art.
△ Less
Submitted 18 July, 2022; v1 submitted 23 May, 2022;
originally announced May 2022.
-
Position-aware Location Regression Network for Temporal Video Grounding
Authors:
Sunoh Kim,
Kimin Yun,
Jin Young Choi
Abstract:
The key to successful grounding for video surveillance is to understand a semantic phrase corresponding to important actors and objects. Conventional methods ignore comprehensive contexts for the phrase or require heavy computation for multiple phrases. To understand comprehensive contexts with only one semantic phrase, we propose Position-aware Location Regression Network (PLRN) which exploits po…
▽ More
The key to successful grounding for video surveillance is to understand a semantic phrase corresponding to important actors and objects. Conventional methods ignore comprehensive contexts for the phrase or require heavy computation for multiple phrases. To understand comprehensive contexts with only one semantic phrase, we propose Position-aware Location Regression Network (PLRN) which exploits position-aware features of a query and a video. Specifically, PLRN first encodes both the video and query using positional information of words and video segments. Then, a semantic phrase feature is extracted from an encoded query with attention. The semantic phrase feature and encoded video are merged and made into a context-aware feature by reflecting local and global contexts. Finally, PLRN predicts start, end, center, and width values of a grounding boundary. Our experiments show that PLRN achieves competitive performance over existing methods with less computation time and memory.
△ Less
Submitted 11 April, 2022;
originally announced April 2022.
-
Neurosymbolic hybrid approach to driver collision warning
Authors:
Kyongsik Yun,
Thomas Lu,
Alexander Huyen,
Patrick Hammer,
Pei Wang
Abstract:
There are two main algorithmic approaches to autonomous driving systems: (1) An end-to-end system in which a single deep neural network learns to map sensory input directly into appropriate warning and driving responses. (2) A mediated hybrid recognition system in which a system is created by combining independent modules that detect each semantic feature. While some researchers believe that deep…
▽ More
There are two main algorithmic approaches to autonomous driving systems: (1) An end-to-end system in which a single deep neural network learns to map sensory input directly into appropriate warning and driving responses. (2) A mediated hybrid recognition system in which a system is created by combining independent modules that detect each semantic feature. While some researchers believe that deep learning can solve any problem, others believe that a more engineered and symbolic approach is needed to cope with complex environments with less data. Deep learning alone has achieved state-of-the-art results in many areas, from complex gameplay to predicting protein structures. In particular, in image classification and recognition, deep learning models have achieved accuracies as high as humans. But sometimes it can be very difficult to debug if the deep learning model doesn't work. Deep learning models can be vulnerable and are very sensitive to changes in data distribution. Generalization can be problematic. It's usually hard to prove why it works or doesn't. Deep learning models can also be vulnerable to adversarial attacks. Here, we combine deep learning-based object recognition and tracking with an adaptive neurosymbolic network agent, called the Non-Axiomatic Reasoning System (NARS), that can adapt to its environment by building concepts based on perceptual sequences. We achieved an improved intersection-over-union (IOU) object recognition performance of 0.65 in the adaptive retraining model compared to IOU 0.31 in the COCO data pre-trained model. We improved the object detection limits using RADAR sensors in a simulated environment, and demonstrated the weaving car detection capability by combining deep learning-based object detection and tracking with a neurosymbolic model.
△ Less
Submitted 28 March, 2022;
originally announced March 2022.
-
Machine Learning Based Relative Orbit Transfer for Swarm Spacecraft Motion Planning
Authors:
Alex Sabol,
Kyongsik Yun,
Muhammad Adil,
Changrak Choi,
Ramtin Madani
Abstract:
In this paper we describe a machine learning based framework for spacecraft swarm trajectory planning. In particular, we focus on coordinating motions of multi-spacecraft in formation flying through passive relative orbit(PRO) transfers. Accounting for spacecraft dynamics while avoiding collisions between the agents makes spacecraft swarm trajectory planning difficult. Centralized approaches can b…
▽ More
In this paper we describe a machine learning based framework for spacecraft swarm trajectory planning. In particular, we focus on coordinating motions of multi-spacecraft in formation flying through passive relative orbit(PRO) transfers. Accounting for spacecraft dynamics while avoiding collisions between the agents makes spacecraft swarm trajectory planning difficult. Centralized approaches can be used to solve this problem, but are computationally demanding and scale poorly with the number of agents in the swarm. As a result, centralized algorithms are ill-suited for real time trajectory planning on board small spacecraft (e.g. CubeSats) comprising the swarm. In our approach a neural network is used to approximate solutions of a centralized method. The necessary training data is generated using a centralized convex optimization framework through which several instances of the n=10 spacecraft swarm trajectory planning problem are solved. We are interested in answering the following questions which will give insight on the potential utility of deep learning-based approaches to the multi-spacecraft motion planning problem: 1) Can neural networks produce feasible trajectories that satisfy safety constraints (e.g. collision avoidance) and low in fuel cost? 2) Can a neural network trained using n spacecraft data be used to solve problems for spacecraft swarms of differing size?
△ Less
Submitted 28 January, 2022;
originally announced January 2022.
-
Explainability Tools Enabling Deep Learning in Future In-Situ Real-Time Planetary Explorations
Authors:
Daniel Lundstrom,
Alexander Huyen,
Arya Mevada,
Kyongsik Yun,
Thomas Lu
Abstract:
Deep learning (DL) has proven to be an effective machine learning and computer vision technique. DL-based image segmentation, object recognition and classification will aid many in-situ Mars rover tasks such as path planning and artifact recognition/extraction. However, most of the Deep Neural Network (DNN) architectures are so complex that they are considered a 'black box'. In this paper, we used…
▽ More
Deep learning (DL) has proven to be an effective machine learning and computer vision technique. DL-based image segmentation, object recognition and classification will aid many in-situ Mars rover tasks such as path planning and artifact recognition/extraction. However, most of the Deep Neural Network (DNN) architectures are so complex that they are considered a 'black box'. In this paper, we used integrated gradients to describe the attributions of each neuron to the output classes. It provides a set of explainability tools (ET) that opens the black box of a DNN so that the individual contribution of neurons to category classification can be ranked and visualized. The neurons in each dense layer are mapped and ranked by measuring expected contribution of a neuron to a class vote given a true image label. The importance of neurons is prioritized according to their correct or incorrect contribution to the output classes and suppression or bolstering of incorrect classes, weighted by the size of each class. ET provides an interface to prune the network to enhance high-rank neurons and remove low-performing neurons. ET technology will make DNNs smaller and more efficient for implementation in small embedded systems. It also leads to more explainable and testable DNNs that can make systems easier for Validation \& Verification. The goal of ET technology is to enable the adoption of DL in future in-situ planetary exploration missions.
△ Less
Submitted 15 January, 2022;
originally announced January 2022.
-
Time Series Comparisons in Deep Space Network
Authors:
Kyongsik Yun,
Rishi Verma,
Umaa Rebbapragada
Abstract:
The Deep Space Network is NASA's international array of antennas that support interplanetary spacecraft missions. A track is a block of multi-dimensional time series from the beginning to end of DSN communication with the target spacecraft, containing thousands of monitor data items lasting several hours at a frequency of 0.2-1Hz. Monitor data on each track reports on the performance of specific s…
▽ More
The Deep Space Network is NASA's international array of antennas that support interplanetary spacecraft missions. A track is a block of multi-dimensional time series from the beginning to end of DSN communication with the target spacecraft, containing thousands of monitor data items lasting several hours at a frequency of 0.2-1Hz. Monitor data on each track reports on the performance of specific spacecraft operations and the DSN itself. DSN is receiving signals from 32 spacecraft across the solar system. DSN has pressure to reduce costs while maintaining the quality of support for DSN mission users. DSN Link Control Operators need to simultaneously monitor multiple tracks and identify anomalies in real time. DSN has seen that as the number of missions increases, the data that needs to be processed increases over time. In this project, we look at the last 8 years of data for analysis. Any anomaly in the track indicates a problem with either the spacecraft, DSN equipment, or weather conditions. DSN operators typically write Discrepancy Reports for further analysis. It is recognized that it would be quite helpful to identify 10 similar historical tracks out of the huge database to quickly find and match anomalies. This tool has three functions: (1) identification of the top 10 similar historical tracks, (2) detection of anomalies compared to the reference normal track, and (3) comparison of statistical differences between two given tracks. The requirements for these features were confirmed by survey responses from 21 DSN operators and engineers. The preliminary machine learning model has shown promising performance (AUC=0.92). We plan to increase the number of data sets and perform additional testing to improve performance further before its planned integration into the track visualizer interface to assist DSN field operators and engineers.
△ Less
Submitted 2 November, 2021;
originally announced November 2021.
-
Robust Pedestrian Attribute Recognition Using Group Sparsity for Occlusion Videos
Authors:
Geonu Lee,
Kimin Yun,
Jungchan Cho
Abstract:
Occlusion processing is a key issue in pedestrian attribute recognition (PAR). Nevertheless, several existing video-based PAR methods have not yet considered occlusion handling in depth. In this paper, we formulate finding non-occluded frames as sparsity-based temporal attention of a crowded video. In this manner, a model is guided not to pay attention to the occluded frame. However, temporal spar…
▽ More
Occlusion processing is a key issue in pedestrian attribute recognition (PAR). Nevertheless, several existing video-based PAR methods have not yet considered occlusion handling in depth. In this paper, we formulate finding non-occluded frames as sparsity-based temporal attention of a crowded video. In this manner, a model is guided not to pay attention to the occluded frame. However, temporal sparsity cannot include a correlation between attributes when occlusion occurs. For example, "boots" and "shoe color" cannot be recognized when the foot is invisible. To solve the uncorrelated attention issue, we also propose a novel group sparsity-based temporal attention module. Group sparsity is applied across attention weights in correlated attributes. Thus, attention weights in a group are forced to pay attention to the same frames. Experimental results showed that the proposed method achieved a higher F1-score than the state-of-the-art methods on two video-based PAR datasets.
△ Less
Submitted 21 July, 2022; v1 submitted 16 October, 2021;
originally announced October 2021.
-
Recursive Contour Saliency Blending Network for Accurate Salient Object Detection
Authors:
Yi Ke Yun,
Takahiro Tsubono
Abstract:
Contour information plays a vital role in salient object detection. However, excessive false positives remain in predictions from existing contour-based models due to insufficient contour-saliency fusion. In this work, we designed a network for better edge quality in salient object detection. We proposed a contour-saliency blending module to exchange information between contour and saliency. We ad…
▽ More
Contour information plays a vital role in salient object detection. However, excessive false positives remain in predictions from existing contour-based models due to insufficient contour-saliency fusion. In this work, we designed a network for better edge quality in salient object detection. We proposed a contour-saliency blending module to exchange information between contour and saliency. We adopted recursive CNN to increase contour-saliency fusion while keeping the total trainable parameters the same. Furthermore, we designed a stage-wise feature extraction module to help the model pick up the most helpful features from previous intermediate saliency predictions. Besides, we proposed two new loss functions, namely Dual Confinement Loss and Confidence Loss, for our model to generate better boundary predictions. Evaluation results on five common benchmark datasets reveal that our model achieves competitive state-of-the-art performance.
△ Less
Submitted 22 August, 2021; v1 submitted 28 May, 2021;
originally announced May 2021.
-
Single-Crystalline Metallic Films Induced by van der Waals Epitaxy on Black Phosphorus
Authors:
Yangjin Lee,
Han-gyu Kim,
Tae Keun Yun,
Jong Chan Kim,
Sol Lee,
Sung Jin Yang,
Myeongjin Jang,
Donggyu Kim,
Huije Ryu,
Gwan-Hyoung Lee,
Seongil Im,
Hu Young Jeong,
Hyoung Joon Choi,
Kwanpyo Kim
Abstract:
The properties of metal-semiconductor junctions are often unpredictable because of non-ideal interfacial structures, such as interfacial defects or chemical reactions introduced at junctions. Black phosphorus (BP), an elemental two-dimensional (2D) semiconducting crystal, possesses the puckered atomic structure with high chemical reactivity, and the establishment of a realistic atomic-scale pictur…
▽ More
The properties of metal-semiconductor junctions are often unpredictable because of non-ideal interfacial structures, such as interfacial defects or chemical reactions introduced at junctions. Black phosphorus (BP), an elemental two-dimensional (2D) semiconducting crystal, possesses the puckered atomic structure with high chemical reactivity, and the establishment of a realistic atomic-scale picture of BP's interface toward metallic contact has remained elusive. Here we examine the interfacial structures and properties of physically-deposited metals of various kinds on BP. We find that Au, Ag, and Bi form single-crystalline films with (110) orientation through guided van der Waals epitaxy. Transmission electron microscopy and X-ray photoelectron spectroscopy confirm that atomically sharp van der Waals metal-BP interfaces forms with exceptional rotational alignment. Under a weak metal-BP interaction regime, the BP's puckered structure play an essential role in the adatom assembly process and can lead to the formation of a single crystal, which is supported by our theoretical analysis and calculations. The experimental survey also demonstrates that the BP-metal junctions can exhibit various types of interfacial structures depending on metals, such as the formation of polycrystalline microstructure or metal phosphides. This study provides a guideline for obtaining a realistic view on metal-2D semiconductor interfacial structures, especially for atomically puckered 2D crystals.
△ Less
Submitted 3 May, 2021;
originally announced May 2021.
-
Ultrafast carrier-lattice interactions and interlayer modulations of Bi2Se3 by X-ray free electron laser diffraction
Authors:
Sungwon Kim,
Youngsam Kim,
Jaeseung Kim,
Sungwook Choi,
Kyuseok Yun,
Dongjin Kim,
Soo Yeon Lim,
Sunam Kim,
Sae Hwan Chun,
Jaeku Park,
Intae Eom,
Kyung Sook Kim,
Tae-Yeong Koo,
Yunbo Ou,
Ferhat Katmis,
Haidan Wen,
Anthony Dichiara,
Donald Walko,
Eric C. Landahl,
Hyeonsik Cheong,
Eunji Sim,
Jagadeesh Moodera,
Hyunjung Kim
Abstract:
As a 3D topological insulator, bismuth selenide (Bi2Se3) has potential applications for electrically and optically controllable magnetic and optoelectronic devices. How the carriers interact with lattice is important to understand the coupling with its topological phase. It is essential to measure with a time scale smaller than picoseconds for initial interaction. Here we use an X-ray free-electro…
▽ More
As a 3D topological insulator, bismuth selenide (Bi2Se3) has potential applications for electrically and optically controllable magnetic and optoelectronic devices. How the carriers interact with lattice is important to understand the coupling with its topological phase. It is essential to measure with a time scale smaller than picoseconds for initial interaction. Here we use an X-ray free-electron laser to perform time-resolved diffraction to study ultrafast carrier-induced lattice contractions and interlayer modulations in Bi2Se3 thin films. The lattice contraction depends on the carrier concentration and is followed by an interlayer expansion accompanied by oscillations. Using density functional theory (DFT) and the Lifshitz model, the initial contraction can be explained by van der Waals force modulation of the confined free carrier layers. Band inversion, related to a topological phase transition, is modulated by the expansion of the interlayer distance. These results provide insight into instantaneous topological phases on ultrafast timescales.
△ Less
Submitted 22 March, 2021;
originally announced March 2021.
-
Diverse Temporal Aggregation and Depthwise Spatiotemporal Factorization for Efficient Video Classification
Authors:
Youngwan Lee,
Hyung-Il Kim,
Kimin Yun,
Jinyoung Moon
Abstract:
Video classification researches that have recently attracted attention are the fields of temporal modeling and 3D efficient architecture. However, the temporal modeling methods are not efficient or the 3D efficient architecture is less interested in temporal modeling. For bridging the gap between them, we propose an efficient temporal modeling 3D architecture, called VoV3D, that consists of a temp…
▽ More
Video classification researches that have recently attracted attention are the fields of temporal modeling and 3D efficient architecture. However, the temporal modeling methods are not efficient or the 3D efficient architecture is less interested in temporal modeling. For bridging the gap between them, we propose an efficient temporal modeling 3D architecture, called VoV3D, that consists of a temporal one-shot aggregation (T-OSA) module and depthwise factorized component, D(2+1)D. The T-OSA is devised to build a feature hierarchy by aggregating temporal features with different temporal receptive fields. Stacking this T-OSA enables the network itself to model short-range as well as long-range temporal relationships across frames without any external modules. Inspired by kernel factorization and channel factorization, we also design a depthwise spatiotemporal factorization module, named, D(2+1)D that decomposes a 3D depthwise convolution into two spatial and temporal depthwise convolutions for making our network more lightweight and efficient. By using the proposed temporal modeling method (T-OSA), and the efficient factorized component (D(2+1)D), we construct two types of VoV3D networks, VoV3D-M and VoV3D-L. Thanks to its efficiency and effectiveness of temporal modeling, VoV3D-L has 6x fewer model parameters and 16x less computation, surpassing a state-of-the-art temporal modeling method on both Something-Something and Kinetics-400. Furthermore, VoV3D shows better temporal modeling ability than a state-of-the-art efficient 3D architecture, X3D having comparable model capacity. We hope that VoV3D can serve as a baseline for efficient video classification.
△ Less
Submitted 21 April, 2021; v1 submitted 1 December, 2020;
originally announced December 2020.
-
Multi-Agent Motion Planning using Deep Learning for Space Applications
Authors:
Kyongsik Yun,
Changrak Choi,
Ryan Alimo,
Anthony Davis,
Linda Forster,
Amir Rahmani,
Muhammad Adil,
Ramtin Madani
Abstract:
State-of-the-art motion planners cannot scale to a large number of systems. Motion planning for multiple agents is an NP (non-deterministic polynomial-time) hard problem, so the computation time increases exponentially with each addition of agents. This computational demand is a major stumbling block to the motion planner's application to future NASA missions involving the swarm of space vehicles.…
▽ More
State-of-the-art motion planners cannot scale to a large number of systems. Motion planning for multiple agents is an NP (non-deterministic polynomial-time) hard problem, so the computation time increases exponentially with each addition of agents. This computational demand is a major stumbling block to the motion planner's application to future NASA missions involving the swarm of space vehicles. We applied a deep neural network to transform computationally demanding mathematical motion planning problems into deep learning-based numerical problems. We showed optimal motion trajectories can be accurately replicated using deep learning-based numerical models in several 2D and 3D systems with multiple agents. The deep learning-based numerical model demonstrates superior computational efficiency with plans generated 1000 times faster than the mathematical model counterpart.
△ Less
Submitted 15 October, 2020;
originally announced October 2020.
-
Localization Uncertainty Estimation for Anchor-Free Object Detection
Authors:
Youngwan Lee,
Joong-won Hwang,
Hyung-Il Kim,
Kimin Yun,
Yongjin Kwon,
Yuseok Bae,
Sung Ju Hwang
Abstract:
Since many safety-critical systems, such as surgical robots and autonomous driving cars operate in unstable environments with sensor noise and incomplete data, it is desirable for object detectors to take the localization uncertainty into account. However, there are several limitations of the existing uncertainty estimation methods for anchor-based object detection. 1) They model the uncertainty o…
▽ More
Since many safety-critical systems, such as surgical robots and autonomous driving cars operate in unstable environments with sensor noise and incomplete data, it is desirable for object detectors to take the localization uncertainty into account. However, there are several limitations of the existing uncertainty estimation methods for anchor-based object detection. 1) They model the uncertainty of the heterogeneous object properties with different characteristics and scales, such as location (center point) and scale (width, height), which could be difficult to estimate. 2) They model box offsets as Gaussian distributions, which is not compatible with the ground truth bounding boxes that follow the Dirac delta distribution. 3) Since anchor-based methods are sensitive to anchor hyper-parameters, their localization uncertainty could also be highly sensitive to the choice of hyper-parameters. To tackle these limitations, we propose a new localization uncertainty estimation method called UAD for anchor-free object detection. Our method captures the uncertainty in four directions of box offsets (left, right, top, bottom) that are homogeneous, so that it can tell which direction is uncertain, and provide a quantitative value of uncertainty in [0, 1]. To enable such uncertainty estimation, we design a new uncertainty loss, negative power log-likelihood loss, to measure the localization uncertainty by weighting the likelihood loss by its IoU, which alleviates the model misspecification problem. Furthermore, we propose an uncertainty-aware focal loss for reflecting the estimated uncertainty to the classification score. Experimental results on COCO datasets demonstrate that our method significantly improves FCOS, by up to 1.8 points, without sacrificing computational efficiency.
△ Less
Submitted 6 July, 2022; v1 submitted 28 June, 2020;
originally announced June 2020.
-
Transforming unstructured voice and text data into insight for paramedic emergency service using recurrent and convolutional neural networks
Authors:
Kyongsik Yun,
Thomas Lu,
Alexander Huyen
Abstract:
Paramedics often have to make lifesaving decisions within a limited time in an ambulance. They sometimes ask the doctor for additional medical instructions, during which valuable time passes for the patient. This study aims to automatically fuse voice and text data to provide tailored situational awareness information to paramedics. To train and test speech recognition models, we built a bidirecti…
▽ More
Paramedics often have to make lifesaving decisions within a limited time in an ambulance. They sometimes ask the doctor for additional medical instructions, during which valuable time passes for the patient. This study aims to automatically fuse voice and text data to provide tailored situational awareness information to paramedics. To train and test speech recognition models, we built a bidirectional deep recurrent neural network (long short-term memory (LSTM)). Then we used convolutional neural networks on top of custom-trained word vectors for sentence-level classification tasks. Each sentence is automatically categorized into four classes, including patient status, medical history, treatment plan, and medication reminder. Subsequently, incident reports were automatically generated to extract keywords and assist paramedics and physicians in making decisions. The proposed system found that it could provide timely medication notifications based on unstructured voice and text data, which was not possible in paramedic emergencies at present. In addition, the automatic incident report generation provided by the proposed system improves the routine but error-prone tasks of paramedics and doctors, helping them focus on patient care.
△ Less
Submitted 30 May, 2020;
originally announced June 2020.
-
Smoke Sky -- Exploring New Frontiers of Unmanned Aerial Systems for Wildland Fire Science and Applications
Authors:
E. Natasha Stavros,
Ali Agha,
Allen Sirota,
Marco Quadrelli,
Kamak Ebadi,
Kyongsik Yun
Abstract:
Wildfire has had increasing impacts on society as the climate changes and the wildland urban interface grows. As such, there is a demand for innovative solutions to help manage fire. Managing wildfire can include proactive fire management such as prescribed burning within constrained areas or advancements for reactive fire management (e.g., fire suppression). Because of the growing societal impact…
▽ More
Wildfire has had increasing impacts on society as the climate changes and the wildland urban interface grows. As such, there is a demand for innovative solutions to help manage fire. Managing wildfire can include proactive fire management such as prescribed burning within constrained areas or advancements for reactive fire management (e.g., fire suppression). Because of the growing societal impact, the JPL BlueSky program sought to assess the current state of fire management and technology and determine areas with high return on investment. To accomplish this, we met with the national interagency Unmanned Aerial System (UAS) Advisory Group (UASAG) and with leading technology transfer experts for fire science and management applications. We provide an overview of the current state as well as an analysis of the impact, maturity and feasibility of integrating different technologies that can be developed by JPL. Based on the findings, the highest return on investment technologies for fire management are first to develop single micro-aerial vehicle (MAV) autonomy, autonomous sensing over fire, and the associated data and information system for active fire local environment mapping. Once this is completed for a single MAV, expanding the work to include many in a swarm would require further investment of distributed MAV autonomy and MAV swarm mechanics, but could greatly expand the breadth of application over large fires. Important to investing in these technologies will be in developing collaborations with the key influencers and champions for using UAS technology in fire management.
△ Less
Submitted 12 November, 2019;
originally announced November 2019.
-
Quantitative estimates for enhancement of the field excited by an emitter due to presence of two closely located spherical inclusions
Authors:
Hyeonbae Kang,
KiHyun Yun
Abstract:
A field in a homogeneous medium can be amplified or enhanced by inserting closely located perfectly conducting inclusions into the medium. In this paper precise quantitative estimates for such enhancement are derived when the given field is the one excited by an emitter of a dipole type and inclusions are spheres of the same radii in three dimensions. Derived estimates reveal the difference, as we…
▽ More
A field in a homogeneous medium can be amplified or enhanced by inserting closely located perfectly conducting inclusions into the medium. In this paper precise quantitative estimates for such enhancement are derived when the given field is the one excited by an emitter of a dipole type and inclusions are spheres of the same radii in three dimensions. Derived estimates reveal the difference, as well as the similarity, between enhancement of the field excited by the emitter and that of the smooth back-ground field. In particular, an estimate shows that when the enhancement occurs, the factor of enhancement is $(\sqrtε|\log ε|)^{-1}$, which is different from that for the smooth background field, which is known to be $(ε|\log ε|)^{-1}$ ($ε$ is the distance between two inclusions).
△ Less
Submitted 8 September, 2019;
originally announced September 2019.
-
Improved visible to IR image transformation using synthetic data augmentation with cycle-consistent adversarial networks
Authors:
Kyongsik Yun,
Kevin Yu,
Joseph Osborne,
Sarah Eldin,
Luan Nguyen,
Alexander Huyen,
Thomas Lu
Abstract:
Infrared (IR) images are essential to improve the visibility of dark or camouflaged objects. Object recognition and segmentation based on a neural network using IR images provide more accuracy and insight than color visible images. But the bottleneck is the amount of relevant IR images for training. It is difficult to collect real-world IR images for special purposes, including space exploration,…
▽ More
Infrared (IR) images are essential to improve the visibility of dark or camouflaged objects. Object recognition and segmentation based on a neural network using IR images provide more accuracy and insight than color visible images. But the bottleneck is the amount of relevant IR images for training. It is difficult to collect real-world IR images for special purposes, including space exploration, military and fire-fighting applications. To solve this problem, we created color visible and IR images using a Unity-based 3D game editor. These synthetically generated color visible and IR images were used to train cycle consistent adversarial networks (CycleGAN) to convert visible images to IR images. CycleGAN has the advantage that it does not require precisely matching visible and IR pairs for transformation training. In this study, we discovered that additional synthetic data can help improve CycleGAN performance. Neural network training using real data (N = 20) performed more accurate transformations than training using real (N = 10) and synthetic (N = 10) data combinations. The result indicates that the synthetic data cannot exceed the quality of the real data. Neural network training using real (N = 10) and synthetic (N = 100) data combinations showed almost the same performance as training using real data (N = 20). At least 10 times more synthetic data than real data is required to achieve the same performance. In summary, CycleGAN is used with synthetic data to improve the IR image conversion performance of visible images.
△ Less
Submitted 25 April, 2019;
originally announced April 2019.
-
Small Target Detection for Search and Rescue Operations using Distributed Deep Learning and Synthetic Data Generation
Authors:
Kyongsik Yun,
Luan Nguyen,
Tuan Nguyen,
Doyoung Kim,
Sarah Eldin,
Alexander Huyen,
Thomas Lu,
Edward Chow
Abstract:
It is important to find the target as soon as possible for search and rescue operations. Surveillance camera systems and unmanned aerial vehicles (UAVs) are used to support search and rescue. Automatic object detection is important because a person cannot monitor multiple surveillance screens simultaneously for 24 hours. Also, the object is often too small to be recognized by the human eye on the…
▽ More
It is important to find the target as soon as possible for search and rescue operations. Surveillance camera systems and unmanned aerial vehicles (UAVs) are used to support search and rescue. Automatic object detection is important because a person cannot monitor multiple surveillance screens simultaneously for 24 hours. Also, the object is often too small to be recognized by the human eye on the surveillance screen. This study used UAVs around the Port of Houston and fixed surveillance cameras to build an automatic target detection system that supports the US Coast Guard (USCG) to help find targets (e.g., person overboard). We combined image segmentation, enhancement, and convolution neural networks to reduce detection time to detect small targets. We compared the performance between the auto-detection system and the human eye. Our system detected the target within 8 seconds, but the human eye detected the target within 25 seconds. Our systems also used synthetic data generation and data augmentation techniques to improve target detection accuracy. This solution may help the search and rescue operations of the first responders in a timely manner.
△ Less
Submitted 25 April, 2019;
originally announced April 2019.
-
Skeleton-based Action Recognition of People Handling Objects
Authors:
Sunoh Kim,
Kimin Yun,
Jongyoul Park,
Jin Young Choi
Abstract:
In visual surveillance systems, it is necessary to recognize the behavior of people handling objects such as a phone, a cup, or a plastic bag. In this paper, to address this problem, we propose a new framework for recognizing object-related human actions by graph convolutional networks using human and object poses. In this framework, we construct skeletal graphs of reliable human poses by selectiv…
▽ More
In visual surveillance systems, it is necessary to recognize the behavior of people handling objects such as a phone, a cup, or a plastic bag. In this paper, to address this problem, we propose a new framework for recognizing object-related human actions by graph convolutional networks using human and object poses. In this framework, we construct skeletal graphs of reliable human poses by selectively sampling the informative frames in a video, which include human joints with high confidence scores obtained in pose estimation. The skeletal graphs generated from the sampled frames represent human poses related to the object position in both the spatial and temporal domains, and these graphs are used as inputs to the graph convolutional networks. Through experiments over an open benchmark and our own data sets, we verify the validity of our framework in that our method outperforms the state-of-the-art method for skeleton-based action recognition.
△ Less
Submitted 21 January, 2019;
originally announced January 2019.
-
Quantitative estimates of the field excited by an emitter in a narrow region between two circular inclusions
Authors:
Hyeonbae Kang,
KiHyun Yun
Abstract:
A field excited by an emitter can be enhanced due to presence of closely located inclusions. In this paper we consider such field enhancement when inclusions are disks of the same radii, and the emitter is of dipole type and located in the narrow region between two inclusions. We derive quantitatively precise estimates of the field enhancement in the narrow region. The estimates reveal that the fi…
▽ More
A field excited by an emitter can be enhanced due to presence of closely located inclusions. In this paper we consider such field enhancement when inclusions are disks of the same radii, and the emitter is of dipole type and located in the narrow region between two inclusions. We derive quantitatively precise estimates of the field enhancement in the narrow region. The estimates reveal that the field is enhanced by a factor of $ε^{-1/2}$ in most area, where $ε$ is the distance between two inclusions. This factor is the same as that of gradient blow-up when there is a smooth back-ground field, not a field excited by an emitter. The method of deriving estimates shows clearly that enhancement is due to potential gap between two inclusions.
△ Less
Submitted 5 November, 2018;
originally announced November 2018.
-
Precise estimates of the field excited by an emitter in presence of closely located inclusions of a bow-tie shape
Authors:
Hyeonbae Kang,
KiHyun Yun
Abstract:
This paper studies in a quantitatively precise manner the field enhancement due to presence of an emitter of the dipole type near the bow-tie structure of perfectly conducting inclusions in the two-dimensional space. We put special emphasis on field enhancement near vertices of the bow-tie structure, and derive upper and lower bounds of the gradient blow-up there. All three different kinds of symm…
▽ More
This paper studies in a quantitatively precise manner the field enhancement due to presence of an emitter of the dipole type near the bow-tie structure of perfectly conducting inclusions in the two-dimensional space. We put special emphasis on field enhancement near vertices of the bow-tie structure, and derive upper and lower bounds of the gradient blow-up there. All three different kinds of symmetries are considered by varying locations and directions of the emitter, and a different estimate is derived for each case.
△ Less
Submitted 21 October, 2018;
originally announced October 2018.
-
Jellyfish galaxies with the IllustrisTNG simulations: I. Gas-stripping phenomena in the full cosmological context
Authors:
Kiyun Yun,
Annalisa Pillepich,
Elad Zinger,
Dylan Nelson,
Martina Donnari,
Gandhali Joshi,
Vicente Rodriguez-Gomez,
Shy Genel,
Rainer Weinberger,
Mark Vogelsberger,
Lars Hernquist
Abstract:
We use IllustrisTNG, a suite of gravity and MHD simulations, to study the demographics and properties of jellyfish galaxies in the full cosmological context. By jellyfish galaxies, we mean satellites orbiting in massive groups and clusters that exhibit highly asymmetric distributions of gas and gas tails. We use the TNG100 run and select galaxies at redshifts $z\le0.6$ with stellar mass exceeding…
▽ More
We use IllustrisTNG, a suite of gravity and MHD simulations, to study the demographics and properties of jellyfish galaxies in the full cosmological context. By jellyfish galaxies, we mean satellites orbiting in massive groups and clusters that exhibit highly asymmetric distributions of gas and gas tails. We use the TNG100 run and select galaxies at redshifts $z\le0.6$ with stellar mass exceeding $10^{9.5}{\rm M_\odot}$ and with host halo masses of $10^{13}-10^{14.6}\,{\rm M_\odot}$. Among more than about 6000 (2600) galaxies with stars (and some gas), we identify 800 jellyfish galaxies by visually inspecting their gas and stellar mass maps in random projections. About $31\%$ of cluster satellites are found with signatures of ram-pressure stripping and gaseous tails stemming from the main luminous bodies. This is a lower limit, since the random orientation entails a loss of about $30\%$ of galaxies that in an optimal projection would otherwise be identified as jellyfish. The connection with ram-pressure stripping is further confirmed by a series of findings: jellyfish galaxies are more frequent at intermediate and large cluster-centric distances ($r/R_{\rm 200c}\gtrsim 0.25$); they move through the ICM with larger bulk velocities and Mach numbers than the general cluster population, typically orbiting supersonically and experiencing larger ram pressures. Furthermore, the gaseous tails usually extend in opposite directions to the galaxy trajectory, with no relation between tail orientation and the host's center. The frequency of jellyfish galaxies shows a very weak dependence on redshift $(0\le z\le0.6)$ but larger fractions of disturbed gaseous morphologies occur in more massive hosts and at smaller satellite masses. Finally, jellyfish galaxies are late infallers ($< 2.5-3$ Gyrs ago, at $z=0$) and the emergence of gaseous tails correlates well with the presence of bow shocks in the ICM.
△ Less
Submitted 26 November, 2018; v1 submitted 28 September, 2018;
originally announced October 2018.
-
Deep Neural Networks for Pattern Recognition
Authors:
Kyongsik Yun,
Alexander Huyen,
Thomas Lu
Abstract:
In the field of pattern recognition research, the method of using deep neural networks based on improved computing hardware recently attracted attention because of their superior accuracy compared to conventional methods. Deep neural networks simulate the human visual system and achieve human equivalent accuracy in image classification, object detection, and segmentation. This chapter introduces t…
▽ More
In the field of pattern recognition research, the method of using deep neural networks based on improved computing hardware recently attracted attention because of their superior accuracy compared to conventional methods. Deep neural networks simulate the human visual system and achieve human equivalent accuracy in image classification, object detection, and segmentation. This chapter introduces the basic structure of deep neural networks that simulate human neural networks. Then we identify the operational processes and applications of conditional generative adversarial networks, which are being actively researched based on the bottom-up and top-down mechanisms, the most important functions of the human visual perception process. Finally, recent developments in training strategies for effective learning of complex deep neural networks are addressed.
△ Less
Submitted 25 September, 2018;
originally announced September 2018.
-
Occluded object reconstruction for first responders with augmented reality glasses using conditional generative adversarial networks
Authors:
Kyongsik Yun,
Thomas Lu,
Edward Chow
Abstract:
Firefighters suffer a variety of life-threatening risks, including line-of-duty deaths, injuries, and exposures to hazardous substances. Support for reducing these risks is important. We built a partially occluded object reconstruction method on augmented reality glasses for first responders. We used a deep learning based on conditional generative adversarial networks to train associations between…
▽ More
Firefighters suffer a variety of life-threatening risks, including line-of-duty deaths, injuries, and exposures to hazardous substances. Support for reducing these risks is important. We built a partially occluded object reconstruction method on augmented reality glasses for first responders. We used a deep learning based on conditional generative adversarial networks to train associations between the various images of flammable and hazardous objects and their partially occluded counterparts. Our system then reconstructed an image of a new flammable object. Finally, the reconstructed image was superimposed on the input image to provide "transparency". The system imitates human learning about the laws of physics through experience by learning the shape of flammable objects and the flame characteristics.
△ Less
Submitted 20 April, 2018;
originally announced May 2018.
-
Automatic speech recognition for launch control center communication using recurrent neural networks with data augmentation and custom language model
Authors:
Kyongsik Yun,
Joseph Osborne,
Madison Lee,
Thomas Lu,
Edward Chow
Abstract:
Transcribing voice communications in NASA's launch control center is important for information utilization. However, automatic speech recognition in this environment is particularly challenging due to the lack of training data, unfamiliar words in acronyms, multiple different speakers and accents, and conversational characteristics of speaking. We used bidirectional deep recurrent neural networks…
▽ More
Transcribing voice communications in NASA's launch control center is important for information utilization. However, automatic speech recognition in this environment is particularly challenging due to the lack of training data, unfamiliar words in acronyms, multiple different speakers and accents, and conversational characteristics of speaking. We used bidirectional deep recurrent neural networks to train and test speech recognition performance. We showed that data augmentation and custom language models can improve speech recognition accuracy. Transcribing communications from the launch control center will help the machine analyze information and accelerate knowledge generation.
△ Less
Submitted 24 April, 2018;
originally announced April 2018.
-
Smartphone-based point-of-care lipid blood test performance evaluation compared with a clinical diagnostic laboratory method
Authors:
Kyongsik Yun,
Juhee Lee,
Jaekyu Choi,
In-Uk Song,
Yong-An Chung
Abstract:
Managing blood lipid levels is important for the treatment and prevention of diabetes, cardiovascular disease, and obesity. An easy-to-use, portable lipid blood test will accelerate more frequent testing by patients and at-risk populations. We used smartphone systems that are already familiar to many people. Because smartphone systems can be carried around everywhere, blood can be measured easily…
▽ More
Managing blood lipid levels is important for the treatment and prevention of diabetes, cardiovascular disease, and obesity. An easy-to-use, portable lipid blood test will accelerate more frequent testing by patients and at-risk populations. We used smartphone systems that are already familiar to many people. Because smartphone systems can be carried around everywhere, blood can be measured easily and frequently. We compared the results of lipid tests with those of existing clinical diagnostic laboratory methods. We found that smartphone-based point-of-care lipid blood tests are as accurate as hospital-grade laboratory tests. Our system will be useful for those who need to manage blood lipid levels to motivate them to track and control their behavior.
△ Less
Submitted 19 April, 2018;
originally announced April 2018.
-
Predicting Rapid Fire Growth (Flashover) Using Conditional Generative Adversarial Networks
Authors:
Kyongsik Yun,
Jessi Bustos,
Thomas Lu
Abstract:
A flashover occurs when a fire spreads very rapidly through crevices due to intense heat. Flashovers present one of the most frightening and challenging fire phenomena to those who regularly encounter them: firefighters. Firefighters' safety and lives often depend on their ability to predict flashovers before they occur. Typical pre-flashover fire characteristics include dark smoke, high heat, and…
▽ More
A flashover occurs when a fire spreads very rapidly through crevices due to intense heat. Flashovers present one of the most frightening and challenging fire phenomena to those who regularly encounter them: firefighters. Firefighters' safety and lives often depend on their ability to predict flashovers before they occur. Typical pre-flashover fire characteristics include dark smoke, high heat, and rollover ("angel fingers") and can be quantified by color, size, and shape. Using a color video stream from a firefighter's body camera, we applied generative adversarial neural networks for image enhancement. The neural networks were trained to enhance very dark fire and smoke patterns in videos and monitor dynamic changes in smoke and fire areas. Preliminary tests with limited flashover training videos showed that we predicted a flashover as early as 55 seconds before it occurred.
△ Less
Submitted 29 January, 2018;
originally announced January 2018.
-
Neural correlates of flow using auditory evoked potential suppression
Authors:
Kyongsik Yun,
Saeran Doh,
Elisa Carrus,
Daw-An Wu,
Shinsuke Shimojo
Abstract:
"Flow" is a hyper-engaged state of consciousness most commonly described in athletics, popularly termed "being in the zone." Quantitative research into flow has been hampered by the disruptive nature of gathering subjective reports. Here we show that a passive probe (suppression of Auditory Evoked Potential in EEG) that allowed our participants to remain engaged in a first-person shooting game whi…
▽ More
"Flow" is a hyper-engaged state of consciousness most commonly described in athletics, popularly termed "being in the zone." Quantitative research into flow has been hampered by the disruptive nature of gathering subjective reports. Here we show that a passive probe (suppression of Auditory Evoked Potential in EEG) that allowed our participants to remain engaged in a first-person shooting game while we continually tracked the depth of their immersion corresponded with the participants' subjective experiences, and with their objective performance levels. Comparing this time-varying record of flow against the overall EEG record, we identified neural correlates of flow in the anterior cingulate cortex and the temporal pole. These areas displayed increased beta band activity, mutual connectivity, and feedback connectivity with primary motor cortex. These results corroborate the notion that the flow state is an objective and quantifiable state of consciousness, which we identify and characterize across subjective, behavioral and neural measures.
△ Less
Submitted 24 November, 2017; v1 submitted 18 November, 2017;
originally announced November 2017.
-
Optimal estimates of the field enhancement in presence of a bow-tie structure of perfectly conducting inclusions in two dimensions
Authors:
Hyeonbae Kang,
KiHyun Yun
Abstract:
This paper deals with the field enhancement, that is, the gradient blow-up, due to presence of a bow-tie structure of perfectly conducting inclusions in two dimensions. The bow-tie structure consists of two disjoint bounded domains which have corners with possibly different aperture angles. The domains are parts of cones near the vertices, and they are nearly touching to each other. We characteriz…
▽ More
This paper deals with the field enhancement, that is, the gradient blow-up, due to presence of a bow-tie structure of perfectly conducting inclusions in two dimensions. The bow-tie structure consists of two disjoint bounded domains which have corners with possibly different aperture angles. The domains are parts of cones near the vertices, and they are nearly touching to each other. We characterize the field enhancement using explicit functions and, as consequences, derive optimal estimates of the gradient in terms of the distance between two inclusions and aperture angles of the corners. The estimates show that the field is enhanced beyond the corner singularities due to the interaction between two inclusions.
△ Less
Submitted 9 July, 2017; v1 submitted 1 July, 2017;
originally announced July 2017.
-
On dissolving knot surgery $4$-manifolds under a $\mathbb{CP}^2$-connected sum
Authors:
Hakho Choi,
Jongil Park,
Ki-Heon Yun
Abstract:
In this article we prove that, if $X$ is a smooth $4$-manifold containing an embedded double node neighborhood, all knot surgery $4$-manifolds $X_K$ are mutually diffeomorphic to each other after a connected sum with $\mathbb{CP}^2$. Hence, by applying to the simply connected elliptic surface $E(n)$, we also show that every knot surgery $4$-manifold $E(n)_K$ is almost completely decomposable.
In this article we prove that, if $X$ is a smooth $4$-manifold containing an embedded double node neighborhood, all knot surgery $4$-manifolds $X_K$ are mutually diffeomorphic to each other after a connected sum with $\mathbb{CP}^2$. Hence, by applying to the simply connected elliptic surface $E(n)$, we also show that every knot surgery $4$-manifold $E(n)_K$ is almost completely decomposable.
△ Less
Submitted 24 April, 2017; v1 submitted 7 April, 2017;
originally announced April 2017.
-
Two types of electric field enhancements by infinitely many circular conductors arranged closely in two parallel line
Authors:
KiHyun Yun
Abstract:
In this paper, we consider very high concentration of electric field in between infinitely many circular perfect conductors arranged closely in two rows. In stiff fiber-reinforced composite, shear stress concentrations occur in between neighboring fibers, and the electric field means shear stress in this paper. Due to material failure of composites, there have been intensive studies so far to esti…
▽ More
In this paper, we consider very high concentration of electric field in between infinitely many circular perfect conductors arranged closely in two rows. In stiff fiber-reinforced composite, shear stress concentrations occur in between neighboring fibers, and the electric field means shear stress in this paper. Due to material failure of composites, there have been intensive studies so far to estimate the field in between only a finite number of inclusions. Indeed, fiber reinforced composites contain a large number of stiff fibers, and the concentration can be strongly enhanced by some combination of inclusions. Thus, we establish some asymptotes and optimal blow-up rates for the field in narrow regions in between infinitely many conductors in two rows to describe the horizontally and vertically combined effects of a large number of ones. Especially, one of the blow-up rates is substantially different from the existing result in the case of finite inclusions.
△ Less
Submitted 16 April, 2017; v1 submitted 14 December, 2016;
originally announced December 2016.
-
On the minimal number of singular fibers in Lefschetz fibrations over the torus
Authors:
András I. Stipsicz,
Ki-Heon Yun
Abstract:
We show that the minimal number of singular fibers $N(g,1)$ in a genus-$g$ Lefschetz fibration over the torus is at least $3$. As an application, we show that $N(g, 1) \in \{ 3, 4\}$ for $g\ge 5$, $N(g, 1) \in \{3, 4,5 \}$ for $g= 3, 4$ and $N(2,1) = 7$.
We show that the minimal number of singular fibers $N(g,1)$ in a genus-$g$ Lefschetz fibration over the torus is at least $3$. As an application, we show that $N(g, 1) \in \{ 3, 4\}$ for $g\ge 5$, $N(g, 1) \in \{3, 4,5 \}$ for $g= 3, 4$ and $N(2,1) = 7$.
△ Less
Submitted 4 June, 2016; v1 submitted 17 April, 2016;
originally announced April 2016.
-
Odyssey: A Public GPU-Based Code for General-Relativistic Radiative Transfer in Kerr Spacetime
Authors:
Hung-Yi Pu,
Kiyun Yun,
Ziri Younsi,
Suk-Jin Yoon
Abstract:
General-relativistic radiative transfer (GRRT) calculations coupled with the calculation of geodesics in the Kerr spacetime are an essential tool for determining the images, spectra and light curves from matter in the vicinity of black holes. Such studies are especially important for ongoing and upcoming millimeter/submillimeter (mm/sub-mm) Very Long Baseline Interferometry (VLBI) observations of…
▽ More
General-relativistic radiative transfer (GRRT) calculations coupled with the calculation of geodesics in the Kerr spacetime are an essential tool for determining the images, spectra and light curves from matter in the vicinity of black holes. Such studies are especially important for ongoing and upcoming millimeter/submillimeter (mm/sub-mm) Very Long Baseline Interferometry (VLBI) observations of the supermassive black holes at the centres of Sgr A^{*} and M87. To this end we introduce Odyssey, a Graphics Processing Unit(GPU)-based code for ray tracing and radiative transfer in the Kerr spacetime. On a single GPU, the performance of Odyssey can exceed 1 nanosecond per photon, per Runge-Kutta integration step. Odyssey is publicly available, fast, accurate, and flexible enough to be modified to suit the specific needs of new users. Along with a Graphical User Interface (GUI) powered by a video-accelerated display architecture, we also present an educational software tool, Odyssey_Edu, for showing in real time how null geodesics around a Kerr black hole vary as a function of black hole spin and angle of incidence onto the black hole.
△ Less
Submitted 19 January, 2016; v1 submitted 8 January, 2016;
originally announced January 2016.