-
NT-ViT: Neural Transcoding Vision Transformers for EEG-to-fMRI Synthesis
Authors:
Romeo Lanzino,
Federico Fontana,
Luigi Cinque,
Francesco Scarcello,
Atsuto Maki
Abstract:
This paper introduces the Neural Transcoding Vision Transformer (\modelname), a generative model designed to estimate high-resolution functional Magnetic Resonance Imaging (fMRI) samples from simultaneous Electroencephalography (EEG) data. A key feature of \modelname is its Domain Matching (DM) sub-module which effectively aligns the latent EEG representations with those of fMRI volumes, enhancing…
▽ More
This paper introduces the Neural Transcoding Vision Transformer (\modelname), a generative model designed to estimate high-resolution functional Magnetic Resonance Imaging (fMRI) samples from simultaneous Electroencephalography (EEG) data. A key feature of \modelname is its Domain Matching (DM) sub-module which effectively aligns the latent EEG representations with those of fMRI volumes, enhancing the model's accuracy and reliability. Unlike previous methods that tend to struggle with fidelity and reproducibility of images, \modelname addresses these challenges by ensuring methodological integrity and higher-quality reconstructions which we showcase through extensive evaluation on two benchmark datasets; \modelname outperforms the current state-of-the-art by a significant margin in both cases, e.g. achieving a $10\times$ reduction in RMSE and a $3.14\times$ increase in SSIM on the Oddball dataset. An ablation study also provides insights into the contribution of each component to the model's overall effectiveness. This development is critical in offering a new approach to lessen the time and financial constraints typically linked with high-resolution brain imaging, thereby aiding in the swift and precise diagnosis of neurological disorders. Although it is not a replacement for actual fMRI but rather a step towards making such imaging more accessible, we believe that it represents a pivotal advancement in clinical practice and neuroscience research. Code is available at \url{https://github.com/rom42pla/ntvit}.
△ Less
Submitted 18 September, 2024;
originally announced September 2024.
-
Conceptual Design on the Field of View of Celestial Navigation Systems for Maritime Autonomous Surface Ships
Authors:
Kouki Wakita,
Fuyuki Hane,
Takeshi Sekiguchi,
Shigehito Shimizu,
Shinji Mitani,
Youhei Akimoto,
Atsuo Maki
Abstract:
In order to understand the appropriate field of view (FOV) size of celestial automatic navigation systems for surface ships, we investigate the variations of measurement accuracy of star position and probability of successful star identification with respect to FOV, focusing on the decreasing number of observable star magnitudes and the presence of physically covered stars in marine environments.…
▽ More
In order to understand the appropriate field of view (FOV) size of celestial automatic navigation systems for surface ships, we investigate the variations of measurement accuracy of star position and probability of successful star identification with respect to FOV, focusing on the decreasing number of observable star magnitudes and the presence of physically covered stars in marine environments. The results revealed that, although a larger FOV reduces the measurement accuracy of star positions, it increases the number of observable objects and thus improves the probability of star identification using subgraph isomorphism-based methods. It was also found that, although at least four objects need to be observed for accurate identification, four objects may not be sufficient for wider FOVs. On the other hand, from the point of view of celestial navigation systems, a decrease in the measurement accuracy leads to a decrease in positioning accuracy. Therefore, it was found that maximizing the FOV is required for celestial automatic navigation systems as long as the desired positioning accuracy can be ensured. Furthermore, it was found that algorithms incorporating more than four observed celestial objects are required to achieve highly accurate star identification over a wider FOV.
△ Less
Submitted 28 August, 2024;
originally announced August 2024.
-
Quantitative Evaluation of Full-Scale Ship Maneuvering Characteristics During Berthing and Unberthing
Authors:
Agnes N. Mwange,
Yoshiki Miyauchi,
Taichi Kambara,
Hiroaki Koike,
Kazuyoshi Hosogaya,
Atsuo Maki
Abstract:
Leveraging empirical data is crucial in the development of accurate and reliable virtual models for the advancement of autonomous ship technologies and the optimization of port operations. This study presents an in-depth analysis of ship berthing and unberthing maneuvering characteristics by utilizing a comprehensive dataset encompassing the operation of a full-scale ship in diverse infrastructura…
▽ More
Leveraging empirical data is crucial in the development of accurate and reliable virtual models for the advancement of autonomous ship technologies and the optimization of port operations. This study presents an in-depth analysis of ship berthing and unberthing maneuvering characteristics by utilizing a comprehensive dataset encompassing the operation of a full-scale ship in diverse infrastructural and environmental conditions. Various statistical techniques and time-series analysis were employed to process and interpret the operational data. A systematic analysis was conducted on key performance variables, including approach speed, drift angles, turning motions, distance from obstacles, and actuator utilization. The results demonstrate significant discrepancies between the empirical data and the established maneuvering characteristics. These findings have the potential to significantly enhance the accuracy and reliability of conventional maneuvering models, such as the Mathematical Modeling Group (MMG) model, and improve the conditions used in captive model tests for the identification of maneuvering model parameters. Furthermore, these findings could inform the development of more robust autonomous berthing and unberthing algorithms and digital twins.
△ Less
Submitted 24 August, 2024;
originally announced August 2024.
-
Perspective on the Marine Simulator for Autonomous Vessel Development
Authors:
Ryouhei Sawada,
Yoshiki Miyauchi,
Suisei Wada,
Takuya Tanigushi,
Satoru Hamada,
Hiroaki Koike,
Kouki Wakita,
Atsuo Maki
Abstract:
There is a growing demand for simulators for the research and development of maritime autonomous surface ships (MASS) and the approval of autonomous navigation algorithms. Simulators are used for purposes such as evaluation and training and are taken on various configurations accordingly. The ship maneuvering mathematical model used in such a simulator is an important element that characterizes th…
▽ More
There is a growing demand for simulators for the research and development of maritime autonomous surface ships (MASS) and the approval of autonomous navigation algorithms. Simulators are used for purposes such as evaluation and training and are taken on various configurations accordingly. The ship maneuvering mathematical model used in such a simulator is an important element that characterizes the simulator. In this paper, we discuss the dynamic model of the hull and its position in the simulator that will be required for MASSs in the future. It also discusses guidelines for selecting an appropriate model, which has not been discussed extensively in previous studies. Finally, we discuss the functional requirements that simulators should have.
△ Less
Submitted 28 July, 2024;
originally announced July 2024.
-
Simultaneous optimization of control gains and reference filter coefficients for trajectory tracking control
Authors:
Amane Sakanashi,
Rin Suyama,
Atsuo Maki,
Youhei Akimoto
Abstract:
Research on vessel automation and autonomy is currently being conducted by various countries and institutions. Safe and accurate ship control algorithms are crucial to realize automated operation. Actuator drive constraints of a target ship may jeopardize the stability of the control law and require complex theory. In this study, we include a penalty term to the control law gain optimization stage…
▽ More
Research on vessel automation and autonomy is currently being conducted by various countries and institutions. Safe and accurate ship control algorithms are crucial to realize automated operation. Actuator drive constraints of a target ship may jeopardize the stability of the control law and require complex theory. In this study, we include a penalty term to the control law gain optimization stage of dynamic positioning systems to account for the amounts by which the actuator input value and its rate of change exceed the constraint. The parameters for generating a suitable reference path for the control law are identified simultaneously with the control gains. The simulation results show that the proposed method can realize control parameters and a reference design with excellent tracking performance while determining the cost of the controller design by considering the effects of both the actuators and rate saturation.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Validation of Theoretical Estimation Methods and Maximum Value Distribution Calculation for Parametric Roll Amplitude in Long-Crested Irregular Waves
Authors:
Keiji Katsumura,
Leo Dostal,
Taiga Kono,
Yuuki Maruyama,
Masahiro Sakai,
Atsuo Maki
Abstract:
Parametric rolling is a parametric excitation phenomenon caused by GM variation in waves. There are a lot of studies of the estimation the conditions, the occurrence, and the amplitude of parametric rolling. On the other hand, there are relatively few cases in which theoretical methods for estimating parametric roll amplitudes in irregular waves have been validated in tank tests. The primary objec…
▽ More
Parametric rolling is a parametric excitation phenomenon caused by GM variation in waves. There are a lot of studies of the estimation the conditions, the occurrence, and the amplitude of parametric rolling. On the other hand, there are relatively few cases in which theoretical methods for estimating parametric roll amplitudes in irregular waves have been validated in tank tests. The primary objective of this study is to validate theoretical estimation methods for the parametric roll amplitude in irregular waves and improve their accuracy. First, the probability density functions (PDF) of the parametric roll amplitude obtained from the model ship motion experiment in irregular waves are compared with that obtained from theoretical estimation methods. Second, the method to improve the accuracy of estimation of the roll restoring variation in irregular waves is suggested. Third, the method to estimate the distribution of the maximum amplitude of parametric rolling in irregular waves. As a result, the PDFs of the roll amplitude obtained from the experiments differ from the results of theoretical estimation. After that, by correcting GM variation, the results of theoretical estimation are closer to the experimental results. Moreover, by the theoretical estimation method using the moment equation, the qualitative estimation for the PDF of the maximum roll amplitude is succeeded.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Review of the analytical prediction method of surf-riding threshold in following sea, and its relation to IMO second-generation intact stability criteria
Authors:
Atsuo Maki,
Masahiro Sakai,
Tetsushi Ueta
Abstract:
In high-speed maritime operations, the broaching phenomenon can pose a significant risk when navigating in following/quartering seas. The occurrence of this phenomenon can result in a violent yaw motion, regardless of the steering effort, which, in turn, cause the resulting centrifugal force to capsize a vessel. A necessary condition for the occurrence of broaching is the surf-riding phenomenon. T…
▽ More
In high-speed maritime operations, the broaching phenomenon can pose a significant risk when navigating in following/quartering seas. The occurrence of this phenomenon can result in a violent yaw motion, regardless of the steering effort, which, in turn, cause the resulting centrifugal force to capsize a vessel. A necessary condition for the occurrence of broaching is the surf-riding phenomenon. Therefore, the International Maritime Organization (IMO) has set up criteria to include theoretical formulas for estimating the occurrence of surf-riding phenomena. The theoretical equation used in the IMO's second-generation intact stability criteria (SGISC) to estimate the surf-riding threshold is based on Melnikov's method. This paper presents nonlinear equations describing the forward and backward motions of a ship. However, such equations cannot be directly solved; therefore, we proposed the use of and explain various approximate solution methods, including Meknikov's method. Subsequently, the relationship between the theoretical prediction method of the surf-riding threshold rooted in Melnikov's method and the IMO's SGISC is determined.
△ Less
Submitted 3 December, 2023;
originally announced June 2024.
-
Towards Sim-to-Real Industrial Parts Classification with Synthetic Dataset
Authors:
Xiaomeng Zhu,
Talha Bilal,
Pär Mårtensson,
Lars Hanson,
Mårten Björkman,
Atsuto Maki
Abstract:
This paper is about effectively utilizing synthetic data for training deep neural networks for industrial parts classification, in particular, by taking into account the domain gap against real-world images. To this end, we introduce a synthetic dataset that may serve as a preliminary testbed for the Sim-to-Real challenge; it contains 17 objects of six industrial use cases, including isolated and…
▽ More
This paper is about effectively utilizing synthetic data for training deep neural networks for industrial parts classification, in particular, by taking into account the domain gap against real-world images. To this end, we introduce a synthetic dataset that may serve as a preliminary testbed for the Sim-to-Real challenge; it contains 17 objects of six industrial use cases, including isolated and assembled parts. A few subsets of objects exhibit large similarities in shape and albedo for reflecting challenging cases of industrial parts. All the sample images come with and without random backgrounds and post-processing for evaluating the importance of domain randomization. We call it Synthetic Industrial Parts dataset (SIP-17). We study the usefulness of SIP-17 through benchmarking the performance of five state-of-the-art deep network models, supervised and self-supervised, trained only on the synthetic data while testing them on real data. By analyzing the results, we deduce some insights on the feasibility and challenges of using synthetic data for industrial parts classification and for further developing larger-scale synthetic datasets. Our dataset and code are publicly available.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer
Authors:
Hiroki Azuma,
Yusuke Matsui,
Atsuto Maki
Abstract:
Deep learning models achieve high accuracy in segmentation tasks among others, yet domain shift often degrades the models' performance, which can be critical in real-world scenarios where no target images are available. This paper proposes a zero-shot domain adaptation method based on diffusion models, called ZoDi, which is two-fold by the design: zero-shot image transfer and model adaptation. Fir…
▽ More
Deep learning models achieve high accuracy in segmentation tasks among others, yet domain shift often degrades the models' performance, which can be critical in real-world scenarios where no target images are available. This paper proposes a zero-shot domain adaptation method based on diffusion models, called ZoDi, which is two-fold by the design: zero-shot image transfer and model adaptation. First, we utilize an off-the-shelf diffusion model to synthesize target-like images by transferring the domain of source images to the target domain. In this we specifically try to maintain the layout and content by utilising layout-to-image diffusion models with stochastic inversion. Secondly, we train the model using both source images and synthesized images with the original segmentation maps while maximizing the feature similarity of images from the two domains to learn domain-robust representations. Through experiments we show benefits of ZoDi in the task of image segmentation over state-of-the-art methods. It is also more applicable than existing CLIP-based methods because it assumes no specific backbone or models, and it enables to estimate the model's performance without target images by inspecting generated images. Our implementation will be publicly available.
△ Less
Submitted 25 September, 2024; v1 submitted 20 March, 2024;
originally announced March 2024.
-
A Practical and Online Trajectory Planner for Autonomous Ships' Berthing, Incorporating Speed Control
Authors:
Agnes Ngina Mwange,
Dimas Maulana Rachman,
Rin Suyama,
Atsuo Maki
Abstract:
Autonomous ships are essentially designed and equipped to perceive their internal and external environment and subsequently perform appropriate actions depending on the predetermined objective(s) without human intervention. Consequently, trajectory planning algorithms for autonomous berthing must consider factors such as system dynamics, ship actuators, environmental disturbances, and the safety o…
▽ More
Autonomous ships are essentially designed and equipped to perceive their internal and external environment and subsequently perform appropriate actions depending on the predetermined objective(s) without human intervention. Consequently, trajectory planning algorithms for autonomous berthing must consider factors such as system dynamics, ship actuators, environmental disturbances, and the safety of the ship, other ships, and port structures, among others. In this study, basing the ship dynamics on the low-speed MMG model, trajectory planning for an autonomous ship is modeled as an optimal control problem (OCP) that is transcribed into a nonlinear programming problem (NLP) using the direct multiple shooting technique. To enhance berthing safety, besides considering wind disturbances, speed control, actuators' limitations, and collision avoidance features are incorporated as constraints in the NLP, which is then solved using the Sequential Quadratic Programming (SQP) algorithm in MATLAB. Finally, the performance of the proposed planner is evaluated through (i) comparison with solutions obtained using CMA-ES for two different model ships, (ii) trajectory planning for different harbor entry and berth approach scenarios, and (iii) feasibility study using stochastically generated initial conditions and positions within the port boundaries. Simulation results indicate enhanced berthing safety as well as practical and computational feasibility making the planner suitable for real-time applications.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
Approximate probability density function for nonlinear surging in irregular following seas
Authors:
Atsuo Maki,
Yuuki Maruyama,
Keiji Katsumura,
Leo Dostal
Abstract:
The broaching that follows the surf-riding is a dangerous phenomenon that can lead to the capsizing of a vessel due to its violent yaw motion. Most of the previous studies on surf-riding phenomena in irregular waves have been conducted by replacing irregular waves with regular waves. In contrast, this study provides suggestions on how to directly calculate nonlinear surge motion in irregular seas.…
▽ More
The broaching that follows the surf-riding is a dangerous phenomenon that can lead to the capsizing of a vessel due to its violent yaw motion. Most of the previous studies on surf-riding phenomena in irregular waves have been conducted by replacing irregular waves with regular waves. In contrast, this study provides suggestions on how to directly calculate nonlinear surge motion in irregular seas. In this study, the statistical aspects of the surf-riding phenomenon are first presented. Then, under several approximations, we show how to calculate the probability density function theoretically. Although the results obtained are based on strong approximations, it is found that the nonlinear surge oscillations in irregular following seas can be explained from a qualitative point of view.
△ Less
Submitted 9 December, 2023;
originally announced January 2024.
-
Parameter fine-tuning method for MMG model using real-scale ship data
Authors:
Rin Suyama,
Rintaro Matsushita,
Ryo Kakuta,
Kouki Wakita,
Atsuo Maki
Abstract:
In this paper, a fine-tuning method of the parameters in the MMG model for the real-scale ship is proposed. In the proposed method, all of the arbitrarily indicated target parameters of the MMG model are tuned simultaneously in the framework of SI using time series data of real-sale ship maneuvering motion data to steadily improve the accuracy of the MMG model. Parameter tuning is formulated as a…
▽ More
In this paper, a fine-tuning method of the parameters in the MMG model for the real-scale ship is proposed. In the proposed method, all of the arbitrarily indicated target parameters of the MMG model are tuned simultaneously in the framework of SI using time series data of real-sale ship maneuvering motion data to steadily improve the accuracy of the MMG model. Parameter tuning is formulated as a minimization problem of the deviation of the maneuvering motion simulated with given parameters and the real-scale ship trials, and the global solution is explored using CMA-ES. By constraining the exploration ranges to the neighborhood of the previously determined parameter values, the proposed method limits the output in a realistic range. The proposed method is applied to the tuning of 12 parameters for a container ship with five different widths of the exploration range. The results show that, in all cases, the accuracy of the maneuvering simulation is improved by applying the tuned parameters to the MMG model, and the validity of the proposed parameter fine-tuning method is confirmed.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
Nonlinear steering control under input magnitude and rate constraints with exponential convergence
Authors:
Rin Suyama,
Satoshi Satoh,
Atsuo Maki
Abstract:
A ship steering control is designed for a nonlinear maneuvering model whose rudder manipulation is constrained in both magnitude and rate. In our method, the tracking problem of the target heading angle with input constraints is converted into the tracking problem for a strict-feedback system without any input constraints. To derive this system, hyperbolic tangent ($\tanh$) function and auxiliary…
▽ More
A ship steering control is designed for a nonlinear maneuvering model whose rudder manipulation is constrained in both magnitude and rate. In our method, the tracking problem of the target heading angle with input constraints is converted into the tracking problem for a strict-feedback system without any input constraints. To derive this system, hyperbolic tangent ($\tanh$) function and auxiliary variables are introduced to deal with the input constraints. Furthermore, using the feature of the derivative of $\tanh$ function, auxiliary systems are successfully derived in the strict-feedback form. The backstepping method is utilized to construct the feedback control law for the resulting cascade system. The proposed steering control is verified in numerical experiments, and the result shows that the tracking of the target heading angle is successful using the proposed control law.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
Discrete approximation of reflected Brownian motions by Markov chains on partitions of domains
Authors:
Masanori Hino,
Arata Maki,
Kouhei Matsuura
Abstract:
In this paper, we study discrete approximation of reflected Brownian motions on domains in Euclidean space. Our approximation is given by a sequence of Markov chains on partitions of the domain, where we allow uneven or random partitions. We provide sufficient conditions for the weak convergence of the Markov chains.
In this paper, we study discrete approximation of reflected Brownian motions on domains in Euclidean space. Our approximation is given by a sequence of Markov chains on partitions of the domain, where we allow uneven or random partitions. We provide sufficient conditions for the weak convergence of the Markov chains.
△ Less
Submitted 12 October, 2023;
originally announced October 2023.
-
Comparison of stochastic stability boundaries for parametrically forced systems with application to ship rolling motion
Authors:
Atsuo Maki,
Yuuki Maruyama,
Yaliu Liu,
Leo Dostal
Abstract:
Numerous accidents caused by parametric rolling have been reported on container ships and pure car carriers (PCCs). A number of theoretical studies have been performed to estimate the occurrence condition of parametric rolling in both regular and irregular seas. Some studies in random wave conditions have been the approximate extension of the occurrence conditions for regular waves (e.g. Maki et a…
▽ More
Numerous accidents caused by parametric rolling have been reported on container ships and pure car carriers (PCCs). A number of theoretical studies have been performed to estimate the occurrence condition of parametric rolling in both regular and irregular seas. Some studies in random wave conditions have been the approximate extension of the occurrence conditions for regular waves (e.g. Maki et al). Furthermore, several researches have been based on the stochastic process in ocean engineering (Roberts and Dostal). This study tackled the parametric rolling in irregular seas from the stability of the system's origin. It provided a novel theoretical explanation of the instability mechanism for two cases: white noise parametric excitation and colored noise parametric excitation. The authors then confirmed the usefulness of the previously provided formulae by Roberts and Dostal through numerical examples.
△ Less
Submitted 8 April, 2023;
originally announced September 2023.
-
Data Augmentation Methods of Parameter Identification of a Dynamic Model for Harbor Maneuvers
Authors:
Kouki Wakita,
Yoshiki Miyauchi,
Youhei Akimoto,
Atsuo Maki
Abstract:
A dynamic model for an automatic berthing and unberthing controller has to estimate harbor maneuvers, which include berthing, unberthing, approach maneuvers to berths, and entering and leaving the port. When the dynamic model is estimated by the system identification, a large number of tests or trials are required to measure the various motions of harbor maneuvers. However, the amount of data that…
▽ More
A dynamic model for an automatic berthing and unberthing controller has to estimate harbor maneuvers, which include berthing, unberthing, approach maneuvers to berths, and entering and leaving the port. When the dynamic model is estimated by the system identification, a large number of tests or trials are required to measure the various motions of harbor maneuvers. However, the amount of data that can be obtained is limited due to the high costs and time-consuming nature of full-scale ship trials. In this paper, we improve the generalization performance of the dynamic model for the automatic berthing and unberthing controller by introducing data augmentation. This study used slicing and jittering as data augmentation methods and confirmed their effectiveness by numerical experiments using the free-running model tests. The dynamic model is represented by a neural network-based model in numerical experiments. Results of numerical experiments demonstrated that slicing and jittering are effective data augmentation methods but could not improve generalization performance for extrapolation states of the original dataset.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
Marginal Thresholding in Noisy Image Segmentation
Authors:
Marcus Nordström,
Henrik Hult,
Atsuto Maki
Abstract:
This work presents a study on label noise in medical image segmentation by considering a noise model based on Gaussian field deformations. Such noise is of interest because it yields realistic looking segmentations and because it is unbiased in the sense that the expected deformation is the identity mapping. Efficient methods for sampling and closed form solutions for the marginal probabilities ar…
▽ More
This work presents a study on label noise in medical image segmentation by considering a noise model based on Gaussian field deformations. Such noise is of interest because it yields realistic looking segmentations and because it is unbiased in the sense that the expected deformation is the identity mapping. Efficient methods for sampling and closed form solutions for the marginal probabilities are provided. Moreover, theoretically optimal solutions to the loss functions cross-entropy and soft-Dice are studied and it is shown how they diverge as the level of noise increases. Based on recent work on loss function characterization, it is shown that optimal solutions to soft-Dice can be recovered by thresholding solutions to cross-entropy with a particular a priori unknown threshold that efficiently can be computed. This raises the question whether the decrease in performance seen when using cross-entropy as compared to soft-Dice is caused by using the wrong threshold. The hypothesis is validated in 5-fold studies on three organ segmentation problems from the TotalSegmentor data set, using 4 different strengths of noise. The results show that changing the threshold leads the performance of cross-entropy to go from systematically worse than soft-Dice to similar or better results than soft-Dice.
△ Less
Submitted 8 July, 2023; v1 submitted 8 April, 2023;
originally announced April 2023.
-
Noisy Image Segmentation With Soft-Dice
Authors:
Marcus Nordström,
Henrik Hult,
Atsuto Maki,
Fredrik Löfman
Abstract:
This paper presents a study on the soft-Dice loss, one of the most popular loss functions in medical image segmentation, for situations where noise is present in target labels. In particular, the set of optimal solutions are characterized and sharp bounds on the volume bias of these solutions are provided. It is further shown that a sequence of soft segmentations converging to optimal soft-Dice al…
▽ More
This paper presents a study on the soft-Dice loss, one of the most popular loss functions in medical image segmentation, for situations where noise is present in target labels. In particular, the set of optimal solutions are characterized and sharp bounds on the volume bias of these solutions are provided. It is further shown that a sequence of soft segmentations converging to optimal soft-Dice also converges to optimal Dice when converted to hard segmentations using thresholding. This is an important result because soft-Dice is often used as a proxy for maximizing the Dice metric. Finally, experiments confirming the theoretical results are provided.
△ Less
Submitted 4 May, 2023; v1 submitted 3 April, 2023;
originally announced April 2023.
-
Time-series Anomaly Detection based on Difference Subspace between Signal Subspaces
Authors:
Takumi Kanai,
Naoya Sogi,
Atsuto Maki,
Kazuhiro Fukui
Abstract:
This paper proposes a new method for anomaly detection in time-series data by incorporating the concept of difference subspace into the singular spectrum analysis (SSA). The key idea is to monitor slight temporal variations of the difference subspace between two signal subspaces corresponding to the past and present time-series data, as anomaly score. It is a natural generalization of the conventi…
▽ More
This paper proposes a new method for anomaly detection in time-series data by incorporating the concept of difference subspace into the singular spectrum analysis (SSA). The key idea is to monitor slight temporal variations of the difference subspace between two signal subspaces corresponding to the past and present time-series data, as anomaly score. It is a natural generalization of the conventional SSA-based method which measures the minimum angle between the two signal subspaces as the degree of changes. By replacing the minimum angle with the difference subspace, our method boosts the performance while using the SSA-based framework as it can capture the whole structural difference between the two subspaces in its magnitude and direction. We demonstrate our method's effectiveness through performance evaluations on public time-series datasets.
△ Less
Submitted 4 April, 2023; v1 submitted 31 March, 2023;
originally announced March 2023.
-
Covariance Matrix Adaptation Evolutionary Strategy with Worst-Case Ranking Approximation for Min--Max Optimization and its Application to Berthing Control Tasks
Authors:
Atsuhiro Miyagi,
Yoshiki Miyauchi,
Atsuo Maki,
Kazuto Fukuchi,
Jun Sakuma,
Youhei Akimoto
Abstract:
In this study, we consider a continuous min--max optimization problem $\min_{x \in \mathbb{X} \max_{y \in \mathbb{Y}}}f(x,y)$ whose objective function is a black-box. We propose a novel approach to minimize the worst-case objective function $F(x) = \max_{y} f(x,y)$ directly using a covariance matrix adaptation evolution strategy (CMA-ES) in which the rankings of solution candidates are approximate…
▽ More
In this study, we consider a continuous min--max optimization problem $\min_{x \in \mathbb{X} \max_{y \in \mathbb{Y}}}f(x,y)$ whose objective function is a black-box. We propose a novel approach to minimize the worst-case objective function $F(x) = \max_{y} f(x,y)$ directly using a covariance matrix adaptation evolution strategy (CMA-ES) in which the rankings of solution candidates are approximated by our proposed worst-case ranking approximation (WRA) mechanism. We develop two variants of WRA combined with CMA-ES and approximate gradient ascent as numerical solvers for the inner maximization problem. Numerical experiments show that our proposed approach outperforms several existing approaches when the objective function is a smooth strongly convex--concave function and the interaction between $x$ and $y$ is strong. We investigate the advantages of the proposed approach for problems where the objective function is not limited to smooth strongly convex--concave functions. The effectiveness of the proposed approach is demonstrated in the robust berthing control problem with uncertainty.ngly convex--concave functions. The effectiveness of the proposed approach is demonstrated in the robust berthing control problem with uncertainty.
△ Less
Submitted 28 March, 2023;
originally announced March 2023.
-
Ship trajectory planning method for reproducing human operation at ports
Authors:
Rin Suyama,
Yoshiki Miyauchi,
Atsuo Maki
Abstract:
Among ship maneuvers, berthing/unberthing maneuvers are one of the most challenging and stressful phases for captains. Concerning burden reduction on ship operators and preventing accidents, several researches have been conducted on trajectory planning to automate berthing/unberthing. However, few studies have aimed at assisting captains in berthing/unberthing. The trajectory to be presented to th…
▽ More
Among ship maneuvers, berthing/unberthing maneuvers are one of the most challenging and stressful phases for captains. Concerning burden reduction on ship operators and preventing accidents, several researches have been conducted on trajectory planning to automate berthing/unberthing. However, few studies have aimed at assisting captains in berthing/unberthing. The trajectory to be presented to the captain should be a maneuver that reproduces human captain's control characteristics. The previously proposed methods cannot explicitly reflect the motion and navigation, which human captains pay particular attention to reduce the mental burden in the trajectory planning. Herein, mild constraints to the trajectory planning method are introduced. The constraints impose certain states (position, bow heading angle, ship speed, and yaw angular velocity), to be taken approximately at any given time. The introduction of this new constraint allows imposing careful trajectory planning (e.g., in-situ turns at zero speed or a pause for safety before going astern), as if performed by a human during berthing/unberthing. The algorithm proposed herein was used to optimize the berthing/unberthing trajectories for a large car ferry. The results show that this method can generate the quantitatively equivalent trajectory recorded in the actual berthing/unberthing maneuver performed by a human captain.
△ Less
Submitted 8 March, 2023; v1 submitted 6 March, 2023;
originally announced March 2023.
-
Development of a Mathematical Model for Harbor-Maneuvers to Realize Modeling Automation
Authors:
Yoshiki Miyauchi,
Youhei Akimoto,
Naoya Umeda,
Atsuo Maki
Abstract:
A simulation environment of harbor maneuvers is critical for developing automatic berthing. Dynamic models are widely used to estimate harbor maneuvers. However, human decision-making and data analysis are necessary to derive, select, and identify the model because each actuator configuration needs an inherent mathematical expression. We proposed a new dynamic model for arbitrary configurations to…
▽ More
A simulation environment of harbor maneuvers is critical for developing automatic berthing. Dynamic models are widely used to estimate harbor maneuvers. However, human decision-making and data analysis are necessary to derive, select, and identify the model because each actuator configuration needs an inherent mathematical expression. We proposed a new dynamic model for arbitrary configurations to overcome that issue. The new model is a hybrid model that combines the simplicity of the derivation of the Taylor expansion and the high degree of freedom of the MMG low-speed maneuvering model. We also developed a method to select mathematical expressions for the proposed model using system identification. Because the proposed model can easily derive mathematical expressions, we can generate multiple models simultaneously and choose the best one. This method can reduce the workload of model identification and selection. Furthermore, the proposed method will enable the automatic generation of dynamic models because it can reduce human decision-making and data analysis for the model generation due to its less dependency on the knowledge of ship hydrodynamics and captive model test. The proposed method was validated with free-running model tests and showed equivalent or better estimation performance than the conventional model generation method.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
Experimental Low-speed Positioning System with VecTwin Rudder for Automatic Docking (Berthing)
Authors:
Dimas M. Rachman,
Yusuke Aoki,
Yoshiki Miyauchi,
Naoya Umeda,
Atsuo Maki
Abstract:
A VecTwin rudder system comprises twin fishtail rudders with reaction fins to increase its performance. With a constant propeller revolution number, the vessel can execute special low-speed maneuvers like hover, crabbing, reverse, and rotation. Such low-speed maneuvers are termed dynamic positioning (DP), and a DP vessel should be fully/overly actuated with several thrusters. This article introduc…
▽ More
A VecTwin rudder system comprises twin fishtail rudders with reaction fins to increase its performance. With a constant propeller revolution number, the vessel can execute special low-speed maneuvers like hover, crabbing, reverse, and rotation. Such low-speed maneuvers are termed dynamic positioning (DP), and a DP vessel should be fully/overly actuated with several thrusters. This article introduces a novel and experimental VecTwin positioning system (VTPS) without making the ship fully/overly actuated. Unlike the usual dynamic positioning system (DPS), the VTPS is developed for low-speed operations in a calm harbor area. It is designed upon an assumption that the forces due to the interaction between the rudders, the propeller, and the hull are linear with the rudder angles within a range around the hover rudder angle. The linear relationship is obtained through linear regression of the results from several CFD simulations. The VTPS implements a PID controller that regulates the actuator forces to achieve the given low-speed positioning objective. It was tested in combined automatic docking and position-keeping experiments where disturbances from the environment exist. It shows promising potential for a practical application but with further improvements.
△ Less
Submitted 19 December, 2022;
originally announced December 2022.
-
Collision probability reduction method for tracking control in automatic docking / berthing using reinforcement learning
Authors:
Kouki Wakita,
Youhei Akimoto,
Dimas M. Rachman,
Yoshiki Miyauchi,
Umeda Naoya,
Atsuo Maki
Abstract:
Automation of berthing maneuvers in shipping is a pressing issue as the berthing maneuver is one of the most stressful tasks seafarers undertake. Berthing control problems are often tackled via tracking a predefined trajectory or path. Maintaining a tracking error of zero under an uncertain environment is impossible; the tracking controller is nonetheless required to bring vessels close to desired…
▽ More
Automation of berthing maneuvers in shipping is a pressing issue as the berthing maneuver is one of the most stressful tasks seafarers undertake. Berthing control problems are often tackled via tracking a predefined trajectory or path. Maintaining a tracking error of zero under an uncertain environment is impossible; the tracking controller is nonetheless required to bring vessels close to desired berths. The tracking controller must prioritize the avoidance of tracking errors that may cause collisions with obstacles. This paper proposes a training method based on reinforcement learning for a trajectory tracking controller that reduces the probability of collisions with static obstacles. Via numerical simulations, we show that the proposed method reduces the probability of collisions during berthing maneuvers. Furthermore, this paper shows the tracking performance in a model experiment.
△ Less
Submitted 13 December, 2022; v1 submitted 13 December, 2022;
originally announced December 2022.
-
Dense FixMatch: a simple semi-supervised learning method for pixel-wise prediction tasks
Authors:
Miquel Martí i Rabadán,
Alessandro Pieropan,
Hossein Azizpour,
Atsuto Maki
Abstract:
We propose Dense FixMatch, a simple method for online semi-supervised learning of dense and structured prediction tasks combining pseudo-labeling and consistency regularization via strong data augmentation. We enable the application of FixMatch in semi-supervised learning problems beyond image classification by adding a matching operation on the pseudo-labels. This allows us to still use the full…
▽ More
We propose Dense FixMatch, a simple method for online semi-supervised learning of dense and structured prediction tasks combining pseudo-labeling and consistency regularization via strong data augmentation. We enable the application of FixMatch in semi-supervised learning problems beyond image classification by adding a matching operation on the pseudo-labels. This allows us to still use the full strength of data augmentation pipelines, including geometric transformations. We evaluate it on semi-supervised semantic segmentation on Cityscapes and Pascal VOC with different percentages of labeled data and ablate design choices and hyper-parameters. Dense FixMatch significantly improves results compared to supervised learning using only labeled data, approaching its performance with 1/4 of the labeled samples.
△ Less
Submitted 18 October, 2022;
originally announced October 2022.
-
Towards a Unified View of Affinity-Based Knowledge Distillation
Authors:
Vladimir Li,
Atsuto Maki
Abstract:
Knowledge transfer between artificial neural networks has become an important topic in deep learning. Among the open questions are what kind of knowledge needs to be preserved for the transfer, and how it can be effectively achieved. Several recent work have shown good performance of distillation methods using relation-based knowledge. These algorithms are extremely attractive in that they are bas…
▽ More
Knowledge transfer between artificial neural networks has become an important topic in deep learning. Among the open questions are what kind of knowledge needs to be preserved for the transfer, and how it can be effectively achieved. Several recent work have shown good performance of distillation methods using relation-based knowledge. These algorithms are extremely attractive in that they are based on simple inter-sample similarities. Nevertheless, a proper metric of affinity and use of it in this context is far from well understood. In this paper, by explicitly modularising knowledge distillation into a framework of three components, i.e. affinity, normalisation, and loss, we give a unified treatment of these algorithms as well as study a number of unexplored combinations of the modules. With this framework we perform extensive evaluations of numerous distillation objectives for image classification, and obtain a few useful insights for effective design choices while demonstrating how relation-based knowledge distillation could achieve comparable performance to the state of the art in spite of the simplicity.
△ Less
Submitted 30 September, 2022;
originally announced September 2022.
-
Stochastic Assessment of Acceleration Probability Density Function for Parametric Rolling Using Moment Method
Authors:
Yuuki Maruyama,
Atsuo Maki,
Leo Dostal,
Naoya Umeda
Abstract:
Container ships encounter large roll angles and high acceleration, and container loss remains a problem. This study proposes a method for calculating the probability density function~(PDF) of roll angular and cargo lateral accelerations. First, the moment values of these accelerations are derived using the linearity of expectation and the validity of this method is examined. Second, the PDF shapes…
▽ More
Container ships encounter large roll angles and high acceleration, and container loss remains a problem. This study proposes a method for calculating the probability density function~(PDF) of roll angular and cargo lateral accelerations. First, the moment values of these accelerations are derived using the linearity of expectation and the validity of this method is examined. Second, the PDF shapes of these accelerations are proposed and their coefficients are determined using the obtained moment values. Our proposed method can be used to derive the PDFs of roll angular and cargo lateral accelerations.
△ Less
Submitted 24 September, 2022;
originally announced September 2022.
-
Revisiting a kNN-based Image Classification System with High-capacity Storage
Authors:
Kengo Nakata,
Youyang Ng,
Daisuke Miyashita,
Asuka Maki,
Yu-Chieh Lin,
Jun Deguchi
Abstract:
In existing image classification systems that use deep neural networks, the knowledge needed for image classification is implicitly stored in model parameters. If users want to update this knowledge, then they need to fine-tune the model parameters. Moreover, users cannot verify the validity of inference results or evaluate the contribution of knowledge to the results. In this paper, we investigat…
▽ More
In existing image classification systems that use deep neural networks, the knowledge needed for image classification is implicitly stored in model parameters. If users want to update this knowledge, then they need to fine-tune the model parameters. Moreover, users cannot verify the validity of inference results or evaluate the contribution of knowledge to the results. In this paper, we investigate a system that stores knowledge for image classification, such as image feature maps, labels, and original images, not in model parameters but in external high-capacity storage. Our system refers to the storage like a database when classifying input images. To increase knowledge, our system updates the database instead of fine-tuning model parameters, which avoids catastrophic forgetting in incremental learning scenarios. We revisit a kNN (k-Nearest Neighbor) classifier and employ it in our system. By analyzing the neighborhood samples referred by the kNN algorithm, we can interpret how knowledge learned in the past is used for inference results. Our system achieves 79.8% top-1 accuracy on the ImageNet dataset without fine-tuning model parameters after pretraining, and 90.8% accuracy on the Split CIFAR-100 dataset in the task incremental learning setting.
△ Less
Submitted 28 July, 2022; v1 submitted 3 April, 2022;
originally announced April 2022.
-
An analysis of over-sampling labeled data in semi-supervised learning with FixMatch
Authors:
Miquel Martí i Rabadán,
Sebastian Bujwid,
Alessandro Pieropan,
Hossein Azizpour,
Atsuto Maki
Abstract:
Most semi-supervised learning methods over-sample labeled data when constructing training mini-batches. This paper studies whether this common practice improves learning and how. We compare it to an alternative setting where each mini-batch is uniformly sampled from all the training data, labeled or not, which greatly reduces direct supervision from true labels in typical low-label regimes. Howeve…
▽ More
Most semi-supervised learning methods over-sample labeled data when constructing training mini-batches. This paper studies whether this common practice improves learning and how. We compare it to an alternative setting where each mini-batch is uniformly sampled from all the training data, labeled or not, which greatly reduces direct supervision from true labels in typical low-label regimes. However, this simpler setting can also be seen as more general and even necessary in multi-task problems where over-sampling labeled data would become intractable. Our experiments on semi-supervised CIFAR-10 image classification using FixMatch show a performance drop when using the uniform sampling approach which diminishes when the amount of labeled data or the training time increases. Further, we analyse the training dynamics to understand how over-sampling of labeled data compares to uniform sampling. Our main finding is that over-sampling is especially beneficial early in training but gets less important in the later stages when more pseudo-labels become correct. Nevertheless, we also find that keeping some true labels remains important to avoid the accumulation of confirmation errors from incorrect pseudo-labels.
△ Less
Submitted 8 April, 2022; v1 submitted 3 January, 2022;
originally announced January 2022.
-
Warm-started Semionline Trajectory Planner for Ship's Automatic Docking (Berthing)
Authors:
Dimas M. Rachman,
Atsuo Maki,
Yoshiki Miyauchi,
Naoya Umeda
Abstract:
In the usual framework of control, a reference trajectory is needed as the set point for a feedback controller. This reference trajectory can be generated by solving a trajectory optimization problem. This problem is a continuous optimal control problem (OCP) that is transcribed into a finite-dimensional nonlinear optimization problem (NLP) and solved by SQP. For an underactuated conventional vess…
▽ More
In the usual framework of control, a reference trajectory is needed as the set point for a feedback controller. This reference trajectory can be generated by solving a trajectory optimization problem. This problem is a continuous optimal control problem (OCP) that is transcribed into a finite-dimensional nonlinear optimization problem (NLP) and solved by SQP. For an underactuated conventional vessel, the mathematical model can be very intricate, hence the NLP itself. This causes significant computational time. This article demonstrates that the balance between the feasibility of the reference trajectory and the computational time can be achieved for an underactuated vessel in a disturbed and restricted environment. This is done by: (1) using an almost-globally optimal offline solution as a warm start in a semionline trajectory optimization to speed up the calculation, (2) including the prediction of wind dynamics, and (3) representing the ship as a rigid body and using a predefined boundary to generate the necessary spatial constraints via a point-in-polygon method that ensure a collision-free trajectory in a nonconvex region. Incorporation of these three things maintains a safe and dynamically feasible trajectory where the warm start gives a considerable computational speedup and better results than that without a warm start.
△ Less
Submitted 7 April, 2022; v1 submitted 21 December, 2021;
originally announced December 2021.
-
Application of Linear Filter and Moment Equation for Parametric Rolling in Irregular Longitudinal Waves
Authors:
Yuuki Maruyama,
Atsuo Maki,
Leo Dostal,
Naoya Umeda
Abstract:
Parametric rolling is one of the dangerous dynamic phenomena. In order to discuss the safety of a vessel when a dangerous phenomenon occurs, it is important to estimate the probability of certain dynamical behavior of the ship with respect to a certain threshold level. In this paper, the moment values are obtained by solving the moment equations. Since the stochastic differential equation(SDE) is…
▽ More
Parametric rolling is one of the dangerous dynamic phenomena. In order to discuss the safety of a vessel when a dangerous phenomenon occurs, it is important to estimate the probability of certain dynamical behavior of the ship with respect to a certain threshold level. In this paper, the moment values are obtained by solving the moment equations. Since the stochastic differential equation(SDE) is needed to obtain the moment equations, the Autoregressive Moving Average(ARMA) filter is used. The effective wave is modeled by using the 6th-order ARMA filter. In addition, the parametric excitation process is modeled by using a non-memory transformation obtained from the relationship between GM and wave elevation. The resulting system of equations is represented by the 8th-order Itô stochastic differential equation, which consists of a second-order SDE for the ship motion and a 6th-order SDE for the effective wave. This system has nonlinear components. Therefore, the cumulant neglect closure method is used as higher-order moments need to be truncated. Furthermore, the probability density function of roll angle is determined by using moment values obtained from the SDE and the moment equation. Here, two types of the probability density function are suggested and have a good agreement.
△ Less
Submitted 1 December, 2022; v1 submitted 13 December, 2021;
originally announced December 2021.
-
System Parameter Exploration of Ship Maneuvering Model for Automatic Docking / Berthing using CMA-ES
Authors:
Yoshiki Miyauchi,
Atsuo Maki,
Naoya Umeda,
Dimas M. Rachman,
Youhei Akimoto
Abstract:
Accurate maneuvering estimation is essential to establish autonomous berthing control. The system-based mathematical model is widely used to estimate the ship's maneuver. Commonly, the system parameters of the mathematical model are obtained by the captive model test (CMT), which is time-consuming to construct an accurate model suitable for complex berthing maneuvers. System identification (SI) is…
▽ More
Accurate maneuvering estimation is essential to establish autonomous berthing control. The system-based mathematical model is widely used to estimate the ship's maneuver. Commonly, the system parameters of the mathematical model are obtained by the captive model test (CMT), which is time-consuming to construct an accurate model suitable for complex berthing maneuvers. System identification (SI) is an alternative to constructing the mathematical model. However, SI on the mathematical model of ship's maneuver has been only conducted on much simpler maneuver: turning and zig-zag. Therefore, this study investigates the SI on a mathematical model capable of berthing maneuver. The main contributions of this study are as follows: (i) construct the system-based mathematical model on berthing by optimizing system parameters with a reduced amount of model tests than the CMT-based scheme; (ii) Find the favorable choice of objective function and type of training data for optimization. Global optimization scheme CMA-ES explored the system parameters of the MMG model from the free-running model's trajectories. The berthing simulation with the parameters obtained by the proposed method showed better agreement with the free-running model test than parameters obtained by the CMT. Furthermore, the proposed method required fewer data amounts than a CMT-based scheme.
△ Less
Submitted 11 November, 2021;
originally announced November 2021.
-
On Neural Network Identification for Low-Speed Ship Maneuvering Model
Authors:
Kouki Wakita,
Atsuo Maki,
Umeda Naoya,
Yoshiki Miyauchi,
Tohga Shimoji,
Dimas M. Rachman,
Youhei Akimoto
Abstract:
Several studies on ship maneuvering models have been conducted using captive model tests or computational fluid dynamics (CFD) and physical models, such as the maneuvering modeling group (MMG) model. A new system identification method for generating a low-speed maneuvering model using recurrent neural networks (RNNs) and free running model tests is proposed in this study. We especially focus on a…
▽ More
Several studies on ship maneuvering models have been conducted using captive model tests or computational fluid dynamics (CFD) and physical models, such as the maneuvering modeling group (MMG) model. A new system identification method for generating a low-speed maneuvering model using recurrent neural networks (RNNs) and free running model tests is proposed in this study. We especially focus on a low-speed maneuver such as the final phase in berthing to achieve automatic berthing control. Accurate dynamic modeling with minimum modeling error is highly desired to establish a model-based control system. We propose a new loss function that reduces the effect of the noise included in the training data. Besides, we revealed the following facts - an RNN that ignores the memory before a certain time improved the prediction accuracy compared with the "standard" RNN, and the random maneuver test was effective in obtaining an accurate berthing maneuver model. In addition, several low-speed free running model tests were performed for the scale model of the M.V. Esso Osaka. As a result, this paper showed that the proposed method using a neural network model could accurately represent low-speed maneuvering motions.
△ Less
Submitted 11 November, 2021;
originally announced November 2021.
-
Optimization on Planning of Trajectory and Control of Autonomous Berthing and Unberthing for the Realistic Port Geometry
Authors:
Yoshiki Miyauchi,
Ryohei Sawada,
Youhei Akimoto,
Naoya Umeda,
Atsuo Maki
Abstract:
To realize autonomous shipping, autonomous berthing and unberthing are some of the technical challenges. In the past, numerous research have been done on the optimization of trajectory planning of berthing problems. However, these studies assumed only a simple berth and did not consider obstacles. Optimization of trajectory planning on berthing and unberthing in actual ports must consider the spat…
▽ More
To realize autonomous shipping, autonomous berthing and unberthing are some of the technical challenges. In the past, numerous research have been done on the optimization of trajectory planning of berthing problems. However, these studies assumed only a simple berth and did not consider obstacles. Optimization of trajectory planning on berthing and unberthing in actual ports must consider the spatial constraints and maintain sufficient distance to obstacles. The main contributions of this study are as follows: (i) a collision avoidance algorithm based on the ship domain which has variable size by the ship speed is proposed, to include the spatial constraints to optimization; (ii) the effect of wind disturbance is taken into account to the trajectory planning to make a feasible trajectory based on the capacity limit of actuators; (iii) showing that the optimization method for berthing is also eligible for the unberthing, which has been almost neglected; (iv) waypoints are included to the optimization process, to make optimization easier on practical applications. The authors tested the proposed method on two existing ports. The proposed method performed well on both the berthing and the unberthing problem and optimized the control input and the trajectory while avoiding collision with the complex obstacles.
△ Less
Submitted 12 January, 2022; v1 submitted 4 June, 2021;
originally announced June 2021.
-
Saddle Point Optimization with Approximate Minimization Oracle and its Application to Robust Berthing Control
Authors:
Youhei Akimoto,
Yoshiki Miyauchi,
Atsuo Maki
Abstract:
We propose an approach to saddle point optimization relying only on oracles that solve minimization problems approximately. We analyze its convergence property on a strongly convex--concave problem and show its linear convergence toward the global min--max saddle point. Based on the convergence analysis, we develop a heuristic approach to adapt the learning rate. An implementation of the developed…
▽ More
We propose an approach to saddle point optimization relying only on oracles that solve minimization problems approximately. We analyze its convergence property on a strongly convex--concave problem and show its linear convergence toward the global min--max saddle point. Based on the convergence analysis, we develop a heuristic approach to adapt the learning rate. An implementation of the developed approach using the (1+1)-CMA-ES as the minimization oracle, namely Adversarial-CMA-ES, is shown to outperform several existing approaches on test problems. Numerical evaluation confirms the tightness of the theoretical convergence rate bound as well as the efficiency of the learning rate adaptation mechanism. As an example of real-world problems, the suggested optimization method is applied to automatic berthing control problems under model uncertainties, showing its usefulness in obtaining solutions robust to uncertainty.
△ Less
Submitted 4 January, 2022; v1 submitted 24 May, 2021;
originally announced May 2021.
-
Discriminant analysis based on projection onto generalized difference subspace
Authors:
Kazuhiro Fukui,
Naoya Sogi,
Takumi Kobayashi,
Jing-Hao Xue,
Atsuto Maki
Abstract:
This paper discusses a new type of discriminant analysis based on the orthogonal projection of data onto a generalized difference subspace (GDS). In our previous work, we have demonstrated that GDS projection works as the quasi-orthogonalization of class subspaces, which is an effective feature extraction for subspace based classifiers. Interestingly, GDS projection also works as a discriminant fe…
▽ More
This paper discusses a new type of discriminant analysis based on the orthogonal projection of data onto a generalized difference subspace (GDS). In our previous work, we have demonstrated that GDS projection works as the quasi-orthogonalization of class subspaces, which is an effective feature extraction for subspace based classifiers. Interestingly, GDS projection also works as a discriminant feature extraction through a similar mechanism to the Fisher discriminant analysis (FDA). A direct proof of the connection between GDS projection and FDA is difficult due to the significant difference in their formulations. To avoid the difficulty, we first introduce geometrical Fisher discriminant analysis (gFDA) based on a simplified Fisher criterion. Our simplified Fisher criterion is derived from a heuristic yet practically plausible principle: the direction of the sample mean vector of a class is in most cases almost equal to that of the first principal component vector of the class, under the condition that the principal component vectors are calculated by applying the principal component analysis (PCA) without data centering. gFDA can work stably even under few samples, bypassing the small sample size (SSS) problem of FDA. Next, we prove that gFDA is equivalent to GDS projection with a small correction term. This equivalence ensures GDS projection to inherit the discriminant ability from FDA via gFDA. Furthermore, to enhance the performances of gFDA and GDS projection, we normalize the projected vectors on the discriminant spaces. Extensive experiments using the extended Yale B+ database and the CMU face database show that gFDA and GDS projection have equivalent or better performance than the original FDA and its extensions.
△ Less
Submitted 29 October, 2019; v1 submitted 29 October, 2019;
originally announced October 2019.
-
Regularizing CNN Transfer Learning with Randomised Regression
Authors:
Yang Zhong,
Atsuto Maki
Abstract:
This paper is about regularizing deep convolutional networks (CNNs) based on an adaptive framework for transfer learning with limited training data in the target domain. Recent advances of CNN regularization in this context are commonly due to the use of additional regularization objectives. They guide the training away from the target task using some forms of concrete tasks. Unlike those related…
▽ More
This paper is about regularizing deep convolutional networks (CNNs) based on an adaptive framework for transfer learning with limited training data in the target domain. Recent advances of CNN regularization in this context are commonly due to the use of additional regularization objectives. They guide the training away from the target task using some forms of concrete tasks. Unlike those related approaches, we suggest that an objective without a concrete goal can still serve well as a regularized. In particular, we demonstrate Pseudo-task Regularization (PtR) which dynamically regularizes a network by simply attempting to regress image representations to pseudo-regression targets during fine-tuning. That is, a CNN is efficiently regularized without additional resources of data or prior domain expertise. In sum, the proposed PtR provides: a) an alternative for network regularization without dependence on the design of concrete regularization objectives or extra annotations; b) a dynamically adjusted and maintained strength of regularization effect by balancing the gradient norms between objectives on-line. Through numerous experiments, surprisingly, the improvements on classification accuracy by PtR are shown greater or on a par to the recent state-of-the-art methods.
△ Less
Submitted 28 April, 2020; v1 submitted 16 August, 2019;
originally announced August 2019.
-
Deep Active Learning for Efficient Training of a LiDAR 3D Object Detector
Authors:
Di Feng,
Xiao Wei,
Lars Rosenbaum,
Atsuto Maki,
Klaus Dietmayer
Abstract:
Training a deep object detector for autonomous driving requires a huge amount of labeled data. While recording data via on-board sensors such as camera or LiDAR is relatively easy, annotating data is very tedious and time-consuming, especially when dealing with 3D LiDAR points or radar data. Active learning has the potential to minimize human annotation efforts while maximizing the object detector…
▽ More
Training a deep object detector for autonomous driving requires a huge amount of labeled data. While recording data via on-board sensors such as camera or LiDAR is relatively easy, annotating data is very tedious and time-consuming, especially when dealing with 3D LiDAR points or radar data. Active learning has the potential to minimize human annotation efforts while maximizing the object detector's performance. In this work, we propose an active learning method to train a LiDAR 3D object detector with the least amount of labeled training data necessary. The detector leverages 2D region proposals generated from the RGB images to reduce the search space of objects and speed up the learning process. Experiments show that our proposed method works under different uncertainty estimations and query functions, and can save up to 60% of the labeling efforts while reaching the same network performance.
△ Less
Submitted 5 May, 2019; v1 submitted 29 January, 2019;
originally announced January 2019.
-
Target Aware Network Adaptation for Efficient Representation Learning
Authors:
Yang Zhong,
Vladimir Li,
Ryuzo Okada,
Atsuto Maki
Abstract:
This paper presents an automatic network adaptation method that finds a ConvNet structure well-suited to a given target task, e.g., image classification, for efficiency as well as accuracy in transfer learning. We call the concept target-aware transfer learning. Given only small-scale labeled data, and starting from an ImageNet pre-trained network, we exploit a scheme of removing its potential red…
▽ More
This paper presents an automatic network adaptation method that finds a ConvNet structure well-suited to a given target task, e.g., image classification, for efficiency as well as accuracy in transfer learning. We call the concept target-aware transfer learning. Given only small-scale labeled data, and starting from an ImageNet pre-trained network, we exploit a scheme of removing its potential redundancy for the target task through iterative operations of filter-wise pruning and network optimization. The basic motivation is that compact networks are on one hand more efficient and should also be more tolerant, being less complex, against the risk of overfitting which would hinder the generalization of learned representations in the context of transfer learning. Further, unlike existing methods involving network simplification, we also let the scheme identify redundant portions across the entire network, which automatically results in a network structure adapted to the task at hand. We achieve this with a few novel ideas: (i) cumulative sum of activation statistics for each layer, and (ii) a priority evaluation of pruning across multiple layers. Experimental results by the method on five datasets (Flower102, CUB200-2011, Dog120, MIT67, and Stanford40) show favorable accuracies over the related state-of-the-art techniques while enhancing the computational and storage efficiency of the transferred model.
△ Less
Submitted 2 October, 2018;
originally announced October 2018.
-
A multitask deep learning model for real-time deployment in embedded systems
Authors:
Miquel Martí,
Atsuto Maki
Abstract:
We propose an approach to Multitask Learning (MTL) to make deep learning models faster and lighter for applications in which multiple tasks need to be solved simultaneously, which is particularly useful in embedded, real-time systems. We develop a multitask model for both Object Detection and Semantic Segmentation and analyze the challenges that appear during its training. Our multitask network is…
▽ More
We propose an approach to Multitask Learning (MTL) to make deep learning models faster and lighter for applications in which multiple tasks need to be solved simultaneously, which is particularly useful in embedded, real-time systems. We develop a multitask model for both Object Detection and Semantic Segmentation and analyze the challenges that appear during its training. Our multitask network is 1.6x faster, lighter and uses less memory than deploying the single-task models in parallel. We conclude that MTL has the potential to give superior performance in exchange of a more complex training process that introduces challenges not present in single-task models.
△ Less
Submitted 31 October, 2017;
originally announced November 2017.
-
A systematic study of the class imbalance problem in convolutional neural networks
Authors:
Mateusz Buda,
Atsuto Maki,
Maciej A. Mazurowski
Abstract:
In this study, we systematically investigate the impact of class imbalance on classification performance of convolutional neural networks (CNNs) and compare frequently used methods to address the issue. Class imbalance is a common problem that has been comprehensively studied in classical machine learning, yet very limited systematic research is available in the context of deep learning. In our st…
▽ More
In this study, we systematically investigate the impact of class imbalance on classification performance of convolutional neural networks (CNNs) and compare frequently used methods to address the issue. Class imbalance is a common problem that has been comprehensively studied in classical machine learning, yet very limited systematic research is available in the context of deep learning. In our study, we use three benchmark datasets of increasing complexity, MNIST, CIFAR-10 and ImageNet, to investigate the effects of imbalance on classification and perform an extensive comparison of several methods to address the issue: oversampling, undersampling, two-phase training, and thresholding that compensates for prior class probabilities. Our main evaluation metric is area under the receiver operating characteristic curve (ROC AUC) adjusted to multi-class tasks since overall accuracy metric is associated with notable difficulties in the context of imbalanced data. Based on results from our experiments we conclude that (i) the effect of class imbalance on classification performance is detrimental; (ii) the method of addressing class imbalance that emerged as dominant in almost all analyzed scenarios was oversampling; (iii) oversampling should be applied to the level that completely eliminates the imbalance, whereas the optimal undersampling ratio depends on the extent of imbalance; (iv) as opposed to some classical machine learning models, oversampling does not cause overfitting of CNNs; (v) thresholding should be applied to compensate for prior class probabilities when overall number of properly classified cases is of interest.
△ Less
Submitted 12 October, 2018; v1 submitted 15 October, 2017;
originally announced October 2017.
-
Power packet transferability via symbol propagation matrix
Authors:
Shinya Nawata,
Atsuto Maki,
Takashi Hikihara
Abstract:
Power packet is a unit of electric power transferred by a power pulse with an information tag. In Shannon's information theory, messages are represented by symbol sequences in a digitized manner. Referring to this formulation, we define symbols in power packetization as a minimum unit of power transferred by a tagged pulse. Here, power is digitized and quantized. In this paper, we consider packeti…
▽ More
Power packet is a unit of electric power transferred by a power pulse with an information tag. In Shannon's information theory, messages are represented by symbol sequences in a digitized manner. Referring to this formulation, we define symbols in power packetization as a minimum unit of power transferred by a tagged pulse. Here, power is digitized and quantized. In this paper, we consider packetized power in networks for a finite duration, giving symbols and their energies to the networks. A network structure is defined using a graph whose nodes represent routers, sources, and destinations. First, we introduce symbol propagation matrix (SPM) in which symbols are transferred at links during unit times. Packetized power is described as a network flow in a spatio-temporal structure. Then, we study the problem of selecting an SPM in terms of transferability, that is, the possibility to represent given energies at sources and destinations during the finite duration. To select an SPM, we consider a network flow problem of packetized power. The problem is formulated as an M-convex submodular flow problem which is known as generalization of the minimum cost flow problem and solvable. Finally, through examples, we verify that this formulation provides reasonable packetized power.
△ Less
Submitted 8 August, 2017;
originally announced August 2017.
-
Deep Predictive Policy Training using Reinforcement Learning
Authors:
Ali Ghadirzadeh,
Atsuto Maki,
Danica Kragic,
Mårten Björkman
Abstract:
Skilled robot task learning is best implemented by predictive action policies due to the inherent latency of sensorimotor processes. However, training such predictive policies is challenging as it involves finding a trajectory of motor activations for the full duration of the action. We propose a data-efficient deep predictive policy training (DPPT) framework with a deep neural network policy arch…
▽ More
Skilled robot task learning is best implemented by predictive action policies due to the inherent latency of sensorimotor processes. However, training such predictive policies is challenging as it involves finding a trajectory of motor activations for the full duration of the action. We propose a data-efficient deep predictive policy training (DPPT) framework with a deep neural network policy architecture which maps an image observation to a sequence of motor activations. The architecture consists of three sub-networks referred to as the perception, policy and behavior super-layers. The perception and behavior super-layers force an abstraction of visual and motor data trained with synthetic and simulated training samples, respectively. The policy super-layer is a small sub-network with fewer parameters that maps data in-between the abstracted manifolds. It is trained for each task using methods for policy search reinforcement learning. We demonstrate the suitability of the proposed architecture and learning framework by training predictive policies for skilled object grasping and ball throwing on a PR2 robot. The effectiveness of the method is illustrated by the fact that these tasks are trained using only about 180 real robot attempts with qualitative terminal rewards.
△ Less
Submitted 2 March, 2017;
originally announced March 2017.
-
A Sensorimotor Reinforcement Learning Framework for Physical Human-Robot Interaction
Authors:
Ali Ghadirzadeh,
Judith Bütepage,
Atsuto Maki,
Danica Kragic,
Mårten Björkman
Abstract:
Modeling of physical human-robot collaborations is generally a challenging problem due to the unpredictive nature of human behavior. To address this issue, we present a data-efficient reinforcement learning framework which enables a robot to learn how to collaborate with a human partner. The robot learns the task from its own sensorimotor experiences in an unsupervised manner. The uncertainty of t…
▽ More
Modeling of physical human-robot collaborations is generally a challenging problem due to the unpredictive nature of human behavior. To address this issue, we present a data-efficient reinforcement learning framework which enables a robot to learn how to collaborate with a human partner. The robot learns the task from its own sensorimotor experiences in an unsupervised manner. The uncertainty of the human actions is modeled using Gaussian processes (GP) to implement action-value functions. Optimal action selection given the uncertain GP model is ensured by Bayesian optimization. We apply the framework to a scenario in which a human and a PR2 robot jointly control the ball position on a plank based on vision and force/torque data. Our experimental results show the suitability of the proposed method in terms of fast and data-efficient model learning, optimal action selection under uncertainties and equal role sharing between the partners.
△ Less
Submitted 26 July, 2016;
originally announced July 2016.
-
Visual Instance Retrieval with Deep Convolutional Networks
Authors:
Ali Sharif Razavian,
Josephine Sullivan,
Stefan Carlsson,
Atsuto Maki
Abstract:
This paper provides an extensive study on the availability of image representations based on convolutional networks (ConvNets) for the task of visual instance retrieval. Besides the choice of convolutional layers, we present an efficient pipeline exploiting multi-scale schemes to extract local features, in particular, by taking geometric invariance into explicit account, i.e. positions, scales and…
▽ More
This paper provides an extensive study on the availability of image representations based on convolutional networks (ConvNets) for the task of visual instance retrieval. Besides the choice of convolutional layers, we present an efficient pipeline exploiting multi-scale schemes to extract local features, in particular, by taking geometric invariance into explicit account, i.e. positions, scales and spatial consistency. In our experiments using five standard image retrieval datasets, we demonstrate that generic ConvNet image representations can outperform other state-of-the-art methods if they are extracted appropriately.
△ Less
Submitted 9 May, 2016; v1 submitted 19 December, 2014;
originally announced December 2014.
-
Persistent Evidence of Local Image Properties in Generic ConvNets
Authors:
Ali Sharif Razavian,
Hossein Azizpour,
Atsuto Maki,
Josephine Sullivan,
Carl Henrik Ek,
Stefan Carlsson
Abstract:
Supervised training of a convolutional network for object classification should make explicit any information related to the class of objects and disregard any auxiliary information associated with the capture of the image or the variation within the object class. Does this happen in practice? Although this seems to pertain to the very final layers in the network, if we look at earlier layers we f…
▽ More
Supervised training of a convolutional network for object classification should make explicit any information related to the class of objects and disregard any auxiliary information associated with the capture of the image or the variation within the object class. Does this happen in practice? Although this seems to pertain to the very final layers in the network, if we look at earlier layers we find that this is not the case. Surprisingly, strong spatial information is implicit. This paper addresses this, in particular, exploiting the image representation at the first fully connected layer, i.e. the global image descriptor which has been recently shown to be most effective in a range of visual recognition tasks. We empirically demonstrate evidences for the finding in the contexts of four different tasks: 2d landmark detection, 2d object keypoints prediction, estimation of the RGB values of input image, and recovery of semantic label of each pixel. We base our investigation on a simple framework with ridge rigression commonly across these tasks, and show results which all support our insight. Such spatial information can be used for computing correspondence of landmarks to a good accuracy, but should potentially be useful for improving the training of the convolutional nets for classification purposes.
△ Less
Submitted 24 November, 2014;
originally announced November 2014.
-
Factors of Transferability for a Generic ConvNet Representation
Authors:
Hossein Azizpour,
Ali Sharif Razavian,
Josephine Sullivan,
Atsuto Maki,
Stefan Carlsson
Abstract:
Evidence is mounting that Convolutional Networks (ConvNets) are the most effective representation learning method for visual recognition tasks. In the common scenario, a ConvNet is trained on a large labeled dataset (source) and the feed-forward units activation of the trained network, at a certain layer of the network, is used as a generic representation of an input image for a task with relative…
▽ More
Evidence is mounting that Convolutional Networks (ConvNets) are the most effective representation learning method for visual recognition tasks. In the common scenario, a ConvNet is trained on a large labeled dataset (source) and the feed-forward units activation of the trained network, at a certain layer of the network, is used as a generic representation of an input image for a task with relatively smaller training set (target). Recent studies have shown this form of representation transfer to be suitable for a wide range of target visual recognition tasks. This paper introduces and investigates several factors affecting the transferability of such representations. It includes parameters for training of the source ConvNet such as its architecture, distribution of the training data, etc. and also the parameters of feature extraction such as layer of the trained ConvNet, dimensionality reduction, etc. Then, by optimizing these factors, we show that significant improvements can be achieved on various (17) visual recognition tasks. We further show that these visual recognition tasks can be categorically ordered based on their distance from the source task such that a correlation between the performance of tasks and their distance from the source task w.r.t. the proposed factors is observed.
△ Less
Submitted 15 July, 2015; v1 submitted 22 June, 2014;
originally announced June 2014.
-
The MEG detector for $μ+\to e+γ$ decay search
Authors:
J. Adam,
X. Bai,
A. M. Baldini,
E. Baracchini,
C. Bemporad,
G. Boca,
P. W. Cattaneo,
G. Cavoto,
F. Cei,
C. Cerri,
M. Corbo,
N. Curalli,
A. De Bari,
M. De Gerone,
L. Del Frate,
S. Doke,
S. Dussoni,
J. Egger,
K. Fratini,
Y. Fujii,
L. Galli,
S. Galeotti,
G. Gallucci,
F. Gatti,
B. Golden
, et al. (51 additional authors not shown)
Abstract:
The MEG (Mu to Electron Gamma) experiment has been running at the Paul Scherrer Institut (PSI), Switzerland since 2008 to search for the decay \meg\ by using one of the most intense continuous $μ^+$ beams in the world. This paper presents the MEG components: the positron spectrometer, including a thin target, a superconducting magnet, a set of drift chambers for measuring the muon decay vertex and…
▽ More
The MEG (Mu to Electron Gamma) experiment has been running at the Paul Scherrer Institut (PSI), Switzerland since 2008 to search for the decay \meg\ by using one of the most intense continuous $μ^+$ beams in the world. This paper presents the MEG components: the positron spectrometer, including a thin target, a superconducting magnet, a set of drift chambers for measuring the muon decay vertex and the positron momentum, a timing counter for measuring the positron time, and a liquid xenon detector for measuring the photon energy, position and time. The trigger system, the read-out electronics and the data acquisition system are also presented in detail. The paper is completed with a description of the equipment and techniques developed for the calibration in time and energy and the simulation of the whole apparatus.
△ Less
Submitted 10 April, 2013; v1 submitted 10 March, 2013;
originally announced March 2013.
-
New constraint on the existence of the mu+-> e+ gamma decay
Authors:
MEG Collaboration,
J. Adam,
X. Bai,
A. M. Baldini,
E. Baracchini,
C. Bemporad,
G. Boca,
P. W. Cattaneo,
G. Cavoto,
F. Cei,
C. Cerri,
A. de Bari,
M. De Gerone,
T. Doke,
S. Dussoni,
J. Egger,
K. Fratini,
Y. Fujii,
L. Galli,
G. Gallucci,
F. Gatti,
B. Golden,
M. Grassi,
A. Graziosi,
D. N. Grigoriev
, et al. (49 additional authors not shown)
Abstract:
The analysis of a combined data set, totaling 3.6 \times 10^14 stopped muons on target, in the search for the lepton flavour violating decay mu^+ -> e^+ gamma is presented. The data collected by the MEG experiment at the Paul Scherrer Institut show no excess of events compared to background expectations and yield a new upper limit on the branching ratio of this decay of 5.7 \times 10^-13 (90% conf…
▽ More
The analysis of a combined data set, totaling 3.6 \times 10^14 stopped muons on target, in the search for the lepton flavour violating decay mu^+ -> e^+ gamma is presented. The data collected by the MEG experiment at the Paul Scherrer Institut show no excess of events compared to background expectations and yield a new upper limit on the branching ratio of this decay of 5.7 \times 10^-13 (90% confidence level). This represents a four times more stringent limit than the previous world best limit set by MEG.
△ Less
Submitted 23 April, 2013; v1 submitted 4 March, 2013;
originally announced March 2013.
-
New limit on the lepton-flavour violating decay mu -> e gamma
Authors:
MEG collaboration,
J. Adam,
X. Bai,
A. M. Baldini,
E. Baracchini,
C. Bemporad,
G. Boca,
P. W. Cattaneo,
G. Cavoto,
F. Cei,
C. Cerri,
A. de Bari,
M. De Gerone,
T. Doke,
S. Dussoni,
J. Egger,
K. Fratini,
Y. Fujii,
L. Galli,
G. Gallucci,
F. Gatti,
B. Golden,
M. Grassi,
D. N. Grigoriev,
T. Haruyama
, et al. (42 additional authors not shown)
Abstract:
We present a new result based on an analysis of the data collected by the MEG detector at the Paul Scherrer Institut in 2009 and 2010, in search of the lepton flavour violating decay mu->e gamma. The likelihood analysis of the combined data sample, which corresponds to a total of 1.8 x 10**14 muon decays, gives a 90% C.L. upper limit of 2.4 x 10**-12 on the branching ratio of the mu->e gamma decay…
▽ More
We present a new result based on an analysis of the data collected by the MEG detector at the Paul Scherrer Institut in 2009 and 2010, in search of the lepton flavour violating decay mu->e gamma. The likelihood analysis of the combined data sample, which corresponds to a total of 1.8 x 10**14 muon decays, gives a 90% C.L. upper limit of 2.4 x 10**-12 on the branching ratio of the mu->e gamma decay, constituting the most stringent limit on the existence of this decay to date.
△ Less
Submitted 2 September, 2011; v1 submitted 27 July, 2011;
originally announced July 2011.