-
Reducing Catastrophic Forgetting in Online Class Incremental Learning Using Self-Distillation
Authors:
Kotaro Nagata,
Hiromu Ono,
Kazuhiro Hotta
Abstract:
In continual learning, there is a serious problem of catastrophic forgetting, in which previous knowledge is forgotten when a model learns new tasks. Various methods have been proposed to solve this problem. Replay methods which replay data from previous tasks in later training, have shown good accuracy. However, replay methods have a generalizability problem from a limited memory buffer. In this…
▽ More
In continual learning, there is a serious problem of catastrophic forgetting, in which previous knowledge is forgotten when a model learns new tasks. Various methods have been proposed to solve this problem. Replay methods which replay data from previous tasks in later training, have shown good accuracy. However, replay methods have a generalizability problem from a limited memory buffer. In this paper, we tried to solve this problem by acquiring transferable knowledge through self-distillation using highly generalizable output in shallow layer as a teacher. Furthermore, when we deal with a large number of classes or challenging data, there is a risk of learning not converging and not experiencing overfitting. Therefore, we attempted to achieve more efficient and thorough learning by prioritizing the storage of easily misclassified samples through a new method of memory update. We confirmed that our proposed method outperformed conventional methods by experiments on CIFAR10, CIFAR100, and MiniimageNet datasets.
△ Less
Submitted 17 September, 2024;
originally announced September 2024.
-
Mesoscopic Bayesian Inference by Solvable Models
Authors:
Shun Katakami,
Shuhei Kashiwamura,
Kenji Nagata,
Masaichiro Mizumaki,
Masato Okada
Abstract:
The rapid advancement of data science and artificial intelligence has affected physics in numerous ways, including the application of Bayesian inference, setting the stage for a revolution in research methodology. Our group has proposed Bayesian measurement, a framework that applies Bayesian inference to measurement science with broad applicability across various natural sciences. This framework e…
▽ More
The rapid advancement of data science and artificial intelligence has affected physics in numerous ways, including the application of Bayesian inference, setting the stage for a revolution in research methodology. Our group has proposed Bayesian measurement, a framework that applies Bayesian inference to measurement science with broad applicability across various natural sciences. This framework enables the determination of posterior probability distributions of system parameters, model selection, and the integration of multiple measurement datasets. However, applying Bayesian measurement to real data analysis requires a more sophisticated approach than traditional statistical methods like Akaike information criterion (AIC) and Bayesian information criterion (BIC), which are designed for an infinite number of measurements $N$. Therefore, in this paper, we propose an analytical theory that explicitly addresses the case where $N$ is finite in the linear regression model. We introduce $O(1)$ mesoscopic variables for $N$ observation noises. Using this mesoscopic theory, we analyze the three core principles of Bayesian measurement: parameter estimation, model selection, and measurement integration. Furthermore, by introducing these mesoscopic variables, we demonstrate that the difference in free energies, critical for both model selection and measurement integration, can be analytically reduced by two mesoscopic variables of $N$ observation noises. This provides a deeper qualitative understanding of model selection and measurement integration and further provides deeper insights into actual measurements for nonlinear models. Our framework presents a novel approach to understanding Bayesian measurement results.
△ Less
Submitted 27 August, 2024; v1 submitted 4 June, 2024;
originally announced June 2024.
-
Algebraic Geometrical Analysis of Metropolis Algorithm When Parameters Are Non-identifiable
Authors:
Kenji Nagata,
Yoh-ichi Mototake
Abstract:
The Metropolis algorithm is one of the Markov chain Monte Carlo (MCMC) methods that realize sampling from the target probability distribution. In this paper, we are concerned with the sampling from the distribution in non-identifiable cases that involve models with Fisher information matrices that may fail to be invertible. The theoretical adjustment of the step size, which is the variance of the…
▽ More
The Metropolis algorithm is one of the Markov chain Monte Carlo (MCMC) methods that realize sampling from the target probability distribution. In this paper, we are concerned with the sampling from the distribution in non-identifiable cases that involve models with Fisher information matrices that may fail to be invertible. The theoretical adjustment of the step size, which is the variance of the candidate distribution, is difficult for non-identifiable cases. In this study, to establish such a principle, the average acceptance rate, which is used as a guideline to optimize the step size in the MCMC method, was analytically derived in non-identifiable cases. The optimization principle for the step size was developed from the viewpoint of the average acceptance rate. In addition, we performed numerical experiments on some specific target distributions to verify the effectiveness of our theoretical results.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
Bayesian Inference for Small-Angle Scattering Data in Core-Shell Samples
Authors:
Keigo Oyama,
Yui Hayashi,
Shigeo Kuwamoto,
Shun Katakami,
Kenji Nagata,
Masaichiro Mizumaki,
Masato Okada
Abstract:
Small-angle scattering (SAS) techniques, which utilize neutrons and X-rays, are employed in various scientific fields, including materials science, biochemistry, and polymer physics. During the analysis of SAS data, model parameters that contain information about the sample are estimated by fitting the observational data to a model of sample. Previous research has demonstrated the effectiveness of…
▽ More
Small-angle scattering (SAS) techniques, which utilize neutrons and X-rays, are employed in various scientific fields, including materials science, biochemistry, and polymer physics. During the analysis of SAS data, model parameters that contain information about the sample are estimated by fitting the observational data to a model of sample. Previous research has demonstrated the effectiveness of Bayesian inference in analyzing SAS data using a sphere model. However, compared with the sphere model, the core-shell model, which represents functional nanoparticles, offers higher application potential and greater analytical value. Therefore, in this study, we propose an analytical method for the more complex and practical core-shell model based on Bayesian inference. Through numerical experiments, we evaluated the performance of this method under different conditions, including measurement times, number of data points, and differences in scattering length density. As a result, we clarify the conditions under which accurate estimations are possible.
△ Less
Submitted 12 May, 2024;
originally announced May 2024.
-
Basis Function Dependence of Estimation Precision for Synchrotron-Radiation-Based Mössbauer Spectroscopy
Authors:
Binsheu Shieh,
Ryo Masuda,
Satoshi Tsutsui,
Shun Katakami,
Kenji Nagata,
Masaichiro Mizumaki,
Masato Okada
Abstract:
Mössbauer spectroscopy is a technique employed to investigate the microscopic properties of materials using transitions between energy levels in the nuclei. Conventionally, in synchrotron-radiation-based Mössbauer spectroscopy, the measurement window is decided by the researcher heuristically, although this decision has a significant impact on the shape of the measurement spectra. In this paper, w…
▽ More
Mössbauer spectroscopy is a technique employed to investigate the microscopic properties of materials using transitions between energy levels in the nuclei. Conventionally, in synchrotron-radiation-based Mössbauer spectroscopy, the measurement window is decided by the researcher heuristically, although this decision has a significant impact on the shape of the measurement spectra. In this paper, we propose a method for evaluating the precision of the spectral position by introducing Bayesian estimation. The proposed method makes it possible to select the best measurement window by calculating the precision of Mössbauer spectroscopy from the data. Based on the results, the precision of the Mössbauer center shifts improved by more than three times compared with the results achieved with the conventional simple fitting method using the Lorentzian function.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Rapid and Robust construction of an ML-ready peak feature table from X-ray diffraction data using Bayesian peak-top fitting
Authors:
Ryo Murakami,
Taisuke T. Sasaki,
Hideki Yoshikawa,
Yoshitaka Matsushita,
Keitaro Sodeyama,
Tadakatsu Ohkubo,
Hiroshi Shinotsuka,
Kenji Nagata
Abstract:
To advance the development of materials through data-driven scientific methods, appropriate methods for building machine learning (ML)-ready feature tables from measured and computed data must be established. In materials development, X-ray diffraction (XRD) is an effective technique for analysing crystal structures and other microstructural features that have information that can explain material…
▽ More
To advance the development of materials through data-driven scientific methods, appropriate methods for building machine learning (ML)-ready feature tables from measured and computed data must be established. In materials development, X-ray diffraction (XRD) is an effective technique for analysing crystal structures and other microstructural features that have information that can explain material properties. Therefore, the fully automated extraction of peak features from XRD data without the bias of an analyst is a significant challenge. This study aimed to establish an efficient and robust approach for constructing peak feature tables that follow ML standards (ML-ready) from XRD data. We challenge peak feature extraction in the situation where only the peak function profile is known a priori, without knowledge of the measurement material or crystal structure factor. We utilized Bayesian estimation to extract peak features from XRD data and subsequently performed Bayesian regression analysis with feature selection to predict the material property. The proposed method focused only on the tops of peaks within localized regions of interest (ROIs) and extracted peak features quickly and accurately. This process facilitated the rapid extracting of major peak features from the XRD data and the construction of an ML-ready feature table. We then applied Bayesian linear regression to the maximum energy product $(BH)_{max}$, using the extracted peak features as the explanatory variable. The outcomes yielded reasonable and robust regression results. Thus, the findings of this study indicated that \textit{004} peak height and area were important features for predicting $(BH)_{max}$.
△ Less
Submitted 6 February, 2024;
originally announced March 2024.
-
Quantitative Selection of Sample Structures in Small-Angle Scattering Using Bayesian Methods
Authors:
Yui Hayashi,
Shun Katakami,
Shigeo Kuwamoto,
Kenji Nagata,
Masaichiro Mizumaki,
Masato Okada
Abstract:
Small-angle scattering (SAS) is a key experimental technique for analyzing nano-scale structures in various materials.In SAS data analysis, selecting an appropriate mathematical model for the scattering intensity is critical, as it generates a hypothesis of the structure of the experimental sample. Traditional model selection methods either rely on qualitative approaches or are prone to overfittin…
▽ More
Small-angle scattering (SAS) is a key experimental technique for analyzing nano-scale structures in various materials.In SAS data analysis, selecting an appropriate mathematical model for the scattering intensity is critical, as it generates a hypothesis of the structure of the experimental sample. Traditional model selection methods either rely on qualitative approaches or are prone to overfitting.This paper introduces an analytical method that applies Bayesian model selection to SAS measurement data, enabling a quantitative evaluation of the validity of mathematical models.We assess the performance of our method through numerical experiments using artificial data for multicomponent spherical materials, demonstrating that our proposed method analysis approach yields highly accurate and interpretable results.We also discuss the ability of our method to analyze a range of mixing ratios and particle size ratios for mixed components, along with its precision in model evaluation by the degree of fitting.Our proposed method effectively facilitates quantitative analysis of nano-scale sample structures in SAS, which has traditionally been challenging, and is expected to significantly contribute to advancements in a wide range of fields.
△ Less
Submitted 18 January, 2024;
originally announced January 2024.
-
Nearly homogeneous and isotropic turbulence generated by the interaction of supersonic jets
Authors:
Takahiro Mori,
Tomoaki Watanabe,
Koji Nagata
Abstract:
This study reports the development and characterization of a multiple-supersonic-jet wind tunnel designed to investigate the decay of nearly homogeneous and isotropic turbulence in a compressible regime. The interaction of 36 supersonic jets generates turbulence that decays in the streamwise direction. The velocity field is measured with particle image velocimetry by seeding tracer particles with…
▽ More
This study reports the development and characterization of a multiple-supersonic-jet wind tunnel designed to investigate the decay of nearly homogeneous and isotropic turbulence in a compressible regime. The interaction of 36 supersonic jets generates turbulence that decays in the streamwise direction. The velocity field is measured with particle image velocimetry by seeding tracer particles with ethanol condensation. Various velocity statistics are evaluated to diagnose decaying turbulence generated by the supersonic jet interaction. The flow is initially inhomogeneous and anisotropic and possesses intermittent large-scale velocity fluctuations. The flow evolves into a statistically homogeneous and isotropic state as the mean velocity profile becomes uniform. In the nearly homogeneous and isotropic region, the ratio of root-mean-squared velocity fluctuations in the streamwise and vertical directions is about 1.08, the longitudinal integral scales are also similar in these directions, and the large-scale intermittency becomes insignificant. The turbulent kinetic energy per unit mass decays according to a power law with an exponent of about 2, larger than those reported for incompressible grid turbulence. The energy spectra in the inertial subrange agree well with other turbulent flows when normalized by the dissipation rate and kinematic viscosity. The non-dimensional dissipation rate is within a range of 0.51--0.87, which is also consistent with incompressible grid turbulence. These results demonstrate that the multiple-supersonic-jet wind tunnel is helpful in the investigation of decaying homogeneous isotropic turbulence whose generation process is strongly influenced by fluid compressibility.
△ Less
Submitted 14 January, 2024;
originally announced January 2024.
-
Bayesian inference to identify crystalline structures for XRD
Authors:
Ryo Murakami,
Yoshitaka Matsushita,
Kenji Nagata,
Hayaru Shouno,
Hideki Yoshikawa
Abstract:
Crystalline phase structure is essential for understanding the performance and properties of a material. Therefore, this study identified and quantified the crystalline phase structure of a sample based on the diffraction pattern observed when the crystalline sample was irradiated with electromagnetic waves such as X-rays. Conventional analysis necessitates experienced and knowledgeable researcher…
▽ More
Crystalline phase structure is essential for understanding the performance and properties of a material. Therefore, this study identified and quantified the crystalline phase structure of a sample based on the diffraction pattern observed when the crystalline sample was irradiated with electromagnetic waves such as X-rays. Conventional analysis necessitates experienced and knowledgeable researchers to shorten the list from many candidate crystalline phase structures. However, the Conventional diffraction pattern analysis is highly analyst-dependent and not objective. Additionally, there is no established method for discussing the confidence intervals of the analysis results. Thus, this study aimed to establish a method for automatically inferring crystalline phase structures from diffraction patterns using Bayesian inference. Our method successfully identified true crystalline phase structures with a high probability from 50 candidate crystalline phase structures. Further, the mixing ratios of selected crystalline phase structures were estimated with a high degree of accuracy. This study provided reasonable results for well-crystallized samples that clearly identified the crystalline phase structures.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
Sequential Experimental Design for Spectral Measurement: Active Learning Using a Parametric Model
Authors:
Tomohiro Nabika,
Kenji Nagata,
Shun Katakami,
Masaichiro Mizumaki,
Masato Okada
Abstract:
In this study, we demonstrate a sequential experimental design for spectral measurements by active learning using parametric models as predictors. In spectral measurements, it is necessary to reduce the measurement time because of sample fragility and high energy costs. To improve the efficiency of experiments, sequential experimental designs are proposed, in which the subsequent measurement is de…
▽ More
In this study, we demonstrate a sequential experimental design for spectral measurements by active learning using parametric models as predictors. In spectral measurements, it is necessary to reduce the measurement time because of sample fragility and high energy costs. To improve the efficiency of experiments, sequential experimental designs are proposed, in which the subsequent measurement is designed by active learning using the data obtained before the measurement. Conventionally, parametric models are employed in data analysis; when employed for active learning, they are expected to afford a sequential experimental design that improves the accuracy of data analysis. However, due to the complexity of the formulas, a sequential experimental design using general parametric models has not been realized. Therefore, we applied Bayesian inference-based data analysis using the exchange Monte Carlo method to realize a sequential experimental design with general parametric models. In this study, we evaluated the effectiveness of the proposed method by applying it to Bayesian spectral deconvolution and Bayesian Hamiltonian selection in X-ray photoelectron spectroscopy. Using numerical experiments with artificial data, we demonstrated that the proposed method improves the accuracy of model selection and parameter estimation while reducing the measurement time compared with the results achieved without active learning or with active learning using the Gaussian process regression.
△ Less
Submitted 11 May, 2023;
originally announced May 2023.
-
Bayesian Inference for Small-Angle Scattering Data
Authors:
Yui Hayashi,
Shun Katakami,
Shigeo Kuwamoto,
Kenji Nagata,
Masaichiro Mizumaki,
Masato Okada
Abstract:
In this paper, we propose a method for estimating model parameters using Small-Angle Scattering (SAS) data based on the Bayesian inference. Conventional SAS data analyses involve processes of manual parameter adjustment by analysts or optimization using gradient methods. These analysis processes tend to involve heuristic approaches and may lead to local solutions.Furthermore, it is difficult to ev…
▽ More
In this paper, we propose a method for estimating model parameters using Small-Angle Scattering (SAS) data based on the Bayesian inference. Conventional SAS data analyses involve processes of manual parameter adjustment by analysts or optimization using gradient methods. These analysis processes tend to involve heuristic approaches and may lead to local solutions.Furthermore, it is difficult to evaluate the reliability of the results obtained by conventional analysis methods. Our method solves these problems by estimating model parameters as probability distributions from SAS data using the framework of the Bayesian inference. We evaluate the performance of our method through numerical experiments using artificial data of representative measurement target models.From the results of the numerical experiments, we show that our method provides not only high accuracy and reliability of estimation, but also perspectives on the transition point of estimability with respect to the measurement time and the lower bound of the angular domain of the measured data.
△ Less
Submitted 28 July, 2023; v1 submitted 8 March, 2023;
originally announced March 2023.
-
Bayesian Inference of Absorption Spectra Based on Binomial Distribution
Authors:
Tomohiro Nabika,
Kenji Nagata,
Shun Katakami,
Masaichiro Mizumaki,
Masato Okada
Abstract:
In this paper, we propose a Bayesian spectral deconvolution method for absorption spectra. In conventional analysis, the noise mechanism of absorption spectral data is never considered appropriately. In that analysis, the least-squares method, which assumes Gaussian noise from the perspective of Bayesian statistics, is frequently used. Since Bayesian inference is possible by introducing an appropr…
▽ More
In this paper, we propose a Bayesian spectral deconvolution method for absorption spectra. In conventional analysis, the noise mechanism of absorption spectral data is never considered appropriately. In that analysis, the least-squares method, which assumes Gaussian noise from the perspective of Bayesian statistics, is frequently used. Since Bayesian inference is possible by introducing an appropriate noise model for the data, we consider the absorption process of a single photon to be a Bernoulli trial and develop a Bayesian spectral deconvolution method based on binomial distribution. We have evaluated our method on artificial data under several conditions by numerical experiments. The results show that our method not only allows us to estimate parameters with high accuracy from absorption spectral data, but also to infer them even from absorption spectral data with large absorption rates where the spectral structure is flattened, which was previously impossible to analyze.
△ Less
Submitted 20 April, 2023; v1 submitted 14 December, 2022;
originally announced December 2022.
-
Bayesian Inference on Hamiltonian Selections for Mössbauer Spectroscopy
Authors:
Ryota Moriguchi,
Satoshi Tsutsui,
Shun Katakami,
Kenji Nagata,
Masaichiro Mizumaki,
Masato Okada
Abstract:
Mössbauer spectroscopy, which provides knowledge related to electronic states in materials, has been applied to various fields such as condensed matter physics and material sciences. In conventional spectral analyses based on least-square fitting, hyperfine interactions in materials have been determined from the shape of observed spectra. In conventional spectral analyses, it is difficult to discu…
▽ More
Mössbauer spectroscopy, which provides knowledge related to electronic states in materials, has been applied to various fields such as condensed matter physics and material sciences. In conventional spectral analyses based on least-square fitting, hyperfine interactions in materials have been determined from the shape of observed spectra. In conventional spectral analyses, it is difficult to discuss the validity of the hyperfine interactions and the estimated values. We propose a spectral analysis method based on Bayesian inference for the selection of hyperfine interactions and the estimation of Mössbauer parameters. An appropriate Hamiltonian has been selected by comparing Bayesian free energy among possible Hamiltonians. We have estimated the Mössbauer parameters and evaluated their estimated values by calculating the posterior distribution of each Mössbauer parameter with confidence intervals. We have also discussed the accuracy of the spectral analyses to elucidate the noise intensity dependence of numerical experiments.
△ Less
Submitted 2 May, 2022;
originally announced May 2022.
-
Bayesian Spectral Deconvolution of X-Ray Absorption Near Edge Structure Discriminating High- and Low-Energy Domains
Authors:
Shuhei Kashiwamura,
Shun Katakami,
Ryo Yamagami,
Kazunori Iwamitsu,
Hiroyuki Kumazoe,
Kenji Nagata,
Toshihiro Okajima,
Ichiro Akai,
Masato Okada
Abstract:
In this paper, we propose a Bayesian spectral deconvolution considering the properties of peaks in different energy domains. Bayesian spectral deconvolution regresses spectral data into the sum of multiple basis functions. Conventional methods use a model that treats all peaks equally. However, in X-ray absorption near edge structure (XANES) spectra, the properties of the peaks differ depending on…
▽ More
In this paper, we propose a Bayesian spectral deconvolution considering the properties of peaks in different energy domains. Bayesian spectral deconvolution regresses spectral data into the sum of multiple basis functions. Conventional methods use a model that treats all peaks equally. However, in X-ray absorption near edge structure (XANES) spectra, the properties of the peaks differ depending on the energy domain, and the specific energy domain of XANES is essential in condensed matter physics. We propose a model that discriminates between the low- and high-energy domains. We also propose a prior distribution that reflects the physical properties. We compare the conventional and proposed models in terms of computational efficiency, estimation accuracy, and model evidence. We demonstrate that our method effectively estimates the number of transition components in the important energy domain, on which the material scientists focus for mapping the electronic transition analysis by first-principles simulation.
△ Less
Submitted 11 July, 2022; v1 submitted 18 March, 2022;
originally announced March 2022.
-
Finite-density lattice QCD and sign problem: current status and open problems
Authors:
Keitaro Nagata
Abstract:
This an English translation of a review of finite-density lattice QCD. The original version in Japanese appeared in Soryushiron Kenkyu Vol 31 (2020) No. 1.
This an English translation of a review of finite-density lattice QCD. The original version in Japanese appeared in Soryushiron Kenkyu Vol 31 (2020) No. 1.
△ Less
Submitted 26 August, 2021;
originally announced August 2021.
-
Fast Bayesian Deconvolution using Simple Reversible Jump Moves
Authors:
Koki Okajima,
Kenji Nagata,
Masato Okada
Abstract:
We propose a Markov chain Monte Carlo-based deconvolution method designed to estimate the number of peaks in spectral data, along with the optimal parameters of each radial basis function. Assuming cases where the number of peaks is unknown, and a sweep simulation on all candidate models is computationally unrealistic, the proposed method efficiently searches over the probable candidates via trans…
▽ More
We propose a Markov chain Monte Carlo-based deconvolution method designed to estimate the number of peaks in spectral data, along with the optimal parameters of each radial basis function. Assuming cases where the number of peaks is unknown, and a sweep simulation on all candidate models is computationally unrealistic, the proposed method efficiently searches over the probable candidates via trans-dimensional moves assisted by annealing effects from replica exchange Monte Carlo moves. Through simulation using synthetic data, the proposed method demonstrates its advantages over conventional sweep simulations, particularly in model selection problems. Application to a set of olivine reflectance spectral data with varying forsterite and fayalite mixture ratios reproduced results obtained from previous mineralogical research, indicating that our method is applicable to deconvolution on real data sets.
△ Less
Submitted 26 November, 2020;
originally announced November 2020.
-
Large negative magnetoresistance in BaMn$_2$Bi$_2$ antiferromagnet
Authors:
Takuma Ogasawara,
Kim-Khuong Huynh,
Time Tahara,
Takanori Kida,
Masayuki Hagiwara,
Denis Arčon,
Motoi Kimata,
Stephane Yu Matsushita,
Kazumasa Nagata,
Katsumi Tanigaki
Abstract:
A very large negative magnetoresistance (LNMR) is observed in the insulating regime of the antiferromagnet BaMn$_2$Bi$_2$ when a magnetic field is applied perpendicular to the direction of the sublattice magnetization. High perpendicular magnetic field eventually suppresses the insulating behavior and allows BaMn$_2$Bi$_2$ to re-enter a metallic state. This effect is seemingly unrelated to any fie…
▽ More
A very large negative magnetoresistance (LNMR) is observed in the insulating regime of the antiferromagnet BaMn$_2$Bi$_2$ when a magnetic field is applied perpendicular to the direction of the sublattice magnetization. High perpendicular magnetic field eventually suppresses the insulating behavior and allows BaMn$_2$Bi$_2$ to re-enter a metallic state. This effect is seemingly unrelated to any field induced magnetic phase transition, as measurements of magnetic susceptibility and specific heat did not find any anomaly as a function of magnetic fields at temperatures above $2\,\mathrm{K}$. The LNMR appears in both current-in-plane and current-out-of-plane settings, and Hall effects suggest that its origin lies in an extreme sensitivity of conduction processes of holelike carriers to the infinitesimal field-induced canting of the sublattice magnetization. The LNMR-induced metallic state may thus be associated with the breaking of the antiferromagnetic parity-time symmetry by perpendicular magnetic fields and/or the intricate multi-orbital electronic structure of BaMn$_2$Bi$_2$.
△ Less
Submitted 17 February, 2021; v1 submitted 2 October, 2020;
originally announced October 2020.
-
Development of a Shape-memorable Adaptive Pin Array Fixture
Authors:
Peihao Shi,
Zhengtao Hu,
Kazuyuki Nagata,
Weiwei Wan,
Yukiyasu Domae,
Kensuke Harada
Abstract:
This paper proposes an adaptive pin-array fixture. The key idea of this research is to use the shape-memorable mechanism of pin array to fix multiple different shaped parts with common pin configuration. The clamping area consists of a matrix of passively slid-able pins that conform themselves to the contour of the target object. Vertical motion of the pins enables the fixture to encase the profil…
▽ More
This paper proposes an adaptive pin-array fixture. The key idea of this research is to use the shape-memorable mechanism of pin array to fix multiple different shaped parts with common pin configuration. The clamping area consists of a matrix of passively slid-able pins that conform themselves to the contour of the target object. Vertical motion of the pins enables the fixture to encase the profile of the object. The shape memorable mechanism is realized by the combination of the rubber bush and fixing mechanism of a pin. Several physical peg-in-hole tasks is conducted to verify the feasibility of the fixture.
△ Less
Submitted 20 May, 2020;
originally announced May 2020.
-
Functionally Divided Manipulation Synergy for Controlling Multi-fingered Hands
Authors:
Kazuki Higashi,
Keisuke Koyama,
Ryuta Ozawa,
Kazuyuki Nagata,
Weiwei Wan,
Kensuke Harada
Abstract:
Synergy supplies a practical approach for expressing various postures of a multi-fingered hand. However, a conventional synergy defined for reproducing grasping postures cannot perform general-purpose tasks expected for a multi-fingered hand. Locking the position of particular fingers is essential for a multi-fingered hand to manipulate an object. When using conventional synergy based control to m…
▽ More
Synergy supplies a practical approach for expressing various postures of a multi-fingered hand. However, a conventional synergy defined for reproducing grasping postures cannot perform general-purpose tasks expected for a multi-fingered hand. Locking the position of particular fingers is essential for a multi-fingered hand to manipulate an object. When using conventional synergy based control to manipulate an object, which requires locking some fingers, the coordination of joints is heavily restricted, decreasing the dexterity of the hand. We propose a functionally divided manipulation synergy (FDMS) method, which provides a synergy-based control to achieves both dimensionality reduction and in-hand manipulation. In FDMS, first, we define the function of each finger of the hand as either "manipulation" or "fixed." Then, we apply synergy control only to the fingers having the manipulation function, so that dexterous manipulations can be realized with few control inputs. The effectiveness of our proposed approach is experimentally verified.
△ Less
Submitted 25 March, 2020;
originally announced March 2020.
-
Nonparametric Regression Quantum Neural Networks
Authors:
Do Ngoc Diep,
Koji Nagata,
Tadao Nakamura
Abstract:
In two pervious papers \cite{dndiep3}, \cite{dndiep4}, the first author constructed the least square quantum neural networks (LS-QNN), and ploynomial interpolation quantum neural networks ( PI-QNN), parametrico-stattistical QNN like: leanr regrassion quantum neural networks (LR-QNN), polynomial regression quantum neural networks (PR-QNN), chi-squared quantum neural netowrks ($χ^2$-QNN). We observe…
▽ More
In two pervious papers \cite{dndiep3}, \cite{dndiep4}, the first author constructed the least square quantum neural networks (LS-QNN), and ploynomial interpolation quantum neural networks ( PI-QNN), parametrico-stattistical QNN like: leanr regrassion quantum neural networks (LR-QNN), polynomial regression quantum neural networks (PR-QNN), chi-squared quantum neural netowrks ($χ^2$-QNN). We observed that the method works also in the cases by using nonparametric statistics. In this paper we analyze and implement the nonparametric tests on QNN such as: linear nonparametric regression quantum neural networks (LNR-QNN), polynomial nonparametric regression quantum neural networks (PNR-QNN). The implementation is constructed through the Gauss-Jordan Elimination quantum neural networks (GJE-QNN).The training rule is to use the high probability confidence regions or intervals.
△ Less
Submitted 7 February, 2020;
originally announced February 2020.
-
Incompleteness in the Bell Theorem Using Non-contextual Local Realistic Model
Authors:
Koji Nagata,
Tadao Nakamura,
Han Geurdes
Abstract:
Here, we consider the Bell experiment for a system described by multipartite states in the case where n-dichotomic observables are measured per site. If n is two, we consider a two-setting Bell experiment. If n is three, we consider a three-setting Bell experiment. Twosetting model is an explicit local realistic model for the values of a correlation function, given in a two-setting Bell experiment…
▽ More
Here, we consider the Bell experiment for a system described by multipartite states in the case where n-dichotomic observables are measured per site. If n is two, we consider a two-setting Bell experiment. If n is three, we consider a three-setting Bell experiment. Twosetting model is an explicit local realistic model for the values of a correlation function, given in a two-setting Bell experiment. Three-setting model is an explicit local realistic model for the values of a correlation function, given in a three-setting Bell experiment. In the non-contextual scenario, there is not the difference between three-setting model and two-setting model. And we cannot classify local realistic theories in this case. This says that we can construct three-setting model from two-setting model. Surprisingly we can discuss incompleteness in the Bell theorem using non-contextual model. On the other hand, in the contextual scenario, there is the difference between three-setting model and two-setting model. This says that we must distinguish three-setting model from two-setting model. And we can classify local realistic theories in this case.
△ Less
Submitted 10 January, 2020;
originally announced January 2020.
-
Intrinsic regularization effect in Bayesian nonlinear regression scaled by observed data
Authors:
Satoru Tokuda,
Kenji Nagata,
Masato Okada
Abstract:
Occam's razor is a guiding principle that models should be simple enough to describe observed data. While Bayesian model selection (BMS) embodies it by the intrinsic regularization effect (IRE), how observed data scale the IRE has not been fully understood. In the nonlinear regression with conditionally independent observations, we show that the IRE is scaled by observations' fineness, defined by…
▽ More
Occam's razor is a guiding principle that models should be simple enough to describe observed data. While Bayesian model selection (BMS) embodies it by the intrinsic regularization effect (IRE), how observed data scale the IRE has not been fully understood. In the nonlinear regression with conditionally independent observations, we show that the IRE is scaled by observations' fineness, defined by the amount and quality of observed data. We introduce an observable that quantifies the IRE, referred to as the Bayes specific heat, inspired by the correspondence between statistical inference and statistical physics. We derive its scaling relation to observations' fineness. We demonstrate that the optimal model chosen by the BMS changes at critical values of observations' fineness, accompanying the IRE's variation. The changes are from choosing a coarse-grained model to a fine-grained one as observations' fineness increases. Our findings expand an understanding of BMS's typicality when observed data are insufficient.
△ Less
Submitted 6 December, 2022; v1 submitted 5 January, 2020;
originally announced January 2020.
-
Triple decomposition of velocity gradient tensor in homogeneous isotropic turbulence
Authors:
Ryosuke Nagata,
Tomoaki Watanabe,
Koji Nagata,
Carlos B. da Silva
Abstract:
The triple decomposition of a velocity gradient tensor is studied with direct numerical simulations of homogeneous isotropic turbulence, where the velocity gradient tensor is decomposed into three components representing an irrotational straining motion. Strength of these motions can be quantified with the decomposed components. A procedure of the triple decomposition is proposed for three-dimensi…
▽ More
The triple decomposition of a velocity gradient tensor is studied with direct numerical simulations of homogeneous isotropic turbulence, where the velocity gradient tensor is decomposed into three components representing an irrotational straining motion. Strength of these motions can be quantified with the decomposed components. A procedure of the triple decomposition is proposed for three-dimensional flows, where the decomposition is applied in a basic reference frame identified by examining a finite number of reference frames obtained by three sequential rotational transformations of a Cartesian coordinate. Even though more than one basic reference frame may be available for the triple decomposition, the results of the decomposition depend little on the choice of basic reference frame. In homogeneous isotropic turbulence, regions with strong rigid-body rotations or straining motions are highly intermittent in space, while most flow regions exhibit moderately strong shearing motions in the absence of straining motions and rigid-body rotations. The shear tensor is also used for detecting intense shear layers.
△ Less
Submitted 20 November, 2019; v1 submitted 1 November, 2019;
originally announced November 2019.
-
Bayesian Spectral Deconvolution Based on Poisson Distribution: Bayesian Measurement and Virtual Measurement Analytics (VMA)
Authors:
Kenji Nagata,
Yoh-ichi Mototake,
Rei Muraoka,
Takehiko Sasaki,
Masato Okada
Abstract:
In this paper, we propose a new method of Bayesian measurement for spectral deconvolution, which regresses spectral data into the sum of unimodal basis function such as Gaussian or Lorentzian functions. Bayesian measurement is a framework for considering not only the target physical model but also the measurement model as a probabilistic model, and enables us to estimate the parameter of a physica…
▽ More
In this paper, we propose a new method of Bayesian measurement for spectral deconvolution, which regresses spectral data into the sum of unimodal basis function such as Gaussian or Lorentzian functions. Bayesian measurement is a framework for considering not only the target physical model but also the measurement model as a probabilistic model, and enables us to estimate the parameter of a physical model with its confidence interval through a Bayesian posterior distribution given a measurement data set. The measurement with Poisson noise is one of the most effective system to apply our proposed method. Since the measurement time is strongly related to the signal-to-noise ratio for the Poisson noise model, Bayesian measurement with Poisson noise model enables us to clarify the relationship between the measurement time and the limit of estimation. In this study, we establish the probabilistic model with Poisson noise for spectral deconvolution. Bayesian measurement enables us to perform virtual and computer simulation for a certain measurement through the established probabilistic model. This property is called "Virtual Measurement Analytics(VMA)" in this paper. We also show that the relationship between the measurement time and the limit of estimation can be extracted by using the proposed method in a simulation of synthetic data and real data for XPS measurement of MoS$_2$.
△ Less
Submitted 11 December, 2018;
originally announced December 2018.
-
Tool Exchangeable Grasp/Assembly Planner
Authors:
Kensuke Harada,
Kento Nakayama,
Weiwei Wan,
Kazuyuki Nagata,
Natsuki Yamanobe,
Ixchel G. Ramirez-Alpizar
Abstract:
This paper proposes a novel assembly planner for a manipulator which can simultaneously plan assembly sequence, robot motion, grasping configuration, and exchange of grippers. Our assembly planner assumes multiple grippers and can automatically selects a feasible one to assemble a part. For a given AND/OR graph of an assembly task, we consider generating the assembly graph from which assembly moti…
▽ More
This paper proposes a novel assembly planner for a manipulator which can simultaneously plan assembly sequence, robot motion, grasping configuration, and exchange of grippers. Our assembly planner assumes multiple grippers and can automatically selects a feasible one to assemble a part. For a given AND/OR graph of an assembly task, we consider generating the assembly graph from which assembly motion of a robot can be planned. The edges of the assembly graph are composed of three kinds of paths, i.e., transfer/assembly paths, transit paths and tool exchange paths. In this paper, we first explain the proposed method for planning assembly motion sequence including the function of gripper exchange. Finally, the effectiveness of the proposed method is confirmed through some numerical examples and a physical experiment.
△ Less
Submitted 23 May, 2018;
originally announced May 2018.
-
Experiments on Learning Based Industrial Bin-picking with Iterative Visual Recognition
Authors:
Kensuke Harada,
Weiwei Wan,
Tokuo Tsuji,
Kohei Kikuchi,
Kazuyuki Nagata,
Hiromu Onda
Abstract:
This paper shows experimental results on learning based randomized bin-picking combined with iterative visual recognition. We use the random forest to predict whether or not a robot will successfully pick an object for given depth images of the pile taking the collision between a finger and a neighboring object into account. For the discriminator to be accurate, we consider estimating objects' pos…
▽ More
This paper shows experimental results on learning based randomized bin-picking combined with iterative visual recognition. We use the random forest to predict whether or not a robot will successfully pick an object for given depth images of the pile taking the collision between a finger and a neighboring object into account. For the discriminator to be accurate, we consider estimating objects' poses by merging multiple depth images of the pile captured from different points of view by using a depth sensor attached at the wrist. We show that, even if a robot is predicted to fail in picking an object with a single depth image due to its large occluded area, it is finally predicted as success after merging multiple depth images. In addition, we show that the random forest can be trained with the small number of training data.
△ Less
Submitted 22 May, 2018;
originally announced May 2018.
-
Complex Langevin calculations in finite density QCD at large $μ/T$ with the deformation technique
Authors:
Keitaro Nagata,
Jun Nishimura,
Shinji Shimasaki
Abstract:
It is well known that investigating QCD at finite density by standard Monte Carlo methods is extremely difficult due to the sign problem. Some years ago, the complex Langevin method with gauge cooling was shown to work at high temperature, i.e., in the deconfined phase. The same method was also applied to QCD in the so-called heavy dense limit in the whole temperature region. In this paper we atte…
▽ More
It is well known that investigating QCD at finite density by standard Monte Carlo methods is extremely difficult due to the sign problem. Some years ago, the complex Langevin method with gauge cooling was shown to work at high temperature, i.e., in the deconfined phase. The same method was also applied to QCD in the so-called heavy dense limit in the whole temperature region. In this paper we attempt to apply this method to the large $μ/T$ regime with moderate quark mass using four-flavor staggered fermions on a $4^3\times 8$ lattice. While a straightforward application faces with the singular-drift problem, which spoils the validity of the method, we overcome this problem by the deformation technique proposed earlier. Explicit results for the quark number density and the chiral condensate obtained in this way for $3.2\leq μ/T\leq 5.6$ are compared with the results for the phase-quenched model obtained by the standard rational hybrid Monte Carlo calculation. This reveals a clear difference, which is qualitatively consistent with the Silver Blaze phenomenon.
△ Less
Submitted 14 January, 2019; v1 submitted 10 May, 2018;
originally announced May 2018.
-
Testing the criterion for correct convergence in the complex Langevin method
Authors:
Keitaro Nagata,
Jun Nishimura,
Shinji Shimasaki
Abstract:
Recently the complex Langevin method (CLM) has been attracting attention as a solution to the sign problem, which occurs in Monte Carlo calculations when the effective Boltzmann weight is not real positive. An undesirable feature of the method, however, was that it can happen in some parameter regions that the method yields wrong results even if the Langevin process reaches equilibrium without any…
▽ More
Recently the complex Langevin method (CLM) has been attracting attention as a solution to the sign problem, which occurs in Monte Carlo calculations when the effective Boltzmann weight is not real positive. An undesirable feature of the method, however, was that it can happen in some parameter regions that the method yields wrong results even if the Langevin process reaches equilibrium without any problem. In our previous work, we proposed a practical criterion for correct convergence based on the probability distribution of the drift term that appears in the complex Langevin equation. Here we demonstrate the usefulness of this criterion in two solvable theories with many dynamical degrees of freedom, i.e., two-dimensional Yang-Mills theory with a complex coupling constant and the chiral Random Matrix Theory for finite density QCD, which were studied by the CLM before. Our criterion can indeed tell the parameter regions in which the CLM gives correct results.
△ Less
Submitted 9 May, 2018; v1 submitted 6 February, 2018;
originally announced February 2018.
-
Complex Langevin simulation of QCD at finite density and low temperature using the deformation technique
Authors:
Keitro Nagata,
Jun Nishimura,
Shinji Shimasaki
Abstract:
We study QCD at finite density and low temperature by using the complex Langevin method. We employ the gauge cooling to control the unitarity norm and introduce a deformation parameter in the Dirac operator to avoid the singular-drift problem. The reliability of the obtained results are judged by the probability distribution of the magnitude of the drift term. By making extrapolations with respect…
▽ More
We study QCD at finite density and low temperature by using the complex Langevin method. We employ the gauge cooling to control the unitarity norm and introduce a deformation parameter in the Dirac operator to avoid the singular-drift problem. The reliability of the obtained results are judged by the probability distribution of the magnitude of the drift term. By making extrapolations with respect to the deformation parameter using only the reliable results, we obtain results for the original system. We perform simulations on a $4^3\times 3$ lattice and show that our method works well even in the region where the reweighting method fails due to the severe sign problem. As a result we observe a delayed onset of the baryon number density as compared with the phase-quenched model, which is a clear sign of the Silver Blaze phenomenon.
△ Less
Submitted 20 October, 2017;
originally announced October 2017.
-
A note on the possibility of incomplete theory
Authors:
Han Geurdes,
Koji Nagata,
Tadao Nakamura,
Ahmed Farouk
Abstract:
In the paper it is demonstrated that Bells theorem is an unprovable theorem.
In the paper it is demonstrated that Bells theorem is an unprovable theorem.
△ Less
Submitted 2 May, 2019; v1 submitted 2 April, 2017;
originally announced April 2017.
-
On the condition for correct convergence in the complex Langevin method
Authors:
Shinji Shimasaki,
Keitaro Nagata,
Jun Nishimura
Abstract:
The complex Langevin method (CLM) provides a promising way to perform the path integral with a complex action using a stochastic equation for complexified dynamical variables. It is known, however, that the method gives wrong results in some cases, while it works, for instance, in finite density QCD in the deconfinement phase or in the heavy dense limit. Here we revisit the argument for justificat…
▽ More
The complex Langevin method (CLM) provides a promising way to perform the path integral with a complex action using a stochastic equation for complexified dynamical variables. It is known, however, that the method gives wrong results in some cases, while it works, for instance, in finite density QCD in the deconfinement phase or in the heavy dense limit. Here we revisit the argument for justification of the CLM and point out a subtlety in using the time-evolved observables, which play a crucial role in the argument. This subtlety requires that the probability distribution of the drift term should fall off exponentially or faster at large magnitude. We demonstrate our claim in some examples such as chiral Random Matrix Theory and show that our criterion is indeed useful in judging whether the results obtained by the CLM are trustable or not.
△ Less
Submitted 30 November, 2016;
originally announced November 2016.
-
Gauge cooling for the singular-drift problem in the complex Langevin method --- an application to finite density QCD
Authors:
Keitaro Nagata,
Hideo Matsufuru,
Jun Nishimura,
Shinji Shimasaki
Abstract:
We study full QCD at finite density and low temperature with light quark mass using the complex Langevin method. Since the singular drift problem turns out to be mild on a $4^3 \times 8$ lattice we use, the gauge cooling is performed only to control the unitarity norm in this exploratory study. We report on our preliminary data obtained from the complex Langevin simulation up to certain Langevin t…
▽ More
We study full QCD at finite density and low temperature with light quark mass using the complex Langevin method. Since the singular drift problem turns out to be mild on a $4^3 \times 8$ lattice we use, the gauge cooling is performed only to control the unitarity norm in this exploratory study. We report on our preliminary data obtained from the complex Langevin simulation up to certain Langevin time. While the data are still noisy due to lack of statistics, the onset of the baryon number density seems to occur at larger $μ$ than half the pion mass, which is the value for the phase quenched QCD. The validity of our simulation is tested by the recently proposed criterion based on the probability distribution of the drift term.
△ Less
Submitted 24 November, 2016;
originally announced November 2016.
-
Assembly Sequence Planning for Motion Planning
Authors:
Weiwei Wan,
Kensuke Harada,
Kazuyuki Nagata
Abstract:
This paper develops a planner to find an optimal assembly sequence to assemble several objects. The input to the planner is the mesh models of the objects, the relative poses between the objects in the assembly, and the final pose of the assembly. The output is an optimal assembly sequence, namely (1) in which order should one assemble the objects, (2) from which directions should the objects be d…
▽ More
This paper develops a planner to find an optimal assembly sequence to assemble several objects. The input to the planner is the mesh models of the objects, the relative poses between the objects in the assembly, and the final pose of the assembly. The output is an optimal assembly sequence, namely (1) in which order should one assemble the objects, (2) from which directions should the objects be dropped, and (3) candidate grasps of each object. The proposed planner finds the optimal solution by automatically permuting, evaluating, and searching the possible assembly sequences considering stability, graspability, and assemblability qualities. It is expected to guide robots to do assembly using translational motion. The output provides initial and goal configurations to motion planning algorithms. It is ready to be used by robots and is demonstrated using several simulations and real-world executions.
△ Less
Submitted 10 September, 2016;
originally announced September 2016.
-
Entanglement entropy for pure gauge theories in 1+1 dimensions using the lattice regularization
Authors:
Sinya Aoki,
Etsuko Itou,
Keitaro Nagata
Abstract:
We study the entanglement entropy (EE) for pure gauge theories in 1+1 dimensions with the lattice regularization. Using the definition of the EE for lattice gauge theories proposed in a previous paper [1] (S. Aoki, T. Iritani, M. Nozaki, T. Numasawa, N. Shiba and H. Tasaki, JHEP 1506 (2015) 187), we calculate the EE for arbitrary pure as well as mixed states in terms of eigenstates of the transfer…
▽ More
We study the entanglement entropy (EE) for pure gauge theories in 1+1 dimensions with the lattice regularization. Using the definition of the EE for lattice gauge theories proposed in a previous paper [1] (S. Aoki, T. Iritani, M. Nozaki, T. Numasawa, N. Shiba and H. Tasaki, JHEP 1506 (2015) 187), we calculate the EE for arbitrary pure as well as mixed states in terms of eigenstates of the transfer matrix in 1+1 dimensional lattice gauge theory. We find that the EE of an arbitrary pure state does not depend on the lattice spacing, thus giving the EE in the continuum limit, and show that the EE for an arbitrary pure state is independent of the real (Minkowski) time evolution. We also explicitly demonstrate the dependence of EE on the gauge fixing at the boundaries between two subspaces, which was pointed out for general cases in the paper [1]. In addition, we calculate the EE at zero as well as finite temperature by the replica method, and show that our result in the continuum limit corresponds to the result obtained before in the continuum theory, with a specific value of the counter term, which is otherwise arbitrary in the continuum calculation. We confirm the gauge dependence of the EE also for the replica method.
△ Less
Submitted 31 August, 2016;
originally announced August 2016.
-
A Mid-level Planning System for Object Reorientation
Authors:
Weiwei Wan,
Hisashi Igawa,
Kensuke Harada,
Zepei Wu,
Hiromu Onda,
Kazuyuki Nagata,
Natsuki Yamanobe
Abstract:
This paper presents a mid-level planning system for object reorientation. It includes a grasp planner, a placement planner, and a regrasp sequence solver. Given the initial and goal poses of an object, the mid-level planning system finds a sequence of hand configurations that reorient the object from the initial to the goal. This mid-level planning system is open to low-level motion planning algor…
▽ More
This paper presents a mid-level planning system for object reorientation. It includes a grasp planner, a placement planner, and a regrasp sequence solver. Given the initial and goal poses of an object, the mid-level planning system finds a sequence of hand configurations that reorient the object from the initial to the goal. This mid-level planning system is open to low-level motion planning algorithm by providing two end-effector poses as the input. It is also open to high-level symbolic planners by providing interface functions like placing an object to a given position at a given rotation. The planning system is demonstrated with several simulation examples and real-robot executions using a Kawada Hiro robot and Robotiq 85 grippers.
△ Less
Submitted 10 August, 2016;
originally announced August 2016.
-
Iterative Visual Recognition for Learning Based Randomized Bin-Picking
Authors:
Kensuke Harada,
Weiwei Wan,
Tokuo Tsuji,
Kohei Kikuchi,
Kazuyuki Nagata,
Hiromu Onda
Abstract:
This paper proposes a iterative visual recognition system for learning based randomized bin-picking. Since the configuration on randomly stacked objects while executing the current picking trial is just partially different from the configuration while executing the previous picking trial, we consider detecting the poses of objects just by using a part of visual image taken at the current picking t…
▽ More
This paper proposes a iterative visual recognition system for learning based randomized bin-picking. Since the configuration on randomly stacked objects while executing the current picking trial is just partially different from the configuration while executing the previous picking trial, we consider detecting the poses of objects just by using a part of visual image taken at the current picking trial where it is different from the visual image taken at the previous picking trial. By using this method, we do not need to try to detect the poses of all objects included in the pile at every picking trial. Assuming the 3D vision sensor attached at the wrist of a manipulator, we first explain a method to determine the pose of a 3D vision sensor maximizing the visibility of randomly stacked objects. Then, we explain a method for detecting the poses of randomly stacked objects. Effectiveness of our proposed approach is confirmed by experiments using a dual-arm manipulator where a 3D vision sensor and the two-fingered hand attached at the right and the left wrists, respectively.
△ Less
Submitted 1 August, 2016;
originally announced August 2016.
-
Simultaneous Estimation of Noise Variance and Number of Peaks in Bayesian Spectral Deconvolution
Authors:
Satoru Tokuda,
Kenji Nagata,
Masato Okada
Abstract:
The heuristic identification of peaks from noisy complex spectra often leads to misunderstanding of the physical and chemical properties of matter. In this paper, we propose a framework based on Bayesian inference, which enables us to separate multipeak spectra into single peaks statistically and consists of two steps. The first step is estimating both the noise variance and the number of peaks as…
▽ More
The heuristic identification of peaks from noisy complex spectra often leads to misunderstanding of the physical and chemical properties of matter. In this paper, we propose a framework based on Bayesian inference, which enables us to separate multipeak spectra into single peaks statistically and consists of two steps. The first step is estimating both the noise variance and the number of peaks as hyperparameters based on Bayes free energy, which generally is not analytically tractable. The second step is fitting the parameters of each peak function to the given spectrum by calculating the posterior density, which has a problem of local minima and saddles since multipeak models are nonlinear and hierarchical. Our framework enables the escape from local minima or saddles by using the exchange Monte Carlo method and calculates Bayes free energy via the multiple histogram method. We discuss a simulation demonstrating how efficient our framework is and show that estimating both the noise variance and the number of peaks prevents overfitting, overpenalizing, and misunderstanding the precision of parameter estimation.
△ Less
Submitted 15 December, 2016; v1 submitted 26 July, 2016;
originally announced July 2016.
-
Initial Experiments on Learning-Based Randomized Bin-Picking Allowing Finger Contact with Neighboring Objects
Authors:
Kensuke Harada,
Weiwei Wan,
Tokuo Tsuji,
Kohei Kikuchi,
Kazuyuki Nagata,
Hiromu Onda
Abstract:
This paper proposes a novel method for randomized bin-picking based on learning. When a two-fingered gripper tries to pick an object from the pile, a finger often contacts a neighboring object. Even if a finger contacts a neighboring object, the target object will be successfully picked depending on the configuration of neighboring objects. In our proposed method, we use the visual information on…
▽ More
This paper proposes a novel method for randomized bin-picking based on learning. When a two-fingered gripper tries to pick an object from the pile, a finger often contacts a neighboring object. Even if a finger contacts a neighboring object, the target object will be successfully picked depending on the configuration of neighboring objects. In our proposed method, we use the visual information on neighboring objects to train the discriminator. Corresponding to a grasping posture of an object, the discriminator predicts whether or not the pick will be successful even if a finger contacts a neighboring object. We examine two learning algorithms, the linear support vector machine (SVM) and the random forest (RF) approaches. By using both methods, we demonstrate that the picking success rate is significantly higher than with conventional methods without learning.
△ Less
Submitted 11 July, 2016;
originally announced July 2016.
-
The argument for justification of the complex Langevin method and the condition for correct convergence
Authors:
Keitaro Nagata,
Jun Nishimura,
Shinji Shimasaki
Abstract:
The complex Langevin method is a promising approach to the complex-action problem based on a fictitious time evolution of complexified dynamical variables under the influence of a Gaussian noise. Although it is known to have a restricted range of applicability, the use of gauge cooling made it applicable to various interesting cases including finite density QCD in certain parameter regions. In thi…
▽ More
The complex Langevin method is a promising approach to the complex-action problem based on a fictitious time evolution of complexified dynamical variables under the influence of a Gaussian noise. Although it is known to have a restricted range of applicability, the use of gauge cooling made it applicable to various interesting cases including finite density QCD in certain parameter regions. In this paper, we revisit the argument for justification of the method. In particular, we point out a subtlety in the use of time-evolved observables, which play a crucial role in the previous argument. This requires that the probability of the drift term should fall off exponentially or faster at large magnitude. We argue that this is actually a necessary and sufficient condition for the method to be justified. Using two simple examples, we show that our condition tells us clearly whether the results obtained by the method are trustable or not. We also discuss a new possibility for the gauge cooling, which can reduce the magnitude of the drift term directly.
△ Less
Submitted 11 December, 2016; v1 submitted 24 June, 2016;
originally announced June 2016.
-
Gauge cooling for the singular-drift problem in the complex Langevin method --a test in Random Matrix Theory for finite density QCD
Authors:
Keitaro Nagata,
Jun Nishimura,
Shinji Shimasaki
Abstract:
Recently, the complex Langevin method has been applied successfully to finite density QCD either in the deconfinement phase or in the heavy dense limit with the aid of a new technique called the gauge cooling. In the confinement phase with light quarks, however, convergence to wrong limits occurs due to the singularity in the drift term caused by small eigenvalues of the Dirac operator including t…
▽ More
Recently, the complex Langevin method has been applied successfully to finite density QCD either in the deconfinement phase or in the heavy dense limit with the aid of a new technique called the gauge cooling. In the confinement phase with light quarks, however, convergence to wrong limits occurs due to the singularity in the drift term caused by small eigenvalues of the Dirac operator including the mass term. We propose that this singular-drift problem should also be overcome by the gauge cooling with different criteria for choosing the complexified gauge transformation. The idea is tested in chiral Random Matrix Theory for finite density QCD, where exact results are reproduced at zero temperature with light quarks. It is shown that the gauge cooling indeed changes drastically the eigenvalue distribution of the Dirac operator measured during the Langevin process. Despite its non-holomorphic nature, this eigenvalue distribution has a universal diverging behavior at the origin in the chiral limit due to a generalized Banks-Casher relation as we confirm explicitly.
△ Less
Submitted 6 May, 2016; v1 submitted 26 April, 2016;
originally announced April 2016.
-
Test for a universal behavior of Dirac eigenvalues in the complex Langevin method
Authors:
Terukazu Ichihara,
Keitaro Nagata,
Kouji Kashiwa
Abstract:
We apply the complex Langevin (CL) method to a chiral random matrix theory (ChRMT) at non-zero chemical potential and study the nearest neighbor spacing (NNS) distribution of the Dirac eigenvalues. The NNS distribution is extracted using an unfolding procedure for the Dirac eigenvalues obtained in the CL method. For large quark mass, we find that the NNS distribution obeys the Ginibre ensemble as…
▽ More
We apply the complex Langevin (CL) method to a chiral random matrix theory (ChRMT) at non-zero chemical potential and study the nearest neighbor spacing (NNS) distribution of the Dirac eigenvalues. The NNS distribution is extracted using an unfolding procedure for the Dirac eigenvalues obtained in the CL method. For large quark mass, we find that the NNS distribution obeys the Ginibre ensemble as expected. For small quark mass, the NNS distribution follows the Wigner surmise for correct convergence case, while it follows the Ginibre ensemble for wrong convergence case. The Wigner surmise is physically reasonable from the chemical potential independence of the ChRMT. The Ginibre ensemble is known to be favored in a phase quenched QCD at finite chemical potential. Our result suggests a possibility that the originally universal behavior of the NNS distribution is preserved even in the CL method for correct convergence case.
△ Less
Submitted 31 March, 2016;
originally announced March 2016.
-
Entanglement in Four-Dimensional SU(3) Gauge Theory
Authors:
Etsuko Itou,
Keitaro Nagata,
Yoshiyuki Nakagawa,
Atsushi Nakamura,
V. I. Zakharov
Abstract:
We investigate the quantum entanglement entropy for the four-dimensional Euclidean SU(3) gauge theory. We present the first non-perturbative calculation of the entropic $c$-function ($C(l)$) of SU(3) gauge theory in lattice Monte Carlo simulation using the replica method. For $0 \leqslant l \leqslant 0.7$~fm, where $l$ is the length of the subspace, the entropic $c$-function is almost constant, in…
▽ More
We investigate the quantum entanglement entropy for the four-dimensional Euclidean SU(3) gauge theory. We present the first non-perturbative calculation of the entropic $c$-function ($C(l)$) of SU(3) gauge theory in lattice Monte Carlo simulation using the replica method. For $0 \leqslant l \leqslant 0.7$~fm, where $l$ is the length of the subspace, the entropic $c$-function is almost constant, indicating conformally invariant dynamics. The value of the constant agrees with that perturbatively obtained from free gluons, with 20 % discrepancy. When $l$ is close to the Hadronic scale, the entropic $c$-function decreases smoothly, and it is consistent with zero within error bars at $l \gtrsim 0.9$ fm.
△ Less
Submitted 4 December, 2015;
originally announced December 2015.
-
Testing a generalized cooling procedure in the complex Langevin simulation of chiral Random Matrix Theory
Authors:
Keitaro Nagata,
Jun Nishimura,
Shinji Shimasaki
Abstract:
The complex Langevin method has been attracting much attention as a solution to the sign problem since the method was shown to work in finite density QCD in the deconfined phase by using the so-called gauge cooling procedure. Whether it works also in the confined phase with light quarks is still an open question, though. In order to shed light on this question, we apply the method to the chiral Ra…
▽ More
The complex Langevin method has been attracting much attention as a solution to the sign problem since the method was shown to work in finite density QCD in the deconfined phase by using the so-called gauge cooling procedure. Whether it works also in the confined phase with light quarks is still an open question, though. In order to shed light on this question, we apply the method to the chiral Random Matrix Theory, which describes the epsilon regime of finite density QCD. Earlier works reported that a naive implementation of the method fails to reproduce the known exact results and that the problem can be solved by choosing a suitable coordinate. In this work we stick to the naive implementation, and show that a generalized gauge cooling procedure can be used to avoid the problem.
△ Less
Submitted 27 November, 2015;
originally announced November 2015.
-
Justification of the complex Langevin method with the gauge cooling procedure
Authors:
Keitaro Nagata,
Jun Nishimura,
Shinji Shimasaki
Abstract:
Recently there has been remarkable progress in the complex Langevin method, which aims at solving the complex action problem by complexifying the dynamical variables in the original path integral. In particular, a new technique called the gauge cooling was introduced and the full QCD simulation at finite density has been made possible in the high temperature (deconfined) phase or with heavy quarks…
▽ More
Recently there has been remarkable progress in the complex Langevin method, which aims at solving the complex action problem by complexifying the dynamical variables in the original path integral. In particular, a new technique called the gauge cooling was introduced and the full QCD simulation at finite density has been made possible in the high temperature (deconfined) phase or with heavy quarks. Here we provide a rigorous justification of the complex Langevin method including the gauge cooling procedure. We first show that the gauge cooling can be formulated as an extra term in the complex Langevin equation involving a gauge transformation parameter, which is chosen appropriately as a function of the configuration before cooling. The probability distribution of the complexified dynamical variables is modified by this extra term. However, this modification is shown not to affect the Fokker-Planck equation for the corresponding complex weight as far as observables are restricted to gauge invariant ones. Thus we demonstrate explicitly that the gauge cooling can be used as a viable technique to satisfy the convergence conditions for the complex Langevin method. We also discuss the "gauge cooling" in 0-dimensional systems such as vector models or matrix models.
△ Less
Submitted 18 September, 2015; v1 submitted 10 August, 2015;
originally announced August 2015.
-
Development FD-SOI MOSFET amplifiers for integrated read-out circuit of superconducting-tunnel-junction single-photon-detectors
Authors:
Kenji Kiuchi,
Shinhong Kim,
Yuji Takeuchi,
Kenichi Takemasa,
Kazuki Nagata,
Kota Kasahara,
Koya Moriuchi,
Ren Senzaki,
Shunsuke Yagi,
Hirokazu Ikeda,
Shuji Matsuura,
Takehiko Wada,
Hirokazu Ishino,
Atsuko Kibayashi,
Hiromi Sato,
Satoru Mima,
Takuo Yoshida,
Ryuta Hirose,
Yukihiro Kato,
Masasi Hazumi,
Yasuo Arai,
Ikuo Kurachi,
Erik Ramgerg,
Mark Kozlovsky,
Paul Rubinov
, et al. (6 additional authors not shown)
Abstract:
We proposed a new high resolution single photon infrared spectrometer for search for radiative decay of cosmic neutrino background(C$ν$B). The superconducting-tunnel-junctions(STJs) are used as a single photoncounting device. Each STJ consists of Nb/Al/Al${}_{\mathrm{x}}$O${}_{\mathrm{y}}$/Al/Nb layers and their thicknesses are optimized for the operation temperature at 370 mK cooled by a…
▽ More
We proposed a new high resolution single photon infrared spectrometer for search for radiative decay of cosmic neutrino background(C$ν$B). The superconducting-tunnel-junctions(STJs) are used as a single photoncounting device. Each STJ consists of Nb/Al/Al${}_{\mathrm{x}}$O${}_{\mathrm{y}}$/Al/Nb layers and their thicknesses are optimized for the operation temperature at 370 mK cooled by a ${}^{3}$He sorption refrigerator. Our STJs achieved the leak current 250 pA and the measured data implies that a smaller area STJ fulfills our requirement. FD-SOI MOSFETs are employed to amplify the STJ signal current in order to increase signal-to-noise ratio(S/N). FD-SOI MOSFETs can be operated at cryogenic temperature of 370 mK, which reduces the noise of the signal amplification system. FD-SOI MOSFET characteristics are measured at cryogenic temperature. The Id-Vgs curve shows a sharper turn on with a higher threshold voltage and the Id-Vds curve shows a non linear shape in linear region at cryogenic temperature. Taking into account these effects, FD-SOI MOSFETs are available for read-out circuit of STJ detectors. The bias voltage for STJ detectors are 0.4 mV and it must be well stabilized to deliver high performance. We proposed an FD-SOI MOSFET based charge integrated amplifier design as a read-out circuit of STJ detectors. The requirements for an operational amplifier used in the amplifier is estimated using SPICE simulation. The op-amp required to have a fast response(GBW$\geq$100 MHz) and it must have low power dissipation as compared to the cooling power of refrigerator.
△ Less
Submitted 27 July, 2015;
originally announced July 2015.
-
A filtering technique for the temporally reduced matrix of the Wilson fermion determinant
Authors:
Yasunori Futamura,
Shoji Hashimoto,
Akira Imakura,
Keitaro Nagata,
Tetsuya Sakurai
Abstract:
The Wilson fermion determinant can be written in the form of a series expansion in fugacity $ξ=\exp(μ/T)$, provided that the eigenmodes of the temporally reduced operator are obtained. Since the calculation of all eigenmodes rapidly becomes prohibitive for larger volumes, we develop a method to calculate only the low-energy eigenmodes of the reduced matrix using a matrix filetering technique. This…
▽ More
The Wilson fermion determinant can be written in the form of a series expansion in fugacity $ξ=\exp(μ/T)$, provided that the eigenmodes of the temporally reduced operator are obtained. Since the calculation of all eigenmodes rapidly becomes prohibitive for larger volumes, we develop a method to calculate only the low-energy eigenmodes of the reduced matrix using a matrix filetering technique. This provides a basis for an approximation to neglect uninteresting ultraviolet contributions.
△ Less
Submitted 16 November, 2014;
originally announced November 2014.
-
Lee-Yang zero distribution of high temperature QCD and Roberge-Weiss phase transition
Authors:
Keitaro Nagata,
Kouji Kashiwa,
Atsushi Nakamura,
Shinsuke M. Nishigaki
Abstract:
Canonical partition functions and Lee-Yang zeros of QCD at finite density and high temperature are studied. Recent lattice simulations have confirmed that the free energy of QCD is a quartic function of quark chemical potential at temperature slightly above pseudo-critical temperature $T_c$, as in the case with a gas of free massless fermions.
We present analytic derivation of the canonical part…
▽ More
Canonical partition functions and Lee-Yang zeros of QCD at finite density and high temperature are studied. Recent lattice simulations have confirmed that the free energy of QCD is a quartic function of quark chemical potential at temperature slightly above pseudo-critical temperature $T_c$, as in the case with a gas of free massless fermions.
We present analytic derivation of the canonical partition functions and Lee-Yang zeros for this type of free energy using the saddle point approximation. We also perform lattice QCD simulation in a canonical approach using the fugacity expansion of the fermion determinant, and carefully examine its reliability. By comparing the analytic and numerical results, we conclude that the canonical partition functions follow the Gaussian distribution of the baryon number, and the accumulation of Lee-Yang zeros of these canonical partition functions exhibit the first-order Roberge-Weiss phase transition. We discuss the validity and applicable range of the result and its implications both for theoretical and experimental studies.
△ Less
Submitted 16 May, 2015; v1 submitted 3 October, 2014;
originally announced October 2014.
-
Oscillations in Spurious States of the Associative Memory Model with Synaptic Depression
Authors:
Shin Murata,
Yosuke Otsubo,
Kenji Nagata,
Masato Okada
Abstract:
The associative memory model is a typical neural network model, which can store discretely distributed fixed-point attractors as memory patterns. When the network stores the memory patterns extensively, however, the model has other attractors besides the memory patterns. These attractors are called spurious memories. Both spurious states and memory states are equilibrium, so there is little differ…
▽ More
The associative memory model is a typical neural network model, which can store discretely distributed fixed-point attractors as memory patterns. When the network stores the memory patterns extensively, however, the model has other attractors besides the memory patterns. These attractors are called spurious memories. Both spurious states and memory states are equilibrium, so there is little difference between their dynamics. Recent physiological experiments have shown that short-term dynamic synapse called synaptic depression decreases its transmission efficacy to postsynaptic neurons according to the activities of presynaptic neurons. Previous studies have shown that synaptic depression induces oscillation in the network and decreases the storage capacity at finite temperature. How synaptic depression affects spurious states, however, is still unclear. We investigate the effect of synaptic depression on spurious states through Monte Carlo simulation. The results demonstrate that synaptic depression does not affect the memory states but mainly destabilizes the spurious states and induces the periodic oscillations.
△ Less
Submitted 9 May, 2014;
originally announced May 2014.
-
Heavy quark potential at finite imaginary chemical potential
Authors:
Junichi Takahashi,
Takahiro Sasaki,
Keitaro Nagata,
Takuya Saito,
Hiroaki Kouno,
Atsushi Nakamura,
Masanobu Yahiro
Abstract:
We investigate chemical-potential ($μ$) dependence of the static-quark free energies in both the real and imaginary $μ$ regions, using the clover-improved two-flavor Wilson fermion action and the renormalization-group improved Iwasaki gauge action. Static-quark potentials are evaluated from Polyakov-loop correlators in the deconfinement phase and the imaginary $μ=iμ_{\rm I}$ region and extrapolate…
▽ More
We investigate chemical-potential ($μ$) dependence of the static-quark free energies in both the real and imaginary $μ$ regions, using the clover-improved two-flavor Wilson fermion action and the renormalization-group improved Iwasaki gauge action. Static-quark potentials are evaluated from Polyakov-loop correlators in the deconfinement phase and the imaginary $μ=iμ_{\rm I}$ region and extrapolated to the real $μ$ region with analytic continuation. As the analytic continuation, the potential calculated at imaginary $μ=iμ_{\rm I}$ is expanded into a Taylor-expansion series of $iμ_{\rm I}/T$ up to 4th order and the pure imaginary variable $iμ_{\rm I}/T$ is replaced by the real one $μ_{\rm R}/T$. At real $μ$, the 4th-order term weakens $μ$ dependence of the potential sizably. Also, the color-Debye screening mass is extracted from the color-singlet potential at imaginary $μ$, and the mass is extrapolated to real $μ$ by analytic continuation. The screening mass thus obtained has stronger $μ$ dependence than the prediction of the leading-order thermal perturbation theory at both real and imaginary $μ$.
△ Less
Submitted 24 March, 2014; v1 submitted 29 October, 2013;
originally announced October 2013.
-
Scalar Transfer across a Turbulent/non-turbulent Interface in a Planar Jet
Authors:
Tomoaki Watanabe,
Yasuhiko Sakai,
Kouji Nagata,
Osamu Terashima,
Yasumasa Ito,
Toshiyuki Hayase
Abstract:
This fluid dynamics video is an entry for the Gallery of Fluid Motion of the 66th Annual Meeting of the APS-DFD. In this video, the scalar transfer across the turbulent/non-turbulent (T/NT) interface in a planar jet is investigated by using a direct numerical simulation. Visualization of the scalar flux across the T/NT interface shows that the diffusive species premixed in the ambient flow is tran…
▽ More
This fluid dynamics video is an entry for the Gallery of Fluid Motion of the 66th Annual Meeting of the APS-DFD. In this video, the scalar transfer across the turbulent/non-turbulent (T/NT) interface in a planar jet is investigated by using a direct numerical simulation. Visualization of the scalar flux across the T/NT interface shows that the diffusive species premixed in the ambient flow is transferred into the turbulent region mainly across the leading edge (Here, the leading edge is the T/NT interface across which the turbulent fluid turns into the non-turbulent fluid in the streamwise direction).
△ Less
Submitted 2 October, 2013;
originally announced October 2013.