-
The time periodic problem for the Navier-Stokes equations in exterior domains in weighted spaces
Authors:
Reinhard Farwig,
Kazuyuki Tsuda
Abstract:
The paper considers the time periodic problem of the Navier-Stokes system in an exterior domain under time periodic external forces. Existence of periodic mild solutions is obtained in the critical scale invariant space $C(\mathbb{R};L^n)$ if the external force is small without exploiting any divergence form as in the study of Okabe and Tsutsui (2017) for the whole space case in Lorentz spaces. Pr…
▽ More
The paper considers the time periodic problem of the Navier-Stokes system in an exterior domain under time periodic external forces. Existence of periodic mild solutions is obtained in the critical scale invariant space $C(\mathbb{R};L^n)$ if the external force is small without exploiting any divergence form as in the study of Okabe and Tsutsui (2017) for the whole space case in Lorentz spaces. Previous studies mainly rely on either potential theoretical estimates or time-space integral estimates in Lorentz spaces introduced by Yamazaki (Math. Ann.(2000)). To the best of our knowledge, there are no results using Muckenhoupt weights in $L^q$ class for $1< q <\infty$ to construct time periodic solutions of the Navier-Stokes equations in the exterior domain case. In this article, a new method based on radially symmetric Muckenhoupt weights in space is used. To apply these weights, we reconsider weighted $L^p$-$L^q$ decay estimates for the Stokes semigroup. This important result was announced by Kobayashi and Kubo (2012-2015) about ten years ago with a sketch of the proof by Kubo. In this paper, we give a rigorous proof of the result and, as an important application, solve the time periodic problem for the Navier-Stokes equations on an exterior domain.
△ Less
Submitted 26 September, 2024;
originally announced September 2024.
-
Preference-Optimized Pareto Set Learning for Blackbox Optimization
Authors:
Zhang Haishan,
Diptesh Das,
Koji Tsuda
Abstract:
Multi-Objective Optimization (MOO) is an important problem in real-world applications. However, for a non-trivial problem, no single solution exists that can optimize all the objectives simultaneously. In a typical MOO problem, the goal is to find a set of optimum solutions (Pareto set) that trades off the preferences among objectives. Scalarization in MOO is a well-established method for finding…
▽ More
Multi-Objective Optimization (MOO) is an important problem in real-world applications. However, for a non-trivial problem, no single solution exists that can optimize all the objectives simultaneously. In a typical MOO problem, the goal is to find a set of optimum solutions (Pareto set) that trades off the preferences among objectives. Scalarization in MOO is a well-established method for finding a finite set approximation of the whole Pareto set (PS). However, in real-world experimental design scenarios, it's beneficial to obtain the whole PS for flexible exploration of the design space. Recently Pareto set learning (PSL) has been introduced to approximate the whole PS. PSL involves creating a manifold representing the Pareto front of a multi-objective optimization problem. A naive approach includes finding discrete points on the Pareto front through randomly generated preference vectors and connecting them by regression. However, this approach is computationally expensive and leads to a poor PS approximation. We propose to optimize the preference points to be distributed evenly on the Pareto front. Our formulation leads to a bilevel optimization problem that can be solved by e.g. differentiable cross-entropy methods. We demonstrated the efficacy of our method for complex and difficult black-box MOO problems using both synthetic and real-world benchmark data.
△ Less
Submitted 19 August, 2024;
originally announced August 2024.
-
Molecule Graph Networks with Many-body Equivariant Interactions
Authors:
Zetian Mao,
Jiawen Li,
Chen Liang,
Diptesh Das,
Masato Sumita,
Koji Tsuda
Abstract:
Message passing neural networks have demonstrated significant efficacy in predicting molecular interactions. Introducing equivariant vectorial representations augments expressivity by capturing geometric data symmetries, thereby improving model accuracy. However, two-body bond vectors in opposition may cancel each other out during message passing, leading to the loss of directional information on…
▽ More
Message passing neural networks have demonstrated significant efficacy in predicting molecular interactions. Introducing equivariant vectorial representations augments expressivity by capturing geometric data symmetries, thereby improving model accuracy. However, two-body bond vectors in opposition may cancel each other out during message passing, leading to the loss of directional information on their shared node. In this study, we develop Equivariant N-body Interaction Networks (ENINet) that explicitly integrates equivariant many-body interactions to preserve directional information in the message passing scheme. Experiments indicate that integrating many-body equivariant representations enhances prediction accuracy across diverse scalar and tensorial quantum chemical properties. Ablation studies show an average performance improvement of 7.9% across 11 out of 12 properties in QM9, 27.9% in forces in MD17, and 11.3% in polarizabilities (CCSD) in QM7b.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Boltzmann sampling with quantum annealers via fast Stein correction
Authors:
Ryosuke Shibukawa,
Ryo Tamura,
Koji Tsuda
Abstract:
Despite the attempts to apply a quantum annealer to Boltzmann sampling, it is still impossible to perform accurate sampling at arbitrary temperatures. Conventional distribution correction methods such as importance sampling and resampling cannot be applied, because the analytical expression of sampling distribution is unknown for a quantum annealer. Stein correction (Liu and Lee, 2017) can correct…
▽ More
Despite the attempts to apply a quantum annealer to Boltzmann sampling, it is still impossible to perform accurate sampling at arbitrary temperatures. Conventional distribution correction methods such as importance sampling and resampling cannot be applied, because the analytical expression of sampling distribution is unknown for a quantum annealer. Stein correction (Liu and Lee, 2017) can correct the samples by weighting without the knowledge of the sampling distribution, but the naive implementation requires the solution of a large-scale quadratic program, hampering usage in practical problems. In this letter, a fast and approximate method based on random feature map and exponentiated gradient updates is developed to compute the sample weights, and used to correct the samples generated by D-Wave quantum annealers. In benchmarking problems, it is observed that the residual error of thermal average calculations is reduced significantly. If combined with our method, quantum annealers may emerge as a viable alternative to long-established Markov chain Monte Carlo methods.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
Feature Importance Measurement based on Decision Tree Sampling
Authors:
Chao Huang,
Diptesh Das,
Koji Tsuda
Abstract:
Random forest is effective for prediction tasks but the randomness of tree generation hinders interpretability in feature importance analysis. To address this, we proposed DT-Sampler, a SAT-based method for measuring feature importance in tree-based model. Our method has fewer parameters than random forest and provides higher interpretability and stability for the analysis in real-world problems.…
▽ More
Random forest is effective for prediction tasks but the randomness of tree generation hinders interpretability in feature importance analysis. To address this, we proposed DT-Sampler, a SAT-based method for measuring feature importance in tree-based model. Our method has fewer parameters than random forest and provides higher interpretability and stability for the analysis in real-world problems. An implementation of DT-Sampler is available at https://github.com/tsudalab/DT-sampler.
△ Less
Submitted 25 July, 2023;
originally announced July 2023.
-
Efficient Model Selection for Predictive Pattern Mining Model by Safe Pattern Pruning
Authors:
Takumi Yoshida,
Hiroyuki Hanada,
Kazuya Nakagawa,
Kouichi Taji,
Koji Tsuda,
Ichiro Takeuchi
Abstract:
Predictive pattern mining is an approach used to construct prediction models when the input is represented by structured data, such as sets, graphs, and sequences. The main idea behind predictive pattern mining is to build a prediction model by considering substructures, such as subsets, subgraphs, and subsequences (referred to as patterns), present in the structured data as features of the model.…
▽ More
Predictive pattern mining is an approach used to construct prediction models when the input is represented by structured data, such as sets, graphs, and sequences. The main idea behind predictive pattern mining is to build a prediction model by considering substructures, such as subsets, subgraphs, and subsequences (referred to as patterns), present in the structured data as features of the model. The primary challenge in predictive pattern mining lies in the exponential growth of the number of patterns with the complexity of the structured data. In this study, we propose the Safe Pattern Pruning (SPP) method to address the explosion of pattern numbers in predictive pattern mining. We also discuss how it can be effectively employed throughout the entire model building process in practical data analysis. To demonstrate the effectiveness of the proposed method, we conduct numerical experiments on regression and classification problems involving sets, graphs, and sequences.
△ Less
Submitted 23 June, 2023;
originally announced June 2023.
-
Virtual Human Generative Model: Masked Modeling Approach for Learning Human Characteristics
Authors:
Kenta Oono,
Nontawat Charoenphakdee,
Kotatsu Bito,
Zhengyan Gao,
Yoshiaki Ota,
Shoichiro Yamaguchi,
Yohei Sugawara,
Shin-ichi Maeda,
Kunihiko Miyoshi,
Yuki Saito,
Koki Tsuda,
Hiroshi Maruyama,
Kohei Hayashi
Abstract:
Identifying the relationship between healthcare attributes, lifestyles, and personality is vital for understanding and improving physical and mental conditions. Machine learning approaches are promising for modeling their relationships and offering actionable suggestions. In this paper, we propose Virtual Human Generative Model (VHGM), a machine learning model for estimating attributes about healt…
▽ More
Identifying the relationship between healthcare attributes, lifestyles, and personality is vital for understanding and improving physical and mental conditions. Machine learning approaches are promising for modeling their relationships and offering actionable suggestions. In this paper, we propose Virtual Human Generative Model (VHGM), a machine learning model for estimating attributes about healthcare, lifestyles, and personalities. VHGM is a deep generative model trained with masked modeling to learn the joint distribution of attributes conditioned on known ones. Using heterogeneous tabular datasets, VHGM learns more than 1,800 attributes efficiently. We numerically evaluate the performance of VHGM and its training techniques. As a proof-of-concept of VHGM, we present several applications demonstrating user scenarios, such as virtual measurements of healthcare attributes and hypothesis verifications of lifestyles.
△ Less
Submitted 14 August, 2023; v1 submitted 18 June, 2023;
originally announced June 2023.
-
NIMS-OS: An automation software to implement a closed loop between artificial intelligence and robotic experiments in materials science
Authors:
Ryo Tamura,
Koji Tsuda,
Shoichi Matsuda
Abstract:
NIMS-OS (NIMS Orchestration System) is a Python library created to realize a closed loop of robotic experiments and artificial intelligence (AI) without human intervention for automated materials exploration. It uses various combinations of modules to operate autonomously. Each module acts as an AI for materials exploration or a controller for a robotic experiments. As AI techniques, Bayesian opti…
▽ More
NIMS-OS (NIMS Orchestration System) is a Python library created to realize a closed loop of robotic experiments and artificial intelligence (AI) without human intervention for automated materials exploration. It uses various combinations of modules to operate autonomously. Each module acts as an AI for materials exploration or a controller for a robotic experiments. As AI techniques, Bayesian optimization (PHYSBO), boundless objective-free exploration (BLOX), phase diagram construction (PDC), and random exploration (RE) methods can be used. Moreover, a system called NIMS automated robotic electrochemical experiments (NAREE) is available as a set of robotic experimental equipment. Visualization tools for the results are also included, which allows users to check the optimization results in real time. Newly created modules for AI and robotic experiments can be added easily to extend the functionality of the system. In addition, we developed a GUI application to control NIMS-OS.To demonstrate the operation of NIMS-OS, we consider an automated exploration for new electrolytes. NIMS-OS is available at https://github.com/nimsos-dev/nimsos.
△ Less
Submitted 26 April, 2023;
originally announced April 2023.
-
On a linear fused Gromov-Wasserstein distance for graph structured data
Authors:
Dai Hai Nguyen,
Koji Tsuda
Abstract:
We present a framework for embedding graph structured data into a vector space, taking into account node features and topology of a graph into the optimal transport (OT) problem. Then we propose a novel distance between two graphs, named linearFGW, defined as the Euclidean distance between their embeddings. The advantages of the proposed distance are twofold: 1) it can take into account node featu…
▽ More
We present a framework for embedding graph structured data into a vector space, taking into account node features and topology of a graph into the optimal transport (OT) problem. Then we propose a novel distance between two graphs, named linearFGW, defined as the Euclidean distance between their embeddings. The advantages of the proposed distance are twofold: 1) it can take into account node feature and structure of graphs for measuring the similarity between graphs in a kernel-based framework, 2) it can be much faster for computing kernel matrix than pairwise OT-based distances, particularly fused Gromov-Wasserstein, making it possible to deal with large-scale data sets. After discussing theoretical properties of linearFGW, we demonstrate experimental results on classification and clustering tasks, showing the effectiveness of the proposed linearFGW.
△ Less
Submitted 9 March, 2022;
originally announced March 2022.
-
Coexisting Z-type charge and bond order in metallic NaRu$_2$O$_4$
Authors:
Arvind Kumar Yogi,
Alexander Yaresko,
C. I. Sathish,
Hasung Sim,
Daisuke Morikawa,
J. Nuss,
Kenji Tsuda,
Y. Noda,
Daniel I. Khomskii,
Je-Geun Park
Abstract:
How particular bonds form in quantum materials has been a long-standing puzzle. Two key concepts dealing with charge degrees of freedom are dimerization (forming metal-metal bonds) and charge ordering (CO). Since the 1930s, these two concepts have been frequently invoked to explain numerous exciting quantum materials, typically insulators. Here we report dimerization and CO within the dimers coexi…
▽ More
How particular bonds form in quantum materials has been a long-standing puzzle. Two key concepts dealing with charge degrees of freedom are dimerization (forming metal-metal bonds) and charge ordering (CO). Since the 1930s, these two concepts have been frequently invoked to explain numerous exciting quantum materials, typically insulators. Here we report dimerization and CO within the dimers coexisting in metallic NaRu$_2$O$_4$. By combining high-resolution x-ray diffraction studies and theoretical calculations, we demonstrate that this unique phenomenon occurs through a new type of bonding, which we call Z-type ordering. The low-temperature superstructure has strong dimerization in legs of zigzag ladders, with short dimers in legs connected by short zigzag bonds, forming Z-shape clusters: simultaneously, site-centered charge ordering also appears. Our results demonstrate the yet unknown flexibility of quantum materials with the intricate interplay among orbital, charge, and lattice degrees of freedom.
△ Less
Submitted 13 February, 2022;
originally announced February 2022.
-
Bayesian optimization package: PHYSBO
Authors:
Yuichi Motoyama,
Ryo Tamura,
Kazuyoshi Yoshimi,
Kei Terayama,
Tsuyoshi Ueno,
Koji Tsuda
Abstract:
PHYSBO (optimization tools for PHYSics based on Bayesian Optimization) is a Python library for fast and scalable Bayesian optimization. It has been developed mainly for application in the basic sciences such as physics and materials science. Bayesian optimization is used to select an appropriate input for experiments/simulations from candidate inputs listed in advance in order to obtain better out…
▽ More
PHYSBO (optimization tools for PHYSics based on Bayesian Optimization) is a Python library for fast and scalable Bayesian optimization. It has been developed mainly for application in the basic sciences such as physics and materials science. Bayesian optimization is used to select an appropriate input for experiments/simulations from candidate inputs listed in advance in order to obtain better output values with the help of machine learning prediction. PHYSBO can be used to find better solutions for both single and multi-objective optimization problems. At each cycle in the Bayesian optimization, a single proposal or multiple proposals can be obtained for the next experiments/simulations. These proposals can be obtained interactively for use in experiments. PHYSBO is available at https://github.com/issp-center-dev/PHYSBO.
△ Less
Submitted 24 May, 2022; v1 submitted 15 October, 2021;
originally announced October 2021.
-
Probing conformational dynamics of antibodies with geometric simulations
Authors:
Andrejs Tucs,
Koji Tsuda,
Adnan Sljoka
Abstract:
This chapter describes the application of constrained geometric simulations for prediction of antibody structural dynamics. We utilize constrained geometric simulations method FRODAN, which is a low computational complexity alternative to Molecular Dynamics (MD) simulations that can rapidly explore flexible motions in protein structures. FRODAN is highly suited for conformational dynamics analysis…
▽ More
This chapter describes the application of constrained geometric simulations for prediction of antibody structural dynamics. We utilize constrained geometric simulations method FRODAN, which is a low computational complexity alternative to Molecular Dynamics (MD) simulations that can rapidly explore flexible motions in protein structures. FRODAN is highly suited for conformational dynamics analysis of large proteins, complexes, intrinsically disordered proteins and dynamics that occurs on longer biologically relevant time scales which are normally inaccessible to classical MD simulations. This approach predicts protein dynamics at an all-atom scale while retaining realistic covalent bonding, maintaining dihedral angles in energetically good conformations while avoiding steric clashes in addition to performing other geometric and stereochemical criteria checks. In this chapter, we apply FRODAN to showcase its applicability for probing functionally relevant dynamics of IgG2a, including large amplitude domain-domain motions and motions of complementarity determining region (CDR) loops. As was suggested in previous experimental studies, our simulations show that antibodies can explore a large range of conformational space.
△ Less
Submitted 29 September, 2021;
originally announced September 2021.
-
Fast and More Powerful Selective Inference for Sparse High-order Interaction Model
Authors:
Diptesh Das,
Vo Nguyen Le Duy,
Hiroyuki Hanada,
Koji Tsuda,
Ichiro Takeuchi
Abstract:
Automated high-stake decision-making such as medical diagnosis requires models with high interpretability and reliability. As one of the interpretable and reliable models with good prediction ability, we consider Sparse High-order Interaction Model (SHIM) in this study. However, finding statistically significant high-order interactions is challenging due to the intrinsic high dimensionality of the…
▽ More
Automated high-stake decision-making such as medical diagnosis requires models with high interpretability and reliability. As one of the interpretable and reliable models with good prediction ability, we consider Sparse High-order Interaction Model (SHIM) in this study. However, finding statistically significant high-order interactions is challenging due to the intrinsic high dimensionality of the combinatorial effects. Another problem in data-driven modeling is the effect of "cherry-picking" a.k.a. selection bias. Our main contribution is to extend the recently developed parametric programming approach for selective inference to high-order interaction models. Exhaustive search over the cherry tree (all possible interactions) can be daunting and impractical even for a small-sized problem. We introduced an efficient pruning strategy and demonstrated the computational efficiency and statistical power of the proposed method using both synthetic and real data.
△ Less
Submitted 9 June, 2021;
originally announced June 2021.
-
A generative model for molecule generation based on chemical reaction trees
Authors:
Dai Hai Nguyen,
Koji Tsuda
Abstract:
Deep generative models have been shown powerful in generating novel molecules with desired chemical properties via their representations such as strings, trees or graphs. However, these models are limited in recommending synthetic routes for the generated molecules in practice. We propose a generative model to generate molecules via multi-step chemical reaction trees. Specifically, our model first…
▽ More
Deep generative models have been shown powerful in generating novel molecules with desired chemical properties via their representations such as strings, trees or graphs. However, these models are limited in recommending synthetic routes for the generated molecules in practice. We propose a generative model to generate molecules via multi-step chemical reaction trees. Specifically, our model first propose a chemical reaction tree with predicted reaction templates and commercially available molecules (starting molecules), and then perform forward synthetic steps to obtain product molecules. Experiments show that our model can generate chemical reactions whose product molecules are with desired chemical properties. Also, the complete synthetic routes for these product molecules are provided.
△ Less
Submitted 7 June, 2021;
originally announced June 2021.
-
Continuous black-box optimization with quantum annealing and random subspace coding
Authors:
Syun Izawa,
Koki Kitai,
Shu Tanaka,
Ryo Tamura,
Koji Tsuda
Abstract:
A black-box optimization algorithm such as Bayesian optimization finds extremum of an unknown function by alternating inference of the underlying function and optimization of an acquisition function. In a high-dimensional space, such algorithms perform poorly due to the difficulty of acquisition function optimization. Herein, we apply quantum annealing (QA) to overcome the difficulty in the contin…
▽ More
A black-box optimization algorithm such as Bayesian optimization finds extremum of an unknown function by alternating inference of the underlying function and optimization of an acquisition function. In a high-dimensional space, such algorithms perform poorly due to the difficulty of acquisition function optimization. Herein, we apply quantum annealing (QA) to overcome the difficulty in the continuous black-box optimization. As QA specializes in optimization of binary problems, a continuous vector has to be encoded to binary, and the solution of QA has to be translated back. Our method has the following three parts: 1) Random subspace coding based on axis-parallel hyperrectangles from continuous vector to binary vector. 2) A quadratic unconstrained binary optimization (QUBO) defined by acquisition function based on nonnegative-weighted linear regression model which is solved by QA. 3) A penalization scheme to ensure that the QA solution can be translated back. It is shown in benchmark tests that its performance using D-Wave Advantage$^{\rm TM}$ quantum annealer is competitive with a state-of-the-art method based on the Gaussian process in high-dimensional problems. Our method may open up a new possibility of quantum annealing and other QUBO solvers including quantum approximate optimization algorithm (QAOA) using a gated-quantum computers, and expand its range of application to continuous-valued problems.
△ Less
Submitted 30 April, 2021;
originally announced April 2021.
-
Structural-transition-driven antiferromagnetic to spin-glass transition in Cd-Mg-Tb 1/1 approximants
Authors:
Farid Labib,
Daisuke Okuyama,
Nobuhisa Fujita,
Tsunetomo Yamada,
Satoshi Ohhashi,
Daisuke Morikawa,
Kenji Tsuda,
Taku J. Sato,
An-Pang Tsai
Abstract:
The magnetic susceptibility of the 1/1 approximants to icosahedral quasicrystals in a series of Cd85-xMgxTb15 (x = 5, 10, 15, 20) alloys was investigated in detail. The occurrence of antiferromagnetic to spin-glass-like transition was noticed by increasing Mg. Transmission electron microscopy analysis evidenced a correlation between the magnetic transition and suppression of the monoclinic superla…
▽ More
The magnetic susceptibility of the 1/1 approximants to icosahedral quasicrystals in a series of Cd85-xMgxTb15 (x = 5, 10, 15, 20) alloys was investigated in detail. The occurrence of antiferromagnetic to spin-glass-like transition was noticed by increasing Mg. Transmission electron microscopy analysis evidenced a correlation between the magnetic transition and suppression of the monoclinic superlattice ordering with respect to the orientation of the Cd4 tetrahedron at T > 100 K. The possible origins of this phenomenon were discussed in detail. The occurrence of the antiferromagnetic to spin-glass -like magnetic transition is associated with the combination of chemical disorder due to a randomized substitution of Cd with Mg and the orientational disorder of the Cd4 tetrahedra.
△ Less
Submitted 28 July, 2020;
originally announced July 2020.
-
Optimization of heterogeneous ternary Li3PO4-Li3BO3-Li2SO4 mixture for Li-ion conductivity by machine learning
Authors:
Kenji Homma,
Yu Liu,
Masato Sumita,
Ryo Tamura,
Naoki Fushimi,
Junichi Iwata,
Koji Tsuda,
Chioko Kaneta
Abstract:
Mixing heterogeneous Li-ion conductive materials is one of potential ways to enhance the Li-ion conductivity more than that of the parent materials. However, the development of the mixtures had not exhibited significant progress because it is a formidable task to cover the vast possible composition of the parent materials using traditional ways. Here, we introduce a fashion based on machine learni…
▽ More
Mixing heterogeneous Li-ion conductive materials is one of potential ways to enhance the Li-ion conductivity more than that of the parent materials. However, the development of the mixtures had not exhibited significant progress because it is a formidable task to cover the vast possible composition of the parent materials using traditional ways. Here, we introduce a fashion based on machine learning to optimize the composition ratio of ternary Li3PO4-Li3BO3-Li2SO4 mixture for its Li-ion conductivity. According to our results, the optimum composition of the ternary mixture system is 25:14:61 (Li3PO4: Li3BO3: Li2SO4 in mol%), whose Li-ion conductivity is measured as 4.9 x 10E-4 S/cm at 300 °C. Our X-ray structure analysis indicates that Li-ion conductivity in the mixing systems is enhanced by virtue of the coexistence of two or more phases. Although the mechanism enhancing Li-ion conductivity is not simple, our results demonstrate the effectiveness of machine learning for the development of materials.
△ Less
Submitted 28 November, 2019;
originally announced November 2019.
-
Leveraging Legacy Data to Accelerate Materials Design via Preference Learning
Authors:
Xiaolin Sun,
Zhufeng Hou,
Masato Sumita,
Shinsuke Ishihara,
Ryo Tamura,
Koji Tsuda
Abstract:
Machine learning applications in materials science are often hampered by shortage of experimental data. Integration with legacy data from past experiments is a viable way to solve the problem, but complex calibration is often necessary to use the data obtained under different conditions. In this paper, we present a novel calibration-free strategy to enhance the performance of Bayesian optimization…
▽ More
Machine learning applications in materials science are often hampered by shortage of experimental data. Integration with legacy data from past experiments is a viable way to solve the problem, but complex calibration is often necessary to use the data obtained under different conditions. In this paper, we present a novel calibration-free strategy to enhance the performance of Bayesian optimization with preference learning. The entire learning process is solely based on pairwise comparison of quantities (i.e., higher or lower) in the same dataset, and experimental design can be done without comparing quantities in different datasets. We demonstrate that Bayesian optimization is significantly enhanced via addition of legacy data for organic molecules and inorganic solid-state materials.
△ Less
Submitted 25 October, 2019;
originally announced October 2019.
-
Asymptotic profile for diffusion wave terms of the compressible Navier-Stokes-Korteweg system
Authors:
Takayuki Kobayashi,
Masashi Misawa,
Kazuyuki Tsuda
Abstract:
Asymptotic profile for diffusion wave terms of solutions to the compressible Navier-Stokes-Korteweg system is studied on $R^2$. The diffusion wave with time decay estimate is studied by Hoff and Zumbrun (1995, 1997), Kobayashi and Shibata (2002) and Kobayashi and Tsuda (2018) for the compressible Navier-Stokes system and the compressible Navier-Stokes-Korteweg system. Our main assertion in this pa…
▽ More
Asymptotic profile for diffusion wave terms of solutions to the compressible Navier-Stokes-Korteweg system is studied on $R^2$. The diffusion wave with time decay estimate is studied by Hoff and Zumbrun (1995, 1997), Kobayashi and Shibata (2002) and Kobayashi and Tsuda (2018) for the compressible Navier-Stokes system and the compressible Navier-Stokes-Korteweg system. Our main assertion in this paper is that, for some initial conditions given by the Hardy space, asymptotic behaviors in space-time $L^2$ of the diffusion wave parts are essentially different between density and the potential flow part of the momentum. Even though measuring by $L^2$ on space, a decay of the potential flow part is slower than that of the Stokes flow part of the momentum. The proof is based on a modified version of Morawetz's energy estimate, and the Fefferman-Stein inequality on the duality between the Hardy space and functions of bounded mean oscillation.
△ Less
Submitted 9 July, 2019;
originally announced July 2019.
-
Deep learning-based quality filtering of mechanically exfoliated 2D crystals
Authors:
Yu Saito,
Kento Shin,
Kei Terayama1,
Shaan Desai,
Masaru Onga,
Yuji Nakagawa,
Yuki M. Itahashi,
Yoshihiro Iwasa,
Makoto Yamada,
Koji Tsuda
Abstract:
Two-dimensional (2D) crystals are attracting growing interest in various research fields such as engineering, physics, chemistry, pharmacy and biology owing to their low dimensionality and dramatic change of properties compared to the bulk counterparts. Among the various techniques used to manufacture 2D crystals, mechanical exfoliation has been essential to practical applications and fundamental…
▽ More
Two-dimensional (2D) crystals are attracting growing interest in various research fields such as engineering, physics, chemistry, pharmacy and biology owing to their low dimensionality and dramatic change of properties compared to the bulk counterparts. Among the various techniques used to manufacture 2D crystals, mechanical exfoliation has been essential to practical applications and fundamental research. However, mechanically exfoliated crystals on substrates contain relatively thick flakes that must be found and removed manually, limiting high-throughput manufacturing of atomic 2D crystals and van der Waals heterostructures. Here we present a deep learning-based method to segment and identify the thickness of atomic layer flakes from optical microscopy images. Through carefully designing a neural network based on U-Net, we found that our neural network based on U-net trained only with the data based on 24 images successfully distinguish monolayer and bilayer MoS2 with a success rate of 70%, which is a practical value in the first screening process for choosing monolayer and bilayer flakes of MoS2 of all flakes on substrates without human eye. The remarkable results highlight the possibility that a large fraction of manual laboratory work can be replaced by AI-based systems, boosting productivity.
△ Less
Submitted 7 July, 2019;
originally announced July 2019.
-
Time decay estimate with diffusion wave property and smoothing effect for solutions to the compressible Navier-Stokes-Korteweg system
Authors:
Takayuki KOBAYASHI,
Kazuyuki TSUDA
Abstract:
Time decay estimate of solutions to the compressible Navier-Stokes-Korteweg system is studied. Concerning the linearized problem, the decay estimate with diffusion wave property for an initial data is derived. As an application, the time decay estimate of solutions to the nonlinear problem is given. In contrast to the compressible Navier-Stokes system, for linear system regularities of the initial…
▽ More
Time decay estimate of solutions to the compressible Navier-Stokes-Korteweg system is studied. Concerning the linearized problem, the decay estimate with diffusion wave property for an initial data is derived. As an application, the time decay estimate of solutions to the nonlinear problem is given. In contrast to the compressible Navier-Stokes system, for linear system regularities of the initial data are lower and independent of the order of derivative of solutions owing to smoothing effect from the Korteweg tensor. Furthermore, for the nonlinear system diffusion wave property is obtained with an initial data having lower regularity than that of study of the compressible Navier-Stokes system.
△ Less
Submitted 31 May, 2019;
originally announced May 2019.
-
Global existence and time decay estimate of solutions to the compressible Navier-Stokes-Korteweg system under critical condition
Authors:
Kobayashi Takayuki,
Kazuyuki Tsuda
Abstract:
Global existence of solutions to the compressible Navier-Stokes-Korteweg system around a constant state is studied. This system describes liquid-vapor two phase flow with phase transition as diffuse interface model. In previous works they assume that the pressure is a monotone function for change of density similarly to the usual compressible Navier-Stokes system. On the other hand, due to phase t…
▽ More
Global existence of solutions to the compressible Navier-Stokes-Korteweg system around a constant state is studied. This system describes liquid-vapor two phase flow with phase transition as diffuse interface model. In previous works they assume that the pressure is a monotone function for change of density similarly to the usual compressible Navier-Stokes system. On the other hand, due to phase transition the pressure is accurately non-monotone function and the linearized system loses symmetry in a critical case such that the derivative of pressure is 0 at the given constant state. It is shown that in the critical case for small data whose momentum has derivative form there exist global $L^2$ solutions and the parabolic type decay rate of the solutions is obtained. The proof is based on decomposition method for solutions to a low frequency part and a high frequency part.
△ Less
Submitted 9 May, 2019;
originally announced May 2019.
-
Expanding the horizon of automated metamaterials discovery via quantum annealing
Authors:
Koki Kitai,
Jiang Guo,
Shenghong Ju,
Shu Tanaka,
Koji Tsuda,
Junichiro Shiomi,
Ryo Tamura
Abstract:
Complexity of materials designed by machine learning is currently limited by the inefficiency of classical computers. We show how quantum annealing can be incorporated into automated materials discovery and conduct a proof-of-principle study on designing complex thermofunctional metamaterials consisting of SiO2, SiC, and Poly(methyl methacrylate). Empirical computing time of our quantum-classical…
▽ More
Complexity of materials designed by machine learning is currently limited by the inefficiency of classical computers. We show how quantum annealing can be incorporated into automated materials discovery and conduct a proof-of-principle study on designing complex thermofunctional metamaterials consisting of SiO2, SiC, and Poly(methyl methacrylate). Empirical computing time of our quantum-classical hybrid algorithm involving a factorization machine, a rigorous coupled wave analysis, and a D-Wave 2000Q quantum annealer was insensitive to the problem size, while a classical counterpart experienced rapid increase. Our method was used to design complex structures of wavelength selective radiators showing much better concordance with the thermal atmospheric transparency window in comparison to existing human-designed alternatives. Our result shows that quantum annealing provides scientists gigantic computational power that may change how materials are designed.
△ Less
Submitted 18 February, 2019;
originally announced February 2019.
-
Efficient Construction Method for Phase Diagrams Using Uncertainty Sampling
Authors:
Kei Terayama,
Ryo Tamura,
Yoshitaro Nose,
Hidenori Hiramatsu,
Hideo Hosono,
Yasushi Okuno,
Koji Tsuda
Abstract:
We develop a method to efficiently construct phase diagrams using machine learning. Uncertainty sampling (US) in active learning is utilized to intensively sample around phase boundaries. Here, we demonstrate constructions of three known experimental phase diagrams by the US approach. Compared with random sampling, the US approach decreases the number of sampling points to about 20%. In particular…
▽ More
We develop a method to efficiently construct phase diagrams using machine learning. Uncertainty sampling (US) in active learning is utilized to intensively sample around phase boundaries. Here, we demonstrate constructions of three known experimental phase diagrams by the US approach. Compared with random sampling, the US approach decreases the number of sampling points to about 20%. In particular, the reduction rate is pronounced in more complicated phase diagrams. Furthermore, we show that using the US approach, undetected new phase can be rapidly found, and smaller number of initial sampling points are sufficient. Thus, we conclude that the US approach is useful to construct complicated phase diagrams from scratch and will be an essential tool in materials science.
△ Less
Submitted 5 December, 2018;
originally announced December 2018.
-
Transductive Boltzmann Machines
Authors:
Mahito Sugiyama,
Koji Tsuda,
Hiroyuki Nakahara
Abstract:
We present transductive Boltzmann machines (TBMs), which firstly achieve transductive learning of the Gibbs distribution. While exact learning of the Gibbs distribution is impossible by the family of existing Boltzmann machines due to combinatorial explosion of the sample space, TBMs overcome the problem by adaptively constructing the minimum required sample space from data to avoid unnecessary ge…
▽ More
We present transductive Boltzmann machines (TBMs), which firstly achieve transductive learning of the Gibbs distribution. While exact learning of the Gibbs distribution is impossible by the family of existing Boltzmann machines due to combinatorial explosion of the sample space, TBMs overcome the problem by adaptively constructing the minimum required sample space from data to avoid unnecessary generalization. We theoretically provide bias-variance decomposition of the KL divergence in TBMs to analyze its learnability, and empirically demonstrate that TBMs are superior to the fully visible Boltzmann machines and popularly used restricted Boltzmann machines in terms of efficiency and effectiveness.
△ Less
Submitted 21 May, 2018;
originally announced May 2018.
-
Population-based de novo molecule generation, using grammatical evolution
Authors:
Naruki Yoshikawa,
Kei Terayama,
Teruki Honma,
Kenta Oono,
Koji Tsuda
Abstract:
Automatic design with machine learning and molecular simulations has shown a remarkable ability to generate new and promising drug candidates. Current models, however, still have problems in simulation concurrency and molecular diversity. Most methods generate one molecule at a time and do not allow multiple simulators to run simultaneously. Additionally, better molecular diversity could boost the…
▽ More
Automatic design with machine learning and molecular simulations has shown a remarkable ability to generate new and promising drug candidates. Current models, however, still have problems in simulation concurrency and molecular diversity. Most methods generate one molecule at a time and do not allow multiple simulators to run simultaneously. Additionally, better molecular diversity could boost the success rate in the subsequent drug discovery process. We propose a new population-based approach using grammatical evolution named ChemGE. In our method, a large population of molecules are updated concurrently and evaluated by multiple simulators in parallel. In docking experiments with thymidine kinase, ChemGE succeeded in generating hundreds of high-affinity molecules whose diversity is better than that of known inding molecules in DUD-E.
△ Less
Submitted 6 April, 2018;
originally announced April 2018.
-
Legendre Decomposition for Tensors
Authors:
Mahito Sugiyama,
Hiroyuki Nakahara,
Koji Tsuda
Abstract:
We present a novel nonnegative tensor decomposition method, called Legendre decomposition, which factorizes an input tensor into a multiplicative combination of parameters. Thanks to the well-developed theory of information geometry, the reconstructed tensor is unique and always minimizes the KL divergence from an input tensor. We empirically show that Legendre decomposition can more accurately re…
▽ More
We present a novel nonnegative tensor decomposition method, called Legendre decomposition, which factorizes an input tensor into a multiplicative combination of parameters. Thanks to the well-developed theory of information geometry, the reconstructed tensor is unique and always minimizes the KL divergence from an input tensor. We empirically show that Legendre decomposition can more accurately reconstruct tensors than other nonnegative tensor decomposition methods.
△ Less
Submitted 29 October, 2018; v1 submitted 13 February, 2018;
originally announced February 2018.
-
ChemTS: An Efficient Python Library for de novo Molecular Generation
Authors:
Xiufeng Yang,
Jinzhe Zhang,
Kazuki Yoshizoe,
Kei Terayama,
Koji Tsuda
Abstract:
Automatic design of organic materials requires black-box optimization in a vast chemical space. In conventional molecular design algorithms, a molecule is built as a combination of predetermined fragments. Recently, deep neural network models such as variational auto encoders (VAEs) and recurrent neural networks (RNNs) are shown to be effective in de novo design of molecules without any predetermi…
▽ More
Automatic design of organic materials requires black-box optimization in a vast chemical space. In conventional molecular design algorithms, a molecule is built as a combination of predetermined fragments. Recently, deep neural network models such as variational auto encoders (VAEs) and recurrent neural networks (RNNs) are shown to be effective in de novo design of molecules without any predetermined fragments. This paper presents a novel python library ChemTS that explores the chemical space by combining Monte Carlo tree search (MCTS) and an RNN. In a benchmarking problem of optimizing the octanol-water partition coefficient and synthesizability, our algorithm showed superior efficiency in finding high-scoring molecules. ChemTS is available at https://github.com/tsudalab/ChemTS.
△ Less
Submitted 29 September, 2017;
originally announced October 2017.
-
Machine learning reveals orbital interaction in crystalline materials
Authors:
Tien Lam Pham,
Hiori Kino,
Kiyoyuki Terakura,
Takashi Miyake,
Ichigaku Takigawa,
Koji Tsuda,
Hieu Chi Dam
Abstract:
We propose a novel representation of crystalline materials named orbital-field matrix (OFM) based on the distribution of valence shell electrons. We demonstrate that this new representation can be highly useful in mining material data. Our experiment shows that the formation energies of crystalline materials, the atomization energies of molecular materials, and the local magnetic moments of the co…
▽ More
We propose a novel representation of crystalline materials named orbital-field matrix (OFM) based on the distribution of valence shell electrons. We demonstrate that this new representation can be highly useful in mining material data. Our experiment shows that the formation energies of crystalline materials, the atomization energies of molecular materials, and the local magnetic moments of the constituent atoms in transition metal--rare-earth metal bimetal alloys can be predicted with high accuracy using the OFM. Knowledge regarding the role of coordination numbers of transition-metal and rare-earth metal elements in determining the local magnetic moment of transition metal sites can be acquired directly from decision tree regression analyses using the OFM.
△ Less
Submitted 3 May, 2017; v1 submitted 2 May, 2017;
originally announced May 2017.
-
Tensor Balancing on Statistical Manifold
Authors:
Mahito Sugiyama,
Hiroyuki Nakahara,
Koji Tsuda
Abstract:
We solve tensor balancing, rescaling an Nth order nonnegative tensor by multiplying N tensors of order N - 1 so that every fiber sums to one. This generalizes a fundamental process of matrix balancing used to compare matrices in a wide range of applications from biology to economics. We present an efficient balancing algorithm with quadratic convergence using Newton's method and show in numerical…
▽ More
We solve tensor balancing, rescaling an Nth order nonnegative tensor by multiplying N tensors of order N - 1 so that every fiber sums to one. This generalizes a fundamental process of matrix balancing used to compare matrices in a wide range of applications from biology to economics. We present an efficient balancing algorithm with quadratic convergence using Newton's method and show in numerical experiments that the proposed algorithm is several orders of magnitude faster than existing ones. To theoretically prove the correctness of the algorithm, we model tensors as probability distributions in a statistical manifold and realize tensor balancing as projection onto a submanifold. The key to our algorithm is that the gradient of the manifold, used as a Jacobian matrix in Newton's method, can be analytically obtained using the Moebius inversion formula, the essential of combinatorial mathematics. Our model is not limited to tensor balancing, but has a wide applicability as it includes various statistical and machine learning models such as weighted DAGs and Boltzmann machines.
△ Less
Submitted 29 October, 2018; v1 submitted 26 February, 2017;
originally announced February 2017.
-
Designing nanostructures for interfacial phonon transport via Bayesian optimization
Authors:
Shenghong Ju,
Takuma Shiga,
Lei Feng,
Zhufeng Hou,
Koji Tsuda,
Junichiro Shiomi
Abstract:
We demonstrate optimization of thermal conductance across nanostructures by developing a method combining atomistic Green's function and Bayesian optimization. With an aim to minimize and maximize the interfacial thermal conductance (ITC) across Si-Si and Si-Ge interfaces by means of Si/Ge composite interfacial structure, the method identifies the optimal structures from calculations of only a few…
▽ More
We demonstrate optimization of thermal conductance across nanostructures by developing a method combining atomistic Green's function and Bayesian optimization. With an aim to minimize and maximize the interfacial thermal conductance (ITC) across Si-Si and Si-Ge interfaces by means of Si/Ge composite interfacial structure, the method identifies the optimal structures from calculations of only a few percent of the entire candidates (over 60,000 structures). The obtained optimal interfacial structures are non-intuitive and impacting: the minimum-ITC structure is an aperiodic superlattice that realizes 50% reduction from the best periodic superlattice. The physical mechanism of the minimum ITC can be understood in terms of crossover of the two effects on phonon transport: as the layer thickness in superlattice increases, the impact of Fabry-Pérot interference increases, and the rate of reflection at the layer-interfaces decreases. Aperiodic superlattice with spatial variation in the layer thickness has a degree of freedom to realize optimal balance between the above two competing mechanism. Furthermore, aperiodicity breaks the constructive phonon interference between the interfaces inhibiting the coherent phonon transport. The present work shows the effectiveness and advantage of material informatics in designing nanostructures to control heat conduction, which can be extended to other interfacial structures.
△ Less
Submitted 16 September, 2016;
originally announced September 2016.
-
Selective Inference Approach for Statistically Sound Predictive Pattern Mining
Authors:
Shinya Suzumura,
Kazuya Nakagawa,
Mahito Sugiyama,
Koji Tsuda,
Ichiro Takeuchi
Abstract:
Discovering statistically significant patterns from databases is an important challenging problem. The main obstacle of this problem is in the difficulty of taking into account the selection bias, i.e., the bias arising from the fact that patterns are selected from extremely large number of candidates in databases. In this paper, we introduce a new approach for predictive pattern mining problems t…
▽ More
Discovering statistically significant patterns from databases is an important challenging problem. The main obstacle of this problem is in the difficulty of taking into account the selection bias, i.e., the bias arising from the fact that patterns are selected from extremely large number of candidates in databases. In this paper, we introduce a new approach for predictive pattern mining problems that can address the selection bias issue. Our approach is built on a recently popularized statistical inference framework called selective inference. In selective inference, statistical inferences (such as statistical hypothesis testing) are conducted based on sampling distributions conditional on a selection event. If the selection event is characterized in a tractable way, statistical inferences can be made without minding selection bias issue. However, in pattern mining problems, it is difficult to characterize the entire selection process of mining algorithms. Our main contribution in this paper is to solve this challenging problem for a class of predictive pattern mining problems by introducing a novel algorithmic framework. We demonstrate that our approach is useful for finding statistically significant patterns from databases.
△ Less
Submitted 9 March, 2016; v1 submitted 15 February, 2016;
originally announced February 2016.
-
Safe Pattern Pruning: An Efficient Approach for Predictive Pattern Mining
Authors:
Kazuya Nakagawa,
Shinya Suzumura,
Masayuki Karasuyama,
Koji Tsuda,
Ichiro Takeuchi
Abstract:
In this paper we study predictive pattern mining problems where the goal is to construct a predictive model based on a subset of predictive patterns in the database. Our main contribution is to introduce a novel method called safe pattern pruning (SPP) for a class of predictive pattern mining problems. The SPP method allows us to efficiently find a superset of all the predictive patterns in the da…
▽ More
In this paper we study predictive pattern mining problems where the goal is to construct a predictive model based on a subset of predictive patterns in the database. Our main contribution is to introduce a novel method called safe pattern pruning (SPP) for a class of predictive pattern mining problems. The SPP method allows us to efficiently find a superset of all the predictive patterns in the database that are needed for the optimal predictive model. The advantage of the SPP method over existing boosting-type method is that the former can find the superset by a single search over the database, while the latter requires multiple searches. The SPP method is inspired by recent development of safe feature screening. In order to extend the idea of safe feature screening into predictive pattern mining, we derive a novel pruning rule called safe pattern pruning (SPP) rule that can be used for searching over the tree defined among patterns in the database. The SPP rule has a property that, if a node corresponding to a pattern in the database is pruned out by the SPP rule, then it is guaranteed that all the patterns corresponding to its descendant nodes are never needed for the optimal predictive model. We apply the SPP method to graph mining and item-set mining problems, and demonstrate its computational advantage.
△ Less
Submitted 14 February, 2016;
originally announced February 2016.
-
Information Decomposition on Structured Space
Authors:
Mahito Sugiyama,
Hiroyuki Nakahara,
Koji Tsuda
Abstract:
We build information geometry for a partially ordered set of variables and define the orthogonal decomposition of information theoretic quantities. The natural connection between information geometry and order theory leads to efficient decomposition algorithms. This generalization of Amari's seminal work on hierarchical decomposition of probability distributions on event combinations enables us to…
▽ More
We build information geometry for a partially ordered set of variables and define the orthogonal decomposition of information theoretic quantities. The natural connection between information geometry and order theory leads to efficient decomposition algorithms. This generalization of Amari's seminal work on hierarchical decomposition of probability distributions on event combinations enables us to analyze high-order statistical interactions arising in neuroscience, biology, and machine learning.
△ Less
Submitted 5 May, 2016; v1 submitted 21 January, 2016;
originally announced January 2016.
-
Redesigning pattern mining algorithms for supercomputers
Authors:
Kazuki Yoshizoe,
Aika Terada,
Koji Tsuda
Abstract:
Upcoming many core processors are expected to employ a distributed memory architecture similar to currently available supercomputers, but parallel pattern mining algorithms amenable to the architecture are not comprehensively studied. We present a novel closed pattern mining algorithm with a well-engineered communication protocol, and generalize it to find statistically significant patterns from p…
▽ More
Upcoming many core processors are expected to employ a distributed memory architecture similar to currently available supercomputers, but parallel pattern mining algorithms amenable to the architecture are not comprehensively studied. We present a novel closed pattern mining algorithm with a well-engineered communication protocol, and generalize it to find statistically significant patterns from personal genome data. For distributing communication evenly, it employs global load balancing with multiple stacks distributed on a set of cores organized as a hypercube with random edges. Our algorithm achieved up to 1175-fold speedup by using 1200 cores for solving a problem with 11,914 items and 697 transactions, while the naive approach of separating the search space failed completely.
△ Less
Submitted 27 October, 2015;
originally announced October 2015.
-
A ferroelectric-like structural transition in a metal
Authors:
Youguo Shi,
Yanfeng Guo,
Xia Wang,
Andrew J. Princep,
Dmitry Khalyavin,
Pascal Manuel,
Yuichi Michiue,
Akira Sato,
Kenji Tsuda,
Shan Yu,
Masao Arai,
Yuichi Shirako,
Masaki Akaogi,
Nanlin Wang,
Kazunari Yamaura,
Andrew T. Boothroyd
Abstract:
Metals cannot exhibit ferroelectricity because static internal electric fields are screened by conduction electrons, but in 1965, Anderson and Blount predicted the possibility of a ferroelectric metal, in which a ferroelectric-like structural transition occurs in the metallic state. Up to now, no clear example of such a material has been identified. Here we report on a centrosymmetric (R-3c) to no…
▽ More
Metals cannot exhibit ferroelectricity because static internal electric fields are screened by conduction electrons, but in 1965, Anderson and Blount predicted the possibility of a ferroelectric metal, in which a ferroelectric-like structural transition occurs in the metallic state. Up to now, no clear example of such a material has been identified. Here we report on a centrosymmetric (R-3c) to non-centrosymmetric (R3c) transition in metallic LiOsO3 that is structurally equivalent to the ferroelectric transition of LiNbO3. The transition involves a continuous shift in the mean position of Li+ ions on cooling below 140K. Its discovery realizes the scenario described by Anderson and Blount, and establishes a new class of materials whose properties may differ from those of normal metals.
△ Less
Submitted 6 September, 2015;
originally announced September 2015.
-
Safe Feature Pruning for Sparse High-Order Interaction Models
Authors:
Kazuya Nakagawa,
Shinya Suzumura,
Masayuki Karasuyama,
Koji Tsuda,
Ichiro Takeuchi
Abstract:
Taking into account high-order interactions among covariates is valuable in many practical regression problems. This is, however, computationally challenging task because the number of high-order interaction features to be considered would be extremely large unless the number of covariates is sufficiently small. In this paper, we propose a novel efficient algorithm for LASSO-based sparse learning…
▽ More
Taking into account high-order interactions among covariates is valuable in many practical regression problems. This is, however, computationally challenging task because the number of high-order interaction features to be considered would be extremely large unless the number of covariates is sufficiently small. In this paper, we propose a novel efficient algorithm for LASSO-based sparse learning of such high-order interaction models. Our basic strategy for reducing the number of features is to employ the idea of recently proposed safe feature screening (SFS) rule. An SFS rule has a property that, if a feature satisfies the rule, then the feature is guaranteed to be non-active in the LASSO solution, meaning that it can be safely screened-out prior to the LASSO training process. If a large number of features can be screened-out before training the LASSO, the computational cost and the memory requirment can be dramatically reduced. However, applying such an SFS rule to each of the extremely large number of high-order interaction features would be computationally infeasible. Our key idea for solving this computational issue is to exploit the underlying tree structure among high-order interaction features. Specifically, we introduce a pruning condition called safe feature pruning (SFP) rule which has a property that, if the rule is satisfied in a certain node of the tree, then all the high-order interaction features corresponding to its descendant nodes can be guaranteed to be non-active at the optimal solution. Our algorithm is extremely efficient, making it possible to work, e.g., with 3rd order interactions of 10,000 original covariates, where the number of possible high-order interaction features is greater than 10^{12}.
△ Less
Submitted 26 June, 2015;
originally announced June 2015.
-
An Efficient Post-Selection Inference on High-Order Interaction Models
Authors:
S. Suzumura,
K. Nakagawa,
K. Tsuda,
I. Takeuchi
Abstract:
Finding statistically significant high-order interaction features in predictive modeling is important but challenging task. The difficulty lies in the fact that, for a recent applications with high-dimensional covariates, the number of possible high-order interaction features would be extremely large. Identifying statistically significant features from such a huge pool of candidates would be highl…
▽ More
Finding statistically significant high-order interaction features in predictive modeling is important but challenging task. The difficulty lies in the fact that, for a recent applications with high-dimensional covariates, the number of possible high-order interaction features would be extremely large. Identifying statistically significant features from such a huge pool of candidates would be highly challenging both in computational and statistical senses. To work with this problem, we consider a two stage algorithm where we first select a set of high-order interaction features by marginal screening, and then make statistical inferences on the regression model fitted only with the selected features. Such statistical inferences are called post-selection inference (PSI), and receiving an increasing attention in the literature. One of the seminal recent advancements in PSI literature is the works by Lee et al. where the authors presented an algorithmic framework for computing exact sampling distributions in PSI. A main challenge when applying their approach to our high-order interaction models is to cope with the fact that PSI in general depends not only on the selected features but also on the unselected features, making it hard to apply to our extremely high-dimensional high-order interaction models. The goal of this paper is to overcome this difficulty by introducing a novel efficient method for PSI. Our key idea is to exploit the underlying tree structure among high-order interaction features, and to develop a pruning method of the tree which enables us to quickly identify a group of unselected features that are guaranteed to have no influence on PSI. The experimental results indicate that the proposed method allows us to reliably identify statistically significant high-order interaction features with reasonable computational cost.
△ Less
Submitted 26 June, 2015;
originally announced June 2015.
-
Discovery of low thermal conductivity compounds with first-principles anharmonic lattice dynamics calculations and Bayesian optimization
Authors:
Atsuto Seko,
Atsushi Togo,
Hiroyuki Hayashi,
Koji Tsuda,
Laurent Chaput,
Isao Tanaka
Abstract:
Compounds of low lattice thermal conductivity (LTC) are essential for seeking thermoelectric materials with high conversion efficiency. Some strategies have been used to decrease LTC. However, such trials have yielded successes only within a limited exploration space. Here we report the virtual screening of a library containing 54,779 compounds. Our strategy is to search the library through Bayesi…
▽ More
Compounds of low lattice thermal conductivity (LTC) are essential for seeking thermoelectric materials with high conversion efficiency. Some strategies have been used to decrease LTC. However, such trials have yielded successes only within a limited exploration space. Here we report the virtual screening of a library containing 54,779 compounds. Our strategy is to search the library through Bayesian optimization using for the initial data the LTC obtained from first-principles anharmonic lattice dynamics calculations for a set of 101 compounds. We discovered 221 materials with very low LTC. Two of them have even an electronic band gap < 1 eV, what makes them exceptional candidates for thermoelectric applications. In addition to those newly discovered thermoelectric materials, the present strategy is believed to be powerful for many other applications in which chemistry of materials are required to be optimized.
△ Less
Submitted 21 June, 2015;
originally announced June 2015.
-
Van der Waals epitaxial growth of topological insulator Bi$_{2-x}$Sb$_x$Te$_{3-y}$Se$_y$ ultrathin nanoplate on electrically insulating fluorophlogopite mica
Authors:
Ngoc Han Tu,
Yoichi Tanabe,
Khuong Kim Huynh,
Yohei Sato,
Hidetoshi Oguro,
Satoshi Heguri,
Kenji Tsuda,
Masami Terauchi,
Kazuo Watanabe,
Katsumi Tanigaki
Abstract:
We report the growth of high quality Bi$_{2-x}$Sb$_x$Te$_{3-y}$Se$_y$ ultrathin nanoplates (BSTS-NPs) on an electrically insulating fluorophlogopite mica substrate using a catalyst-free vapor solid method. Under an optimized pressure and suitable Ar gas flow rate, we control the thickness, the size and the composition of BSTS-NPs. Raman spectra showing systematic change indicate that the thickness…
▽ More
We report the growth of high quality Bi$_{2-x}$Sb$_x$Te$_{3-y}$Se$_y$ ultrathin nanoplates (BSTS-NPs) on an electrically insulating fluorophlogopite mica substrate using a catalyst-free vapor solid method. Under an optimized pressure and suitable Ar gas flow rate, we control the thickness, the size and the composition of BSTS-NPs. Raman spectra showing systematic change indicate that the thicknesses and compositions of BSTS-NPs are indeed accurately controlled. Electrical transport demonstrates that a robust Dirac cone carrier transport in BSTS-NPs. Since BSTS-NPs provide superior dominant surface transport of the tunable Dirac cone surface states with negligible contribution of the conduction of the bulk states, BSTS-NPs provide an ideal platform to explore intrinsic physical phenomena as well as technological applications of 3-dimensional topological insulators in the future.
△ Less
Submitted 29 July, 2014;
originally announced July 2014.
-
Machine learning with systematic density-functional theory calculations: Application to melting temperatures of single and binary component solids
Authors:
Atsuto Seko,
Tomoya Maekawa,
Koji Tsuda,
Isao Tanaka
Abstract:
A combination of systematic density functional theory (DFT) calculations and machine learning techniques has a wide range of potential applications. This study presents an application of the combination of systematic DFT calculations and regression techniques to the prediction of the melting temperature for single and binary compounds. Here we adopt the ordinary least-squares regression (OLSR), pa…
▽ More
A combination of systematic density functional theory (DFT) calculations and machine learning techniques has a wide range of potential applications. This study presents an application of the combination of systematic DFT calculations and regression techniques to the prediction of the melting temperature for single and binary compounds. Here we adopt the ordinary least-squares regression (OLSR), partial least-squares regression (PLSR), support vector regression (SVR) and Gaussian process regression (GPR). Among the four kinds of regression techniques, the SVR provides the best prediction. In addition, the inclusion of physical properties computed by the DFT calculation to a set of predictor variables makes the prediction better. Finally, a simulation to find the highest melting temperature toward the efficient materials design using kriging is demonstrated. The kriging design finds the compound with the highest melting temperature much faster than random designs. This result may stimulate the application of kriging to efficient materials design for a broad range of applications.
△ Less
Submitted 6 February, 2014; v1 submitted 6 October, 2013;
originally announced October 2013.
-
LGM: Mining Frequent Subgraphs from Linear Graphs
Authors:
Yasuo Tabei,
Daisuke Okanohara,
Shuichi Hirose,
Koji Tsuda
Abstract:
A linear graph is a graph whose vertices are totally ordered. Biological and linguistic sequences with interactions among symbols are naturally represented as linear graphs. Examples include protein contact maps, RNA secondary structures and predicate-argument structures. Our algorithm, linear graph miner (LGM), leverages the vertex order for efficient enumeration of frequent subgraphs. Based on t…
▽ More
A linear graph is a graph whose vertices are totally ordered. Biological and linguistic sequences with interactions among symbols are naturally represented as linear graphs. Examples include protein contact maps, RNA secondary structures and predicate-argument structures. Our algorithm, linear graph miner (LGM), leverages the vertex order for efficient enumeration of frequent subgraphs. Based on the reverse search principle, the pattern space is systematically traversed without expensive duplication checking. Disconnected subgraph patterns are particularly important in linear graphs due to their sequential nature. Unlike conventional graph mining algorithms detecting connected patterns only, LGM can detect disconnected patterns as well. The utility and efficiency of LGM are demonstrated in experiments on protein contact maps.
△ Less
Submitted 5 March, 2011; v1 submitted 22 February, 2011;
originally announced February 2011.
-
A New Relation between Lamb Shift Energies
Authors:
Hiroaki Kubo,
Takehisa Fujita,
Naohiro Kanda,
Hiroshi Kato,
Yasunori Munakata,
Sachiko Oshima,
Kazuhiro Tsuda
Abstract:
We derive a new relation between the observed Lamb shift energies of hydrogen and muonium atoms. The relation is based on the non-relativistic description of the Lamb shift, and the proper treatment of the reduced mass of electron and target particles (proton and muon) leads to the new formula which is expressed as…
▽ More
We derive a new relation between the observed Lamb shift energies of hydrogen and muonium atoms. The relation is based on the non-relativistic description of the Lamb shift, and the proper treatment of the reduced mass of electron and target particles (proton and muon) leads to the new formula which is expressed as $\displaystyle{{ΔE^{(H)}_{2s_{1/2}}\over ΔE^{(μ)}_{2s_{1/2}}} =({1+{m_e\over m_μ}\over 1+{m_e\over M_p}})^3}$. This relation achieves an excellent agreement with experiment and presents an important QED test free from the cutoff momentum $Λ$.
△ Less
Submitted 26 March, 2010;
originally announced March 2010.
-
Efficient Construction of Neighborhood Graphs by the Multiple Sorting Method
Authors:
Takeaki Uno,
Masashi Sugiyama,
Koji Tsuda
Abstract:
Neighborhood graphs are gaining popularity as a concise data representation in machine learning. However, naive graph construction by pairwise distance calculation takes $O(n^2)$ runtime for $n$ data points and this is prohibitively slow for millions of data points. For strings of equal length, the multiple sorting method (Uno, 2008) can construct an $ε$-neighbor graph in $O(n+m)$ time, where…
▽ More
Neighborhood graphs are gaining popularity as a concise data representation in machine learning. However, naive graph construction by pairwise distance calculation takes $O(n^2)$ runtime for $n$ data points and this is prohibitively slow for millions of data points. For strings of equal length, the multiple sorting method (Uno, 2008) can construct an $ε$-neighbor graph in $O(n+m)$ time, where $m$ is the number of $ε$-neighbor pairs in the data. To introduce this remarkably efficient algorithm to continuous domains such as images, signals and texts, we employ a random projection method to convert vectors to strings. Theoretical results are presented to elucidate the trade-off between approximation quality and computation time. Empirical results show the efficiency of our method in comparison to fast nearest neighbor alternatives.
△ Less
Submitted 20 April, 2009;
originally announced April 2009.
-
New Renormalization Scheme of Vacuum Polarization in QED
Authors:
T. Fujita,
N. Kanda,
H. Kato,
H. Kubo,
Y. Munakata,
S. Oshima,
K. Tsuda
Abstract:
We examine the vacuum polarization contribution in the renormalization scheme of QED. Normally, the quadratic divergence term is discarded under the condition that the counter term of the Lagrangian density should be gauge invariant. Here, it is shown that the whole contribution of the photon self-energy should not be considered for the renormalization procedure. In fact, the finite contribution…
▽ More
We examine the vacuum polarization contribution in the renormalization scheme of QED. Normally, the quadratic divergence term is discarded under the condition that the counter term of the Lagrangian density should be gauge invariant. Here, it is shown that the whole contribution of the photon self-energy should not be considered for the renormalization procedure. In fact, the finite contribution of the renormalization in the vacuum polarization is shown to give rise to the hyperfine splitting energy which disagrees with the experimental observation in hydrogen atom. For the treatment of the vacuum polarization, we present a new renormalization scheme of the photon self-energy diagram.
△ Less
Submitted 22 January, 2009;
originally announced January 2009.
-
Anisotropic ground states of the quantum Hall system with currents
Authors:
Kazumi Tsuda
Abstract:
Anisotropic states at half-filled third and higher Landau levels are investigated in the system with a finite electric current. We study the response of the striped Hall state and the anisotropic charge density wave (ACDW) state against the injected current using the effective action. Current distributions and a current dependence of the total energy are determined for both states. With no injec…
▽ More
Anisotropic states at half-filled third and higher Landau levels are investigated in the system with a finite electric current. We study the response of the striped Hall state and the anisotropic charge density wave (ACDW) state against the injected current using the effective action. Current distributions and a current dependence of the total energy are determined for both states. With no injected current, the energy of the ACDW state is lower than that of the striped Hall state. We find that the energy of the ACDW state increases faster than that of the striped Hall state as the injected current increases. Hence, the striped Hall state becomes the lower energy state when the current exceeds the critical value. The critical value is estimated at about 0.04 - 0.05 nA which is much smaller than the current used in the experiments. Our calculations are performed using a block diagonalization technique on a von Neumann lattice. We review this technique in this thesis.
△ Less
Submitted 10 January, 2009;
originally announced January 2009.
-
Unphysical Gauge Fixing in Higgs Mechanism
Authors:
T. Fujita,
A. Kusaka,
K. Tsuda,
S. Oshima
Abstract:
The unitary gauge in the Higgs mechanism is to impose the condition of $φ=φ^\dagger $ on the Higgs fields. However, this is not the gauge fixing but simply a procedure for producing the massive vector boson fields by hand. The Lagrangian density of the weak interactions should be reconsidered by starting from the massive vector boson fields which couple to the fermion currents as the initial ing…
▽ More
The unitary gauge in the Higgs mechanism is to impose the condition of $φ=φ^\dagger $ on the Higgs fields. However, this is not the gauge fixing but simply a procedure for producing the massive vector boson fields by hand. The Lagrangian density of the weak interactions should be reconsidered by starting from the massive vector boson fields which couple to the fermion currents as the initial ingredients.
△ Less
Submitted 18 June, 2008;
originally announced June 2008.
-
Current induced transition of anisotropic quantum Hall states
Authors:
Kazumi Tsuda,
Nobuki Maeda,
Kenzo Ishikawa
Abstract:
We compare the energies of the striped Hall state and the anisotropic charge density wave (ACDW) state at half-filled third and higher Landau levels in the system with injected currents. With no injected current, the ACDW state has a lower energy. We find that the striped Hall state becomes the lower energy state when the injected current exceeds a critical value. The critical value is estimated…
▽ More
We compare the energies of the striped Hall state and the anisotropic charge density wave (ACDW) state at half-filled third and higher Landau levels in the system with injected currents. With no injected current, the ACDW state has a lower energy. We find that the striped Hall state becomes the lower energy state when the injected current exceeds a critical value. The critical value is estimated as about 0.04-0.05 nA.
△ Less
Submitted 23 February, 2007;
originally announced February 2007.
-
Anisotropic ground states of the quantum Hall system with currents
Authors:
Kazumi Tsuda,
Nobuki Maeda,
Kenzo Ishikawa
Abstract:
Anisotropic states at half-filled higher Landau levels are investigated in the system with a finite electric current. We study the response of the striped Hall state and the anisotropic charge density wave (ACDW) state against the injected current using the effective action. Current distributions and a current dependence of the total energy are determined for both states. With no injected curren…
▽ More
Anisotropic states at half-filled higher Landau levels are investigated in the system with a finite electric current. We study the response of the striped Hall state and the anisotropic charge density wave (ACDW) state against the injected current using the effective action. Current distributions and a current dependence of the total energy are determined for both states. With no injected current, the energy of the ACDW state is lower than that of the striped Hall state. We find that the energy of the ACDW state increases faster than that of the striped Hall state as the injected current increases. Hence, the striped Hall state becomes the lower energy state when the current exceeds the critical value. The critical value is estimated at about 0.04-0.07 nA, which is much smaller than the current used in the experiments.
△ Less
Submitted 26 July, 2007; v1 submitted 14 February, 2007;
originally announced February 2007.
-
Approximating Incomplete Kernel Matrices by the em Algorithm
Authors:
Koji Tsuda,
Shotaro Akaho,
Kiyoshi Asai
Abstract:
In biological data, it is often the case that observed data are available only for a subset of samples. When a kernel matrix is derived from such data, we have to leave the entries for unavailable samples as missing. In this paper, we make use of a parametric model of kernel matrices, and estimate missing entries by fitting the model to existing entries. The parametric model is created as a set…
▽ More
In biological data, it is often the case that observed data are available only for a subset of samples. When a kernel matrix is derived from such data, we have to leave the entries for unavailable samples as missing. In this paper, we make use of a parametric model of kernel matrices, and estimate missing entries by fitting the model to existing entries. The parametric model is created as a set of spectral variants of a complete kernel matrix derived from another information source. For model fitting, we adopt the em algorithm based on the information geometry of positive definite matrices. We will report promising results on bacteria clustering experiments using two marker sequences: 16S and gyrB.
△ Less
Submitted 7 November, 2002;
originally announced November 2002.