-
The nucleosynthetic fingerprint of the outermost protoplanetary disk and early Solar System dynamics
Authors:
Elishevah van Kooten,
Xuchao Zhao,
Ian Franchi,
Po-Yen Tung,
Simon Fairclough,
John Walmsley,
Isaac Onyett,
Martin Schiller,
Martin Bizzarro
Abstract:
Knowledge of the nucleosynthetic isotope composition of the outermost protoplanetary disk is critical to understand the formation and early dynamical evolution of the Solar System. We report the discovery of outer disk material preserved in a pristine meteorite based on its chemical composition, organic-rich petrology, and 15N-rich, deuterium-rich, and 16O-poor isotope signatures. We infer that th…
▽ More
Knowledge of the nucleosynthetic isotope composition of the outermost protoplanetary disk is critical to understand the formation and early dynamical evolution of the Solar System. We report the discovery of outer disk material preserved in a pristine meteorite based on its chemical composition, organic-rich petrology, and 15N-rich, deuterium-rich, and 16O-poor isotope signatures. We infer that this outer disk material originated in the comet-forming region. The nucleosynthetic Fe, Mg, Si and Cr compositions of this material reveal that, contrary to current belief, the isotope signature of the comet-forming region is ubiquitous amongst outer Solar System bodies, possibly reflecting an important planetary building block in the outer Solar System. This nucleosynthetic component represents fresh material added to the outer disk by late accretion streamers connected to the ambient molecular cloud. Our results show that most Solar System carbonaceous asteroids accreted material from the comet-forming region, a signature lacking in the terrestrial planet region.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Doubly relaxed forward-Douglas--Rachford splitting for the sum of two nonconvex and a DC function
Authors:
Minh N. Dao,
Tan Nhat Pham,
Phan Thanh Tung
Abstract:
In this paper, we consider a class of structured nonconvex nonsmooth optimization problems whose objective function is the sum of three nonconvex functions, one of which is expressed in a difference-of-convex (DC) form. This problem class covers several important structures in the literature including the sum of three functions and the general DC program. We propose a splitting algorithm and prove…
▽ More
In this paper, we consider a class of structured nonconvex nonsmooth optimization problems whose objective function is the sum of three nonconvex functions, one of which is expressed in a difference-of-convex (DC) form. This problem class covers several important structures in the literature including the sum of three functions and the general DC program. We propose a splitting algorithm and prove the subsequential convergence to a stationary point of the problem. The full sequential convergence, along with convergence rates for both the iterates and objective function values, is then established without requiring differentiability of the concave part. Our analysis not only extends but also unifies and improves recent convergence analyses in nonconvex settings. We benchmark our proposed algorithm with notable algorithms in the literature to show its competitiveness on both synthetic data and real power system load data.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Derivative-free tree optimization for complex systems
Authors:
Ye Wei,
Bo Peng,
Ruiwen Xie,
Yangtao Chen,
Yu Qin,
Peng Wen,
Stefan Bauer,
Po-Yen Tung
Abstract:
A tremendous range of design tasks in materials, physics, and biology can be formulated as finding the optimum of an objective function depending on many parameters without knowing its closed-form expression or the derivative. Traditional derivative-free optimization techniques often rely on strong assumptions about objective functions, thereby failing at optimizing non-convex systems beyond 100 d…
▽ More
A tremendous range of design tasks in materials, physics, and biology can be formulated as finding the optimum of an objective function depending on many parameters without knowing its closed-form expression or the derivative. Traditional derivative-free optimization techniques often rely on strong assumptions about objective functions, thereby failing at optimizing non-convex systems beyond 100 dimensions. Here, we present a tree search method for derivative-free optimization that enables accelerated optimal design of high-dimensional complex systems. Specifically, we introduce stochastic tree expansion, dynamic upper confidence bound, and short-range backpropagation mechanism to evade local optimum, iteratively approximating the global optimum using machine learning models. This development effectively confronts the dimensionally challenging problems, achieving convergence to global optima across various benchmark functions up to 2,000 dimensions, surpassing the existing methods by 10- to 20-fold. Our method demonstrates wide applicability to a wide range of real-world complex systems spanning materials, physics, and biology, considerably outperforming state-of-the-art algorithms. This enables efficient autonomous knowledge discovery and facilitates self-driving virtual laboratories. Although we focus on problems within the realm of natural science, the advancements in optimization techniques achieved herein are applicable to a broader spectrum of challenges across all quantitative disciplines.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
SemiMemes: A Semi-supervised Learning Approach for Multimodal Memes Analysis
Authors:
Pham Thai Hoang Tung,
Nguyen Tan Viet,
Ngo Tien Anh,
Phan Duy Hung
Abstract:
The prevalence of memes on social media has created the need to sentiment analyze their underlying meanings for censoring harmful content. Meme censoring systems by machine learning raise the need for a semi-supervised learning solution to take advantage of the large number of unlabeled memes available on the internet and make the annotation process less challenging. Moreover, the approach needs t…
▽ More
The prevalence of memes on social media has created the need to sentiment analyze their underlying meanings for censoring harmful content. Meme censoring systems by machine learning raise the need for a semi-supervised learning solution to take advantage of the large number of unlabeled memes available on the internet and make the annotation process less challenging. Moreover, the approach needs to utilize multimodal data as memes' meanings usually come from both images and texts. This research proposes a multimodal semi-supervised learning approach that outperforms other multimodal semi-supervised learning and supervised learning state-of-the-art models on two datasets, the Multimedia Automatic Misogyny Identification and Hateful Memes dataset. Building on the insights gained from Contrastive Language-Image Pre-training, which is an effective multimodal learning technique, this research introduces SemiMemes, a novel training method that combines auto-encoder and classification task to make use of the resourceful unlabeled data.
△ Less
Submitted 16 May, 2023; v1 submitted 31 March, 2023;
originally announced April 2023.
-
Deep-XFCT: Deep learning 3D-mineral liberation analysis with micro X-ray fluorescence and computed tomography
Authors:
Patrick Kin Man Tung,
Amalia Yunita Halim,
Huixin Wang,
Anne Rich,
Christopher Marjo,
Klaus Regenauer-Lieb
Abstract:
The rapid development of X-ray micro-computed tomography (micro-CT) opens new opportunities for 3D analysis of particle and grain-size characterisation, determination of particle densities and shape factors, estimation of mineral associations and liberation and locking. Current practices in mineral liberation analysis are based on 2D representations leading to systematic errors in the extrapolatio…
▽ More
The rapid development of X-ray micro-computed tomography (micro-CT) opens new opportunities for 3D analysis of particle and grain-size characterisation, determination of particle densities and shape factors, estimation of mineral associations and liberation and locking. Current practices in mineral liberation analysis are based on 2D representations leading to systematic errors in the extrapolation to volumetric properties. New quantitative methods based on tomographic data are therefore urgently required for characterisation of mineral deposits, mineral processing, characterisation of tailings, rock typing, stratigraphic refinement, reservoir characterisation for applications in the resource industry, environmental and material sciences. To date, no simple non-destructive method exists for 3D mineral liberation analysis. We present a new development based on combining micro-CT with micro-X-ray fluorescence (micro-XRF) using deep learning. We demonstrate successful semi-automated multi-modal analysis of a crystalline magmatic rock where the new technique overcomes the difficult task of differentiating feldspar from quartz in micro-CT data set. The approach is universal and can be extended to any multi-modal and multi-instrument analysis for further refinement. We conclude that the combination of micro-CT and micro-XRF already provides a new opportunity for robust 3D mineral liberation analysis in both field and laboratory applications.
△ Less
Submitted 25 May, 2022;
originally announced May 2022.
-
Machine learning-enabled high-entropy alloy discovery
Authors:
Ziyuan Rao,
PoYen Tung,
Ruiwen Xie,
Ye Wei,
Hongbin Zhang,
Alberto Ferrari,
T. P. C. Klaver,
Fritz Körmann,
Prithiv Thoudden Sukumar,
Alisson Kwiatkowski da Silva,
Yao Chen,
Zhiming Li,
Dirk Ponge,
Jörg Neugebauer,
Oliver Gutfleisch,
Stefan Bauer,
Dierk Raabe
Abstract:
High-entropy alloys are solid solutions of multiple principal elements, capable of reaching composition and feature regimes inaccessible for dilute materials. Discovering those with valuable properties, however, relies on serendipity, as thermodynamic alloy design rules alone often fail in high-dimensional composition spaces. Here, we propose an active-learning strategy to accelerate the design of…
▽ More
High-entropy alloys are solid solutions of multiple principal elements, capable of reaching composition and feature regimes inaccessible for dilute materials. Discovering those with valuable properties, however, relies on serendipity, as thermodynamic alloy design rules alone often fail in high-dimensional composition spaces. Here, we propose an active-learning strategy to accelerate the design of novel high-entropy Invar alloys in a practically infinite compositional space, based on very sparse data. Our approach works as a closed-loop, integrating machine learning with density-functional theory, thermodynamic calculations, and experiments. After processing and characterizing 17 new alloys (out of millions of possible compositions), we identified 2 high-entropy Invar alloys with extremely low thermal expansion coefficients around 2*10-6 K-1 at 300 K. Our study thus opens a new pathway for the fast and automated discovery of high-entropy alloys with optimal thermal, magnetic and electrical properties.
△ Less
Submitted 28 February, 2022;
originally announced February 2022.
-
Learning Speaker Representation with Semi-supervised Learning approach for Speaker Profiling
Authors:
Shangeth Rajaa,
Pham Van Tung,
Chng Eng Siong
Abstract:
Speaker profiling, which aims to estimate speaker characteristics such as age and height, has a wide range of applications inforensics, recommendation systems, etc. In this work, we propose a semisupervised learning approach to mitigate the issue of low training data for speaker profiling. This is done by utilizing external corpus with speaker information to train a better representation which can…
▽ More
Speaker profiling, which aims to estimate speaker characteristics such as age and height, has a wide range of applications inforensics, recommendation systems, etc. In this work, we propose a semisupervised learning approach to mitigate the issue of low training data for speaker profiling. This is done by utilizing external corpus with speaker information to train a better representation which can help to improve the speaker profiling systems. Specifically, besides the standard supervised learning path, the proposed framework has two more paths: (1) an unsupervised speaker representation learning path that helps to capture the speaker information; (2) a consistency training path that helps to improve the robustness of the system by enforcing it to produce similar predictions for utterances of the same speaker.The proposed approach is evaluated on the TIMIT and NISP datasets for age, height, and gender estimation, while the Librispeech is used as the unsupervised external corpus. Trained both on single-task and multi-task settings, our approach was able to achieve state-of-the-art results on age estimation on the TIMIT Test dataset with Root Mean Square Error(RMSE) of6.8 and 7.4 years and Mean Absolute Error(MAE) of 4.8 and5.0 years for male and female speakers respectively.
△ Less
Submitted 24 October, 2021;
originally announced October 2021.
-
Weak convergence of delay SDEs with applications to Carathéodory approximation
Authors:
T. C. Son,
N. T. Dung,
N. V. Tan,
T. M. Cuong,
H. T. P. Thao,
P. D. Tung
Abstract:
In this paper, we consider a fundamental class of stochastic differential equations with time delays. Our aim is to investigate the weak convergence with respect to delay parameter of the solutions. Based on the techniques of Malliavin calculus, we obtain an explicit estimate for the rate of convergence. An application to the Carathéodory approximation scheme of stochastic differential equations i…
▽ More
In this paper, we consider a fundamental class of stochastic differential equations with time delays. Our aim is to investigate the weak convergence with respect to delay parameter of the solutions. Based on the techniques of Malliavin calculus, we obtain an explicit estimate for the rate of convergence. An application to the Carathéodory approximation scheme of stochastic differential equations is provided as well.
△ Less
Submitted 4 September, 2021;
originally announced September 2021.
-
E2E-based Multi-task Learning Approach to Joint Speech and Accent Recognition
Authors:
Jicheng Zhang,
Yizhou Peng,
Pham Van Tung,
Haihua Xu,
Hao Huang,
Eng Siong Chng
Abstract:
In this paper, we propose a single multi-task learning framework to perform End-to-End (E2E) speech recognition (ASR) and accent recognition (AR) simultaneously. The proposed framework is not only more compact but can also yield comparable or even better results than standalone systems. Specifically, we found that the overall performance is predominantly determined by the ASR task, and the E2E-bas…
▽ More
In this paper, we propose a single multi-task learning framework to perform End-to-End (E2E) speech recognition (ASR) and accent recognition (AR) simultaneously. The proposed framework is not only more compact but can also yield comparable or even better results than standalone systems. Specifically, we found that the overall performance is predominantly determined by the ASR task, and the E2E-based ASR pretraining is essential to achieve improved performance, particularly for the AR task. Additionally, we conduct several analyses of the proposed method. First, though the objective loss for the AR task is much smaller compared with its counterpart of ASR task, a smaller weighting factor with the AR task in the joint objective function is necessary to yield better results for each task. Second, we found that sharing only a few layers of the encoder yields better AR results than sharing the overall encoder. Experimentally, the proposed method produces WER results close to the best standalone E2E ASR ones, while it achieves 7.7% and 4.2% relative improvement over standalone and single-task-based joint recognition methods on test set for accent recognition respectively.
△ Less
Submitted 15 June, 2021;
originally announced June 2021.
-
A multimodal operando neutron study of the phase evolution in a graphite electrode
Authors:
Monica-Elisabeta Lăcătuşu,
Luise Theil Kuhn,
Rune E. Johnsen,
Patrick K. M. Tung,
Søren Schmidt,
Takenao Shinohara,
Ryoji Kiyanagi,
Anton S. Tremsin,
Nancy Elewa,
Robin Woracek,
Markus Strobl
Abstract:
Obtaining a complete picture of local processes still poses a significant challenge in battery research. Here we demonstrate an in-situ combination of multimodal neutron imaging with neutron diffraction for spatially resolved operando observations of the lithiation-delithiation of a graphite electrode in a Li-ion battery cell. Throughout the lithiation-delithiation process we image the Li distribu…
▽ More
Obtaining a complete picture of local processes still poses a significant challenge in battery research. Here we demonstrate an in-situ combination of multimodal neutron imaging with neutron diffraction for spatially resolved operando observations of the lithiation-delithiation of a graphite electrode in a Li-ion battery cell. Throughout the lithiation-delithiation process we image the Li distribution based on the local beam attenuation. Simultaneously, we observe the development of the lithiated graphite phases as a function of cycling time and electrode thickness and integral throughout its volume by diffraction contrast imaging and diffraction, respectively. While the conventional imaging data allows to observe the Li uptake in graphite already during the formation of the solid electrolyte interphase, diffraction indicates the onset and development of the Li insertion/extraction globally, which supports the local structural transformation observations by diffraction contrast imaging.
△ Less
Submitted 8 April, 2021;
originally announced April 2021.