subscribe to arXiv mailings

Large Interferometer For Exoplanets (LIFE). XIV. Finding terrestrial protoplanets in the galactic neighborhood

Authors: Lorenzo Cesario, Tim Lichtenberg, Eleonora Alei, Óscar Carrión-González, Felix A. Dannert, Denis Defrère, Steve Ertel, Andrea Fortier, A. García Muñoz, Adrian M. Glauser, Jonah T. Hansen, Ravit Helled, Philipp A. Huber, Michael J. Ireland, Jens Kammerer, Romain Laugier, Jorge Lillo-Box, Franziska Menti, Michael R. Meyer, Lena Noack, Sascha P. Quanz, Andreas Quirrenbach, Sarah Rugheimer, Floris van der Tak, Haiyang S. Wang , et al. (40 additional authors not shown)

Abstract: The increased brightness temperature of young rocky protoplanets during their magma ocean epoch makes them potentially amenable to atmospheric characterization to distances from the solar system far greater than thermally equilibrated terrestrial exoplanets, offering observational opportunities for unique insights into the origin of secondary atmospheres and the near surface conditions of prebioti… ▽ More The increased brightness temperature of young rocky protoplanets during their magma ocean epoch makes them potentially amenable to atmospheric characterization to distances from the solar system far greater than thermally equilibrated terrestrial exoplanets, offering observational opportunities for unique insights into the origin of secondary atmospheres and the near surface conditions of prebiotic environments. The Large Interferometer For Exoplanets (LIFE) mission will employ a space-based mid-infrared nulling interferometer to directly measure the thermal emission of terrestrial exoplanets. Here, we seek to assess the capabilities of various instrumental design choices of the LIFE mission concept for the detection of cooling protoplanets with transient high-temperature magma ocean atmospheres, in young stellar associations in particular. Using the LIFE mission instrument simulator (LIFEsim) we assess how specific instrumental parameters and design choices, such as wavelength coverage, aperture diameter, and photon throughput, facilitate or disadvantage the detection of protoplanets. We focus on the observational sensitivities of distance to the observed planetary system, protoplanet brightness temperature using a blackbody assumption, and orbital distance of the potential protoplanets around both G- and M-dwarf stars. Our simulations suggest that LIFE will be able to detect (S/N $\geq$ 7) hot protoplanets in young stellar associations up to distances of $\approx$100 pc from the solar system for reasonable integration times (up to $\sim$hours). Detection of an Earth-sized protoplanet orbiting a solar-sized host star at 1 AU requires less than 30 minutes of integration time. M-dwarfs generally need shorter integration times. The contribution from wavelength regions $<$6 $μ$m is important for decreasing the detection threshold and discriminating emission temperatures. △ Less

Submitted 17 October, 2024; originally announced October 2024.

Comments: 18 pages, 19 figures; accepted for publication in A&A

arXiv:2410.10933 [pdf, other]

Water depletion and 15NH3 in the atmosphere of the coldest brown dwarf observed with JWST/MIRI

Authors: H. Kühnle, P. Patapis, P. Mollière, P. Tremblin, E. Matthews, A. M. Glauser, N. Whiteford, M. Vasist, O. Absil, D. Barrado, M. Min, P. -O. Lagage, L. B. F. M. Waters, M. Guedel, Th. Henning, B. Vandenbussche, P. Baudoz, L. Decin, J. P. Pye, P. Royer, E. F. van Dishoeck, G. Östlin, T. P. Ray, G. Wright

Abstract: With a temperature of $\sim 285$ K WISE0855 is the coldest brown dwarf observed so far. Using the James Webb Space Telescope (JWST) we obtained observations that allow us to characterize WISE0855s atmosphere focusing on vertical variation in the water steam abundance, measuring trace gas abundances and receiving bulk parameters for this cold object. We observed the ultra cool dwarf WISE0855 using… ▽ More With a temperature of $\sim 285$ K WISE0855 is the coldest brown dwarf observed so far. Using the James Webb Space Telescope (JWST) we obtained observations that allow us to characterize WISE0855s atmosphere focusing on vertical variation in the water steam abundance, measuring trace gas abundances and receiving bulk parameters for this cold object. We observed the ultra cool dwarf WISE0855 using the Mid-Infrared Instrument Medium Resolution Spectrometer (MIRI/MRS) onboard JWST at a spectral resolution of up to 3750. We combined the observation with published data from the Near Infrared Spectrograph (NIRSpec) G395M and PRISM modes yielding a spectrum ranging from 0.8 to 22 um. We apply atmospheric retrievals using petitRADTRANS to measure atmospheric abundances, the pressure-temperature structure, radius and gravity of the brown dwarf. We also employ publicly available clear and cloudy self-consistent grid models to estimate bulk properties of the atmosphere such as the effective temperature, radius, gravity and metallicity. Atmospheric retrievals constrain a variable water abundance profile in the atmosphere, as predicted by equilibrium chemistry. We detect the 15NH3 isotopologue and infer a ratio of mass fraction of 14NH3/15NH3 = 332+63-43 for the clear retrieval. We measure the bolometric luminosity by integrating the presented spectrum and obtain a value of log(L/L$_{\odot}$) = -7.291+/-0.008. The detected water depletion indicates that water condenses out in the upper atmosphere due to the very low effective temperature of WISE0855. The height in the atmosphere where this occurs is covered by the MIRI/MRS data, and thus demonstrates the potential of MIRI to characterize cold gas giants atmospheres. Comparing the data to retrievals and self-consistent grid models, we do not detect signs for water ice clouds, although their spectral features have been predicted in previous studies. △ Less

Submitted 14 October, 2024; originally announced October 2024.

Comments: Submitted to A&A, 29 pages, 21 figures

arXiv:2410.08436 [pdf, other]

Exploring the Role of Reasoning Structures for Constructing Proofs in Multi-Step Natural Language Reasoning with Large Language Models

Authors: Zi'ou Zheng, Christopher Malon, Martin Renqiang Min, Xiaodan Zhu

Abstract: When performing complex multi-step reasoning tasks, the ability of Large Language Models (LLMs) to derive structured intermediate proof steps is important for ensuring that the models truly perform the desired reasoning and for improving models' explainability. This paper is centred around a focused study: whether the current state-of-the-art generalist LLMs can leverage the structures in a few ex… ▽ More When performing complex multi-step reasoning tasks, the ability of Large Language Models (LLMs) to derive structured intermediate proof steps is important for ensuring that the models truly perform the desired reasoning and for improving models' explainability. This paper is centred around a focused study: whether the current state-of-the-art generalist LLMs can leverage the structures in a few examples to better construct the proof structures with \textit{in-context learning}. Our study specifically focuses on structure-aware demonstration and structure-aware pruning. We demonstrate that they both help improve performance. A detailed analysis is provided to help understand the results. △ Less

Submitted 10 October, 2024; originally announced October 2024.

Comments: Accepted by EMNLP2024 main conference

arXiv:2410.08207 [pdf, other]

DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models

Authors: Xiaoxiao He, Ligong Han, Quan Dao, Song Wen, Minhao Bai, Di Liu, Han Zhang, Martin Renqiang Min, Felix Juefei-Xu, Chaowei Tan, Bo Liu, Kang Li, Hongdong Li, Junzhou Huang, Faez Ahmed, Akash Srivastava, Dimitris Metaxas

Abstract: Discrete diffusion models have achieved success in tasks like image generation and masked language modeling but face limitations in controlled content editing. We introduce DICE (Discrete Inversion for Controllable Editing), the first approach to enable precise inversion for discrete diffusion models, including multinomial diffusion and masked generative models. By recording noise sequences and ma… ▽ More Discrete diffusion models have achieved success in tasks like image generation and masked language modeling but face limitations in controlled content editing. We introduce DICE (Discrete Inversion for Controllable Editing), the first approach to enable precise inversion for discrete diffusion models, including multinomial diffusion and masked generative models. By recording noise sequences and masking patterns during the reverse diffusion process, DICE enables accurate reconstruction and flexible editing of discrete data without the need for predefined masks or attention manipulation. We demonstrate the effectiveness of DICE across both image and text domains, evaluating it on models such as VQ-Diffusion, Paella, and RoBERTa. Our results show that DICE preserves high data fidelity while enhancing editing capabilities, offering new opportunities for fine-grained content manipulation in discrete spaces. For project webpage, see https://hexiaoxiao-cs.github.io/DICE/. △ Less

Submitted 10 October, 2024; originally announced October 2024.

arXiv:2410.05561 [pdf, other]

DDES Study of Confined and Unconfined NACA Wing Sections Using Spectral Elements

Authors: Vishal Kumar, Ananias Tomboulides, Paul Fischer, Misun Min

Abstract: We develop hybrid RANS-LES strategies within the spectral element code Nek5000 based on the $k-τ$ class of turbulence models. We chose airfoil sections at small flight configurations as our target problem to comprehensively test the solver accuracy and performance. We present verification and validation results of an unconfined NACA0012 wing section in a pure RANS and in a hybrid RANS-LES setup fo… ▽ More We develop hybrid RANS-LES strategies within the spectral element code Nek5000 based on the $k-τ$ class of turbulence models. We chose airfoil sections at small flight configurations as our target problem to comprehensively test the solver accuracy and performance. We present verification and validation results of an unconfined NACA0012 wing section in a pure RANS and in a hybrid RANS-LES setup for an angle of attack ranging from 0 to 90 degrees. The RANS results shows good corroboration with existing experimental and numerical datasets for low incoming flow angles. A small discrepancy appears at higher angle in comparison with the experiments, which is in line with our expectations from a RANS formulation. On the other hand, DDES captures both the attached and separated flow dynamics well when compared with available numerical datasets. We demonstrate that for the hybrid turbulence modeling approach a high-order spectral element discretization converges faster (i.e., with less resolution) and captures the flow dynamics more accurately than representative low-order finite-volume and finite-difference approaches. We also revise some of the guidelines on sample size requirements for statistics convergence. Furthermore, we analyze some of the observed discrepancies of our unconfined DDES at higher angles with the experiments by evaluating the side wall "blocking" effect. We carry out additional simulations in a confined 'numerical wind tunnel' and assess the observed differences as a function of Reynolds number. △ Less

Submitted 7 October, 2024; originally announced October 2024.

Comments: 28 pages, 10 figures, 3 tables

MSC Class: 35-4 ACM Class: G.4; I.6

arXiv:2410.00202 [pdf, other]

Spectral Element Simulation of Liquid Metal Magnetohydrodynamics

Authors: Yichen Guo, Paul Fischer, Misun Min

Abstract: A spectral-element-based formulation of incompressible MHD is presented in the context of the open-source fluid-thermal code, Nek5000/RS. The formulation supports magnetic fields in a solid domain that surrounds the fluid domain. Several steady-state and time-transient model problems are presented as part of the code verification process. Nek5000/RS is designed for large-scale turbulence simulatio… ▽ More A spectral-element-based formulation of incompressible MHD is presented in the context of the open-source fluid-thermal code, Nek5000/RS. The formulation supports magnetic fields in a solid domain that surrounds the fluid domain. Several steady-state and time-transient model problems are presented as part of the code verification process. Nek5000/RS is designed for large-scale turbulence simulations, which will be the next step with this new MHD capability. △ Less

Submitted 30 September, 2024; originally announced October 2024.

Comments: 26 pages, 2 tables, 14 figures

MSC Class: 35-04 ACM Class: G.4; I.6

arXiv:2410.00147 [pdf, other]

Modeling Turbulence in the Atmospheric Boundary Layer with Spectral Element and Finite Volume Methods

Authors: Ananias Tomboulides Matthew Churchfield, Paul Fischer, Michael Sprague, Misun Min

Abstract: We present large-eddy-simulation (LES) modeling approaches for the simulation of atmospheric boundary layer turbulence that are of direct relevance to wind energy production. In this paper, we study a GABLS benchmark problem using high-order spectral element code Nek5000/RS and a block-structured second-order finite-volume code AMR-Wind which are supported under the DOE's Exascale Computing Projec… ▽ More We present large-eddy-simulation (LES) modeling approaches for the simulation of atmospheric boundary layer turbulence that are of direct relevance to wind energy production. In this paper, we study a GABLS benchmark problem using high-order spectral element code Nek5000/RS and a block-structured second-order finite-volume code AMR-Wind which are supported under the DOE's Exascale Computing Project (ECP) Center for Efficient Exascale Discretizations (CEED) and ExaWind projects, respectively, targeting application simulations on various acceleration-device based exascale computing platforms. As for Nek5000/RS we demonstrate our newly developed subgrid-scale (SGS) models based on mean-field eddy viscosity (MFEV), high-pass filter (HPF), and Smagorinsky (SMG) with traction boundary conditions. For the traction boundary conditions, a novel analytical approach is presented that solves for the surface friction velocity and surface kinematic temperature flux. For AMR-Wind, standard SMG is used and discussed in detail the traction boundary conditions for convergence. We provide low-order statistics, convergence and turbulent structure analysis. Verification and convergence studies were performed for both codes at various resolutions and it was found that Nek5000/RS demonstrate convergence with resolution for all ABL bulk parameters, including boundary layer and low level jet (LLJ) height. Extensive comparisons are presented with simulation data from the literature. △ Less

Submitted 30 September, 2024; originally announced October 2024.

Comments: 35 pages, 24 figures, 1 table

MSC Class: 35-04 ACM Class: G.2; I.6

arXiv:2409.19119 [pdf, other]

Exascale Simulations of Fusion and Fission Systems

Authors: Misun Min, Yu-Hsiang Lan, Paul Fischer, Elia Merzari, Tri Nguyen, Haomin Yuan, Patrick Shriwise, Stefan Kerkemeier, Andrew Davis, Aleksandr Dubas, Rupert Eardly, Rob Akers, Thilina Rathnayake, Tim Warburton

Abstract: We discuss pioneering heat and fluid flow simulations of fusion and fission energy systems with NekRS on exascale computing facilities, including Frontier and Aurora. The Argonne-based code, NekRS, is a highly-performant open-source code for the simulation of incompressible and low-Mach fluid flow, heat transfer, and combustion with a particular focus on turbulent flows in complex domains. It is b… ▽ More We discuss pioneering heat and fluid flow simulations of fusion and fission energy systems with NekRS on exascale computing facilities, including Frontier and Aurora. The Argonne-based code, NekRS, is a highly-performant open-source code for the simulation of incompressible and low-Mach fluid flow, heat transfer, and combustion with a particular focus on turbulent flows in complex domains. It is based on rapidly convergent high-order spectral element discretizations that feature minimal numerical dissipation and dispersion. State-of-the-art multilevel preconditioners, efficient high-order time-splitting methods, and runtime-adaptive communication strategies are built on a fast OCCA-based kernel library, libParanumal, to provide scalability and portability across the spectrum of current and future high-performance computing platforms. On Frontier, Nek5000/RS has achieved an unprecedented milestone in breaching over 1 trillion degrees of freedom with the spectral element methods for the simulation of the CHIMERA fusion technology testing platform. We also demonstrate for the first time the use of high-order overset grids at scale. △ Less

Submitted 27 September, 2024; originally announced September 2024.

Comments: 10 pages, 3 figures, 3 tables

MSC Class: 35-04 ACM Class: G.4; I.6

arXiv:2409.18181 [pdf, other]

ExoLyn: a golden mean approach to multi-species cloud modelling in atmospheric retrieval

Authors: Helong Huang, Chris W. Ormel, Michiel Min

Abstract: Context. Clouds are ubiquitous in exoplanets' atmospheres and play an important role in setting the opacity and chemical inventory of the atmosphere. Understanding clouds is a critical step in interpreting exoplanets' spectroscopic data. Aims. The aim is to model the multi-species nature of clouds in atmospheric retrieval studies. To this end, we develop ExoLyn - a 1D cloud model that balances phy… ▽ More Context. Clouds are ubiquitous in exoplanets' atmospheres and play an important role in setting the opacity and chemical inventory of the atmosphere. Understanding clouds is a critical step in interpreting exoplanets' spectroscopic data. Aims. The aim is to model the multi-species nature of clouds in atmospheric retrieval studies. To this end, we develop ExoLyn - a 1D cloud model that balances physical consistency with computational efficiency. Methods. ExoLyn solves the transport equation of cloud particles and vapor under cloud condensation rates that are self-consistently calculated from thermodynamics. ExoLyn is a standalone, open source package capable to be combined with \texttt{optool} to calculate solid opacities and with \texttt{petitRADTRANS} to generate transmission or emission spectra. Results. With ExoLyn we find that the compositional structure of clouds in hot Jupiter planets' atmospheres is layered with a cloud dominated by magnesium-silicates on top of an iron cloud. This finding is consistent with more complex cloud formation models but can be obtained with ExoLyn in only a few seconds. The composition of the cloud particles can be constrained from the spectrum, for example, MgSiO3 and Mg2SiO4 components give rise to an absorption feature at 8 - 10 um. We investigate the dependence of the cloud structure on the bulk elemental composition of the planet and find that SiO2-dominated clouds forms on metal-rich planet and Fe clouds with strong extinction effect forms on C-rich planet. Conclusions. Designed towards maximum flexibility, ExoLyn can also be used in retrieval analysis of sub-Neptunes and self-luminous planets. The efficiency of ExoLyn opens the possibility of joint retrieval of exoplanets' gas and cloud components. △ Less

Submitted 26 September, 2024; originally announced September 2024.

Comments: 17 pages, 12 figures, accepted by A&A

arXiv:2409.16145 [pdf, other]

Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment

Authors: Yuxiao Chen, Kai Li, Wentao Bao, Deep Patel, Yu Kong, Martin Renqiang Min, Dimitris N. Metaxas

Abstract: Learning to localize temporal boundaries of procedure steps in instructional videos is challenging due to the limited availability of annotated large-scale training videos. Recent works focus on learning the cross-modal alignment between video segments and ASR-transcripted narration texts through contrastive learning. However, these methods fail to account for the alignment noise, i.e., irrelevant… ▽ More Learning to localize temporal boundaries of procedure steps in instructional videos is challenging due to the limited availability of annotated large-scale training videos. Recent works focus on learning the cross-modal alignment between video segments and ASR-transcripted narration texts through contrastive learning. However, these methods fail to account for the alignment noise, i.e., irrelevant narrations to the instructional task in videos and unreliable timestamps in narrations. To address these challenges, this work proposes a novel training framework. Motivated by the strong capabilities of Large Language Models (LLMs) in procedure understanding and text summarization, we first apply an LLM to filter out task-irrelevant information and summarize task-related procedure steps (LLM-steps) from narrations. To further generate reliable pseudo-matching between the LLM-steps and the video for training, we propose the Multi-Pathway Text-Video Alignment (MPTVA) strategy. The key idea is to measure alignment between LLM-steps and videos via multiple pathways, including: (1) step-narration-video alignment using narration timestamps, (2) direct step-to-video alignment based on their long-term semantic similarity, and (3) direct step-to-video alignment focusing on short-term fine-grained semantic similarity learned from general video domains. The results from different pathways are fused to generate reliable pseudo step-video matching. We conducted extensive experiments across various tasks and problem settings to evaluate our proposed method. Our approach surpasses state-of-the-art methods in three downstream tasks: procedure step grounding, step localization, and narration grounding by 5.9\%, 3.1\%, and 2.8\%. △ Less

Submitted 22 September, 2024; originally announced September 2024.

Comments: Accepted to ECCV 2024

arXiv:2409.01121 [pdf, other]

doi 10.1051/0004-6361/202450526

Why heterogeneous cloud particles matter: Iron-bearing species and cloud particle morphology affects exoplanet transmission spectra

Authors: Sven Kiefer, Dominic Samra, David A. Lewis, Aaron D. Schneider, Michiel Min, Ludmila Carone, Leen Decin, Christiane Helling

Abstract: The possibility of observing spectral features in exoplanet atmospheres with space missions like JWST and ARIEL necessitates the accurate modelling of cloud particle opacities. In exoplanet atmospheres, cloud particles can be made from multiple materials and be considerably chemically heterogeneous. Therefore, assumptions on the morphology of cloud particles are required to calculate their opaciti… ▽ More The possibility of observing spectral features in exoplanet atmospheres with space missions like JWST and ARIEL necessitates the accurate modelling of cloud particle opacities. In exoplanet atmospheres, cloud particles can be made from multiple materials and be considerably chemically heterogeneous. Therefore, assumptions on the morphology of cloud particles are required to calculate their opacities. The aim of this work is to analyse how different approaches to calculate the opacities of heterogeneous cloud particles affect cloud particle optical properties. We calculate cloud particle optical properties using seven different mixing treatments: four effective medium theories (EMTs: Bruggeman, Landau-Lifshitz-Looyenga (LLL), Maxwell-Garnett, and Linear), core-shell, and two homogeneous cloud particle approximations. We study the mixing behaviour of 21 commonly considered cloud particle materials for exoplanets. To analyse the impact on observations, we study the transmission spectra of HATS-6b, WASP-39b, WASP-76b, and WASP-107b.Materials with large refractive indices, like iron-bearing species or carbon, can change the optical properties of cloud particles when they comprise less than 1\% of the total particle volume. The mixing treatment of heterogeneous cloud particles also has an observable effect on transmission spectroscopy. Assuming core-shell or homogeneous cloud particles results in less muting of molecular features and retains the cloud spectral features of the individual cloud particle materials. The predicted transit depth for core-shell and homogeneous cloud particle materials are similar for all planets used in this work. If EMTs are used, cloud spectral features are broader and cloud spectral features of the individual cloud particle materials are not retained. Using LLL leads to less molecular features in transmission spectra compared to Bruggeman. △ Less

Submitted 2 September, 2024; originally announced September 2024.

Comments: 21 pages, 15 figures, Accepted by A&A

Journal ref: A&A 690, A244 (2024)

arXiv:2408.16367 [pdf, other]

Dust mineralogy and variability of the inner PDS 70 disk

Authors: Hyerin Jang, Rens Waters, Till Kaeufer, Akemi Tamanai, Giulia Perotti, Valentin Christiaens, Inga Kamp, Thomas Henning, Michiel Min, Aditya M. Arabhavi, David Barrado, Ewine F. van Dishoeck, Danny Gasman, Sierra L. Grant, Manuel Güdel, Pierre-Olivier Lagage, Fred Lahuis, Kamber Schwarz, Benoît Tabone, Milou Temmink

Abstract: The inner disk of the young star PDS 70 may be a site of rocky planet formation, with two giant planets detected further out. Solids in the inner disk may inform us about the origin of this inner disk water and nature of the dust in the rocky planet-forming regions. We aim to constrain the chemical composition, lattice structure, and grain sizes of small silicate grains in the inner disk of PDS 70… ▽ More The inner disk of the young star PDS 70 may be a site of rocky planet formation, with two giant planets detected further out. Solids in the inner disk may inform us about the origin of this inner disk water and nature of the dust in the rocky planet-forming regions. We aim to constrain the chemical composition, lattice structure, and grain sizes of small silicate grains in the inner disk of PDS 70, observed both in JWST/MIRI MRS and Spitzer IRS. We use a dust fitting model, called DuCK, based on a two-layer disk model. We use Gaussian Random Field and Distribution of Hollow Spheres models to obtain two sets of dust opacities. The third set of opacities is obtained from aerosol spectroscopy. We use stoichiometric amorphous silicates, forsterite, and enstatite in our analysis. We also used iron-rich and magnesium-rich amorphous silicate and fayalite dust species to study the iron content. The Gaussian Random Field opacity agrees well with the observed spectrum. In both MIRI and Spitzer spectra, amorphous silicates are the dominant dust species. Crystalline silicates are dominated by iron-poor olivine. We do not find strong evidence for enstatite. Moreover, the MIRI spectrum indicates larger grain sizes than the Spitzer spectrum, indicating a time-variable small grain reservoir. The inner PDS 70 disk is dominated by a variable reservoir of optically thin warm amorphous silicates. We suggest that the small grains detected in the inner PDS 70 disk are likely transported inward from the outer disk as a result of filtration and fragmentation at the ice line. In addition, the variation between MIRI and Spitzer data can be explained by the grain growth over 15 years and a dynamical inner disk where opacity changes occur resulting from the highly variable hot innermost dust reservoir. △ Less

Submitted 29 August, 2024; originally announced August 2024.

Comments: 18 pages, 13 figures, Accepted by A&A

arXiv:2408.06077 [pdf, other]

doi 10.1051/0004-6361/202450891

Disentangling the dust and gas contributions of the JWST/MIRI spectrum of Sz28

Authors: T. Kaeufer, P. Woitke, I. Kamp, J. Kanwar, M. Min

Abstract: Recent spectra of protoplanetary disks around very low-mass stars (VLMS), captured by the Mid-InfraRed Instrument (MIRI) on board the James Webb Space Telescope (JWST), reveal a rich carbon chemistry. Current interpretations of these spectra are based on 0D slab models and provide valuable estimates for molecular emission temperatures and column densities in the innermost disk. However, the establ… ▽ More Recent spectra of protoplanetary disks around very low-mass stars (VLMS), captured by the Mid-InfraRed Instrument (MIRI) on board the James Webb Space Telescope (JWST), reveal a rich carbon chemistry. Current interpretations of these spectra are based on 0D slab models and provide valuable estimates for molecular emission temperatures and column densities in the innermost disk. However, the established fitting procedures and simplified models are challenged by the many overlapping gas features. We aim to simultaneously determine the molecular and the dust composition of the disk around the VLMS Sz28 in a Bayesian way. We model the JWST/MIRI spectrum of Sz28 up to $17\,\rm μm$ using the Dust Continuum Kit with Line emission from Gas (DuCKLinG). Systematically excluding different molecules from the Bayesian analysis allows for an evidence determination of all investigated molecules and isotopologues. We continue by examining the emission conditions and locations of all molecules, analysing the differences to previous 0D slab fitting, and analysing the dust composition. We find very strong Bayesian evidence for the presence of C2H2, HCN, C6H6, CO2, HC3N, C2H6, C3H4, C4H2, and CH4 in the JWST/MIRI spectrum of Sz28. Additionally, we identify CH3 and find tentative indications for NH3. There is no evidence for water in the spectrum. However, we show that column densities of up to $2\times10^{17}\,\rm cm^{-2}$ could be hidden in the observational noise if assuming similar emission conditions of water as the detected hydrocarbons. Contrary to previous 0D slab results, a C4H2 quasi-continuum is robustly identified. We expect some of the stated differences to previous 0D slab fitting results to arise from an updated data reduction of the spectrum, but also due to the different modelling process. The latter reason underpins the need for more advanced models and fitting procedures. △ Less

Submitted 12 August, 2024; originally announced August 2024.

Comments: accepted by Astronomy & Astrophysics

Journal ref: A&A 690, A100 (2024)

arXiv:2406.05447 [pdf, other]

The PLATO Mission

Authors: Heike Rauer, Conny Aerts, Juan Cabrera, Magali Deleuil, Anders Erikson, Laurent Gizon, Mariejo Goupil, Ana Heras, Jose Lorenzo-Alvarez, Filippo Marliani, Cesar Martin-Garcia, J. Miguel Mas-Hesse, Laurence O'Rourke, Hugh Osborn, Isabella Pagano, Giampaolo Piotto, Don Pollacco, Roberto Ragazzoni, Gavin Ramsay, Stéphane Udry, Thierry Appourchaux, Willy Benz, Alexis Brandeker, Manuel Güdel, Eduardo Janot-Pacheco , et al. (801 additional authors not shown)

Abstract: PLATO (PLAnetary Transits and Oscillations of stars) is ESA's M3 mission designed to detect and characterise extrasolar planets and perform asteroseismic monitoring of a large number of stars. PLATO will detect small planets (down to <2 R_(Earth)) around bright stars (<11 mag), including terrestrial planets in the habitable zone of solar-like stars. With the complement of radial velocity observati… ▽ More PLATO (PLAnetary Transits and Oscillations of stars) is ESA's M3 mission designed to detect and characterise extrasolar planets and perform asteroseismic monitoring of a large number of stars. PLATO will detect small planets (down to <2 R_(Earth)) around bright stars (<11 mag), including terrestrial planets in the habitable zone of solar-like stars. With the complement of radial velocity observations from the ground, planets will be characterised for their radius, mass, and age with high accuracy (5 %, 10 %, 10 % for an Earth-Sun combination respectively). PLATO will provide us with a large-scale catalogue of well-characterised small planets up to intermediate orbital periods, relevant for a meaningful comparison to planet formation theories and to better understand planet evolution. It will make possible comparative exoplanetology to place our Solar System planets in a broader context. In parallel, PLATO will study (host) stars using asteroseismology, allowing us to determine the stellar properties with high accuracy, substantially enhancing our knowledge of stellar structure and evolution. The payload instrument consists of 26 cameras with 12cm aperture each. For at least four years, the mission will perform high-precision photometric measurements. Here we review the science objectives, present PLATO's target samples and fields, provide an overview of expected core science performance as well as a description of the instrument and the mission profile at the beginning of the serial production of the flight cameras. PLATO is scheduled for a launch date end 2026. This overview therefore provides a summary of the mission to the community in preparation of the upcoming operational phases. △ Less

Submitted 8 June, 2024; originally announced June 2024.

arXiv:2406.01006 [pdf, other]

SemCoder: Training Code Language Models with Comprehensive Semantics

Authors: Yangruibo Ding, Jinjun Peng, Marcus J. Min, Gail Kaiser, Junfeng Yang, Baishakhi Ray

Abstract: Code Large Language Models (Code LLMs) have excelled at tasks like code completion but often miss deeper semantics such as execution effects and dynamic states. This paper aims to bridge the gap between Code LLMs' reliance on static text data and the need for thorough semantic understanding for complex tasks like debugging and program repair. We introduce a novel strategy to train Code LLMs with c… ▽ More Code Large Language Models (Code LLMs) have excelled at tasks like code completion but often miss deeper semantics such as execution effects and dynamic states. This paper aims to bridge the gap between Code LLMs' reliance on static text data and the need for thorough semantic understanding for complex tasks like debugging and program repair. We introduce a novel strategy to train Code LLMs with comprehensive semantics, encompassing high-level functional descriptions, local execution effects of individual statements, and overall input/output behavior, thereby linking static code text with dynamic execution states. We begin by collecting PyX, a clean code corpus of fully executable samples with functional descriptions and execution tracing. We propose training Code LLMs to write code and represent and reason about execution behaviors using natural language, mimicking human verbal debugging. This approach led to the development of SemCoder, a Code LLM with only 6.7B parameters, which shows competitive performance with GPT-3.5-turbo on code generation and execution reasoning tasks. SemCoder achieves 81.1% on HumanEval (GPT-3.5-turbo: 76.8%) and 54.5% on CRUXEval-I (GPT-3.5-turbo: 50.3%). We also study the effectiveness of SemCoder's monologue-style execution reasoning compared to concrete scratchpad reasoning, showing that our approach integrates semantics from multiple dimensions more smoothly. Finally, we demonstrate the potential of applying learned semantics to improve Code LLMs' debugging and self-refining capabilities. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2405.14781 [pdf, other]

Unified Neural Backdoor Removal with Only Few Clean Samples through Unlearning and Relearning

Authors: Nay Myat Min, Long H. Pham, Jun Sun

Abstract: The application of deep neural network models in various security-critical applications has raised significant security concerns, particularly the risk of backdoor attacks. Neural backdoors pose a serious security threat as they allow attackers to maliciously alter model behavior. While many defenses have been explored, existing approaches are often bounded by model-specific constraints, or necess… ▽ More The application of deep neural network models in various security-critical applications has raised significant security concerns, particularly the risk of backdoor attacks. Neural backdoors pose a serious security threat as they allow attackers to maliciously alter model behavior. While many defenses have been explored, existing approaches are often bounded by model-specific constraints, or necessitate complex alterations to the training process, or fall short against diverse backdoor attacks. In this work, we introduce a novel method for comprehensive and effective elimination of backdoors, called ULRL (short for UnLearn and ReLearn for backdoor removal). ULRL requires only a small set of clean samples and works effectively against all kinds of backdoors. It first applies unlearning for identifying suspicious neurons and then targeted neural weight tuning for backdoor mitigation (i.e., by promoting significant weight deviation on the suspicious neurons). Evaluated against 12 different types of backdoors, ULRL is shown to significantly outperform state-of-the-art methods in eliminating backdoors whilst preserving the model utility. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.12061 [pdf, other]

doi 10.1051/0004-6361/202347124

Constraining the formation of WASP-39b using JWST transit spectroscopy

Authors: N. Khorshid, M. Min, J. Polman, L. B. F. M. Waters

Abstract: Understanding the formation history of planets is one of the goals of studying exoplanet atmospheres. The atmospheric composition of planets can provide insights into the formation pathways of planets. Even though the mapping of the atmospheric composition onto a formation pathway is not unambiguous, with the increasing sensitivity of modern instruments, we can derive promising constraints. In thi… ▽ More Understanding the formation history of planets is one of the goals of studying exoplanet atmospheres. The atmospheric composition of planets can provide insights into the formation pathways of planets. Even though the mapping of the atmospheric composition onto a formation pathway is not unambiguous, with the increasing sensitivity of modern instruments, we can derive promising constraints. In this work, we aim to understand the formation pathway of WASP-39b. We discuss whether the detection of SO2 in its atmosphere would impact our understanding of the formation of the planet and whether it enables us to determine the formation pathway of the planet with greater accuracy. We used the JWST transit observation of the planet together with the available HST and Spitzer observations. We used a formation model coupled with a radiative transfer retrieval model to derive the planet's atmospheric characteristics and formation history. Furthermore, we used a photochemical model to derive the impact of photochemistry on the atmosphere of the planet. In this work, we show that the planet is most likely to have initiated beyond the CO2 ice line of its natal disk. Furthermore, the planet is likely to have have accreted some planetesimals during its formation. We show that the sulfur abundance in the atmosphere of the planet is probably lower than $2.27 \times 10^{-4}$. This abundance indicates that the planet is likely to exhibit a lower metallicity than suggested by the retrievals. Furthermore, such an abundance for sulfur is more likely if WASP-39b had been formed beyond the CO ice line of its natal disk. △ Less

Submitted 20 May, 2024; originally announced May 2024.

Comments: 10 pages, 7 figures, 5 tables

Journal ref: A&A, volume 685, Article A64, Year 2024

arXiv:2405.06486 [pdf, other]

doi 10.1051/0004-6361/202449936

Bayesian Analysis of Molecular Emission and Dust Continuum of Protoplanetary Disks

Authors: T. Kaeufer, M. Min, P. Woitke, I. Kamp, A. M. Arabhavi

Abstract: The Mid-InfraRed Instrument (MIRI) on board the James Webb Space Telescope (JWST) probes the chemistry and dust mineralogy of the inner regions of protoplanetary disks. The observed spectra are unprecedented in their detail, complicating interpretations which are mainly based on manual continuum subtraction and 0D slab models. We investigate the physical conditions under which the gas emits in pro… ▽ More The Mid-InfraRed Instrument (MIRI) on board the James Webb Space Telescope (JWST) probes the chemistry and dust mineralogy of the inner regions of protoplanetary disks. The observed spectra are unprecedented in their detail, complicating interpretations which are mainly based on manual continuum subtraction and 0D slab models. We investigate the physical conditions under which the gas emits in protoplanetary disks. Based on MIRI spectra, we apply a full Bayesian analysis that provides the posterior distributions of dust and molecular properties. For doing so, we introduce the Dust Continuum Kit with Line emission from Gas (DuCKLinG), a model describing the molecular line emission and the dust continuum simultaneously without large computational cost. The dust model is based on work by Juhasz et al. (2009, 2010). The molecular emission is based on LTE slab models, but with radial gradients in column densities and temperatures. The model is compared to observations using Bayesian analysis. We benchmark this model to a complex thermo-chemical ProDiMo model and fit the MIRI spectrum of GWLup. We find that the retrieved molecular conditions from DuCKLinG fall within the true values from ProDiMo. The column densities retrieved by Grant et al. (2023) fall within the retrieved ranges in this study for all examined molecules (CO2, H2O, HCN, and C2H2). Similar overlap is found for the temperatures with only the temperature range of HCN not including the previously found value. This discrepancy may be due to the simultaneous fitting of all molecules compared to the step-by-step fitting of the previous study. There is statistically significant evidence for radial temperature and column density gradients for H2O and CO2 compared to the constant temperature and column density assumed in the 0D slab models. Additionally, HCN and C2H2 emit from a small region with near constant conditions. △ Less

Submitted 10 May, 2024; originally announced May 2024.

Comments: accepted by Astronomy & Astrophysics

Journal ref: A&A 687, A209 (2024)

arXiv:2403.18746 [pdf, other]

CYCLE: Learning to Self-Refine the Code Generation

Authors: Yangruibo Ding, Marcus J. Min, Gail Kaiser, Baishakhi Ray

Abstract: Pre-trained code language models have achieved promising performance in code generation and improved the programming efficiency of human developers. However, their self-refinement capability is typically overlooked by the existing evaluations of code LMs, which focus only on the accuracy of the one-time prediction. For the cases when code LMs fail to implement the correct program, developers actua… ▽ More Pre-trained code language models have achieved promising performance in code generation and improved the programming efficiency of human developers. However, their self-refinement capability is typically overlooked by the existing evaluations of code LMs, which focus only on the accuracy of the one-time prediction. For the cases when code LMs fail to implement the correct program, developers actually find it hard to debug and fix the faulty prediction since it is not written by the developers themselves. Unfortunately, our study reveals that code LMs cannot efficiently self-refine their faulty generations as well. In this paper, we propose CYCLE framework, learning to self-refine the faulty generation according to the available feedback, such as the execution results reported by the test suites. We evaluate CYCLE on three popular code generation benchmarks, HumanEval, MBPP, and APPS. The results reveal that CYCLE successfully maintains, sometimes improves, the quality of one-time code generation, while significantly improving the self-refinement capability of code LMs. We implement four variants of CYCLE with varied numbers of parameters across 350M, 1B, 2B, and 3B, and the experiments show that CYCLE consistently boosts the code generation performance, by up to 63.5%, across benchmarks and varied model sizes. We also notice that CYCLE outperforms code LMs that have 3$\times$ more parameters in self-refinement. △ Less

Submitted 27 March, 2024; originally announced March 2024.

Comments: Camera-ready for OOPSLA'24

arXiv:2403.12848 [pdf, other]

Planner3D: LLM-enhanced graph prior meets 3D indoor scene explicit regularization

Authors: Yao Wei, Martin Renqiang Min, George Vosselman, Li Erran Li, Michael Ying Yang

Abstract: Compositional 3D scene synthesis has diverse applications across a spectrum of industries such as robotics, films, and video games, as it closely mirrors the complexity of real-world multi-object environments. Conventional works typically employ shape retrieval based frameworks which naturally suffer from limited shape diversity. Recent progresses have been made in object shape generation with gen… ▽ More Compositional 3D scene synthesis has diverse applications across a spectrum of industries such as robotics, films, and video games, as it closely mirrors the complexity of real-world multi-object environments. Conventional works typically employ shape retrieval based frameworks which naturally suffer from limited shape diversity. Recent progresses have been made in object shape generation with generative models such as diffusion models, which increases the shape fidelity. However, these approaches separately treat 3D shape generation and layout generation. The synthesized scenes are usually hampered by layout collision, which suggests that the scene-level fidelity is still under-explored. In this paper, we aim at generating realistic and reasonable 3D indoor scenes from scene graph. To enrich the priors of the given scene graph inputs, large language model is utilized to aggregate the global-wise features with local node-wise and edge-wise features. With a unified graph encoder, graph features are extracted to guide joint layout-shape generation. Additional regularization is introduced to explicitly constrain the produced 3D layouts. Benchmarked on the SG-FRONT dataset, our method achieves better 3D scene synthesis, especially in terms of scene-level fidelity. The source code will be released after publication. △ Less

Submitted 26 August, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

Comments: 16 pages, 10 figures

arXiv:2403.02782 [pdf, other]

Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos

Authors: Kumaranage Ravindu Yasas Nagasinghe, Honglu Zhou, Malitha Gunawardhana, Martin Renqiang Min, Daniel Harari, Muhammad Haris Khan

Abstract: In this paper, we explore the capability of an agent to construct a logical sequence of action steps, thereby assembling a strategic procedural plan. This plan is crucial for navigating from an initial visual observation to a target visual outcome, as depicted in real-life instructional videos. Existing works have attained partial success by extensively leveraging various sources of information av… ▽ More In this paper, we explore the capability of an agent to construct a logical sequence of action steps, thereby assembling a strategic procedural plan. This plan is crucial for navigating from an initial visual observation to a target visual outcome, as depicted in real-life instructional videos. Existing works have attained partial success by extensively leveraging various sources of information available in the datasets, such as heavy intermediate visual observations, procedural names, or natural language step-by-step instructions, for features or supervision signals. However, the task remains formidable due to the implicit causal constraints in the sequencing of steps and the variability inherent in multiple feasible plans. To tackle these intricacies that previous efforts have overlooked, we propose to enhance the capabilities of the agent by infusing it with procedural knowledge. This knowledge, sourced from training procedure plans and structured as a directed weighted graph, equips the agent to better navigate the complexities of step sequencing and its potential variations. We coin our approach KEPP, a novel Knowledge-Enhanced Procedure Planning system, which harnesses a probabilistic procedural knowledge graph extracted from training data, effectively acting as a comprehensive textbook for the training domain. Experimental evaluations across three widely-used datasets under settings of varying complexity reveal that KEPP attains superior, state-of-the-art results while requiring only minimal supervision. △ Less

Submitted 15 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

Comments: 8 pages, 6 figures, (supplementary material: 9 pages, 5 figures), accepted to CVPR 2024

Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024 , Pages 18816-18826

arXiv:2402.01717 [pdf, other]

From RAG to QA-RAG: Integrating Generative AI for Pharmaceutical Regulatory Compliance Process

Authors: Jaewoong Kim, Moohong Min

Abstract: Regulatory compliance in the pharmaceutical industry entails navigating through complex and voluminous guidelines, often requiring significant human resources. To address these challenges, our study introduces a chatbot model that utilizes generative AI and the Retrieval Augmented Generation (RAG) method. This chatbot is designed to search for guideline documents relevant to the user inquiries and… ▽ More Regulatory compliance in the pharmaceutical industry entails navigating through complex and voluminous guidelines, often requiring significant human resources. To address these challenges, our study introduces a chatbot model that utilizes generative AI and the Retrieval Augmented Generation (RAG) method. This chatbot is designed to search for guideline documents relevant to the user inquiries and provide answers based on the retrieved guidelines. Recognizing the inherent need for high reliability in this domain, we propose the Question and Answer Retrieval Augmented Generation (QA-RAG) model. In comparative experiments, the QA-RAG model demonstrated a significant improvement in accuracy, outperforming all other baselines including conventional RAG methods. This paper details QA-RAG's structure and performance evaluation, emphasizing its potential for the regulatory compliance domain in the pharmaceutical industry and beyond. We have made our work publicly available for further research and development. △ Less

Submitted 26 January, 2024; originally announced February 2024.

Comments: Total number of pages: 9. Total number of figures: 2. For the source code and experimental results of this paper, see https://github.com/jwoongkim11/QA-RAG. For the dataset used in training and evaluating the model, see https://huggingface.co/datasets/Jaymax/FDA Pharmaceuticals FAQ

ACM Class: I.2.7; I.2.1; J.3

arXiv:2401.11225 [pdf, ps, other]

Protecting Personalized Trajectory with Differential Privacy under Temporal Correlations

Authors: Mingge Cao, Haopeng Zhu, Minghui Min, Yulu Li, Shiyin Li, Hongliang Zhang, Zhu Han

Abstract: Location-based services (LBSs) in vehicular ad hoc networks (VANETs) offer users numerous conveniences. However, the extensive use of LBSs raises concerns about the privacy of users' trajectories, as adversaries can exploit temporal correlations between different locations to extract personal information. Additionally, users have varying privacy requirements depending on the time and location. To… ▽ More Location-based services (LBSs) in vehicular ad hoc networks (VANETs) offer users numerous conveniences. However, the extensive use of LBSs raises concerns about the privacy of users' trajectories, as adversaries can exploit temporal correlations between different locations to extract personal information. Additionally, users have varying privacy requirements depending on the time and location. To address these issues, this paper proposes a personalized trajectory privacy protection mechanism (PTPPM). This mechanism first uses the temporal correlation between trajectory locations to determine the possible location set for each time instant. We identify a protection location set (PLS) for each location by employing the Hilbert curve-based minimum distance search algorithm. This approach incorporates the complementary features of geo-indistinguishability and distortion privacy. We put forth a novel Permute-and-Flip mechanism for location perturbation, which maps its initial application in data publishing privacy protection to a location perturbation mechanism. This mechanism generates fake locations with smaller perturbation distances while improving the balance between privacy and quality of service (QoS). Simulation results show that our mechanism outperforms the benchmark by providing enhanced privacy protection while meeting user's QoS requirements. △ Less

Submitted 20 January, 2024; originally announced January 2024.

arXiv:2401.04168 [pdf, other]

FlopPITy: Enabling self-consistent exoplanet atmospheric retrievals with machine learning

Authors: Francisco Ardévol Martínez, Michiel Min, Daniela Huppenkothen, Inga Kamp, Paul I. Palmer

Abstract: Interpreting the observations of exoplanet atmospheres to constrain physical and chemical properties is typically done using Bayesian retrieval techniques. Because these methods require many model computations, a compromise is made between model complexity and run time. Reaching this compromise leads to the simplification of many physical and chemical processes (e.g. parameterised temperature stru… ▽ More Interpreting the observations of exoplanet atmospheres to constrain physical and chemical properties is typically done using Bayesian retrieval techniques. Because these methods require many model computations, a compromise is made between model complexity and run time. Reaching this compromise leads to the simplification of many physical and chemical processes (e.g. parameterised temperature structure). Here we implement and test sequential neural posterior estimation (SNPE), a machine learning inference algorithm, for exoplanet atmospheric retrievals. The goal is to speed up retrievals so they can be run with more computationally expensive atmospheric models, such as those computing the temperature structure using radiative transfer. We generate 100 synthetic observations using ARCiS (ARtful Modeling Code for exoplanet Science, an atmospheric modelling code with the flexibility to compute models in varying degrees of complexity) and perform retrievals on them to test the faithfulness of the SNPE posteriors. The faithfulness quantifies whether the posteriors contain the ground truth as often as we expect. We also generate a synthetic observation of a cool brown dwarf using the self-consistent capabilities of ARCiS and run a retrieval with self-consistent models to showcase the possibilities that SNPE opens. We find that SNPE provides faithful posteriors and is therefore a reliable tool for exoplanet atmospheric retrievals. We are able to run a self-consistent retrieval of a synthetic brown dwarf spectrum using only 50,000 forward model evaluations. We find that SNPE can speed up retrievals between $\sim2\times$ and $\geq10\times$ depending on the computational load of the forward model, the dimensionality of the observation, and the signal-to-noise ratio of the observation. We make the code publicly available for the community on Github. △ Less

Submitted 8 January, 2024; originally announced January 2024.

Comments: Accepted for publication at A&A

arXiv:2312.09888 [pdf, other]

doi 10.1145/3624062.3624159

Scaling Computational Fluid Dynamics: In Situ Visualization of NekRS using SENSEI

Authors: Victor A. Mateevitsi, Mathis Bode, Nicola Ferrier, Paul Fischer, Jens Henrik Göbbert, Joseph A. Insley, Yu-Hsiang Lan, Misun Min, Michael E. Papka, Saumil Patel, Silvio Rizzi, Jonathan Windgassen

Abstract: In the realm of Computational Fluid Dynamics (CFD), the demand for memory and computation resources is extreme, necessitating the use of leadership-scale computing platforms for practical domain sizes. This intensive requirement renders traditional checkpointing methods ineffective due to the significant slowdown in simulations while saving state data to disk. As we progress towards exascale and G… ▽ More In the realm of Computational Fluid Dynamics (CFD), the demand for memory and computation resources is extreme, necessitating the use of leadership-scale computing platforms for practical domain sizes. This intensive requirement renders traditional checkpointing methods ineffective due to the significant slowdown in simulations while saving state data to disk. As we progress towards exascale and GPU-driven High-Performance Computing (HPC) and confront larger problem sizes, the choice becomes increasingly stark: to compromise data fidelity or to reduce resolution. To navigate this challenge, this study advocates for the use of in situ analysis and visualization techniques. These allow more frequent data "snapshots" to be taken directly from memory, thus avoiding the need for disruptive checkpointing. We detail our approach of instrumenting NekRS, a GPU-focused thermal-fluid simulation code employing the spectral element method (SEM), and describe varied in situ and in transit strategies for data rendering. Additionally, we provide concrete scientific use-cases and report on runs performed on Polaris, Argonne Leadership Computing Facility's (ALCF) 44 Petaflop supercomputer and Jülich Wizard for European Leadership Science (JUWELS) Booster, Jülich Supercomputing Centre's (JSC) 71 Petaflop High Performance Computing (HPC) system, offering practical insight into the implications of our methodology. △ Less

Submitted 18 December, 2023; v1 submitted 15 December, 2023; originally announced December 2023.

arXiv:2311.15702 [pdf, other]

doi 10.1051/0004-6361/202245469

Retrieving planet formation parameters of WASP-77Ab using SimAb

Authors: N. Khorshid, M. Min, J. M. Désert

Abstract: The atmospheric compositions of planets offer a unique view into their respective formation processes. State-of-the-art observatories and techniques are finally able to provide high-precision data on atmospheric composition that can be used to constrain planet formation. In this context, we focus on the formation of WASP-77Ab based on previous observations of its atmosphere, which have provided pr… ▽ More The atmospheric compositions of planets offer a unique view into their respective formation processes. State-of-the-art observatories and techniques are finally able to provide high-precision data on atmospheric composition that can be used to constrain planet formation. In this context, we focus on the formation of WASP-77Ab based on previous observations of its atmosphere, which have provided precise C/O and metallicity measurements. We use the SimAb planet formation simulation to model the formation of WASP-77Ab. We assume two compositions for the disk WASP-77Ab was formed within: one of a solar composition and one that represents the composition of WASP-77A. In addition, we considered two different scenarios regarding the migration of the planet and we study the possible planet formation paths that reproduce the composition of WASP-77Ab. This work shows that the planet is expected to have formed in a disk where not many planetesimals could be accreted. Moreover, we demonstrate that the most likely migration scenario is disk-free migration, whereby the planet initiates its Type II migration within the CO ice line and ends it beyond the water ice line. △ Less

Submitted 27 November, 2023; originally announced November 2023.

Comments: 10 pages, 9 figures

Journal ref: volum number:675 Year: 2023 Pages:A95

arXiv:2311.12515 [pdf, other]

doi 10.1038/s41586-023-06849-0

SO$_2$, silicate clouds, but no CH$_4$ detected in a warm Neptune

Authors: Achrène Dyrek, Michiel Min, Leen Decin, Jeroen Bouwman, Nicolas Crouzet, Paul Mollière, Pierre-Olivier Lagage, Thomas Konings, Pascal Tremblin, Manuel Güdel, John Pye, Rens Waters, Thomas Henning, Bart Vandenbussche, Francisco Ardevol Martinez, Ioannis Argyriou, Elsa Ducrot, Linus Heinke, Gwenael Van Looveren, Olivier Absil, David Barrado, Pierre Baudoz, Anthony Boccaletti, Christophe Cossou, Alain Coulais , et al. (22 additional authors not shown)

Abstract: WASP-107b is a warm ($\sim$740 K) transiting planet with a Neptune-like mass of $\sim$30.5 $M_{\oplus}$ and Jupiter-like radius of $\sim$0.94 $R_{\rm J}$, whose extended atmosphere is eroding. Previous observations showed evidence for water vapour and a thick high-altitude condensate layer in WASP-107b's atmosphere. Recently, photochemically produced sulphur dioxide (SO$_2$) was detected in the at… ▽ More WASP-107b is a warm ($\sim$740 K) transiting planet with a Neptune-like mass of $\sim$30.5 $M_{\oplus}$ and Jupiter-like radius of $\sim$0.94 $R_{\rm J}$, whose extended atmosphere is eroding. Previous observations showed evidence for water vapour and a thick high-altitude condensate layer in WASP-107b's atmosphere. Recently, photochemically produced sulphur dioxide (SO$_2$) was detected in the atmosphere of a hot ($\sim$1,200 K) Saturn-mass planet from transmission spectroscopy near 4.05 $μ$m, but for temperatures below $\sim$1,000 K sulphur is predicted to preferably form sulphur allotropes instead of SO$_2$. Here we report the 9$σ$-detection of two fundamental vibration bands of SO$_2$, at 7.35 $μ$m and 8.69 $μ$m, in the transmission spectrum of WASP-107b using the Mid-Infrared Instrument (MIRI) of the JWST. This discovery establishes WASP-107b as the second irradiated exoplanet with confirmed photochemistry, extending the temperature range of exoplanets exhibiting detected photochemistry from $\sim$1,200 K down to $\sim$740 K. Additionally, our spectral analysis reveals the presence of silicate clouds, which are strongly favoured ($\sim$7$σ$) over simpler cloud setups. Furthermore, water is detected ($\sim$12$σ$), but methane is not. These findings provide evidence of disequilibrium chemistry and indicate a dynamically active atmosphere with a super-solar metallicity. △ Less

Submitted 21 November, 2023; originally announced November 2023.

arXiv:2311.08054 [pdf, other]

doi 10.1038/s41586-023-06813-y

15NH3 in the atmosphere of a cool brown dwarf

Authors: David Barrado, Paul Mollière, Polychronis Patapis, Michiel Min, Pascal Tremblin, Francisco Ardevol Martinez, Niall Whiteford, Malavika Vasist, Ioannis Argyriou, Matthias Samland, Pierre-Olivier Lagage, Leen Decin, Rens Waters, Thomas Henning, María Morales-Calderón, Manuel Guedel, Bart Vandenbussche, Olivier Absil, Pierre Baudoz, Anthony Boccaletti, Jeroen Bouwman, Christophe Cossou, Alain Coulais, Nicolas Crouzet, René Gastaud , et al. (18 additional authors not shown)

Abstract: Brown dwarfs serve as ideal laboratories for studying the atmospheres of giant exoplanets on wide orbits as the governing physical and chemical processes in them are nearly identical. Understanding the formation of gas giant planets is challenging, often involving the endeavour to link atmospheric abundance ratios, such as the carbon-to-oxygen (C/O) ratio, to formation scenarios. However, the comp… ▽ More Brown dwarfs serve as ideal laboratories for studying the atmospheres of giant exoplanets on wide orbits as the governing physical and chemical processes in them are nearly identical. Understanding the formation of gas giant planets is challenging, often involving the endeavour to link atmospheric abundance ratios, such as the carbon-to-oxygen (C/O) ratio, to formation scenarios. However, the complexity of planet formation requires additional tracers, as the unambiguous interpretation of the measured C/O ratio is fraught with complexity. Isotope ratios, such as deuterium-to-hydrogen and 14N/15N, offer a promising avenue to gain further insight into this formation process, mirroring their utility within the solar system. For exoplanets only a handful of constraints on 12C/13C exist, pointing to the accretion of 13C-rich ice from beyond the disks' CO iceline. Here we report on the mid-infrared detection of the 14NH3 and 15NH3 isotopologues in the atmosphere of a cool brown dwarf with an effective temperature of 380 K in a spectrum taken with the Mid-InfraRed Instrument of the James Webb Space Telescope. As expected, our results reveal a 14N/15N value consistent with star-like formation by gravitational collapse, demonstrating that this ratio can be accurately constrained. Since young stars and their planets should be more strongly enriched in the 15N isotope, we expect that 15NH3 will be detectable in a number of cold, wide-separation exoplanets. △ Less

Submitted 14 November, 2023; originally announced November 2023.

Comments: Accepted by Nature. 28 pages, 7 figures, uses nature3.cls

arXiv:2310.14053 [pdf, other]

Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain

Authors: Marcus J. Min, Yangruibo Ding, Luca Buratti, Saurabh Pujar, Gail Kaiser, Suman Jana, Baishakhi Ray

Abstract: Code Large Language Models (Code LLMs) are being increasingly employed in real-life applications, so evaluating them is critical. While the conventional accuracy evaluates the performance of Code LLMs on a set of individual tasks, their self-consistency across different tasks is overlooked. Intuitively, a trustworthy model should be self-consistent when generating natural language specifications f… ▽ More Code Large Language Models (Code LLMs) are being increasingly employed in real-life applications, so evaluating them is critical. While the conventional accuracy evaluates the performance of Code LLMs on a set of individual tasks, their self-consistency across different tasks is overlooked. Intuitively, a trustworthy model should be self-consistent when generating natural language specifications for its own code and generating code for its own specifications. Failure to preserve self-consistency reveals a lack of understanding of the shared semantics underlying natural language and programming language, and therefore undermines the trustworthiness of a model. In this paper, we first formally define the self-consistency of Code LLMs and then design a framework, IdentityChain, which effectively and efficiently evaluates the self-consistency and conventional accuracy of a model at the same time. We study eleven Code LLMs and show that they fail to preserve self-consistency, which is indeed a distinct aspect from conventional accuracy. Furthermore, we show that IdentityChain can be used as a model debugging tool to expose weaknesses of Code LLMs by demonstrating three major weaknesses that we identify in current models using IdentityChain. Our code is available at https://github.com/marcusm117/IdentityChain. △ Less

Submitted 26 February, 2024; v1 submitted 21 October, 2023; originally announced October 2023.

Comments: ICLR 2024

MSC Class: 68 ACM Class: I.2; D.2

arXiv:2310.07701 [pdf, other]

doi 10.1002/asna.20230075

The sulphur species in hot rocky exoplanet atmospheres

Authors: L. J. Janssen, P. Woitke, O. Herbort, M. Min, K. L. Chubb, Ch. Helling, L. Carone

Abstract: The first JWST observations of hot Jupiters showed an unexpected detection of SO2 in their hydrogen-rich atmospheres. We investigate how much sulphur can be expected in the atmospheres of rocky exoplanets and which sulphur molecules can be expected to be most abundant and detectable by transmission spectroscopy. We run thermo-chemical equilibrium models at the crust-atmosphere interface, consideri… ▽ More The first JWST observations of hot Jupiters showed an unexpected detection of SO2 in their hydrogen-rich atmospheres. We investigate how much sulphur can be expected in the atmospheres of rocky exoplanets and which sulphur molecules can be expected to be most abundant and detectable by transmission spectroscopy. We run thermo-chemical equilibrium models at the crust-atmosphere interface, considering surface temperatures 500 to 5000 K, surface pressures 1 to 100 bar, and various sets of element abundances based on common rock compositions. Between 1000 K and 2000 K, we find gaseous sulphur concentrations of up to 25 percent above the rock in our models. SO2, SO, H2S and S2 are by far the most abundant sulphur molecules. SO2 shows potentially detectable features in transmission spectra at about 4 micron, between 7 and 8 micron, and beyond 15 micron. In contrast, the sometimes abundant H2S molecule is difficult to detect in these spectra, which are mostly dominated by H2O and CO2. Although the molecule PS only occurs with concentrations below 300 ppm, it can cause a strong absorption feature between 0.3 and 0.65 micron in some of our models for high surface pressures. The detection of sulphur molecules would enable a better characterisation of the planetary surface. △ Less

Submitted 31 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

Comments: Typos have been corrected compared to version 1

MSC Class: 85

arXiv:2310.04505 [pdf, other]

doi 10.1051/0004-6361/202346262

Hydrocarbon chemistry in inner regions of planet forming disks

Authors: Jayatee Kanwar, Inga Kamp, Peter Woitke, Christian Rab, Wing-Fai Thi, Michiel Min

Abstract: The analysis of the mid-infrared spectra helps understanding the composition of the gas in the inner, dense and warm terrestrial planet forming region of disks around young stars. ALMA has detected hydrocarbons in the outer regions of the planet forming disk and Spitzer detected \ce{C2H2} in the inner regions. JWST- MIRI provides high spectral resolution observations of \ce{C2H2} and a suite of mo… ▽ More The analysis of the mid-infrared spectra helps understanding the composition of the gas in the inner, dense and warm terrestrial planet forming region of disks around young stars. ALMA has detected hydrocarbons in the outer regions of the planet forming disk and Spitzer detected \ce{C2H2} in the inner regions. JWST- MIRI provides high spectral resolution observations of \ce{C2H2} and a suite of more complex hydrocarbons are now reported. Interpreting the fluxes observed in the spectra is challenging and radiation thermo-chemical codes are needed to properly take into account the disk structure, radiative transfer, chemistry and thermal balance. Various disk physical parameters like the gas-to-dust ratio, dust evolution including radial drift, dust growth and settling can affect the fluxes observed in the mid-IR. Still, thermo-chemical disk models were not always successful in matching all observed molecular emission bands simultaneously. The goal of this project is two-fold. We analyse the warm carbon chemistry in the inner regions of the disk, i.e. within 10 au to find pathways forming \ce{C2H2} potentially missing from the existing chemical networks. Second, we analyse the effect of the new chemistry on the line fluxes of acetylene. We use radiative thermo-chemical disk code {P{\small RO}D{\small I}M{\small O}} to expand the hydrocarbon chemistry that occurs in a typical standard T Tauri disks. We used the UMIST and the KIDA rate databases for collecting reactions for the species. We include a number of three-body and thermal decomposition reactions from STAND2020 network. We included isotopomers for the species that were present in the databases. The chemistry is then analysed in the regions that produce observable features in the mid-infrared spectra. The effect of expanding the hydrocarbon chemistry on the mid-infrared spectra is studied. Acetylene is formed via two .... △ Less

Submitted 6 October, 2023; originally announced October 2023.

Comments: accepted for publication in A&A

Journal ref: A&A 681, A22 (2024)

arXiv:2309.16381 [pdf, other]

Nek5000/RS Performance on Advanced GPU Architectures

Authors: Misun Min, Yu-Hsiang Lan, Paul Fischer, Thilina Rathnayake, John Holmen

Abstract: We demonstrate NekRS performance results on various advanced GPU architectures. NekRS is a GPU-accelerated version of Nek5000 that targets high performance on exascale platforms. It is being developed in DOE's Center of Efficient Exascale Discretizations, which is one of the co-design centers under the Exascale Computing Project. In this paper, we consider Frontier, Crusher, Spock, Polaris, Perlmu… ▽ More We demonstrate NekRS performance results on various advanced GPU architectures. NekRS is a GPU-accelerated version of Nek5000 that targets high performance on exascale platforms. It is being developed in DOE's Center of Efficient Exascale Discretizations, which is one of the co-design centers under the Exascale Computing Project. In this paper, we consider Frontier, Crusher, Spock, Polaris, Perlmutter, ThetaGPU, and Summit. Simulations are performed with 17x17 rod-bundle geometries from small modular reactor applications. We discuss strong-scaling performance and analysis. △ Less

Submitted 28 September, 2023; originally announced September 2023.

Comments: 24 pages, 13 figures, 2 tables

MSC Class: 35-04 ACM Class: D.0; F.2; G.2; G.4; I.6; J.2

arXiv:2304.13131 [pdf, other]

Directed Chain Generative Adversarial Networks

Authors: Ming Min, Ruimeng Hu, Tomoyuki Ichiba

Abstract: Real-world data can be multimodal distributed, e.g., data describing the opinion divergence in a community, the interspike interval distribution of neurons, and the oscillators natural frequencies. Generating multimodal distributed real-world data has become a challenge to existing generative adversarial networks (GANs). For example, neural stochastic differential equations (Neural SDEs), treated… ▽ More Real-world data can be multimodal distributed, e.g., data describing the opinion divergence in a community, the interspike interval distribution of neurons, and the oscillators natural frequencies. Generating multimodal distributed real-world data has become a challenge to existing generative adversarial networks (GANs). For example, neural stochastic differential equations (Neural SDEs), treated as infinite-dimensional GANs, have demonstrated successful performance mainly in generating unimodal time series data. In this paper, we propose a novel time series generator, named directed chain GANs (DC-GANs), which inserts a time series dataset (called a neighborhood process of the directed chain or input) into the drift and diffusion coefficients of the directed chain SDEs with distributional constraints. DC-GANs can generate new time series of the same distribution as the neighborhood process, and the neighborhood process will provide the key step in learning and generating multimodal distributed time series. The proposed DC-GANs are examined on four datasets, including two stochastic models from social sciences and computational neuroscience, and two real-world datasets on stock prices and energy consumption. To our best knowledge, DC-GANs are the first work that can generate multimodal time series data and consistently outperforms state-of-the-art benchmarks with respect to measures of distribution, data similarity, and predictive ability. △ Less

Submitted 4 May, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

Comments: Publish at ICML 2023

arXiv:2304.12536 [pdf, other]

Exploring Compositional Visual Generation with Latent Classifier Guidance

Authors: Changhao Shi, Haomiao Ni, Kai Li, Shaobo Han, Mingfu Liang, Martin Renqiang Min

Abstract: Diffusion probabilistic models have achieved enormous success in the field of image generation and manipulation. In this paper, we explore a novel paradigm of using the diffusion model and classifier guidance in the latent semantic space for compositional visual tasks. Specifically, we train latent diffusion models and auxiliary latent classifiers to facilitate non-linear navigation of latent repr… ▽ More Diffusion probabilistic models have achieved enormous success in the field of image generation and manipulation. In this paper, we explore a novel paradigm of using the diffusion model and classifier guidance in the latent semantic space for compositional visual tasks. Specifically, we train latent diffusion models and auxiliary latent classifiers to facilitate non-linear navigation of latent representation generation for any pre-trained generative model with a semantic latent space. We demonstrate that such conditional generation achieved by latent classifier guidance provably maximizes a lower bound of the conditional log probability during training. To maintain the original semantics during manipulation, we introduce a new guidance term, which we show is crucial for achieving compositionality. With additional assumptions, we show that the non-linear manipulation reduces to a simple latent arithmetic approach. We show that this paradigm based on latent classifier guidance is agnostic to pre-trained generative models, and present competitive results for both image generation and sequential manipulation of real and synthetic images. Our findings suggest that latent classifier guidance is a promising approach that merits further exploration, even in the presence of other strong competing methods. △ Less

Submitted 24 May, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

Comments: Accepted to CVPR Workshop 2023

arXiv:2303.13744 [pdf, other]

Conditional Image-to-Video Generation with Latent Flow Diffusion Models

Authors: Haomiao Ni, Changhao Shi, Kai Li, Sharon X. Huang, Martin Renqiang Min

Abstract: Conditional image-to-video (cI2V) generation aims to synthesize a new plausible video starting from an image (e.g., a person's face) and a condition (e.g., an action class label like smile). The key challenge of the cI2V task lies in the simultaneous generation of realistic spatial appearance and temporal dynamics corresponding to the given image and condition. In this paper, we propose an approac… ▽ More Conditional image-to-video (cI2V) generation aims to synthesize a new plausible video starting from an image (e.g., a person's face) and a condition (e.g., an action class label like smile). The key challenge of the cI2V task lies in the simultaneous generation of realistic spatial appearance and temporal dynamics corresponding to the given image and condition. In this paper, we propose an approach for cI2V using novel latent flow diffusion models (LFDM) that synthesize an optical flow sequence in the latent space based on the given condition to warp the given image. Compared to previous direct-synthesis-based works, our proposed LFDM can better synthesize spatial details and temporal motion by fully utilizing the spatial content of the given image and warping it in the latent space according to the generated temporally-coherent flow. The training of LFDM consists of two separate stages: (1) an unsupervised learning stage to train a latent flow auto-encoder for spatial content generation, including a flow predictor to estimate latent flow between pairs of video frames, and (2) a conditional learning stage to train a 3D-UNet-based diffusion model (DM) for temporal latent flow generation. Unlike previous DMs operating in pixel space or latent feature space that couples spatial and temporal information, the DM in our LFDM only needs to learn a low-dimensional latent flow space for motion generation, thus being more computationally efficient. We conduct comprehensive experiments on multiple datasets, where LFDM consistently outperforms prior arts. Furthermore, we show that LFDM can be easily adapted to new domains by simply finetuning the image decoder. Our code is available at https://github.com/nihaomiao/CVPR23_LFDM. △ Less

Submitted 23 March, 2023; originally announced March 2023.

Comments: CVPR 2023

arXiv:2303.02162 [pdf, other]

T-Cell Receptor Optimization with Reinforcement Learning and Mutation Policies for Precesion Immunotherapy

Authors: Ziqi Chen, Martin Renqiang Min, Hongyu Guo, Chao Cheng, Trevor Clancy, Xia Ning

Abstract: T cells monitor the health status of cells by identifying foreign peptides displayed on their surface. T-cell receptors (TCRs), which are protein complexes found on the surface of T cells, are able to bind to these peptides. This process is known as TCR recognition and constitutes a key step for immune response. Optimizing TCR sequences for TCR recognition represents a fundamental step towards the… ▽ More T cells monitor the health status of cells by identifying foreign peptides displayed on their surface. T-cell receptors (TCRs), which are protein complexes found on the surface of T cells, are able to bind to these peptides. This process is known as TCR recognition and constitutes a key step for immune response. Optimizing TCR sequences for TCR recognition represents a fundamental step towards the development of personalized treatments to trigger immune responses killing cancerous or virus-infected cells. In this paper, we formulated the search for these optimized TCRs as a reinforcement learning (RL) problem, and presented a framework TCRPPO with a mutation policy using proximal policy optimization. TCRPPO mutates TCRs into effective ones that can recognize given peptides. TCRPPO leverages a reward function that combines the likelihoods of mutated sequences being valid TCRs measured by a new scoring function based on deep autoencoders, with the probabilities of mutated sequences recognizing peptides from a peptide-TCR interaction predictor. We compared TCRPPO with multiple baseline methods and demonstrated that TCRPPO significantly outperforms all the baseline methods to generate positive binding and valid TCRs. These results demonstrate the potential of TCRPPO for both precision immunotherapy and peptide-recognizing TCR motif discovery. △ Less

Submitted 2 March, 2023; originally announced March 2023.

arXiv:2302.04629 [pdf, other]

doi 10.1051/0004-6361/202245461

Analysing the SEDs of protoplanetary disks with machine learning

Authors: T. Kaeufer, P. Woitke, M. Min, I. Kamp, C. Pinte

Abstract: ABRIDGED. The analysis of spectral energy distributions (SEDs) of protoplanetary disks to determine their physical properties is known to be highly degenerate. Hence, a Bayesian analysis is required to obtain parameter uncertainties and degeneracies. The challenge here is computational speed, as one radiative transfer model requires a couple of minutes to compute. We performed a Bayesian analysis… ▽ More ABRIDGED. The analysis of spectral energy distributions (SEDs) of protoplanetary disks to determine their physical properties is known to be highly degenerate. Hence, a Bayesian analysis is required to obtain parameter uncertainties and degeneracies. The challenge here is computational speed, as one radiative transfer model requires a couple of minutes to compute. We performed a Bayesian analysis for 30 well-known protoplanetary disks to determine their physical disk properties, including uncertainties and degeneracies. To circumvent the computational cost problem, we created neural networks (NNs) to emulate the SED generation process. We created two sets of radiative transfer disk models to train and test two NNs that predict SEDs for continuous and discontinuous disks. A Bayesian analysis was then performed on 30 protoplanetary disks with SED data collected by the DIANA project to determine the posterior distributions of all parameters. We ran this analysis twice, (i) with old distances and additional parameter constraints as used in a previous study, to compare results, and (ii) with updated distances and free choice of parameters to obtain homogeneous and unbiased model parameters. We evaluated the uncertainties in the determination of physical disk parameters from SED analysis, and detected and quantified the strongest degeneracies. The NNs are able to predict SEDs within 1ms with uncertainties of about 5% compared to the true SEDs obtained by the radiative transfer code. We find parameter values and uncertainties that are significantly different from previous values obtained by $χ^2$ fitting. Comparing the global evidence for continuous and discontinuous disks, we find that 26 out of 30 objects are better described by disks that have two distinct radial zones. Also, we created an interactive tool that instantly returns the SED predicted by our NNs for any parameter combination. △ Less

Submitted 9 February, 2023; originally announced February 2023.

Comments: 40 pages, 22 figures, the online tool is available at https://tillkaeufer.github.io/sedpredictor

Journal ref: A&A 672, A30 (2023)

arXiv:2301.02622 [pdf, other]

doi 10.1051/0004-6361/202245689

What does a typical full-disc around a post-AGB binary look like? -- Radiative transfer models reproducing PIONIER, GRAVITY, and MATISSE data

Authors: A. Corporaal, J. Kluska, H. Van Winckel, D. Kamath, M. Min

Abstract: (abridged) Stable circumbinary discs around evolved post-Asymptotic Giant branch (post-AGB) binary systems show many similarities with protoplanetary discs around young stellar objects. These discs can provide constraints on both binary evolution and the formation of macrostructures within circumstellar discs. Here we focus on one post-AGB binary system: IRAS08544-4431. We aim to refine the physic… ▽ More (abridged) Stable circumbinary discs around evolved post-Asymptotic Giant branch (post-AGB) binary systems show many similarities with protoplanetary discs around young stellar objects. These discs can provide constraints on both binary evolution and the formation of macrostructures within circumstellar discs. Here we focus on one post-AGB binary system: IRAS08544-4431. We aim to refine the physical model of IRAS08544-4431 with a radiative transfer treatment and continue the near-infrared and mid-infrared interferometric analysis covering the H-, K-, L-, and N-bands. We aim to capture the previously detected amount of over-resolved flux and the radial intensity profile at and beyond the inner dust disc rim to put constraints on the physical processes in the inner disc regions. We used a Monte Carlo radiative transfer code to investigate the physical structure of the disc by reproducing both the photometry and the multi-wavelength infrared interferometric data set. We developed a strategy to identify the models which perform best to reproduce our data set. We found a family of models that successfully fit the infrared photometric and interferometric data in all bands. Some over-resolved flux component was recovered in all bands but the optimised models still fall short to explain all the over-resolved flux. This suggests that another dusty structure within the system plays a role. Multi-wavelength infrared interferometric observations of circumstellar discs allow to study the inner disc regions in unprecedented detail. The refined physical models can reproduce most of the investigated features, including the photometric characteristics, the radial extent, and the overall shape of the visibility curves. Our multi-wavelength interferometric observations combined with photometry show that the disc is similar to protoplanetary discs with similar dust masses and efficient dust growth. △ Less

Submitted 12 January, 2023; v1 submitted 6 January, 2023; originally announced January 2023.

Comments: 18 pages, 13 figures (including apppendix), accepted for publication in A&A

Journal ref: A&A 671, A15 (2023)

arXiv:2301.01413 [pdf, other]

Attribute-Centric Compositional Text-to-Image Generation

Authors: Yuren Cong, Martin Renqiang Min, Li Erran Li, Bodo Rosenhahn, Michael Ying Yang

Abstract: Despite the recent impressive breakthroughs in text-to-image generation, generative models have difficulty in capturing the data distribution of underrepresented attribute compositions while over-memorizing overrepresented attribute compositions, which raises public concerns about their robustness and fairness. To tackle this challenge, we propose ACTIG, an attribute-centric compositional text-to-… ▽ More Despite the recent impressive breakthroughs in text-to-image generation, generative models have difficulty in capturing the data distribution of underrepresented attribute compositions while over-memorizing overrepresented attribute compositions, which raises public concerns about their robustness and fairness. To tackle this challenge, we propose ACTIG, an attribute-centric compositional text-to-image generation framework. We present an attribute-centric feature augmentation and a novel image-free training scheme, which greatly improves model's ability to generate images with underrepresented attributes. We further propose an attribute-centric contrastive loss to avoid overfitting to overrepresented attribute compositions. We validate our framework on the CelebA-HQ and CUB datasets. Extensive experiments show that the compositional generalization of ACTIG is outstanding, and our framework outperforms previous works in terms of image quality and text-image consistency. △ Less

Submitted 3 January, 2023; originally announced January 2023.

arXiv:2211.00633 [pdf, other]

doi 10.1051/0004-6361/202244939

Clouds form on the hot Saturn JWST ERO target WASP-96b

Authors: Dominic Samra, Christiane Helling, Katy Chubb, Michiel Min, Ludmila Carone, Aaron Schneider

Abstract: WASP-96b is a hot Saturn exoplanet, with an equilibrium temperature well within the regime of thermodynamically expected extensive cloud formation. Prior observations with Hubble/WFC3, Spitzer/IRAC, and VLT/FORS2 have been combined into a single spectra for which retrievals suggest a cold but cloud-free atmosphere. Recently, the planet was observed with the James Webb Space Telescope (JWST) as par… ▽ More WASP-96b is a hot Saturn exoplanet, with an equilibrium temperature well within the regime of thermodynamically expected extensive cloud formation. Prior observations with Hubble/WFC3, Spitzer/IRAC, and VLT/FORS2 have been combined into a single spectra for which retrievals suggest a cold but cloud-free atmosphere. Recently, the planet was observed with the James Webb Space Telescope (JWST) as part of the Early Release Observations (ERO). 1D profiles are extracted from the 3D GCM expeRT/MITgcm results and used as input for a kinetic, non-equilibrium model to study the formation of mineral cloud particles of mixed composition. The ARCiS retrieval framework is applied to the pre-JWST WASP-96b transit spectra to investigate the apparent contradiction between cloudy models and assumed cloud-free transit spectra. Clouds are predicted to be ubiquitous throughout the atmosphere of WASP-96b. Silicate materials contribute between 40% and 90%, hence, also metal oxides contribute with up to 40% in the low-pressure regimes that effect the spectra. We explore how to match these cloudy models with currently available atmospheric transit spectra. A reduced vertical mixing acts to settle clouds to deeper in the atmosphere, and an increased cloud particles porosity reduces the opacity of clouds in the near-IR and optical region. Both these effects allow for clearer molecular features to be observed, while still allowing clouds to be in the atmosphere. The atmosphere of WASP-96b is unlikely to be cloud free. Also retrievals of HST, Spitzer and VLT spectra show that multiple cloudy solutions reproduce the data. JWST observations will be affected by clouds, where within even the NIRISS wavelength range the cloud top pressure varies by an order of magnitude. The long wavelength end of NIRSpec and short end of MIRI may probe atmospheric asymmetries between the limbs of the terminator on WASP-96b. △ Less

Submitted 1 November, 2022; originally announced November 2022.

Comments: 17 pages, 13 figures, accepted for publication in A&A

Journal ref: A&A 669, A142 (2023)

arXiv:2210.08171 [pdf, other]

Disentangled Wasserstein Autoencoder for T-Cell Receptor Engineering

Authors: Tianxiao Li, Hongyu Guo, Filippo Grazioli, Mark Gerstein, Martin Renqiang Min

Abstract: In protein biophysics, the separation between the functionally important residues (forming the active site or binding surface) and those that create the overall structure (the fold) is a well-established and fundamental concept. Identifying and modifying those functional sites is critical for protein engineering but computationally non-trivial, and requires significant domain knowledge. To automat… ▽ More In protein biophysics, the separation between the functionally important residues (forming the active site or binding surface) and those that create the overall structure (the fold) is a well-established and fundamental concept. Identifying and modifying those functional sites is critical for protein engineering but computationally non-trivial, and requires significant domain knowledge. To automate this process from a data-driven perspective, we propose a disentangled Wasserstein autoencoder with an auxiliary classifier, which isolates the function-related patterns from the rest with theoretical guarantees. This enables one-pass protein sequence editing and improves the understanding of the resulting sequences and editing actions involved. To demonstrate its effectiveness, we apply it to T-cell receptors (TCRs), a well-studied structure-function case. We show that our method can be used to alter the function of TCRs without changing the structural backbone, outperforming several competing methods in generation quality and efficiency, and requiring only 10% of the running time needed by baseline models. To our knowledge, this is the first approach that utilizes disentangled representations for TCR engineering. △ Less

Submitted 16 October, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

arXiv:2210.00904 [pdf, other]

Towards Exascale for Wind Energy Simulations

Authors: Misun Min, Michael Brazell, Ananias Tomboulides, Matthew Churchfield, Paul Fischer, Michael Sprague

Abstract: We examine large-eddy-simulation modeling approaches and computational performance of two open-source computational fluid dynamics codes for the simulation of atmospheric boundary layer (ABL) flows that are of direct relevance to wind energy production. The first is NekRS, a high-order, unstructured-grid, spectral element code. The second, AMR-Wind, is a block-structured, second-order finite-volum… ▽ More We examine large-eddy-simulation modeling approaches and computational performance of two open-source computational fluid dynamics codes for the simulation of atmospheric boundary layer (ABL) flows that are of direct relevance to wind energy production. The first is NekRS, a high-order, unstructured-grid, spectral element code. The second, AMR-Wind, is a block-structured, second-order finite-volume code with adaptive-mesh-refinement capabilities. The objective of this study is to co-develop these codes in order to improve model fidelity and performance for each. These features will be critical for running ABL-based applications such as wind farm analysis on advanced computing architectures. To this end, we investigate the performance of NekRS and AMR-Wind on the Oak Ridge Leadership Facility supercomputers Summit, using 4 to 800 nodes (24 to 4,800 NVIDIA V100 GPUs), and Crusher, the testbed for the Frontier exascale system using 18 to 384 Graphics Compute Dies on AMD MI250X GPUs. We compare strong- and weak-scaling capabilities, linear solver performance, and time to solution. We also identify leading inhibitors to parallel scaling. △ Less

Submitted 30 September, 2022; originally announced October 2022.

Comments: 13 pages, 7 figures, 6 tables

MSC Class: 35-04 ACM Class: G.4; I.6

arXiv:2208.00469 [pdf, other]

doi 10.1051/0004-6361/202244647

H2S and SO2 detectability in Hot Jupiters: Sulfur species as indicator of metallicity and C/O ratio

Authors: J. Polman, L. B. F. M. Waters, M. Min, Y. Miguel, N. Khorshid

Abstract: The high cosmic abundance and the intermediate volatility and chemical properties of sulfur allow the use of sulfur-bearing species as a tracer of the chemical processes in the atmospheres of hot Jupiter exoplanets. Nevertheless, despite its properties and relevance as a tracer of the giant planets' formation history, little attention has been paid to this species in the context of hot Jupiter atm… ▽ More The high cosmic abundance and the intermediate volatility and chemical properties of sulfur allow the use of sulfur-bearing species as a tracer of the chemical processes in the atmospheres of hot Jupiter exoplanets. Nevertheless, despite its properties and relevance as a tracer of the giant planets' formation history, little attention has been paid to this species in the context of hot Jupiter atmospheres. Here we provide an overview of the abundances of sulfur-bearing species in hot Jupiter atmospheres under different conditions and explore their observability. We use the photochemical kinetics code VULCAN to model hot Jupiter atmospheric disequilibrium chemistry. Transmission spectra for these atmospheres are created using the modelling framework ARCiS. We vary model parameters such as the diffusion coefficient, and we study the importance of photochemistry on the resulting mixing ratios. Furthermore, we vary the chemical composition of the atmosphere by increasing the metallicity from solar to ~10 times solar. We also explore different C/O ratios. We find that H2S and SO2 are the best candidates for detection between 1 and 10 micron, using a spectral resolution that is representative of the instruments on board the JWST. H2S is easiest to detect at an equilibrium temperature of ~1500 K and C/O ratios between 0.7 and 0.9, with the ideal value increasing slightly for increasing metallicity. SO2 is most likely to be detected at an equilibrium temperature of ~1000 K at low C/O ratios and high metallicities. Nevertheless, among these two molecules, we expect SO2 detection to be more common, as it is detectable in scenarios more favoured by formation models. We conclude that H2S and SO2 will most likely be detected in the coming years with the JWST and that the detection of these species will provide information on atmospheric processes and planet formation scenarios. △ Less

Submitted 16 November, 2022; v1 submitted 31 July, 2022; originally announced August 2022.

Comments: Accepted for publication in A&A

arXiv:2207.00739 [pdf, other]

Deep Learning for Systemic Risk Measures

Authors: Yichen Feng, Ming Min, Jean-Pierre Fouque

Abstract: The aim of this paper is to study a new methodological framework for systemic risk measures by applying deep learning method as a tool to compute the optimal strategy of capital allocations. Under this new framework, systemic risk measures can be interpreted as the minimal amount of cash that secures the aggregated system by allocating capital to the single institutions before aggregating the indi… ▽ More The aim of this paper is to study a new methodological framework for systemic risk measures by applying deep learning method as a tool to compute the optimal strategy of capital allocations. Under this new framework, systemic risk measures can be interpreted as the minimal amount of cash that secures the aggregated system by allocating capital to the single institutions before aggregating the individual risks. This problem has no explicit solution except in very limited situations. Deep learning is increasingly receiving attention in financial modelings and risk management and we propose our deep learning based algorithms to solve both the primal and dual problems of the risk measures, and thus to learn the fair risk allocations. In particular, our method for the dual problem involves the training philosophy inspired by the well-known Generative Adversarial Networks (GAN) approach and a newly designed direct estimation of Radon-Nikodym derivative. We close the paper with substantial numerical studies of the subject and provide interpretations of the risk allocations associated to the systemic risk measures. In the particular case of exponential preferences, numerical experiments demonstrate excellent performance of the proposed algorithm, when compared with the optimal explicit solution as a benchmark. △ Less

Submitted 2 July, 2022; originally announced July 2022.

arXiv:2206.09738 [pdf, other]

doi 10.1051/0004-6361/202142800

Exoplanet atmosphere retrievals in 3D using phase curve data with ARCiS: application to WASP-43b

Authors: Katy L. Chubb, Michiel Min

Abstract: Aims. To create a retrieval framework which encapsulates the 3D nature of exoplanet atmospheres, and to apply it to observed emission phase curve and transmission spectra of hot Jupiter exoplanet WASP-43b. Methods. We present our 3D framework, which is freely available as a stand-alone module from GitHub. We use the atmospheric modelling and Bayesian retrieval package ARCiS (ARtful modelling Code… ▽ More Aims. To create a retrieval framework which encapsulates the 3D nature of exoplanet atmospheres, and to apply it to observed emission phase curve and transmission spectra of hot Jupiter exoplanet WASP-43b. Methods. We present our 3D framework, which is freely available as a stand-alone module from GitHub. We use the atmospheric modelling and Bayesian retrieval package ARCiS (ARtful modelling Code for exoplanet Science) to perform 8 3D retrievals on simultaneous transmission (HST/WFC3) and phase-dependent emission (HST/WFC3, Spitzer/IRAC) observations of WASP-43b as a case study. We assess how input assumptions affect our retrieval outcomes. In particular we look at constraining equilibrium chemistry vs a free molecular retrieval, the case of no clouds vs parametrised clouds, and using Spitzer phase data that have been reduced from two different literature sources. For the free chemistry retrievals, we retrieve abundances of H2O, CH4, CO, CO2, AlO, and NH3 as a function of phase, with many more species considered for the equilibrium chemistry retrievals. Results. We find consistent super-solar C/O (0.6-0.9) and super-solar metallicities (1.7-2.9 dex) for all retrieval setups that assume equilibrium chemistry. Atmospheric heat distribution, hotspot shift (15.6 vs 4.5 for different Spitzer datasets), and temperature structure are influenced by the Spitzer emission phase data. Comparisons are made with other studies of WASP-43b, including available GCM simulations. Conclusions. The parametrised 3D setup we have developed provides a valuable tool to analyse extensive observational datasets such as spectroscopic phase curves. Near-future observations with missions such as the James Webb Space Telescope (JWST) and Ariel will greatly improve our understanding of the atmospheres of exoplanets such as WASP-43b. △ Less

Submitted 20 June, 2022; originally announced June 2022.

Comments: 31 pages; published in A&A

Journal ref: A&A 665, A2 (2022)

arXiv:2206.05885 [pdf, other]

Auction-Promoted Trading for Multiple Federated Learning Services in UAV-Aided Networks

Authors: Zhipeng Cheng, Minghui Liwang, Xiaoyu Xia, Minghui Min, Xianbin Wang, Xiaojiang Du

Abstract: Federated learning (FL) represents a promising distributed machine learning paradigm that allows smart devices to collaboratively train a shared model via providing local data sets. However, problems considering multiple co-existing FL services and different types of service providers are rarely studied. In this paper, we investigate a multiple FL service trading problem in Unmanned Aerial Vehicle… ▽ More Federated learning (FL) represents a promising distributed machine learning paradigm that allows smart devices to collaboratively train a shared model via providing local data sets. However, problems considering multiple co-existing FL services and different types of service providers are rarely studied. In this paper, we investigate a multiple FL service trading problem in Unmanned Aerial Vehicle (UAV)-aided networks, where FL service demanders (FLSDs) aim to purchase various data sets from feasible clients (smart devices, e.g., smartphones, smart vehicles), and model aggregation services from UAVs, to fulfill their requirements. An auction-based trading market is established to facilitate the trading among three parties, i.e., FLSDs acting as buyers, distributed located client groups acting as data-sellers, and UAVs acting as UAV-sellers. The proposed auction is formalized as a 0-1 integer programming problem, aiming to maximize the overall buyers' revenue via investigating winner determination and payment rule design. Specifically, since two seller types (data-sellers and UAV-sellers) are considered, an interesting idea integrating seller pair and joint bid is introduced, which turns diverse sellers into virtual seller pairs. Vickrey-Clarke-Groves (VCG)-based, and one-sided matching-based mechanisms are proposed, respectively, where the former achieves the optimal solutions, which, however, is computationally intractable. While the latter can obtain suboptimal solutions that approach to the optimal ones, with low computational complexity, especially upon considering a large number of participants. Significant properties such as truthfulness and individual rationality are comprehensively analyzed for both mechanisms. Extensive experimental results verify the properties and demonstrate that our proposed mechanisms outperform representative methods significantly. △ Less

Submitted 12 June, 2022; originally announced June 2022.

Comments: 14 pages,6 figures

arXiv:2203.16650 [pdf, ps, other]

Robust Beamforming for Localization-Aided Millimeter Wave Communication Systems

Authors: Junchang Sun, Shuai Ma, Shiyin Li, Ruixin Yang, Minghui Min, Gonzalo Seco-Granados

Abstract: In this letter, we investigate a robust beamforming problem for localization-aided millimeter wave (mmWave) communication systems. To handle this problem, we propose a novel restriction and relaxation (R&R) method. The proposed R&R method aims at minimizing the total transmit power while the positioning error follows a Gaussian distribution. Specifically, in the restriction phase of R&R, the proba… ▽ More In this letter, we investigate a robust beamforming problem for localization-aided millimeter wave (mmWave) communication systems. To handle this problem, we propose a novel restriction and relaxation (R&R) method. The proposed R&R method aims at minimizing the total transmit power while the positioning error follows a Gaussian distribution. Specifically, in the restriction phase of R&R, the probabilistic constraint is transformed into the deterministic form by using the Bernsteintype inequality. In the relaxation phase of R&R, the non-convex optimization problem is reformulated into a convex semidefinite program (SDP) by using semidefinite relaxation (SDR) and firstorder Taylor expansion methods. To the best of our knowledge, we first consider the impact of the distribution of the positioning error on the channel state information (CSI), which further influences the data rate. Numerical results present the trade-off of the beamforming between the communication and positioning. △ Less

Submitted 30 March, 2022; originally announced March 2022.

arXiv:2203.15799 [pdf, other]

StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis

Authors: Zhiheng Li, Martin Renqiang Min, Kai Li, Chenliang Xu

Abstract: Although progress has been made for text-to-image synthesis, previous methods fall short of generalizing to unseen or underrepresented attribute compositions in the input text. Lacking compositionality could have severe implications for robustness and fairness, e.g., inability to synthesize the face images of underrepresented demographic groups. In this paper, we introduce a new framework, StyleT2… ▽ More Although progress has been made for text-to-image synthesis, previous methods fall short of generalizing to unseen or underrepresented attribute compositions in the input text. Lacking compositionality could have severe implications for robustness and fairness, e.g., inability to synthesize the face images of underrepresented demographic groups. In this paper, we introduce a new framework, StyleT2I, to improve the compositionality of text-to-image synthesis. Specifically, we propose a CLIP-guided Contrastive Loss to better distinguish different compositions among different sentences. To further improve the compositionality, we design a novel Semantic Matching Loss and a Spatial Constraint to identify attributes' latent directions for intended spatial region manipulations, leading to better disentangled latent representations of attributes. Based on the identified latent directions of attributes, we propose Compositional Attribute Adjustment to adjust the latent code, resulting in better compositionality of image synthesis. In addition, we leverage the $\ell_2$-norm regularization of identified latent directions (norm penalty) to strike a nice balance between image-text alignment and image fidelity. In the experiments, we devise a new dataset split and an evaluation metric to evaluate the compositionality of text-to-image synthesis models. The results show that StyleT2I outperforms previous approaches in terms of the consistency between the input text and synthesized images and achieves higher fidelity. △ Less

Submitted 29 March, 2022; originally announced March 2022.

Comments: CVPR 2022

arXiv:2203.01236 [pdf, other]

doi 10.1051/0004-6361/202142976

Convolutional neural networks as an alternative to Bayesian retrievals

Authors: Francisco Ardevol Martinez, Michiel Min, Inga Kamp, Paul I. Palmer

Abstract: Exoplanet observations are currently analysed with Bayesian retrieval techniques. Due to the computational load of the models used, a compromise is needed between model complexity and computing time. Analysis of data from future facilities, will need more complex models which will increase the computational load of retrievals, prompting the search for a faster approach for interpreting exoplanet o… ▽ More Exoplanet observations are currently analysed with Bayesian retrieval techniques. Due to the computational load of the models used, a compromise is needed between model complexity and computing time. Analysis of data from future facilities, will need more complex models which will increase the computational load of retrievals, prompting the search for a faster approach for interpreting exoplanet observations. Our goal is to compare machine learning retrievals of exoplanet transmission spectra with nested sampling, and understand if machine learning can be as reliable as Bayesian retrievals for a statistically significant sample of spectra while being orders of magnitude faster. We generate grids of synthetic transmission spectra and their corresponding planetary and atmospheric parameters, one using free chemistry models, and the other using equilibrium chemistry models. Each grid is subsequently rebinned to simulate both HST/WFC3 and JWST/NIRSpec observations, yielding four datasets in total. Convolutional neural networks (CNNs) are trained with each of the datasets. We perform retrievals on a 1,000 simulated observations for each combination of model type and instrument with nested sampling and machine learning. We also use both methods to perform retrievals on real WFC3 transmission spectra. Finally, we test how robust machine learning and nested sampling are against incorrect assumptions in our models. CNNs reach a lower coefficient of determination between predicted and true values of the parameters. Nested sampling underestimates the uncertainty in ~8% of retrievals, whereas CNNs estimate them correctly. For real WFC3 observations, nested sampling and machine learning agree within $2σ$ for ~86% of spectra. When doing retrievals with incorrect assumptions, nested sampling underestimates the uncertainty in ~12% to ~41% of cases, whereas this is always below ~10% for the CNN. △ Less

Submitted 3 March, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

Comments: Accepted for publication in A&A

Journal ref: A&A 662, A108 (2022)

arXiv:2203.00482 [pdf, other]

doi 10.1007/s10686-021-09821-w

A retrieval challenge exercise for the Ariel mission

Authors: Joanna K. Barstow, Quentin Changeat, Katy L. Chubb, Patricio E. Cubillos, Billy Edwards, Ryan J. MacDonald, Michiel Min, Ingo P. Waldmann

Abstract: The Ariel mission, due to launch in 2029, will obtain spectroscopic information for 1000 exoplanets, providing an unprecedented opportunity for comparative exoplanetology. Retrieval codes - parameteric atmospheric models coupled with an inversion algorithm - represent the tool of choice for interpreting Ariel data. Ensuring that reliable and consistent results can be produced by these tools is a c… ▽ More The Ariel mission, due to launch in 2029, will obtain spectroscopic information for 1000 exoplanets, providing an unprecedented opportunity for comparative exoplanetology. Retrieval codes - parameteric atmospheric models coupled with an inversion algorithm - represent the tool of choice for interpreting Ariel data. Ensuring that reliable and consistent results can be produced by these tools is a critical preparatory step for the mission. Here, we present the results of a retrieval challenge. We use five different exoplanet retrieval codes to analyse the same synthetic datasets, and test a) the ability of each to recover the correct input solution and b) the consistency of the results. We find that generally there is very good agreement between the five codes, and in the majority of cases the correct solutions are recovered. This demonstrates the reproducibility of retrievals for transit spectra of exoplanets, even when codes are not previously benchmarked against each other. △ Less

Submitted 15 March, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

Comments: 28 pages, 14 figures. Accepted in Experimental Astronomy (2022)

Showing 1–50 of 236 results for author: Min, M