-
A probabilistic framework for learning non-intrusive corrections to long-time climate simulations from short-time training data
Authors:
Benedikt Barthel Sorensen,
Leonardo Zepeda-Núñez,
Ignacio Lopez-Gomez,
Zhong Yi Wan,
Rob Carver,
Fei Sha,
Themistoklis Sapsis
Abstract:
Chaotic systems, such as turbulent flows, are ubiquitous in science and engineering. However, their study remains a challenge due to the large range scales, and the strong interaction with other, often not fully understood, physics. As a consequence, the spatiotemporal resolution required for accurate simulation of these systems is typically computationally infeasible, particularly for application…
▽ More
Chaotic systems, such as turbulent flows, are ubiquitous in science and engineering. However, their study remains a challenge due to the large range scales, and the strong interaction with other, often not fully understood, physics. As a consequence, the spatiotemporal resolution required for accurate simulation of these systems is typically computationally infeasible, particularly for applications of long-term risk assessment, such as the quantification of extreme weather risk due to climate change. While data-driven modeling offers some promise of alleviating these obstacles, the scarcity of high-quality simulations results in limited available data to train such models, which is often compounded by the lack of stability for long-horizon simulations. As such, the computational, algorithmic, and data restrictions generally imply that the probability of rare extreme events is not accurately captured. In this work we present a general strategy for training neural network models to non-intrusively correct under-resolved long-time simulations of chaotic systems. The approach is based on training a post-processing correction operator on under-resolved simulations nudged towards a high-fidelity reference. This enables us to learn the dynamics of the underlying system directly, which allows us to use very little training data, even when the statistics thereof are far from converged. Additionally, through the use of probabilistic network architectures we are able to leverage the uncertainty due to the limited training data to further improve extrapolation capabilities. We apply our framework to severely under-resolved simulations of quasi-geostrophic flow and demonstrate its ability to accurately predict the anisotropic statistics over time horizons more than 30 times longer than the data seen in training.
△ Less
Submitted 2 August, 2024;
originally announced August 2024.
-
Roadmaps with Gaps over Controllers: Achieving Efficiency in Planning under Dynamics
Authors:
Aravind Sivaramakrishnan,
Sumanth Tangirala,
Edgar Granados,
Noah R. Carver,
Kostas E. Bekris
Abstract:
This paper aims to improve the computational efficiency of motion planning for mobile robots with non-trivial dynamics through the use of learned controllers. Offline, a system-specific controller is first trained in an empty environment. Then, for the target environment, the approach constructs a data structure, a "Roadmap with Gaps," to approximately learn how to solve planning queries using the…
▽ More
This paper aims to improve the computational efficiency of motion planning for mobile robots with non-trivial dynamics through the use of learned controllers. Offline, a system-specific controller is first trained in an empty environment. Then, for the target environment, the approach constructs a data structure, a "Roadmap with Gaps," to approximately learn how to solve planning queries using the learned controller. The roadmap nodes correspond to local regions. Edges correspond to applications of the learned controller that approximately connect these regions. Gaps arise as the controller does not perfectly connect pairs of individual states along edges. Online, given a query, a tree sampling-based motion planner uses the roadmap so that the tree's expansion is informed towards the goal region. The tree expansion selects local subgoals given a wavefront on the roadmap that guides towards the goal. When the controller cannot reach a subgoal region, the planner resorts to random exploration to maintain probabilistic completeness and asymptotic optimality. The accompanying experimental evaluation shows that the approach significantly improves the computational efficiency of motion planning on various benchmarks, including physics-based vehicular models on uneven and varying friction terrains as well as a quadrotor under air pressure effects.
△ Less
Submitted 3 October, 2024; v1 submitted 4 October, 2023;
originally announced October 2023.
-
WeatherBench 2: A benchmark for the next generation of data-driven global weather models
Authors:
Stephan Rasp,
Stephan Hoyer,
Alexander Merose,
Ian Langmore,
Peter Battaglia,
Tyler Russel,
Alvaro Sanchez-Gonzalez,
Vivian Yang,
Rob Carver,
Shreya Agrawal,
Matthew Chantry,
Zied Ben Bouallegue,
Peter Dueben,
Carla Bromberg,
Jared Sisk,
Luke Barrington,
Aaron Bell,
Fei Sha
Abstract:
WeatherBench 2 is an update to the global, medium-range (1-14 day) weather forecasting benchmark proposed by Rasp et al. (2020), designed with the aim to accelerate progress in data-driven weather modeling. WeatherBench 2 consists of an open-source evaluation framework, publicly available training, ground truth and baseline data as well as a continuously updated website with the latest metrics and…
▽ More
WeatherBench 2 is an update to the global, medium-range (1-14 day) weather forecasting benchmark proposed by Rasp et al. (2020), designed with the aim to accelerate progress in data-driven weather modeling. WeatherBench 2 consists of an open-source evaluation framework, publicly available training, ground truth and baseline data as well as a continuously updated website with the latest metrics and state-of-the-art models: https://sites.research.google/weatherbench. This paper describes the design principles of the evaluation framework and presents results for current state-of-the-art physical and data-driven weather models. The metrics are based on established practices for evaluating weather forecasts at leading operational weather centers. We define a set of headline scores to provide an overview of model performance. In addition, we also discuss caveats in the current evaluation setup and challenges for the future of data-driven weather forecasting.
△ Less
Submitted 26 January, 2024; v1 submitted 29 August, 2023;
originally announced August 2023.
-
SEEDS: Emulation of Weather Forecast Ensembles with Diffusion Models
Authors:
Lizao Li,
Rob Carver,
Ignacio Lopez-Gomez,
Fei Sha,
John Anderson
Abstract:
Uncertainty quantification is crucial to decision-making. A prominent example is probabilistic forecasting in numerical weather prediction. The dominant approach to representing uncertainty in weather forecasting is to generate an ensemble of forecasts. This is done by running many physics-based simulations under different conditions, which is a computationally costly process. We propose to amorti…
▽ More
Uncertainty quantification is crucial to decision-making. A prominent example is probabilistic forecasting in numerical weather prediction. The dominant approach to representing uncertainty in weather forecasting is to generate an ensemble of forecasts. This is done by running many physics-based simulations under different conditions, which is a computationally costly process. We propose to amortize the computational cost by emulating these forecasts with deep generative diffusion models learned from historical data. The learned models are highly scalable with respect to high-performance computing accelerators and can sample hundreds to tens of thousands of realistic weather forecasts at low cost. When designed to emulate operational ensemble forecasts, the generated ones are similar to physics-based ensembles in important statistical properties and predictive skill. When designed to correct biases present in the operational forecasting system, the generated ensembles show improved probabilistic forecast metrics. They are more reliable and forecast probabilities of extreme weather events more accurately. While this work demonstrates the utility of the methodology by focusing on weather forecasting, the generative artificial intelligence methodology can be extended for uncertainty quantification in climate modeling, where we believe the generation of very large ensembles of climate projections will play an increasingly important role in climate risk assessment.
△ Less
Submitted 8 October, 2023; v1 submitted 24 June, 2023;
originally announced June 2023.
-
A Machine Learning Outlook: Post-processing of Global Medium-range Forecasts
Authors:
Shreya Agrawal,
Rob Carver,
Cenk Gazen,
Eric Maddy,
Vladimir Krasnopolsky,
Carla Bromberg,
Zack Ontiveros,
Tyler Russell,
Jason Hickey,
Sid Boukabara
Abstract:
Post-processing typically takes the outputs of a Numerical Weather Prediction (NWP) model and applies linear statistical techniques to produce improve localized forecasts, by including additional observations, or determining systematic errors at a finer scale. In this pilot study, we investigate the benefits and challenges of using non-linear neural network (NN) based methods to post-process multi…
▽ More
Post-processing typically takes the outputs of a Numerical Weather Prediction (NWP) model and applies linear statistical techniques to produce improve localized forecasts, by including additional observations, or determining systematic errors at a finer scale. In this pilot study, we investigate the benefits and challenges of using non-linear neural network (NN) based methods to post-process multiple weather features -- temperature, moisture, wind, geopotential height, precipitable water -- at 30 vertical levels, globally and at lead times up to 7 days. We show that we can achieve accuracy improvements of up to 12% (RMSE) in a field such as temperature at 850hPa for a 7 day forecast. However, we recognize the need to strengthen foundational work on objectively measuring a sharp and correct forecast. We discuss the challenges of using standard metrics such as root mean squared error (RMSE) or anomaly correlation coefficient (ACC) as we move from linear statistical models to more complex non-linear machine learning approaches for post-processing global weather forecasts.
△ Less
Submitted 28 March, 2023;
originally announced March 2023.
-
Determining Ionizing Doses in Medium Earth Orbits Using Long-Term GPS Particle Measurements
Authors:
Yue Chen,
Matthew R. Carver,
Steven K. Morley,
Andrew S. Hoover
Abstract:
We use long-term electron and proton in-situ measurements made by the CXD particle instruments, developed by Los Alamos National Laboratory and carried on board GPS satellites, to determine total ionizing dose (TID) values and daily/yearly dose rate (DR) values in medium Earth orbits (MEOs) caused by the natural space radiation environment. Here measurement-based TID and DR values on a simplified…
▽ More
We use long-term electron and proton in-situ measurements made by the CXD particle instruments, developed by Los Alamos National Laboratory and carried on board GPS satellites, to determine total ionizing dose (TID) values and daily/yearly dose rate (DR) values in medium Earth orbits (MEOs) caused by the natural space radiation environment. Here measurement-based TID and DR values on a simplified sample geometry--a small (with a radius of 0.1 mm) Silicon detector within an Aluminum shielding sphere with a thickness of 100 mil--are compared to those calculated from empirical radiation models. Results over the solar cycle 24 show that electron TID from measurements in GPS orbit is well above the values calculated from the median/mean fluences from AE8 and AE9 models, but close to model fluences at high percentiles. Also, it is confirmed that in MEOs proton contributions to TID are minor and mainly dominated by solar energetic protons. Several factors affecting those dose calculations are discussed and evaluated. Results from this study provide us another out-of-sample test on the reliability of existing empirical space radiation models, and also help estimate the margin factors on calculated dose values in MEOs that pass through the heart of the Earth's outer radiation belt.
△ Less
Submitted 30 November, 2020;
originally announced December 2020.
-
Global Prompt Proton Sensor Network: Monitoring Solar Energetic Protons based on GPS Satellite Constellation
Authors:
Yue Chen,
Steven K. Morley,
Matthew R. Carver
Abstract:
Energetic particle instruments on board GPS satellites form a powerful global prompt proton sensor network (GPPSn) that provides an unprecedented opportunity to monitor and characterize solar energetic protons targeting the Earth. The medium-Earth-orbits of the GPS constellation have the unique advantage of allowing solar energetic protons to be simultaneously measured from multiple points in both…
▽ More
Energetic particle instruments on board GPS satellites form a powerful global prompt proton sensor network (GPPSn) that provides an unprecedented opportunity to monitor and characterize solar energetic protons targeting the Earth. The medium-Earth-orbits of the GPS constellation have the unique advantage of allowing solar energetic protons to be simultaneously measured from multiple points in both open- and closed-field line regions. Examining two example intervals of solar proton events, we showcase in this study how GPS proton data are prepared, calibrated and utilized to reveal important features of solar protons, including their source, acceleration/scattering by interplanetary shocks, the relative position of Earth when impinged by these shocks, the shape of solar particle fronts, the access of solar protons inside the dynamic geomagnetic field, as well temporally-varying proton distributions in both energy and space. By comparing to Van Allen Probes data, GPS proton observations are further demonstrated not only to be useful for qualitatively monitoring the dynamics of solar protons, but also for quantitative scientific research including determining cutoff L-shells. Our results establish that this GPPSn can join forces with other existing solar proton monitors and contribute to observing, warning, understanding and ultimately forecasting the incoming solar energetic proton events.
△ Less
Submitted 8 January, 2020;
originally announced January 2020.
-
Laboratory Experiments, Numerical Simulations, and Astronomical Observations of Deflected Supersonic Jets: Application to HH 110
Authors:
P. Hartigan,
J. M. Foster,
B. H. Wilde,
R. F. Coker,
P. A. Rosen,
J. F. Hansen,
B. E. Blue,
R. J. R. Williams,
R. Carver,
A. Frank
Abstract:
Collimated supersonic flows in laboratory experiments behave in a similar manner to astrophysical jets provided that radiation, viscosity, and thermal conductivity are unimportant in the laboratory jets, and that the experimental and astrophysical jets share similar dimensionless parameters such as the Mach number and the ratio of the density between the jet and the ambient medium. Laboratory je…
▽ More
Collimated supersonic flows in laboratory experiments behave in a similar manner to astrophysical jets provided that radiation, viscosity, and thermal conductivity are unimportant in the laboratory jets, and that the experimental and astrophysical jets share similar dimensionless parameters such as the Mach number and the ratio of the density between the jet and the ambient medium. Laboratory jets can be studied for a variety of initial conditions, arbitrary viewing angles, and different times, attributes especially helpful for interpreting astronomical images where the viewing angle and initial conditions are fixed and the time domain is limited. Experiments are also a powerful way to test numerical fluid codes in a parameter range where the codes must perform well. In this paper we combine images from a series of laboratory experiments of deflected supersonic jets with numerical simulations and new spectral observations of an astrophysical example, the young stellar jet HH 110. The experiments provide key insights into how deflected jets evolve in 3-D, particularly within working surfaces where multiple subsonic shells and filaments form, and along the interface where shocked jet material penetrates into and destroys the obstacle along its path. The experiments also underscore the importance of the viewing angle in determining what an observer will see. The simulations match the experiments so well that we can use the simulated velocity maps to compare the dynamics in the experiment with those implied by the astronomical spectra. The experiments support a model where the observed shock structures in HH 110 form as a result of a pulsed driving source rather than from weak shocks that may arise in the supersonic shear layer between the Mach disk and bow shock of the jet's working surface.
△ Less
Submitted 1 October, 2009;
originally announced October 2009.