Search SciRate

8 results for au:Shafei_B in:stat

Show all abstracts

BoFire: Bayesian Optimization Framework Intended for Real Experiments
Johannes P. Dürholt, Thomas S. Asche, Johanna Kleinekorte, Gabriel Mancino-Ball, Benjamin Schiller, Simon Sung, Julian Keupp, Aaron Osburg, Toby Boyne, Ruth Misener, Rosona Eldred, Wagner Steuer Costa, Chrysoula Kappatou, Robert M. Lee, Dominik Linzner, David Walz, Niklas Wulkow, Behrang Shafei
Aug 12 2024 cs.LG math.OC stat.ML arXiv:2408.05040v1

@misc{2408.05040, author = {Johannes P.~Dürholt and Thomas S.~Asche and Johanna Kleinekorte and Gabriel Mancino-Ball and Benjamin Schiller and Simon Sung and Julian Keupp and Aaron Osburg and Toby Boyne and Ruth Misener and Rosona Eldred and Wagner Steuer Costa and Chrysoula Kappatou and Robert M.~Lee and Dominik Linzner and David Walz and Niklas Wulkow and Behrang Shafei}, title = {{B}o{F}ire: {B}ayesian {O}ptimization {F}ramework {I}ntended for {R}eal {E}xperiments}, year = {2024}, eprint = {2408.05040}, note = {arXiv:2408.05040v1} }
PDF
Our open-source Python package BoFire combines Bayesian Optimization (BO) with other design of experiments (DoE) strategies focusing on developing and optimizing new chemistry. Previous BO implementations, for example as they exist in the literature or software, require substantial adaptation for effective real-world deployment in chemical industry. BoFire provides a rich feature-set with extensive configurability and realizes our vision of fast-tracking research contributions into industrial use via maintainable open-source software. Owing to quality-of-life features like JSON-serializability of problem formulations, BoFire enables seamless integration of BO into RESTful APIs, a common architecture component for both self-driving laboratories and human-in-the-loop setups. This paper discusses the differences between BoFire and other BO implementations and outlines ways that BO research needs to be adapted for real-world use in a chemistry setting.
Scalarisation-based risk concepts for robust multi-objective optimisation
Ben Tu, Nikolas Kantas, Robert M. Lee, Behrang Shafei
May 17 2024 math.OC cs.LG stat.ML arXiv:2405.10221v2

@misc{2405.10221, author = {Ben Tu and Nikolas Kantas and Robert M.~Lee and Behrang Shafei}, title = {{S}calarisation-based risk concepts for robust multi-objective optimisation}, year = {2024}, eprint = {2405.10221}, note = {arXiv:2405.10221v2} }
PDF
Robust optimisation is a well-established framework for optimising functions in the presence of uncertainty. The inherent goal of this problem is to identify a collection of inputs whose outputs are both desirable for the decision maker, whilst also being robust to the underlying uncertainties in the problem. In this work, we study the multi-objective case of this problem. We identify that the majority of all robust multi-objective algorithms rely on two key operations: robustification and scalarisation. Robustification refers to the strategy that is used to account for the uncertainty in the problem. Scalarisation refers to the procedure that is used to encode the relative importance of each objective to a scalar-valued reward. As these operations are not necessarily commutative, the order that they are performed in has an impact on the resulting solutions that are identified and the final decisions that are made. The purpose of this work is to give a thorough exposition on the effects of these different orderings and in particular highlight when one should opt for one ordering over the other. As part of our analysis, we showcase how many existing risk concepts can be integrated into the specification and solution of a robust multi-objective optimisation problem. Besides this, we also demonstrate how one can principally define the notion of a robust Pareto front and a robust performance metric based on our ``robustify and scalarise'' methodology. To illustrate the efficacy of these new ideas, we present two insightful case studies which are based on real-world data sets.
Random Pareto front surfaces
Ben Tu, Nikolas Kantas, Robert M. Lee, Behrang Shafei
May 03 2024 stat.ML cs.LG math.OC stat.ME arXiv:2405.01404v2

@misc{2405.01404, author = {Ben Tu and Nikolas Kantas and Robert M.~Lee and Behrang Shafei}, title = {{R}andom {P}areto front surfaces}, year = {2024}, eprint = {2405.01404}, note = {arXiv:2405.01404v2} }
PDF
The goal of multi-objective optimisation is to identify the Pareto front surface which is the set obtained by connecting the best trade-off points. Typically this surface is computed by evaluating the objectives at different points and then interpolating between the subset of the best evaluated trade-off points. In this work, we propose to parameterise the Pareto front surface using polar coordinates. More precisely, we show that any Pareto front surface can be equivalently represented using a scalar-valued length function which returns the projected length along any positive radial direction. We then use this representation in order to rigorously develop the theory and applications of stochastic Pareto front surfaces. In particular, we derive many Pareto front surface statistics of interest such as the expectation, covariance and quantiles. We then discuss how these can be used in practice within a design of experiments setting, where the goal is to both infer and use the Pareto front surface distribution in order to make effective decisions. Our framework allows for clear uncertainty quantification and we also develop advanced visualisation techniques for this purpose. Finally we discuss the applicability of our ideas within multivariate extreme value theory and illustrate our methodology in a variety of numerical examples, including a case study with a real-world air pollution data set.
Practical Path-based Bayesian Optimization
Jose Pablo Folch, James Odgers, Shiqiang Zhang, Robert M Lee, Behrang Shafei, David Walz, Calvin Tsay, Mark van der Wilk, Ruth Misener
Dec 04 2023 cs.LG math.OC stat.ME arXiv:2312.00622v1

@misc{2312.00622, author = {Jose Pablo Folch and James Odgers and Shiqiang Zhang and Robert M Lee and Behrang Shafei and David Walz and Calvin Tsay and Mark van der Wilk and Ruth Misener}, title = {{P}ractical {P}ath-based {B}ayesian {O}ptimization}, year = {2023}, eprint = {2312.00622}, howpublished = {NeurIPS 2023 Workshop on Adaptive Experimental Design and Active Learning in the Real World}, note = {arXiv:2312.00622v1} }
PDF
There has been a surge in interest in data-driven experimental design with applications to chemical engineering and drug manufacturing. Bayesian optimization (BO) has proven to be adaptable to such cases, since we can model the reactions of interest as expensive black-box functions. Sometimes, the cost of this black-box functions can be separated into two parts: (a) the cost of the experiment itself, and (b) the cost of changing the input parameters. In this short paper, we extend the SnAKe algorithm to deal with both types of costs simultaneously. We further propose extensions to the case of a maximum allowable input change, as well as to the multi-objective setting.
Multi-objective optimisation via the R2 utilities
Ben Tu, Nikolas Kantas, Robert M. Lee, Behrang Shafei
May 22 2023 math.OC cs.LG stat.ML arXiv:2305.11774v3

@misc{2305.11774, author = {Ben Tu and Nikolas Kantas and Robert M.~Lee and Behrang Shafei}, title = {{M}ulti-objective optimisation via the {R}2 utilities}, year = {2023}, eprint = {2305.11774}, note = {arXiv:2305.11774v3} }
PDF
The goal of multi-objective optimisation is to identify a collection of points which describe the best possible trade-offs between the multiple objectives. In order to solve this vector-valued optimisation problem, practitioners often appeal to the use of scalarisation functions in order to transform the multi-objective problem into a collection of single-objective problems. This set of scalarised problems can then be solved using traditional single-objective optimisation techniques. In this work, we formalise this convention into a general mathematical framework. We show how this strategy effectively recasts the original multi-objective optimisation problem into a single-objective optimisation problem defined over sets. An appropriate class of objective functions for this new problem are the R2 utilities, which are utility functions that are defined as a weighted integral over the scalarised optimisation problems. As part of our work, we show that these utilities are monotone and submodular set functions which can be optimised effectively using greedy optimisation algorithms. We then analyse the performance of these greedy algorithms both theoretically and empirically. Our analysis largely focusses on Bayesian optimisation, which is a popular probabilistic framework for black-box optimisation.
Combining Multi-Fidelity Modelling and Asynchronous Batch Bayesian Optimization
Jose Pablo Folch, Robert M Lee, Behrang Shafei, David Walz, Calvin Tsay, Mark van der Wilk, Ruth Misener
Nov 14 2022 cs.LG cs.CE stat.ML arXiv:2211.06149v2

@misc{2211.06149, author = {Jose Pablo Folch and Robert M Lee and Behrang Shafei and David Walz and Calvin Tsay and Mark van der Wilk and Ruth Misener}, title = {{C}ombining {M}ulti-{F}idelity {M}odelling and {A}synchronous {B}atch {B}ayesian {O}ptimization}, year = {2022}, eprint = {2211.06149}, note = {arXiv:2211.06149v2} }
PDF
Bayesian Optimization is a useful tool for experiment design. Unfortunately, the classical, sequential setting of Bayesian Optimization does not translate well into laboratory experiments, for instance battery design, where measurements may come from different sources and their evaluations may require significant waiting times. Multi-fidelity Bayesian Optimization addresses the setting with measurements from different sources. Asynchronous batch Bayesian Optimization provides a framework to select new experiments before the results of the prior experiments are revealed. This paper proposes an algorithm combining multi-fidelity and asynchronous batch methods. We empirically study the algorithm behavior, and show it can outperform single-fidelity batch methods and multi-fidelity sequential methods. As an application, we consider designing electrode materials for optimal performance in pouch cells using experiments with coin cells to approximate battery performance.
Joint Entropy Search for Multi-objective Bayesian Optimization
Ben Tu, Axel Gandy, Nikolas Kantas, Behrang Shafei
Oct 07 2022 cs.LG math.OC stat.ML arXiv:2210.02905v1

@misc{2210.02905, author = {Ben Tu and Axel Gandy and Nikolas Kantas and Behrang Shafei}, title = {{J}oint {E}ntropy {S}earch for {M}ulti-objective {B}ayesian {O}ptimization}, year = {2022}, eprint = {2210.02905}, note = {arXiv:2210.02905v1} }
PDF
Many real-world problems can be phrased as a multi-objective optimization problem, where the goal is to identify the best set of compromises between the competing objectives. Multi-objective Bayesian optimization (BO) is a sample efficient strategy that can be deployed to solve these vector-valued optimization problems where access is limited to a number of noisy objective function evaluations. In this paper, we propose a novel information-theoretic acquisition function for BO called Joint Entropy Search (JES), which considers the joint information gain for the optimal set of inputs and outputs. We present several analytical approximations to the JES acquisition function and also introduce an extension to the batch setting. We showcase the effectiveness of this new approach on a range of synthetic and real-world problems in terms of the hypervolume and its weighted variants.
Tree ensemble kernels for Bayesian optimization with known constraints over mixed-feature spaces
Alexander Thebelt, Calvin Tsay, Robert M. Lee, Nathan Sudermann-Merx, David Walz, Behrang Shafei, Ruth Misener
Jul 05 2022 stat.ML cs.AI cs.LG math.OC arXiv:2207.00879v3

@misc{2207.00879, author = {Alexander Thebelt and Calvin Tsay and Robert M.~Lee and Nathan Sudermann-Merx and David Walz and Behrang Shafei and Ruth Misener}, title = {{T}ree ensemble kernels for {B}ayesian optimization with known constraints over mixed-feature spaces}, year = {2022}, eprint = {2207.00879}, note = {arXiv:2207.00879v3} }
PDF
Tree ensembles can be well-suited for black-box optimization tasks such as algorithm tuning and neural architecture search, as they achieve good predictive performance with little or no manual tuning, naturally handle discrete feature spaces, and are relatively insensitive to outliers in the training data. Two well-known challenges in using tree ensembles for black-box optimization are (i) effectively quantifying model uncertainty for exploration and (ii) optimizing over the piece-wise constant acquisition function. To address both points simultaneously, we propose using the kernel interpretation of tree ensembles as a Gaussian Process prior to obtain model variance estimates, and we develop a compatible optimization formulation for the acquisition function. The latter further allows us to seamlessly integrate known constraints to improve sampling efficiency by considering domain-knowledge in engineering settings and modeling search space symmetries, e.g., hierarchical relationships in neural architecture search. Our framework performs as well as state-of-the-art methods for unconstrained black-box optimization over continuous/discrete features and outperforms competing methods for problems combining mixed-variable feature spaces and known input constraints.