Search SciRate

7 results for au:Tsay_C in:stat

Show all abstracts

Practical Path-based Bayesian Optimization
Jose Pablo Folch, James Odgers, Shiqiang Zhang, Robert M Lee, Behrang Shafei, David Walz, Calvin Tsay, Mark van der Wilk, Ruth Misener
Dec 04 2023 cs.LG math.OC stat.ME arXiv:2312.00622v1

@misc{2312.00622, author = {Jose Pablo Folch and James Odgers and Shiqiang Zhang and Robert M Lee and Behrang Shafei and David Walz and Calvin Tsay and Mark van der Wilk and Ruth Misener}, title = {{P}ractical {P}ath-based {B}ayesian {O}ptimization}, year = {2023}, eprint = {2312.00622}, howpublished = {NeurIPS 2023 Workshop on Adaptive Experimental Design and Active Learning in the Real World}, note = {arXiv:2312.00622v1} }
PDF
There has been a surge in interest in data-driven experimental design with applications to chemical engineering and drug manufacturing. Bayesian optimization (BO) has proven to be adaptable to such cases, since we can model the reactions of interest as expensive black-box functions. Sometimes, the cost of this black-box functions can be separated into two parts: (a) the cost of the experiment itself, and (b) the cost of changing the input parameters. In this short paper, we extend the SnAKe algorithm to deal with both types of costs simultaneously. We further propose extensions to the case of a maximum allowable input change, as well as to the multi-objective setting.
Combining Multi-Fidelity Modelling and Asynchronous Batch Bayesian Optimization
Jose Pablo Folch, Robert M Lee, Behrang Shafei, David Walz, Calvin Tsay, Mark van der Wilk, Ruth Misener
Nov 14 2022 cs.LG cs.CE stat.ML arXiv:2211.06149v2

@misc{2211.06149, author = {Jose Pablo Folch and Robert M Lee and Behrang Shafei and David Walz and Calvin Tsay and Mark van der Wilk and Ruth Misener}, title = {{C}ombining {M}ulti-{F}idelity {M}odelling and {A}synchronous {B}atch {B}ayesian {O}ptimization}, year = {2022}, eprint = {2211.06149}, note = {arXiv:2211.06149v2} }
PDF
Bayesian Optimization is a useful tool for experiment design. Unfortunately, the classical, sequential setting of Bayesian Optimization does not translate well into laboratory experiments, for instance battery design, where measurements may come from different sources and their evaluations may require significant waiting times. Multi-fidelity Bayesian Optimization addresses the setting with measurements from different sources. Asynchronous batch Bayesian Optimization provides a framework to select new experiments before the results of the prior experiments are revealed. This paper proposes an algorithm combining multi-fidelity and asynchronous batch methods. We empirically study the algorithm behavior, and show it can outperform single-fidelity batch methods and multi-fidelity sequential methods. As an application, we consider designing electrode materials for optimal performance in pouch cells using experiments with coin cells to approximate battery performance.
Tree ensemble kernels for Bayesian optimization with known constraints over mixed-feature spaces
Alexander Thebelt, Calvin Tsay, Robert M. Lee, Nathan Sudermann-Merx, David Walz, Behrang Shafei, Ruth Misener
Jul 05 2022 stat.ML cs.AI cs.LG math.OC arXiv:2207.00879v3

@misc{2207.00879, author = {Alexander Thebelt and Calvin Tsay and Robert M.~Lee and Nathan Sudermann-Merx and David Walz and Behrang Shafei and Ruth Misener}, title = {{T}ree ensemble kernels for {B}ayesian optimization with known constraints over mixed-feature spaces}, year = {2022}, eprint = {2207.00879}, note = {arXiv:2207.00879v3} }
PDF
Tree ensembles can be well-suited for black-box optimization tasks such as algorithm tuning and neural architecture search, as they achieve good predictive performance with little or no manual tuning, naturally handle discrete feature spaces, and are relatively insensitive to outliers in the training data. Two well-known challenges in using tree ensembles for black-box optimization are (i) effectively quantifying model uncertainty for exploration and (ii) optimizing over the piece-wise constant acquisition function. To address both points simultaneously, we propose using the kernel interpretation of tree ensembles as a Gaussian Process prior to obtain model variance estimates, and we develop a compatible optimization formulation for the acquisition function. The latter further allows us to seamlessly integrate known constraints to improve sampling efficiency by considering domain-knowledge in engineering settings and modeling search space symmetries, e.g., hierarchical relationships in neural architecture search. Our framework performs as well as state-of-the-art methods for unconstrained black-box optimization over continuous/discrete features and outperforms competing methods for problems combining mixed-variable feature spaces and known input constraints.
OMLT: Optimization & Machine Learning Toolkit
Francesco Ceccon, Jordan Jalving, Joshua Haddad, Alexander Thebelt, Calvin Tsay, Carl D. Laird, Ruth Misener
Feb 08 2022 stat.ML cs.AI cs.LG math.OC arXiv:2202.02414v2

@misc{2202.02414, author = {Francesco Ceccon and Jordan Jalving and Joshua Haddad and Alexander Thebelt and Calvin Tsay and Carl D.~Laird and Ruth Misener}, title = {{OMLT}: {O}ptimization & {M}achine {L}earning {T}oolkit}, year = {2022}, eprint = {2202.02414}, note = {arXiv:2202.02414v2} }
PDF
The optimization and machine learning toolkit (OMLT) is an open-source software package incorporating neural network and gradient-boosted tree surrogate models, which have been trained using machine learning, into larger optimization problems. We discuss the advances in optimization technology that made OMLT possible and show how OMLT seamlessly integrates with the algebraic modeling language Pyomo. We demonstrate how to use OMLT for solving decision-making problems in both computer science and engineering.
Maximizing information from chemical engineering data sets: Applications to machine learning
Alexander Thebelt, Johannes Wiebe, Jan Kronqvist, Calvin Tsay, Ruth Misener
Jan 26 2022 stat.ML cs.AI cs.LG math.OC arXiv:2201.10035v1

@misc{2201.10035, author = {Alexander Thebelt and Johannes Wiebe and Jan Kronqvist and Calvin Tsay and Ruth Misener}, title = {{M}aximizing information from chemical engineering data sets: {A}pplications to machine learning}, year = {2022}, eprint = {2201.10035}, note = {arXiv:2201.10035v1} }
PDF
It is well-documented how artificial intelligence can have (and already is having) a big impact on chemical engineering. But classical machine learning approaches may be weak for many chemical engineering applications. This review discusses how challenging data characteristics arise in chemical engineering applications. We identify four characteristics of data arising in chemical engineering applications that make applying classical artificial intelligence approaches difficult: (1) high variance, low volume data, (2) low variance, high volume data, (3) noisy/corrupt/missing data, and (4) restricted data with physics-based limitations. For each of these four data characteristics, we discuss applications where these data characteristics arise and show how current chemical engineering research is extending the fields of data science and machine learning to incorporate these challenges. Finally, we identify several challenges for future research.
Multi-Objective Constrained Optimization for Energy Applications via Tree Ensembles
Alexander Thebelt, Calvin Tsay, Robert M. Lee, Nathan Sudermann-Merx, David Walz, Tom Tranter, Ruth Misener
Nov 08 2021 stat.ML cs.AI cs.LG math.OC arXiv:2111.03140v1

@misc{2111.03140, author = {Alexander Thebelt and Calvin Tsay and Robert M.~Lee and Nathan Sudermann-Merx and David Walz and Tom Tranter and Ruth Misener}, title = {{M}ulti-{O}bjective {C}onstrained {O}ptimization for {E}nergy {A}pplications via {T}ree {E}nsembles}, year = {2021}, eprint = {2111.03140}, doi = {10.1016/j.apenergy.2021.118061}, note = {arXiv:2111.03140v1} }
PDF
Energy systems optimization problems are complex due to strongly non-linear system behavior and multiple competing objectives, e.g. economic gain vs. environmental impact. Moreover, a large number of input variables and different variable types, e.g. continuous and categorical, are challenges commonly present in real-world applications. In some cases, proposed optimal solutions need to obey explicit input constraints related to physical properties or safety-critical operating conditions. This paper proposes a novel data-driven strategy using tree ensembles for constrained multi-objective optimization of black-box problems with heterogeneous variable spaces for which underlying system dynamics are either too complex to model or unknown. In an extensive case study comprised of synthetic benchmarks and relevant energy applications we demonstrate the competitive performance and sampling efficiency of the proposed algorithm compared to other state-of-the-art tools, making it a useful all-in-one solution for real-world applications with limited evaluation budgets.
Partition-based formulations for mixed-integer optimization of trained ReLU neural networks
Calvin Tsay, Jan Kronqvist, Alexander Thebelt, Ruth Misener
Feb 09 2021 math.OC cs.LG stat.ML arXiv:2102.04373v2

@misc{2102.04373, author = {Calvin Tsay and Jan Kronqvist and Alexander Thebelt and Ruth Misener}, title = {{P}artition-based formulations for mixed-integer optimization of trained {R}e{LU} neural networks}, year = {2021}, eprint = {2102.04373}, note = {arXiv:2102.04373v2} }
PDF
This paper introduces a class of mixed-integer formulations for trained ReLU neural networks. The approach balances model size and tightness by partitioning node inputs into a number of groups and forming the convex hull over the partitions via disjunctive programming. At one extreme, one partition per input recovers the convex hull of a node, i.e., the tightest possible formulation for each node. For fewer partitions, we develop smaller relaxations that approximate the convex hull, and show that they outperform existing formulations. Specifically, we propose strategies for partitioning variables based on theoretical motivations and validate these strategies using extensive computational experiments. Furthermore, the proposed scheme complements known algorithmic approaches, e.g., optimization-based bound tightening captures dependencies within a partition.