-
What is missing in autonomous discovery: Open challenges for the community
Authors:
Phillip M. Maffettone,
Pascal Friederich,
Sterling G. Baird,
Ben Blaiszik,
Keith A. Brown,
Stuart I. Campbell,
Orion A. Cohen,
Tantum Collins,
Rebecca L. Davis,
Ian T. Foster,
Navid Haghmoradi,
Mark Hereld,
Nicole Jung,
Ha-Kyung Kwon,
Gabriella Pizzuto,
Jacob Rintamaki,
Casper Steinmann,
Luca Torresi,
Shijing Sun
Abstract:
Self-driving labs (SDLs) leverage combinations of artificial intelligence, automation, and advanced computing to accelerate scientific discovery. The promise of this field has given rise to a rich community of passionate scientists, engineers, and social scientists, as evidenced by the development of the Acceleration Consortium and recent Accelerate Conference. Despite its strengths, this rapidly…
▽ More
Self-driving labs (SDLs) leverage combinations of artificial intelligence, automation, and advanced computing to accelerate scientific discovery. The promise of this field has given rise to a rich community of passionate scientists, engineers, and social scientists, as evidenced by the development of the Acceleration Consortium and recent Accelerate Conference. Despite its strengths, this rapidly developing field presents numerous opportunities for growth, challenges to overcome, and potential risks of which to remain aware. This community perspective builds on a discourse instantiated during the first Accelerate Conference, and looks to the future of self-driving labs with a tempered optimism. Incorporating input from academia, government, and industry, we briefly describe the current status of self-driving labs, then turn our attention to barriers, opportunities, and a vision for what is possible. Our field is delivering solutions in technology and infrastructure, artificial intelligence and knowledge generation, and education and workforce development. In the spirit of community, we intend for this work to foster discussion and drive best practices as our field grows.
△ Less
Submitted 2 May, 2023; v1 submitted 21 April, 2023;
originally announced April 2023.
-
Response properties of embedded molecules through the polarizable embedding model
Authors:
Casper Steinmann,
Peter Reinholdt,
Morten Steen Nørby,
Jacob Kongsted,
Jógvan Magnus Haugaard Olsen
Abstract:
The polarizable embedding (PE) model is a fragment-based quantum-classical approach aimed at accurate inclusion of environment effects in quantum-mechanical response property calculations. The aim of this tutorial is to give insight into the practical use of the PE model. Starting from a set of molecular structures and until you arrive at the final property, there are many crucial details to consi…
▽ More
The polarizable embedding (PE) model is a fragment-based quantum-classical approach aimed at accurate inclusion of environment effects in quantum-mechanical response property calculations. The aim of this tutorial is to give insight into the practical use of the PE model. Starting from a set of molecular structures and until you arrive at the final property, there are many crucial details to consider in order to obtain trustworthy results in an efficient manner. To lower the threshold for new users wanting to explore the use of the PE model, we describe and discuss important aspects related to its practical use. This includes directions on how to generate input files and how to run a calculation.
△ Less
Submitted 10 April, 2018;
originally announced April 2018.
-
A computational method for the systematic screening of reaction barriers in enzymes: Searching for Bacillus circulans xylanase mutants with greater activity towards a synthetic substrate
Authors:
Martin R. Hediger,
Casper Steinmann,
Luca De Vico,
Jan H. Jensen
Abstract:
We present a semi-empirical (PM6-based) computational method for systematically estimating the effect of all possible single mutants, within a certain radius of the active site, on the barrier height of an enzymatic reaction. The intent of this method is not a quantitative prediction of the barrier heights, but rather to identify promising mutants for further computational or experimental study. T…
▽ More
We present a semi-empirical (PM6-based) computational method for systematically estimating the effect of all possible single mutants, within a certain radius of the active site, on the barrier height of an enzymatic reaction. The intent of this method is not a quantitative prediction of the barrier heights, but rather to identify promising mutants for further computational or experimental study. The method is applied to identify promising single and double mutants of Bacillus circulans xylanase (BCX) with increased hydrolytic activity for the artificial substrate ortho-nitrophenyl β-xylobioside (ONPX$_2$). The estimated reaction barrier for wild-type (WT) BCX is 18.5 kcal/mol, which is in good agreement with the experimental activation free energy value of 17.0 kcal/mol extracted from the observed k$_\text{cat}$ using transition state theory (Joshi et al., Biochemistry 2001, 40, 10115). The PM6 reaction profiles for eight single point mutations are recomputed using FMO-MP2/PCM/6-31G(d) single points. PM6 predicts an increase in barrier height for all eight mutants while FMO predicts an increase for six of the eight mutants. Both methods predict that the largest change in barrier occurs for N35F, where PM6 and FMO predict a 9.0 and 15.8 kcal/mol increase, respectively. We thus conclude that PM6 is sufficiently accurate to identify promising mutants for further study. We prepared a set of all theoretically possible (342) single mutants in which every amino acid of the active site (except for the catalytically active residues E78 and E172) was mutated to every other amino acid. Based on results from the single mutants we construct a set of 111 double mutants consisting of all possible pairs of single mutants with the lowest barrier for a particular position and compute their reaction profile. None of the mutants have, to our knowledge, been prepared experimentally[...].
△ Less
Submitted 26 May, 2013;
originally announced May 2013.
-
Hybrid RHF/MP2 geometry optimizations with the Effective Fragment Molecular Orbital Method
Authors:
Anders S. Christensen,
Casper Steinmann,
Dmitri G. Fedorov,
Jan H. Jensen
Abstract:
The frozen domain effective fragment molecular orbital method is extended to allow for the treatment of a single fragment at the MP2 level of theory. The approach is applied to the conversion of chorismate to prephenate by chorismate mutase, where the substrate is treated at the MP2 level of theory while the rest of the system is treated at the RHF level. MP2 geometry optimization is found to lowe…
▽ More
The frozen domain effective fragment molecular orbital method is extended to allow for the treatment of a single fragment at the MP2 level of theory. The approach is applied to the conversion of chorismate to prephenate by chorismate mutase, where the substrate is treated at the MP2 level of theory while the rest of the system is treated at the RHF level. MP2 geometry optimization is found to lower the barrier by up to 3.5 kcal/mol compared to RHF optimzations and ONIOM energy refinement and leads to a smoother convergence with respect to the basis set for the reaction profile. For double zeta basis sets the increase in CPU time relative to RHF is roughly a factor of two.
△ Less
Submitted 28 October, 2013; v1 submitted 3 May, 2013;
originally announced May 2013.
-
Interface of the polarizable continuum model of solvation with semi-empirical methods in the GAMESS program
Authors:
Casper Steinmann,
Kristoffer L. Blædel,
Anders S. Christensen,
Jan H. Jensen
Abstract:
An interface between semi-empirical methods and the polarized continuum model (PCM) of solvation successfully implemented into GAMESS following the approach by Chudinov et al (Chem. Phys. 1992, 160, 41). The interface includes energy gradients and is parallelized. For large molecules such as ubiquitin a reasonable speedup (up to a factor of six) is observed for up to 16 cores. The SCF convergence…
▽ More
An interface between semi-empirical methods and the polarized continuum model (PCM) of solvation successfully implemented into GAMESS following the approach by Chudinov et al (Chem. Phys. 1992, 160, 41). The interface includes energy gradients and is parallelized. For large molecules such as ubiquitin a reasonable speedup (up to a factor of six) is observed for up to 16 cores. The SCF convergence is greatly improved by PCM for proteins compared to the gas phase.
△ Less
Submitted 22 May, 2013; v1 submitted 19 March, 2013;
originally announced March 2013.
-
Mapping Enzymatic Catalysis using the Effective Fragment Molecular Orbital Method: Towards all ab initio Biochemistry
Authors:
Casper Steinmann,
Dmitri G. Fedorov,
Jan H. Jensen
Abstract:
We extend the Effective Fragment Molecular Orbital (EFMO) method to the frozen domain approach where only the geometry of an active part is optimized, while the many-body polarization effects are considered for the whole system. The new approach efficiently mapped out the entire reaction path of chorismate mutase in less than four days using 80 cores on 20 nodes, where the whole system containing…
▽ More
We extend the Effective Fragment Molecular Orbital (EFMO) method to the frozen domain approach where only the geometry of an active part is optimized, while the many-body polarization effects are considered for the whole system. The new approach efficiently mapped out the entire reaction path of chorismate mutase in less than four days using 80 cores on 20 nodes, where the whole system containing 2398 atoms is treated in the ab initio fashion without using any force fields. The reaction path is constructed automatically with the only assumption of defining the reaction coordinate a priori. We determine the reaction barrier of chorismate mutase to be $18.3\pm 3.5$ kcal mol$^{-1}$ for MP2/cc-pVDZ and $19.3\pm 3.6$ for MP2/cc-pVTZ in an ONIOM approach using EFMO-RHF/6-31G(d) for the high and low layers, respectively.
△ Less
Submitted 26 February, 2013; v1 submitted 26 December, 2012;
originally announced December 2012.
-
FragIt: A Tool to Prepare Input Files for Fragment Based Quantum Chemical Calculations
Authors:
Casper Steinmann,
Mikael W. Ibsen,
Anne S. Hansen,
Jan H. Jensen
Abstract:
Near linear scaling fragment based quantum chemical calculations are becoming increasingly popular for treating large systems with high accuracy and is an active field of research. However, it remains difficult to set up these calculations without expert knowledge. To facilitate the use of such methods, software tools need to be available to support these methods and help to set up reasonable inpu…
▽ More
Near linear scaling fragment based quantum chemical calculations are becoming increasingly popular for treating large systems with high accuracy and is an active field of research. However, it remains difficult to set up these calculations without expert knowledge. To facilitate the use of such methods, software tools need to be available to support these methods and help to set up reasonable input files which will lower the barrier of entry for usage by non-experts. Previous tools relies on specific annotations in structure files for automatic and successful fragmentation such as residues in PDB files. We present a general fragmentation methodology and accompanying tools called FragIt to help setup these calculations. FragIt uses the SMARTS language to locate chemically appropriate fragments in large structures and is applicable to fragmentation of any molecular system given suitable SMARTS patterns. We present SMARTS patterns of fragmentation for proteins, DNA and polysaccharides, specifically for D-galactopyranose for use in cyclodextrins. FragIt is used to prepare input files for the Fragment Molecular Orbital method in the GAMESS program package, but can be extended to other computational methods easily.
△ Less
Submitted 2 August, 2012; v1 submitted 22 May, 2012;
originally announced May 2012.
-
The Effective Fragment Molecular Orbital Method for Fragments Connected by Covalent Bonds
Authors:
Casper Steinmann,
Dmitri G. Fedorov,
Jan H. Jensen
Abstract:
We extend the effective fragment molecular orbital method (EFMO) into treating fragments connected by covalent bonds. The accuracy of EFMO is compared to FMO and conventional ab initio electronic structure methods for polypeptides including proteins. Errors in energy for RHF and MP2 are within 2 kcal/mol for neutral polypeptides and 6 kcal/mol for charged polypeptides similar to FMO but obtained t…
▽ More
We extend the effective fragment molecular orbital method (EFMO) into treating fragments connected by covalent bonds. The accuracy of EFMO is compared to FMO and conventional ab initio electronic structure methods for polypeptides including proteins. Errors in energy for RHF and MP2 are within 2 kcal/mol for neutral polypeptides and 6 kcal/mol for charged polypeptides similar to FMO but obtained two to five times faster. For proteins, the errors are also within a few kcal/mol of the FMO results. We developed both the RHF and MP2 gradient for EFMO. Compared to ab initio, the EFMO optimized structures had an RMSD of 0.40 and 0.44 Å for RHF and MP2, respectively.
△ Less
Submitted 2 June, 2012; v1 submitted 22 February, 2012;
originally announced February 2012.