T. Verstraelen

The Gradient Curves Method: An improved strategy for the derivation of molecular mechanics valence force fields from ab initio data

T. Verstraelen, D. Van Neck, P.W. Ayers, V. Van Speybroeck, M. Waroquier
LECTURE SERIES ON COMPUTER AND COMPUTATIONAL SCIENCES
Volume 7A-B, page 576 -+
2006
P1

Abstract 

A novel force-field parameterization procedure[1] is proposed that surmounts well-known difficulties of the conventional least squares parameterization. The multidimensional ab initio training data are first transformed into individual one-dimensional data sets, each associated with one term in the force-field model. In the second step conventional methods call be used to fit each energy term separately to its corresponding data set. The first step call be completed without any knowledge of the analytical expressions for the energy terms. Moreover the transformed data sets dictate the form of these expressions, which makes the method very suitable for deriving valence force fields. During the transformation in the first step, continuity and least-norm criteria are imposed. The latter facilitate the intuitive physical interpretation of the energy terms that are fitted to the transformed data sets, a prerequisite for transferable force fields. Benchmark parameterizations have been performed oil three small molecules, showing that the new method results in physically intuitive energy terms, exactly when a conventional parameterization would suffer from parameter correlations, i.e. when the number of redundant internal coordinates in the force-field model increases.

The significance of fluctuating charges for molecular polarizability and dispersion coefficients

Y. Cheng, T. Verstraelen
The Journal of Chemical Physics
Volume 159, Issue 9
2023
A1

Abstract 

The inŕuence of ŕuctuating charges or charge ŕow on the dynamic linear response properties of isolated molecules from the TS42 database is evaluated, with particular emphasis on dipole polarizability and C6 dispersion coefficients. Two new descriptors are deőned to quantify the charge-ŕow contribution to response properties, making use of the recoupled dipole polarizability to separate isotropic and anisotropic components. Molecular polarizabilities are calculated using the “frequency-dependent atom-condensed Kohn-Sham density functional theory approximated to second orderž, i.e. the ACKS2ω model. With ACKS2ω, the charge-ŕow contribution can be constructed in two conceptually distinct ways, which appear to yield compatible results. The charge-ŕow contribution is signiőcantly affected by molecular geometry and the presence of polarizable bonds, in line with previous studies. We show that the charge-ŕow contribution qualitatively reproduces the polarizability anisotropy. The contribution to the anisotropic C6 coefficients is less pronounced, but cannot be neglected. The effect of ŕuctuating charges is only negligible for small molecules with at most one non-hydrogen atom. They become important and sometimes dominant for larger molecules or when highly polarizable bonds are present, such as conjugated, double or triple bonds. Charge ŕow contributions cannot be explained in terms of individual atomic properties, because they are affected by non-local features such as chemical bonding and geometry. Therefore, polarizable force őelds and dispersion models can beneőt from the explicit modeling of charge ŕow.

DFT-Quality Adsorption Simulations in Metal–Organic Frameworks Enabled by Machine Learning Potentials

R. Goeminne, L. Vanduyfhuys, V. Van Speybroeck, T. Verstraelen
Journal of Chemical Theory and Computation (JCTC)
19, 18, 6313-6325
2023
A1

Abstract 

Nanoporous materials such as metal–organic frameworks (MOFs) have been extensively studied for their potential for adsorption and separation applications. In this respect, grand canonical Monte Carlo (GCMC) simulations have become a well-established tool for computational screenings of the adsorption properties of large sets of MOFs. However, their reliance on empirical force field potentials has limited the accuracy with which this tool can be applied to MOFs with challenging chemical environments such as open-metal sites. On the other hand, density-functional theory (DFT) is too computationally demanding to be routinely employed in GCMC simulations due to the excessive number of required function evaluations. Therefore, we propose in this paper a protocol for training machine learning potentials (MLPs) on a limited set of DFT intermolecular interaction energies (and forces) of CO2 in ZIF-8 and the open-metal site containing Mg-MOF-74, and use the MLPs to derive adsorption isotherms from first principles. We make use of the equivariant NequIP model which has demonstrated excellent data efficiency, and as such an error on the interaction energies below 0.2 kJ mol–1 per adsorbate in ZIF-8 was attained. Its use in GCMC simulations results in highly accurate adsorption isotherms and heats of adsorption. For Mg-MOF-74, a large dependence of the obtained results on the used dispersion correction was observed, where PBE-MBD performs the best. Lastly, to test the transferability of the MLP trained on ZIF-8, it was applied to ZIF-3, ZIF-4, and ZIF-6, which resulted in large deviations in the predicted adsorption isotherms and heats of adsorption. Only when explicitly training on data for all ZIFs, accurate adsorption properties were obtained. As the proposed methodology is widely applicable to guest adsorption in nanoporous materials, it opens up the possibility for training general-purpose MLPs to perform highly accurate investigations of guest adsorption.

Modeling Electronic Response Properties with an Explicit-Electron Machine Learning Potential

M. Cools-Ceuppens, J. Dambre, T. Verstraelen
Journal of Chemical Theory and Computation
18, 3, 1672-1691
2023
A1

Abstract 

 

Explicit-electron force fields introduce electrons or electron pairs as semiclassical particles in force fields or empirical potentials, which are suitable for molecular dynamics simulations. Even though semiclassical electrons are a drastic simplification compared to a quantum-mechanical electronic wave function, they still retain a relatively detailed electronic model compared to conventional polarizable and reactive force fields. The ability of explicit-electron models to describe chemical reactions and electronic response properties has already been demonstrated, yet the description of short-range interactions for a broad range of chemical systems remains challenging. In this work, we present the electron machine learning potential (eMLP), a new explicit electron force field in which the short-range interactions are modeled with machine learning. The electron pair particles will be located at well-defined positions, derived from localized molecular orbitals or Wannier centers, naturally imposing the correct dielectric and piezoelectric behavior of the system. The eMLP is benchmarked on two newly constructed data sets: eQM7, an extension of the QM7 data set for small molecules, and a data set for the crystalline beta-glycine. It is shown that the eMLP can predict dipole moments, polarizabilities, and IR-spectra of unseen molecules with high precision. Furthermore, a variety of response properties, for example, stiffness or piezoelectric constants, can be accurately reproduced.

Green Open Access

An information-theoretic approach to basis-set fitting of electron densities and other non-negative functions

A. Tehrani, J. S. M. Anderson, D. Chakraborty, J. I. Rodriguez-Hernandez, D. C. Thompson, T. Verstraelen, P. W. Ayers, F. Heidar-Zadeh
Journal of Computational Chemistry
2023
A1

Abstract 

The numerical ill-conditioning associated with approximating an electron density with a convex sum of Gaussian or Slater-type functions is overcome by using the (extended) Kullback–Leibler divergence to measure the deviation between the target and approximate density. The optimized densities are non-negative and normalized, and they are accurate enough to be used in applications related to molecular similarity, the topology of the electron density, and numerical molecular integration. This robust, efficient, and general approach can be used to fit any non-negative normalized functions (e.g., the kinetic energy density and molecular electron density) to a convex sum of non-negative basis functions. We present a fixed-point iteration method for optimizing the Kullback–Leibler divergence and compare it to conventional gradient-based optimization methods. These algorithms are released through the free and open-source BFit package, which also includes a L2-norm squared optimization routine applicable to any square-integrable scalar function.

Green Open Access

Modeling electronic response properties with an explicit-electron machine learning potential

M. Cools-Ceuppens, J. Dambre, T. Verstraelen
Journal of Chemical Theory and Computation
Volume 18, Issue 3, Pages 1672-1691
2023
A1

Abstract 

Explicit-electron force fields introduce electrons or electron pairs as semiclassical particles in force fields or empirical potentials, which are suitable for molecular dynamics simulations. Even though semiclassical electrons are a drastic simplification compared to a quantum-mechanical electronic wave function, they still retain a relatively detailed electronic model compared to conventional polarizable and reactive force fields. The ability of explicit-electron models to describe chemical reactions and electronic response properties has already been demonstrated, yet the description of short-range interactions for a broad range of chemical systems remains challenging. In this work, we present the electron machine learning potential (eMLP), a new explicit electron force field in which the short-range interactions are modeled with machine learning. The electron pair particles will be located at well-defined positions, derived from localized molecular orbitals or Wannier centers, naturally imposing the correct dielectric and piezoelectric behavior of the system. The eMLP is benchmarked on two newly constructed data sets: eQM7, an extension of the QM7 data set for small molecules, and a data set for the crystalline beta-glycine. It is shown that the eMLP can predict dipole moments, polarizabilities, and IR-spectra of unseen molecules with high precision. Furthermore, a variety of response properties, for example, stiffness or piezoelectric constants, can be accurately reproduced.

A Reactive Molecular Dynamics Study of Chlorinated Organic Compounds. Part II: A ChemTraYzer Study of Chlorinated Dibenzofuran Formation and Decomposition Processes

L. Krep, F. Schmalz, F. Solbach, L. Komissarov, T. Nevolianis, W. A. Kopp, T. Verstraelen, K. Leonhard
ChemPhysChem
2023
A1

Abstract 

In our two-paper series, we first present the development of ReaxFF CHOCl parameters using the recently published ParAMS parametrization tool. In this second part, we update the reactive Molecular Dynamics - Quantum Mechanics coupling scheme ChemTraYzer and combine it with our new ReaxFF parameters from Part I to study formation and decomposition processes of chlorinated dibenzofurans. We introduce a self-learning method for recovering failed transition-state searches that improves the overall ChemTraYzer transition-state search success rate by 10 percentage points to a total of 48 %. With ChemTraYzer, we automatically find and quantify more than 500 reactions using transition state theory and DFT. Among the discovered chlorinated dibenzofuran reactions are numerous reactions that are new to the literature. In three case studies, we discuss the set of reactions that are most relevant to the dibenzofuran literature: (i) bimolecular reactions of the chlorinated-dibenzofuran precursors phenoxy radical and 1,3,5-trichlorobenzene, (ii) dibenzofuran chlorination and pyrolysis, and (iii) oxidation of chlorinated dibenzofurans.

A Reactive Molecular Dynamics Study of Chlorinated Organic Compounds. Part I: Force Field Development

L. Komissarov, L. Krep, F. Schmalz, W. A. Kopp, K. Leonhard, T. Verstraelen
ChemPhysChem
2023
A1

Abstract 

This work presents a novel parametrization for the ReaxFF formalism as a means to investigate reaction processes of chlorinated organic compounds. Force field parameters cover the chemical elements C, H, O, Cl and were obtained using a novel optimization approach involving relaxed potential energy surface scans as training targets. The resulting ReaxFF parametrization shows good transferability, as demonstrated on two independent ab initio validation sets. While this first part of our two-paper series focuses on force field parametrization, we apply our parameters to the simulation of chlorinated dibenzofuran formation and decomposition processes in Part II.

Sensitivity Analysis for ReaxFF Reparametrization Using the Hilbert–Schmidt Independence Criterion

M. Freitas Gustavo, M. Hellström, T. Verstraelen
Journal of Chemical Theory and Computation
19, 9, 2557-2573
2023
A1

Abstract 

We apply a global sensitivity method, the Hilbert− Schmidt independence criterion (HSIC), to the reparametrization of a Zn/S/H ReaxFF force field to identify the most appropriate parameters for reparametrization. Parameter selection remains a challenge in this context, as high-dimensional optimizations are prone to overfitting and take a long time but selecting too few parameters leads to poor-quality force fields. We show that the HSIC correctly and quickly identifies the most sensitive parameters and that optimizations done using a small number of sensitive parameters outperform those done using a higher-dimensional reasonable-user parameter selection. Optimizations using only sensitive parameters (1) converge faster, (2) have loss values comparable to those found with the naive selection, (3) have similar accuracy in validation tests, and (4) do not suffer from problems of overfitting. We demonstrate that an HSIC global sensitivity is a cheap optimization preprocessing step that has both qualitative and quantitative benefits which can substantially simplify and speed up ReaxFF reparametrizations.

Impact of Ad Hoc Post-Processing Parameters on the Lubricant Viscosity Calculated with Equilibrium Molecular Dynamics Simulations

G. Toraman, T. Verstraelen, D. Fauconnier
Lubricants
11, 4, 183
2023
A1

Abstract 

Viscosity is a crucial property of liquid lubricants, and it is theoretically a well-defined quantity in molecular dynamics (MD) simulations. However, no standardized protocol has been defined for calculating this property from equilibrium MD simulations. While best practices do exist, the actual calculation depends on several ad hoc decisions during the post-processing of the raw MD data. A common protocol for calculating the viscosity with equilibrium MD simulations is called the time decomposition method (TDM). Although the TDM attempts to standardize the viscosity calculation using the Green–Kubo method, it still relies on certain empirical rules and subjective user observations, e.g., the plateau region of the Green–Kubo integral or the integration cut-off time. It is known that the TDM works reasonably well for low-viscosity fluids, e.g., at high temperatures. However, modified heuristics have been proposed at high pressures, indicating that no single set of rules works well for all circumstances. This study examines the effect of heuristics and ad hoc decisions on the predicted viscosity of a short, branched lubricant molecule, 2,2,4-trimethylhexane. Equilibrium molecular dynamics simulations were performed at various operating conditions (high pressures and temperatures), followed by post-processing with three levels of uncertainty quantification. A new approach, “Enhanced Bootstrapping”, is introduced to assess the effects of individual ad hoc parameters on the viscosity. The results show a strong linear correlation (with a Pearson correlation coefficient of up to 36%) between the calculated viscosity and an ad hoc TDM parameter, which determines the integration cut-off time, under realistic lubrication conditions, particularly at high pressures. This study reveals that ad hoc decisions can lead to potentially misleading conclusions when the post-processing is performed ambiguously.

Green Open Access

Pages

Subscribe to RSS - T. Verstraelen