Accurate Hydration Free Energy Calculations for Diverse Organic Molecules With a Machine Learning Force Field

17 December 2025, Version 2
This content is an early or alternative research output and has not been peer-reviewed by Cambridge University Press at the time of posting.

Abstract

Free energy perturbation (FEP) calculations using classical force fields remain the dominant approach for large-scale, computational drug discovery efforts but the accuracy is fundamentally limited by simplified forms that cannot quantitatively reproduce ab initio methods without significant fine tuning. Machine Learning force fields (MLFFs) offer a promising avenue to retain quantum mechanical accuracy with significantly reduced computational cost compared to ab initio molecular dynamics (AIMD) simulations. Thus far, direct applications of ML force fields to FEP calculations lack systematic protocols and extensive benchmarking. In this work, we take a step in this direction by presenting a general and robust workflow for solvation (hydration) free energy (HFE) calculations which is independent of the details of the particular MLFF architecture used. Combining a broadly trained ML force field, Organic_MPNICE, with sufficient statistical and conformational sampling empowered by the solute-tempering technique, affords sub-kcal/mol average errors in HFE predictions relative to experimental estimates. This approach outperforms state-of-the-art classical force fields and DFT-based implicit solvation models on a diverse set of 59 organic molecules and provides a route to ab initio-quality HFE predictions, advancing the use of ML force fields in thermodynamic property prediction.

Keywords

MLFF
MLIP
Free energy
FEP
Solvation
Hydration
Molecular dynamics
simulation

Supplementary materials

Title
Description
Actions
Title
Supplementary Information
Description
Predictions for each molecular test case, along with experimental reference values, as well as further details on the simulation protocol, implementation, training data, charge transfer error analysis and selection of test systems.
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting and Discussion Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.