Accurate Hydration Free Energy Calculations for Diverse Organic Molecules With a Machine Learning Force Field

Xiaowei Xie; John L. Weber; Mats Svensson; Ryne C. Johnston; Edward D. Harder; Leif D. Jacobson

doi:10.26434/chemrxiv-2025-p7r0r-v2

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Accurate Hydration Free Energy Calculations for Diverse Organic Molecules With a Machine Learning Force Field

17 December 2025, Version 2

Working Paper

Show author details

This content is an early or alternative research output and has not been peer-reviewed by Cambridge University Press at the time of posting.

Abstract

Free energy perturbation (FEP) calculations using classical force fields remain the dominant approach for large-scale, computational drug discovery efforts but the accuracy is fundamentally limited by simplified forms that cannot quantitatively reproduce ab initio methods without significant fine tuning. Machine Learning force fields (MLFFs) offer a promising avenue to retain quantum mechanical accuracy with significantly reduced computational cost compared to ab initio molecular dynamics (AIMD) simulations. Thus far, direct applications of ML force fields to FEP calculations lack systematic protocols and extensive benchmarking. In this work, we take a step in this direction by presenting a general and robust workflow for solvation (hydration) free energy (HFE) calculations which is independent of the details of the particular MLFF architecture used. Combining a broadly trained ML force field, Organic_MPNICE, with sufficient statistical and conformational sampling empowered by the solute-tempering technique, affords sub-kcal/mol average errors in HFE predictions relative to experimental estimates. This approach outperforms state-of-the-art classical force fields and DFT-based implicit solvation models on a diverse set of 59 organic molecules and provides a route to ab initio-quality HFE predictions, advancing the use of ML force fields in thermodynamic property prediction.

Keywords

Supplementary materials

Title

Description

Actions

Title

Supplementary Information

Description

Predictions for each molecular test case, along with experimental reference values, as well as further details on the simulation protocol, implementation, training data, charge transfer error analysis and selection of test systems.

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting and Discussion Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Dec 17, 2025 Version 2

Sep 18, 2025 Version 1

Version Notes

Error estimates for acetic acid HFE without REST2, bootstrapped uncertainties for errors in HFE and clarification of overlap statistics for ethion.

Metrics

2,768

1,108

Views

Downloads

Citations

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2025-p7r0r-v2

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Accurate Hydration Free Energy Calculations for Diverse Organic Molecules With a Machine Learning Force Field

Authors

Abstract

Keywords

Supplementary materials

Comments

Version History

Version Notes

Metrics

License

DOI

Author’s competing interest statement

Ethics

Share