aims-PAX: Parallel Active eXploration Enables Expedited Construction of Machine Learning Force Fields for Molecules and Materials

24 October 2025, Version 2
This content is an early or alternative research output and has not been peer-reviewed by Cambridge University Press at the time of posting.

Abstract

Recent advances in machine learning force fields (MLFF) have significantly extended the reach of atomistic simulations. Continuous progress in this field requires reliable reference datasets, accurate MLFF architectures, and efficient active learning strategies to enable robust modeling of complex molecular and material systems. Here we introduce AIMS - PAX, an expedited, multi-trajectory active learning framework that streamlines the development of stable and accurate MLFFs. Designed for a wide range of researchers, AIMS -PAX offers a modular, high-performance workflow that couples diversified sampling with scalable training across CPU and GPU architectures. Integrated with the widely used ab initio code FHI- AIMS, the framework supports state-of-the-art ML models and dataset generation using general- purpose (or "foundational") force-fields for rapid deployment in diverse systems. We demonstrate the capabilities of AIMS -PAX in various challenging tasks: creating datasets and models for highly flexible peptides, multiple organic molecules at once, explicitly solvated molecules, and for efficiently handling computationally demanding systems such as the CsPbI3 perovskite. We show that AIMS -PAX achieves a reduction of up to three orders of magnitude in the number of required reference calculations, automatically selects challenging systems within a given chemical space, facilitates simulation of solvated molecules with more than thousand atoms, while enabling a ten-fold speedup in active- learning time through optimized resource utilization. This positions AIMS -PAX as a powerful and versatile platform for next-generation atomistic simulations in both academic and industrial settings.

Keywords

Machine Learning Force Fields
Software
Active Learning
Computational Chemistry
Machine Learning
Automation
Molecular Dynamics

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting and Discussion Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.