Abstract
Fragmentation schemes enable the efficient quantum-chemical treatment of large biomolecular systems, and provide an ideal starting point for the development of accurate machine-learning potentials for proteins. Here, we present a fragment-based method that only uses calculations for single-amino acids and their dimers, and is able to reduce the fragmentation error in total energies to ca. 1 kJ/mol per amino acid for polypeptides and proteins across different structural motifs. This is achieved by combining a two-body extension of the molecular fractionation with conjugate caps (MFCC) scheme with the density-based many-body expansion (db-MBE), thus extending the applicability of the db-MBE from molecular clusters to polypeptides and proteins.
Supplementary materials
Title
Supporting Information
Description
Explicit expressions for the density-based energy correction in the db-MBE, db-MFCC, and db-MFCC-MBE(2) schemes; additional plot of total absolute errors for the protein test set.
Actions
Supplementary weblinks
Title
Data Set
Description
PDB files of all considered molecular structures, Jupyter notebooks for data analysis (including all raw data) and for generating all figures, as well as PyADF input scripts to perform the eb-MFCC, db-MFCC, eb-MFCC-MBE(2) and db-MFCC-MBE(2) calculations
Actions
View 


![Author ORCID: We display the ORCID iD icon alongside authors names on our website to acknowledge that the ORCiD has been authenticated when entered by the user. To view the users ORCiD record click the icon. [opens in a new tab]](https://www.cambridge.org/engage/assets/public/coe/logo/orcid.png)