QuantumPDB: A Workflow for High-Throughput Quantum Cluster Model Generation from Protein Structures

07 January 2026, Version 1
This content is an early or alternative research output and has not been peer-reviewed by Cambridge University Press at the time of posting.

Abstract

Computational modeling of enzymes provides molecular-level insight into catalysis, but the preparation of quantum mechanical (QM) calculations starting from experimental structures is a significant bottleneck for high-throughput studies. Automated tools developed to accelerate this process may fail to generalize across distinct active site chemistries and geometries. To overcome these limitations, we present QuantumPDB, a Python package that automates the generation of hierarchical coordination/interaction spheres around an active center to create QM cluster models directly from raw protein structures. The workflow integrates structure cleaning, protonation state assignment, and QM calculation setup. It uses chemically meaningful models constructed from contact-based interaction spheres derived from Voronoi tessellation, enabling accurate representation of complex active site geometries. We provide an overview of our modular code and describe how it may be employed to automate high-throughput protein screening. To demonstrate its utility, we curated a dataset of 989 holo-enzymes from the PDB and performed QM calculations on 1,673 enzyme cluster models of 842 of these enzymes. Analysis of computed properties suggests that enzyme environments simulated with density functional theory consistently modulate substrate charge toward neutrality and reduce the substrate dipole moment. This phenomenon appears to be general, even in cases where the active site consists predominantly of neutral residues. By automating and standardizing multi-sphere QM model construction, QuantumPDB provides a robust platform for large-scale, data-driven investigations of proteins.

Keywords

non-heme iron
enzymes
DFT
QM
software

Supplementary materials

Title
Description
Actions
Title
Supporting Information
Description
Supporting Information for "QuantumPDB: A Workflow for High-Throughput Quantum Cluster Model Generation from Protein Structures"
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting and Discussion Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.