Abstract
We introduce the Python package rxnutils that can be used to manipulate chemical reactions, reaction templates and reaction datasets. The package is built entirely on open-source software such as RDKit and is designed with robustness, extendibility, and reproducibility in mind. Currently, it consists of three sub-packages one for working with chemical entities, one provides pipelining capabilities, and one provides an end-to-end pipeline for preparing the US patent reaction dataset for modelling. In this software research note we discuss the design of the package and provide some code examples. The project is open-source with a Apache 2.0 license and available at GitHub: https://github.com/MolecularAI/reaction_utils



![Author ORCID: We display the ORCID iD icon alongside authors names on our website to acknowledge that the ORCiD has been authenticated when entered by the user. To view the users ORCiD record click the icon. [opens in a new tab]](https://www.cambridge.org/engage/assets/public/coe/logo/orcid.png)