Abstract
Metal-organic frameworks (MOFs) are a versatile class of materials with applications in gas storage, separations, and catalysis. Despite extensive research, one of the key factors hindering their broader deployment is the absence of reproducible and scalable synthetic protocols. To address this, we introduce FAIR-MOFs, a database designed to be Findable, Accessible, Interoperable, and Reusable (FAIR), comprising 45{,}700 curated experimental structures, 33{,}361 geometry-optimised structures, and 4{,}161 entries linked to reported synthesis conditions. Analysis of the dataset showed that the propensity of open-metal sites in MOFs is statistically associated with reaction temperature, metal salts, ligands, topology, and metal secondary building unit. Furthermore, we developed a retrosynthetic recommender that captures literature co-usage patterns among solvents, metal salts, and ligands. For any given component, the system suggests compatible reagents and retrieves MOFs prepared under similar conditions. Finally, we trained a graph-based neural network integrated with our MOF deconstruction module to predict most probable metal salts, ligands and solvents directly from 3D structures of experimental or hypothetical MOFs. Using this model, we successfully synthesised MOFs randomly selected from hypothetical MOF databases illustrating the potential of FAIR-MOFs to accelerate the discovery and enable data-driven synthesis of MOFs.
Supplementary materials
Title
Electronic Supporting Information
Description
The Electronic Supporting Information provides comprehensive details of the methods, as well as the challenges encountered during the studies. It also includes in-depth results and descriptions, which offers fundamental understandings of the study.
Actions
Supplementary weblinks
Title
cheminteraction
Description
A retrosynthesis recommender, which maps reagent co-usage patterns, provides an API to reagent vendors and offers web-based querying of the FAIR-MOF dataset using all possible search keywords.
Actions
View Title
FAIR-MOFs in NOMAD
Description
Access to geometry-optimised MOF structures from the NOMAD repository is built in, with mofstructure applied to map each crystal structure onto its fundamental building units and geometric properties, enabling systematic exploration and analysis.
Actions
View Title
mofstructure
Description
mofstructure is a Python module for curating MOFs by removing unbound guest molecules, checking for structural errors, computing porosity, identifying open metal sites, and deconstructing MOFs into their building units.
Actions
View Title
fairmofsyncondition
Description
fairmofsyncondition is a Python module that predicts the synthesis conditions of metal–organic frameworks directly from CIF files. Simply run pip install fairmofsyncondition, then execute fairmofsyncondition_syncon my_mof.cif.
The module also includes built-in cheminformatics tools for converting 3D molecular structures into their IUPAC names, InChIKeys, and SMILES representations. It can interconvert between SMILES, InChIKeys, and IUPAC names.
Actions
View 


![Author ORCID: We display the ORCID iD icon alongside authors names on our website to acknowledge that the ORCiD has been authenticated when entered by the user. To view the users ORCiD record click the icon. [opens in a new tab]](https://www.cambridge.org/engage/assets/public/coe/logo/orcid.png)