Abstract
The exploration and mapping of chemical space – the vast, multidimensional set of all small organic molecules – remains a central challenge in modern drug discovery. Traditional compound libraries and databases cover only a minute fraction of this space, limiting the discovery of novel, bioactive, and patentable chemotypes. Here, we present LEGION, an advanced workflow that integrates generative AI, AI-guided screening, and state-of-the-art cheminformatics tools within the Chemistry42 platform to enable comprehensive, large-scale exploration of chemical space around specific drug targets. Using NLRP3, a clinically relevant but structurally complex inflammasome protein, as a case study, LEGION combined ligand- and structure-based design strategies, in-house algorithms for 3D pharmacophore-aware scaffold extraction, and distinct library enumeration methods to identify over 34,000 unique scaffolds and generate approximately 110 million molecular structures. Iterative stages of 2D and 3D filtering, including machine learning and structure-based pharmacophore scoring, ensured the binding relevance and structural feasibility of the enumerated compounds. The resulting workflow proved effective for scaffold hopping, navigating unexplored regions of chemical space, and supporting intellectual property applications through the generation of structurally diverse and synthetically tractable structures. This work demonstrates, for the first time, an AI-enabled framework capable of both massive virtual screening and de novo compound generation, specifically tailored for extensive chemical space coverage around a biological target. LEGION establishes a new paradigm for the intelligent, scalable, and practical exploration of chemical space in drug discovery.



![Author ORCID: We display the ORCID iD icon alongside authors names on our website to acknowledge that the ORCiD has been authenticated when entered by the user. To view the users ORCiD record click the icon. [opens in a new tab]](https://www.cambridge.org/engage/assets/public/coe/logo/orcid.png)