Abstract
Substances transform in environmental and biological matrices to produce diverse transformation products (TPs). Some chemical moieties, termed persistent substructures, are retained in these TPs. This study used literature TPs, literature-derived analogous TPs and predicted TPs to identify known and novel TPs of five selected data-poor compounds using zebrafish embryos. The workflow was then used to identify persistent substructures in a further 36 persistent, mobile and toxic (PMT) compounds. The suspect screening workflow in patRoon was applied to liquid chromatography high-resolution mass spectrometry data. This study identified 90 TPs at confidence levels 1 (confirmed) or 3 (tentative): 33 from data-poor parents and 57 from the PMT compounds, including 13 Level 1s. Among data-poor compounds, 17 TPs (52%) were analogous TPs, whereas 15% were exclusively predicted. Among PMT compounds, most TPs (63%) were predicted exclusively using BioTransformer, while 12.3% were solely literature TPs. The combined approach yielded a better TP coverage compared to any single source, revealing extensive biotransformation even for PMT substances - indicating that persistence does not exclude downstream transformation. The transformations included phase I and II metabolic reactions, as well as non-enzymatic processes. The 1,3,5-triazine ring, benzotriazole ring and CF3 group were conserved in all TPs of their respective parent compounds, while the 1,2,4-triazole ring was found in most, but not all, triazole TPs. QSAR modelling predictions indicated that several TPs were potentially more persistent, mobile and toxic than their parents. The study shows the significance of including the concept of persistent TP substructures in chemical risk assessments
Supplementary materials
Title
Supplementary tables file
Description
Contains tables refered to in the manuscript (SI Table S1-S13)
Actions
Title
Supplementary figures file
Description
Contains Figures and extra data steps referred to in teh manuscript (SI Figure S1-S5)
Actions
Title
The ARRIVE 2.0 checklist
Description
The ARRIVE 2.0 checklist for tranparent reporting of data when using animal research. Ethical approval is not required (zebrafish embryos until 96 ours post-fertlisation) and the scope of this study does not quite fit the ARRIVE guidlines, however, authors filled it out as best as they could.
Actions
Supplementary weblinks
Title
The .mzML data files
Description
Here are the .mzML files (ZFPMTTPs and ZPMS90TPs) from Q Exactive HF Orbitrap mass spectrometer, converted using proteowizard for the PMT-TPs suspect screening. More details in the manuscript.
Actions
View Title
R-scripts for data analysis
Description
The R-scripts used for suspect screening with patRoon in the manuscript can be found in GitLab page. Morer details in the manuscript.
Actions
View Title
S90 | ZEROPMBOX1 | ZeroPM Box 1 Substances
Description
This is the collection associated with list S90 ZEROPMBOX1 ZeroPM Box 1 Substances on the NORMAN Suspect List Exchange(ZPMS90TPs)
Actions
View Title
S74 | REFTPS
Description
New compounds and transformation products are uploaded here
Actions
View 


![Author ORCID: We display the ORCID iD icon alongside authors names on our website to acknowledge that the ORCiD has been authenticated when entered by the user. To view the users ORCiD record click the icon. [opens in a new tab]](https://www.cambridge.org/engage/assets/public/coe/logo/orcid.png)