Hostname: page-component-5db58dd55d-8mwbx Total loading time: 0 Render date: 2026-05-26T14:28:57.013Z Has data issue: false hasContentIssue false

Towards Mass Spectrum Analysis with ASP

Published online by Cambridge University Press:  19 February 2026

NILS KÜCHENMEISTER
Affiliation:
Computer Science, TU Dresden , Germany (e-mails: nils.kuechenmeister@tu-dresden.de, alex.ivliev@tu-dresden.de, markus.kroetzsch@tu-dresden.de)
ALEX IVLIEV
Affiliation:
Computer Science, TU Dresden , Germany (e-mails: nils.kuechenmeister@tu-dresden.de, alex.ivliev@tu-dresden.de, markus.kroetzsch@tu-dresden.de)
MARKUS KRÖTZSCH
Affiliation:
Computer Science, TU Dresden , Germany (e-mails: nils.kuechenmeister@tu-dresden.de, alex.ivliev@tu-dresden.de, markus.kroetzsch@tu-dresden.de)
Rights & Permissions [Opens in a new window]

Abstract

We present a new use of Answer Set Programming (ASP) to discover the molecular structure of chemical samples based on the relative abundance of elements and structural fragments, as measured in mass spectrometry. To constrain the exponential search space for this combinatorial problem, we develop canonical representations of molecular structures and an ASP implementation that uses these definitions. We evaluate the correctness of our implementation over a large set of known molecular structures, and we compare its quality and performance to other ASP symmetry-breaking methods and to a commercial tool from analytical chemistry.

Information

Type
Original Articles
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press
Figure 0

Fig. 1. User interface of Genmol.

Figure 1

Fig. 2. Hydrogen-suppressed molecular graph of adenine ($\mathit{C}_5\mathit{H}_5\mathit{N}_5$) and corresponding spanning tree with cycle edges (dotted); superscripts indicate correspondence of vertices.

Figure 2

Fig. 3. Canonical molecular tree of threonine ($\mathit{C}_4\mathit{H}_9\mathit{N}\mathit{O}_3$); central vertex is circled.

Figure 3

Fig. 4. Molecular trees of glycine ($\mathit{C}_2\mathit{H}_5\mathit{N}\mathit{O}_2$); central vertices are circled.

Figure 4

Fig. 5. Tree representation of adenine and molecular tree with replaced cycle edges.

Figure 5

Fig. 6. Refinement steps on a molecule with dotted edges in $C$ and solid edges in $T$.

Figure 6

Fig. 7. Centralized maximal refinement (right) of the maximal refinement (left).

Figure 7

Fig. 8. Patterns for detecting shortening cycles.

Figure 8

Fig. 9. Visualization of graph symmetry-breaking encoding.

Figure 9

Fig. 10. Number of models for each compound in the data set (left) and ratio of compounds with model counts within a factor of the gold standard (right).

Figure 10

Fig. 11. Model numbers (top) and runtimes (bottom) for molecules of increasing size.