Hostname: page-component-77f85d65b8-6bnxx Total loading time: 0 Render date: 2026-03-27T01:04:22.443Z Has data issue: false hasContentIssue false

A semantic similarity-based method to support the conversion from EXPRESS to OWL

Published online by Cambridge University Press:  03 November 2023

Yan Liu*
Affiliation:
Department of Computer Science, College of Engineering, Shantou University, Shantou 515063, China
Qingquan Jian
Affiliation:
School of Computer Science and Technology, Changchun University of Science and Technology, Changchun 130022, Jilin, China
Claudia M. Eckert
Affiliation:
School of Engineering and Innovation, The Open University, Walton Hall, Milton Keynes MK7 6AA, UK
*
Corresponding author: Yan Liu; Email: yanliu@stu.edu.cn
Rights & Permissions [Opens in a new window]

Abstract

Product data sharing is fundamental for collaborative product design and development. Although the STandard for Exchange of Product model data (STEP) enables this by providing a unified data definition and description, it lacks the ability to provide a more semantically enriched product data model. Many researchers suggest converting STEP models to ontology models and propose rules for mapping EXPRESS, the descriptive language of STEP, to Web Ontology Language (OWL). In most research, this mapping is a manual process which is time-consuming and prone to misunderstandings. To support this conversion, this research proposes an automatic method based on natural language processing techniques (NLP). The similarities of language elements in the reference manuals of EXPRESS and OWL have been analyzed in terms of three aspects: heading semantics, text semantics, and heading hierarchy. The paper focusses on translating between language elements, but the same approach has also been applied to the definition of the data models. Two forms of the semantic analysis with NLP are proposed: a Combination of Random Walks (RW) and Global Vectors for Word Representation (GloVe) for heading semantic similarity; and a Decoding-enhanced BERT with disentangled attention (DeBERTa) ensemble model for text semantic similarity. The evaluation shows the feasibility of the proposed method. The results not only cover most language elements mapped by current research, but also identify the mappings of the elements that have not been included. It also indicates the potential to identify the OWL segments for the EXPRESS declarations.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2023. Published by Cambridge University Press
Figure 0

Table 1. The mapping results of the existing research

Figure 1

Figure 1. Research overview.

Figure 2

Figure 2. The steps of developing the mapping method.

Figure 3

Figure 3. The proposed framework for mapping EXPRESS and OWL.

Figure 4

Figure 4. Words count of (a) before and (b) after text summarization.

Figure 5

Table 2. Pearson correlation results

Figure 6

Table 3. Spearman correlation results

Figure 7

Table 4. The scores of the DeBERTa ensemble

Figure 8

Table 5. Mapping results for simple elements

Figure 9

Table 6. Mapping results for complex elements

Figure 10

Figure 5. A segment of the STEP file of the example product bearing.

Figure 11

Figure 6. Examples OWL expressions for Entity product_definition_formation: (a) using object property and (b) using data property.

Figure 12

Table 7. Mapping result of OWL segments to two EXPRESS entities in the example

Figure 13

Table A1. The label of OWL segments (in the format of Fig. 6a)

Figure 14

Table A2. The mapping results for 12 EXPRESS declarations

Figure 15

Table B1. The label of OWL segments (in the format of Fig. 6b)

Figure 16

Table B2. The mapping results for 12 EXPRESS declarations