Hostname: page-component-89b8bd64d-ksp62 Total loading time: 0 Render date: 2026-05-08T00:14:10.126Z Has data issue: false hasContentIssue false

Developing a method for creating structured representations of working of systems from natural language descriptions using the SAPPhIRE model of causality

Published online by Cambridge University Press:  23 December 2024

Kausik Bhattacharya*
Affiliation:
Department of Design and Manufacturing, Indian Institute of Science, Bangalore, India
Anubhab Majumder
Affiliation:
Department of Design and Manufacturing, Indian Institute of Science, Bangalore, India
Apoorv Naresh Bhatt
Affiliation:
Department of Design and Manufacturing, Indian Institute of Science, Bangalore, India
Sonal Keshwani
Affiliation:
Human Centered Design, Indraprastha Institute of Information Technology, Delhi Okhla Phase III, New Delhi
Ranjan BSC
Affiliation:
Department of Mechanical Engineering, Dr. H. N. National College of Engineering, Bangalore, India
Srinivasan Venkataraman
Affiliation:
Department of Design, Indian Institute of Technology Delhi (IITD), New Delhi, India
Amaresh Chakrabarti
Affiliation:
Department of Design and Manufacturing, Indian Institute of Science, Bangalore, India
*
Corresponding author: Kausik Bhattacharya; Email: kausikb@iisc.ac.in
Rights & Permissions [Opens in a new window]

Abstract

Due to their significant role in creative design ideation, databases of causal ontology-based models for biological and technical systems have been developed. However, creating structured database entries through system models using a causal ontology requires the time and effort of experts. Researchers have worked toward developing methods that can automatically generate representations of systems from documents using causal ontologies by leveraging machine learning (ML) techniques. However, these methods use limited, hand-annotated data for building the ML models and have manual touchpoints that are not documented. While opportunities exist to improve the accuracy of these ML models, more importantly, it is required to understand the complete process of generating structured representations using causal ontology. This research proposes a new method and a set of rules to extract information relevant to the constructs of the SAPPhIRE model of causality from descriptions of technical systems in natural language and report the performance of this process. This process aims to understand the information in the context of the entire description. The method starts by identifying the system interactions involving material, energy and information and then builds the causal description of each system interaction using the SAPPhIRE ontology. This method was developed iteratively, verifying the improvements through user trials in every cycle. The user trials of this new method and rules with specialists and novice users of the SAPPhIRE modeling showed that the method helps in accurately and consistently extracting the information relevant to the constructs of the SAPPhIRE model from a given natural language description.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© Kausik Bhattacharya, 2024. Published by Cambridge University Press
Figure 0

Figure 1. SAPPhIRE model of causality with an example of heat transfer from a hot body to cool surrounding air (Chakrabarti et al., 2005; Chakrabarti, 2009).

Figure 1

Figure 2. Research cycle.

Figure 2

Table 1. Intercoder reliability study as per current SAPPhIRE modeling

Figure 3

Table 2. Intercoder reliability scores WITHOUT and WITH the new process (preliminary)

Figure 4

Figure 3. Procedure for the user trial.

Figure 5

Figure 4. Process diagram of the new process (intermediate) for the first user trial.

Figure 6

Figure 5. % of the total number of relevant words identified by the participants on average in the first user trial.

Figure 7

Figure 6. Process diagram of the new process (final) for the final user trial.

Figure 8

Table 3. Definitions of terms true positives, false positives and false negatives

Figure 9

Figure 7. Analysis of models in the final user trial.

Figure 10

Figure 8. Number of system interactions identified by the participants in the final user trial.

Figure 11

Figure 9. % Agreement scores for use case 1 and use case 2 in the final user trial.

Supplementary material: File

Bhattacharya et al. supplementary material

Bhattacharya et al. supplementary material
Download Bhattacharya et al. supplementary material(File)
File 385.2 KB