Hostname: page-component-89b8bd64d-ktprf Total loading time: 0 Render date: 2026-05-06T21:55:23.501Z Has data issue: false hasContentIssue false

Visualizing design project team and individual progress using NLP: a comparison between latent semantic analysis and Word2Vector algorithms

Published online by Cambridge University Press:  14 June 2023

Matt Chiu*
Affiliation:
Singapore University of Technology and Design, Singapore, Singapore
Siska Lim
Affiliation:
Singapore University of Technology and Design, Singapore, Singapore
Arlindo Silva
Affiliation:
Singapore University of Technology and Design, Singapore, Singapore
*
Corresponding author: Matt Chiu; Email: poheng_chiu@mymail.sutd.edu.sg
Rights & Permissions [Opens in a new window]

Abstract

Design has always been seen as an inherently human activity and hard to automate. It requires a lot of traits that are seldom attributable to machines or algorithms. Consequently, the act of designing is also hard to assess. In particular in an educational context, the assessment of progress of design tasks performed by individuals or teams is difficult, and often only the outcome of the task is assessed or graded. There is a need to better understand, and potentially quantify, design progress. Natural Language Processing (NLP) is one way of doing so. With the advancement in NLP research, some of its models are adopted into the field of design to quantify a design class performance. To quantify and visualize design progress, the NLP models are often deployed to analyze written documentation collected from the class participants at fixed time intervals through the span of a course. This paper will explore several ways of using NLP in assessing design progress, analyze its advantages and shortcomings, and present a case study to demonstrate its application. The paper concludes with some guidelines and recommendations for future development.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NC
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial licence (http://creativecommons.org/licenses/by-nc/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use.
Copyright
Copyright © The Author(s), 2023. Published by Cambridge University Press
Figure 0

Figure 1. The dynamics of divergence and convergence (Banathy, 2013).

Figure 1

Figure 2. The spike and synergy diagram (Chiu et al., 2022).

Figure 2

Table 1. Sample output of LSA

Figure 3

Figure 3. A sample t-SNE wordcloud which is produced from a high-dimension data from Word2Vec (each dot represents a unique word in the corpus).

Figure 4

Figure 4. A representation of Centroid determination (left) and Euclidean distance computation (right).

Figure 5

Figure 5. An image representing how tokens are compared.

Figure 6

Figure 6. Overall NLP process flowchart.

Figure 7

Table 2. Sample of data collection done in the MIbD class for three participates

Figure 8

Figure 7. Result of MIbD class with four different NLP combinations using the same dataset (plotted with interval against divergence).

Figure 9

Figure 8. LSA versus W2V using Euclidean distance (plotted with interval against divergence).

Figure 10

Figure 9. LSA versus W2V using Cosine similarity (plotted with interval against divergence).

Figure 11

Table 3. Sample of LSA distribution table using responses from one student (P1)