Hostname: page-component-89b8bd64d-4ws75 Total loading time: 0 Render date: 2026-05-07T16:03:15.090Z Has data issue: false hasContentIssue false

Prompting for products: investigating design space exploration strategies for text-to-image generative models

Published online by Cambridge University Press:  15 January 2025

Leah Chong
Affiliation:
Department of Mechanical Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
I-Ping Lo
Affiliation:
Department of Information and Computing Sciences, Utrecht University, Utrecht, Netherlands
Jude Rayan
Affiliation:
Department of Cognitive Science, University of California, San Diego, San Diego, CA, USA
Steven Dow
Affiliation:
Department of Cognitive Science, University of California, San Diego, San Diego, CA, USA
Faez Ahmed
Affiliation:
Department of Mechanical Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
Ioanna Lykourentzou*
Affiliation:
Department of Information and Computing Sciences, Utrecht University, Utrecht, Netherlands
*
Corresponding author Ioanna Lykourentzou i.lykourentzou@uu.nl
Rights & Permissions [Opens in a new window]

Abstract

Text-to-image models are enabling efficient design space exploration, rapidly generating images from text prompts. However, many generative AI tools are imperfect for product design applications as they are not built for the goals and requirements of product design. The unclear link between text input and image output further complicates their application. This work empirically investigates design space exploration strategies that can successfully yield product images that are feasible, novel and aesthetic – three common goals in product design. Specifically, users’ actions within the global and local editing modes, including their time spent, prompt length, mono versus multi-criteria prompts, and goal orientation of prompts, are analyzed. Key findings reveal the pivotal role of mono versus multi-criteria and goal orientation of prompts in achieving specific design goals over time and prompt length. The study recommends prioritizing the use of multi-criteria prompts for feasibility and novelty during global editing while favoring mono-criteria prompts for aesthetics during local editing. Overall, this article underscores the nuanced relationship between the AI-driven text-to-image models and their effectiveness in product design, urging designers to carefully structure prompts during different editing modes to better meet the unique demands of product design.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Global and local editing modes in Leonardo.AI and their example input and output. The global editing mode is primarily for generating entire images by entering text prompts, while the local editing mode is for more detailed refinement of selected images using features like prompting, masking and erasing.

Figure 1

Figure 2. Example problem in the crowd-sourced image evaluations.

Figure 2

Figure 3. The distribution of time spent in global and local editing modes. On average, the participants spent about 38.5% of their time in the global editing mode, and the other 61.5% in the local editing mode.

Figure 3

Figure 4. Average Prompt Length. The participants use significantly longer prompts in the global editing mode compared to the local editing mode.

Figure 4

Table 1. Descriptions of feasibility, novelty and aesthetics-oriented prompts

Figure 5

Figure 5. Percentage of mono-criteria (feasibility, novelty and aesthetics-oriented), multi-criteria and no-criteria prompts. the participants use much more multi-criteria prompts in the global editing mode than in the local editing mode. Most of the mono-criteria prompts in the global editing mode are feasibility or novelty-oriented, while those in the local editing mode are mostly feasibility-oriented with some being novelty and aesthetics-oriented.

Figure 6

Figure 6. Percentage of Feasibility, Novelty and Aesthetics-Oriented Prompts. The results shown here include both mono and multi-criteria prompts, therefore not adding up to 100%. While targeting all three goals often in the global editing mode, the participants tend to focus more on feasibility and aesthetics than novelty in the local editing mode.

Figure 7

Figure 7. Correlation between the prompting characteristics (mono versus multi and goal orientation) in global and local editing modes and the crowd-sourced design ratings (feasibility, novelty and aesthetics). The values are indicated as (correlation coefficient) (p-value). The results bolded with dashed lines are statistically significant at 5%.

Figure 8

Figure A1. Correlation graphs for all the statistically significant results in Figure 7.

Figure 9

Table A1. Example prompts with different goal orientation(s)