1. Introduction
With the advancement of intelligent manufacturing technologies, products have become increasingly homogeneous in terms of processes, functions, and performance, while exhibiting greater diversity in visual aesthetic features such as color matching, shape, material, texture, and pattern [
1,
2,
3]. Many scholars argue that products with visual aesthetics that align with consumers’ emotional needs are more likely to attract attention and stimulate purchasing desire, ultimately boosting sales [
4,
5,
6]. Therefore, regardless of the product type, visual aesthetic features should be given significant consideration during the product development phase. Among various visual aesthetic features, shape and color matching are the two core design characteristics in product development [
3,
7]. For years, both academia and industry have commonly employed Kansei engineering to establish models that link consumers’ emotional needs with product design features, aiming to translate abstract emotional demands into tangible product shapes and color-matching schemes [
8,
9].
Regarding Kansei design for product shape, Lo et al. [
10] used six form aesthetic-related Kansei words as evaluation indicators and combined genetic algorithms with fuzzy theory to create a shape design and evaluation method. Liu et al. [
11] proposed a methodology for the shape design of cultural and creative products based on Kansei engineering, factor analysis, and triangular fuzzy numbers. Zhou et al. [
12] introduced a car frontal form design and evaluation method using Kansei engineering and convolutional neural networks. Recently, Wu et al. [
13] developed an intelligent design system that uses generative adversarial networks to transform product hand-drawn sketches into designs that match the target imagery. Yuan et al. [
6] established an aesthetic measurement evaluation index consisting of eight form aesthetics Kansei words, and combined Kansei engineering, genetic algorithms, and eye-tracking technology to propose a design method for automotive forms. Lu et al. [
14] introduced an improved form aesthetic quantification formula and combined it with Kansei engineering and the finite structure method to propose a product shape design and evaluation method.
In addition, regarding Kansei design for product color matching, Tsai and Chou [
15] proposed a color-matching design method for two-color combination products based on Kansei engineering, color harmony, color association, and genetic algorithms. Hsiao et al. [
16] developed a model for quantifying product color-matching aesthetics based on parameters such as hue, value, chroma, and the area of colored regions. This model addressed imagery adjective-guided product color-matching design and was applied to the Kansei design of products [
17]. Moreover, Hsiao and Tsai [
18] suggested that the color imageries found in colored images (e.g., natural images and works of famous painters) could be transferred to product color matching in Kansei design. They developed a color-matching design system based on Kansei engineering and fuzzy theory, demonstrating its feasibility. Recently, Lu and Hsiao [
3] expanded the feasibility of the product color-matching aesthetics quantification model and Kansei design by considering the impact of different observation angles on product color matching. Wu et al. [
7] proposed three methodologies to extract colors from natural images for product color matching, utilizing Kansei engineering, AIGC, and color harmony theory.
Table 1 presents a concise comparison of the key characteristics of these studies. In these studies, combining Kansei engineering with product color matching and shape design, imagery adjectives are commonly used as a medium to build models linking consumer emotional needs with product design features.
In recent years, with the development of artificial intelligence, and big data processing, scholars have constructed generative models to address various specific design problems using machine learning techniques like genetic algorithms and neural networks [
13,
19,
20]. Recently, many emerging tech companies have developed various generative large models based on deep learning networks, such as CNNs, GANs, diffusion models, and transformer models. Specifically, generative large models can be categorized based on different types of generated content (i.e., artificial intelligence generated content, AIGC), including text, image, audio, and video generative models [
7]. Currently, the most popular text generative large model globally is ChatGPT (
https://chatgpt.com/), with its latest version updated to GPT-4.5 as of March 2025. Due to its use of large language models (LLMs) and advanced natural language processing techniques during training, ChatGPT is capable of handling complex text generation tasks such as writing, translation, programming, and text analysis [
21,
22].
In addition, Midjourney, Stable Diffusion, and DALL-E are considered the leading image generative large models internationally [
23,
24,
25]. These models are trained using millions of text-to-image examples and complex deep learning networks, enabling them to generate creative and high-quality images. Since the emergence of generative large models, many scholars have actively applied them to product design. Lu et al. [
9] proposed an automotive shape design method based on ChatGPT, Midjourney, and Stable Diffusion, demonstrating the feasibility of collaboration between generative large models and Kansei engineering. The results showed that generative large models can not only understand abstract user needs but also translate imagery adjectives into concrete product shapes. Wu et al. [
7] proposed several color-matching design methods for household vacuum cleaners based on ChatGPT and Midjourney. The findings revealed that Midjourney could generate colored images that align with the imagery adjectives, providing effective color combinations for target imagery-oriented color design. Du et al. [
26] introduced a Kansei design approach for product shape design based on the 2D sketch-to-3D rendering capability of Stable Diffusion, illustrating the effectiveness of the LoRA model (i.e., stylization model) in optimizing design efficiency. Wang et al. [
27] used Midjourney to generate a large number of forms for reference in designing female electric scooter shapes, enhancing the efficiency of product shape design. Wang et al. [
28] explored the feasibility of applying Midjourney to Ming Dynasty furniture design, paving a new path for the innovative design of furniture shapes. These studies have demonstrated that generative large models positively contribute to product color matching and shape design, optimizing traditional design processes and improving design efficiency.
Based on the aforementioned research background, the motivations for this study are as follows: (1) Although many scholars recognize that color matching [
6,
10,
11,
12,
13,
14] and shape [
15,
16,
17,
18] are the two core design features in product development, and various methodologies for color matching or shape design have been proposed based on Kansei engineering, few have addressed both target imagery-oriented color matching and shape design simultaneously [
3,
7]. Therefore, achieving consistency between color matching and shape toward the same target imagery during product development can significantly enhance the efficiency of Kansei design. (2) Recently, although several scholars have demonstrated the contribution of generative large models to improving product design efficiency through case studies [
26,
27,
28], there is still a lack of examples showcasing collaboration among multiple generative large models. Therefore, presenting a case involving the collaboration of multiple generative large models would help inspire product designers to explore more flexible uses of generative models. (3) In the existing research, whether it is color matching design [
7] or shape design [
9,
26,
27,
28] based on generative large models, the importance of evaluating the aesthetics of generated content has not been emphasized, nor have practical quantitative methods for aesthetic evaluation been provided to select the optimal shape or color combinations. Therefore, incorporating scientifically valid quantitative aesthetic evaluation methods would enhance the rigor and usability of design methodologies.
In summary, this study integrates the text-generative large model ChatGPT, the image-generative large model Midjourney, color harmony theory, and quantitative evaluation of shape and color matching to propose a design methodology that considers both product color and shape. Given the importance of tourism IP cultural products in promoting sustainable tourism development, this study uses the color matching and shape design of the Harbor Seal IP cultural product in Dalian as a case study. The structure of this paper is as follows:
Section 2 introduces the theories and research methods used in this study.
Section 3 presents the implementation steps of the generative large model-driven methodology. In
Section 4, the Harbor Seal IP cultural product from Dalian is used as an example to demonstrate the proposed design methodology in detail and confirm its effectiveness.
Section 5 discusses the research results and analyzes this study’s limitations. Finally, the conclusion highlights the theoretical and practical contributions made by this study.
5. Results and Discussion
Section 4 implemented and validated the design methodology based on GLMs using the Harbor Seal IP cultural product example. The first three phases of the methodology (i.e., preparation, shape generation, and color generation) utilized the text-GLM GPT-4o and the image-GLM Midjourney. In the preparation phase, GPT-4o analyzed tourists’ emotional needs for the Harbor Seal IP cultural product and summarized them into 14 imagery adjectives through prompt engineering. Compared to traditional methods for collecting Kansei imageries, the intelligent search method based on GPT-4o can quickly gather tourists’ emotional feedback from the provided online information and summarize it into imagery adjectives, thereby reducing user research costs. In the shape generation phase, Midjourney was used to generate shapes for the Harbor Seal IP cultural product that corresponded to the target imagery, thereby forming a shape database (see
Figure 5). Compared to traditional Kansei design methods for product shape, the shape generation method based on Midjourney can transform tourists’ abstract Kansei needs into a concrete product shape, laying the foundation for detailed shape design. In the color generation phase, GPT-4o served as the prompt generator for Midjourney, while Midjourney was used to create colored images aligned with the target imagery (see
Figure 8). Compared to traditional methods of extracting colors from images for color matching, the color image generation method based on multimodal GLMs can transform abstract target imageries into corresponding natural color images, laying the foundation for color matching design. Overall, in this paper, a Kansei design methodology is presented for both the shape and color matching of tourism IP cultural products based on GLMs, using the Harbor Seal IP cultural product as an example to demonstrate a collaborative design case involving multimodal GLMs. It highlights the empowerment of generative AI for tourism IP cultural product design, promoting the intelligent development of such products.
This paper emphasizes the quantitative evaluation process for content generated by GLMs. To select the optimal shape and representative color images from the GLM-generated content, this study employed quantitative evaluation methods based on QCE and AHP. Specifically, QCE was used as an indicator for evaluating shape aesthetics, where the lower the entropy value of the shape curve, the higher the aesthetic measurement. AHP was used to evaluate whether the color images corresponded to the target imageries, where higher weight values indicated better alignment with the target imagery. The quantitative evaluation not only improved the practicality of the design methodology but also ensured its rigor and scientific basis. In addition, a combination of quantitative and qualitative evaluations was used to select the optimal color-matching solution. The quantitative evaluation applied the color-matching aesthetic measurement formula to determine the aesthetic measurement of alternatives from two observation angles (see
Table 10 and
Table 11). In the qualitative evaluation, a perceptual evaluation questionnaire for tourists was used to obtain scores for the alternatives from two observation angles (see
Table 12 and
Table 13).
Figure 17 presents the scores of the 16 healing alternatives in both quantitative evaluation (i.e., Mp (Observation angle_1), Mp (Observation angle_2), and Mp (Total)) and qualitative evaluation (i.e., score (Observation angle_1), score (Observation angle_2), and score (total)), while
Figure 18 shows the scores for the 16 cyberpunk alternatives. In this paper, Pearson correlation analysis was conducted between the total values of the quantitative evaluation (i.e., Mp (Total)) and the qualitative evaluation (i.e., score (total)) (see
Table 14). The results showed that the correlation coefficients were greater than 0.7, indicating a statistically significant correlation between the quantitative and qualitative evaluation results. Therefore, the optimal design solutions were selected based on a comprehensive consideration of both evaluation results.
6. Conclusions
This paper focuses on tourism IP cultural products and proposes an IP product color-matching and shape design methodology building on GLMs, using tourists’ emotional needs as a medium. The Harbor Seal IP product served as a case study, following the process of “preparation—shape generation—color generation—color matching” to demonstrate the implementation of the proposed methodology and validate its effectiveness through both quantitative and qualitative means. Compared with the literature in
Table 1, this study makes the following academic contributions: (1) unlike existing design methodologies, the proposed GLM-based methodology considers both color matching and shape, promoting the intelligent development of Kansei design; (2) the different design phases of the proposed methodology utilized text and image-generative large models, presenting a collaborative design case involving multimodal GLMs and inspiring designers to explore the potential applications of other GLMs; and (3) a quantitative evaluation method for GLM-generated content based on the macroscopic shape information of shape curves was presented, enhancing the scientific rigor and practicality of GLM-driven design methodologies. In summary, generative large models improve the efficiency of Kansei design for product color matching and shape, paving a new path for intelligent product design.
Although this paper used the Harbor Seal IP cultural product as a case study to demonstrate the implementation of the proposed methodology and validate its effectiveness through quantitative and qualitative evaluations, some research limitations remain. First, the optimal shape was selected from those generated by the image-generative large model based on QCE and expert perceptual questionnaires; however, some scholars may argue that this approach overly relies on the generative model, thus reducing the designer’s subjective involvement. In future research, a shape elements chart could be constructed based on the generated shapes, and shape elements could be recombined into new shapes under other constraints. Second, this paper introduced a QCE-based method for evaluating shape aesthetics, but the evaluation only considered the external contour curves of the product shape without including other internal shape elements. Although the external contour is the most representative shape curve of tourism IP cultural products, future research should selectively consider representative shape curves based on the morphological characteristics of the target product to ensure scientific rigor in shape evaluation. Third, this study used the top ten colors by area ratio in color images as dominant colors to represent their overall imagery. However, colors with higher chroma and value also significantly contribute to the overall imagery of color images. Therefore, future research should develop a more rigorous strategy for selecting dominant colors to improve the feasibility of color imagery transfer. Finally, compared to other functional products, IP cultural products are more akin to decorative items. While their development does not require extensive consideration of functional components, technical design aspects such as material selection and manufacturing processes remain crucial. Therefore, future research could incorporate a technical design phase into the current framework to enhance the feasibility and manufacturability of product designs.