基于多智能体系统的卷烟包装设计预测方法

邵钰雯; 王正原; 王秋领; 张勇; 张峻松; 田海英

doi:10.16135/j.issn1002-0861.2026.0556

基于多智能体系统的卷烟包装设计预测方法

Predicting Cigarette Packaging Design Preferences Using a Multi-Agent System Approach

摘要

摘要: 为解决传统包装设计方法难以刻画用户个性化审美偏好的问题，提出一种融合视觉语言模型（VLM）与大语言模型（LLM）的卷烟包装设计偏好分析方法，并构建多智能体协同框架Box Insight。首先，通过网络爬虫收集卷烟包装评论数据，提取与消费者审美相关的感性词汇，构建包含独特、高端、简洁、奢华和文雅5个维度的用户感性需求体系；其次，建立卷烟包装图像数据集，结合专家评估提炼出开合方式、字体风格、插画设计、背景颜色、构图设计和底纹设计6类共34项设计要素；最后，基于3个基座模型（Qwen2.5-VL-72B、DeepSeek-v3和QWQ-32B）进行领域适配训练，分别构建视觉特征识别模型SP-VL-Qwen、语义推理模型SP-Reasoner-DS和偏好预测模型SP-Pref-QWQ，并通过多智能体协同机制实现包装设计特征解析、用户审美推理与偏好预测。通过构建用户问卷数据集对模型进行验证，并与不引入视觉信息的基线模型进行对比实验。结果表明：在5个感性维度上，Box Insight框架的平均预测准确率达到80.07%，较单模态方法提升17.6百分点，尤其在高端、奢华两个维度上的提升尤为显著。研究结果表明，引入视觉语义信息能够有效提升用户设计偏好推理能力，为快速消费品包装的个性化设计提供了一种新的智能化分析路径。

Abstract: To address the limitation of traditional packaging design methods in capturing users’ personalized aesthetic preferences, this study proposes a cigarette packaging design preference analysis method that integrates a Visual-Language Model (VLM) with a Large Language Model (LLM), and constructs a multi-agent collaborative framework named Box Insight. First, online reviews of cigarette packaging were collected through web crawling, and Kansei-related vocabulary reflecting consumer aesthetic perceptions was extracted to establish a user emotional demand system comprising five dimensions: uniqueness, premium quality, simplicity, luxury, and elegance. Second, a cigarette packaging image dataset was established, and six categories of design elements—opening mechanism, typography style, illustration design, background color, layout composition, and background pattern—were identified through expert evaluation, resulting in a total of 34 detailed design features. On this basis, domain-specific adaptation training was conducted on three foundation models—Qwen2.5-VL-72B, DeepSeek-v3, and QWQ-32B—to develop three task-oriented models: the visual feature recognition model SP-VL-Qwen, the semantic reasoning model SP-Reasoner-DS, and the preference prediction model SP-Pref-QWQ. These models were integrated through a multi-agent collaborative mechanism to enable packaging design feature analysis, user aesthetic reasoning, and preference prediction. A user questionnaire dataset was constructed to evaluate the proposed framework, and comparative experiments were conducted against a baseline model without visual information. The results show that, across the five Kansei dimensions, the Box Insight framework achieved an average prediction accuracy of 80.07%, representing an improvement of 17.6 percentage points over the unimodal approach, with particularly significant improvements in the premium and luxury dimensions. The findings indicate that incorporating visual semantic information can effectively enhance the inference of user design preferences, providing a new intelligent analytical pathway for personalized packaging design in fast-moving consumer goods.

HTML全文

参考文献(18)

施引文献

资源附件(0)