Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners

Shanu Vashishtha; Abhinav Prakash; Lalitesh Morishetti; Kaushiki Nag; Yokila Arora; Sushant Kumar; Kannan Achan

Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners

Shanu Vashishtha, Abhinav Prakash, Lalitesh Morishetti, Kaushiki Nag, Yokila Arora, Sushant Kumar, Kannan Achan

TL;DR

This work demonstrates the use of text-to-image models for generating personalized web banners with dynamic content for online shoppers based on their interactions using a large language model to systematically extract a tuple of attributes from item meta-information.

Abstract

Text-to-image models such as stable diffusion have opened a plethora of opportunities for generating art. Recent literature has surveyed the use of text-to-image models for enhancing the work of many creative artists. Many e-commerce platforms employ a manual process to generate the banners, which is time-consuming and has limitations of scalability. In this work, we demonstrate the use of text-to-image models for generating personalized web banners with dynamic content for online shoppers based on their interactions. The novelty in this approach lies in converting users' interaction data to meaningful prompts without human intervention. To this end, we utilize a large language model (LLM) to systematically extract a tuple of attributes from item meta-information. The attributes are then passed to a text-to-image model via prompt engineering to generate images for the banner. Our results show that the proposed approach can create high-quality personalized banners for users.

Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners

TL;DR

Abstract

Paper Structure (15 sections, 1 equation, 2 figures, 4 tables)

This paper contains 15 sections, 1 equation, 2 figures, 4 tables.

Introduction
Background and Related Work
Image Generation
Large Language Models
Prompt Engineering for Image Generation
Methodology
Item Representation Problem
Image Generation Problem
Personalization Problem
Evaluation Study
Image quality evaluation
Human evaluation
Conclusion and Future Work
Prompt to LLM
Additional examples from image generation survey

Figures (2)

Figure 1: A flow diagram for the proposed method; rectangle denotes input/output and rhombus denotes action.
Figure 2: Average user score along with standard error for different products.

Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners

TL;DR

Abstract

Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners

Authors

TL;DR

Abstract

Table of Contents

Figures (2)