Table of Contents
Fetching ...

Multi-Object Advertisement Creative Generation

Jialu Gao, Mithun Das Gupta, Qun Li, Raveena Kshatriya, Andrew D. Wilson, Keng-hao Chang, Balasaravanan Thoravi Kumaravel

Abstract

Lifestyle images are photographs that capture environments and objects in everyday settings. In furniture product marketing, advertisers often create lifestyle images containing products to resonate with potential buyers, allowing buyers to visualize how the products fit into their daily lives. While recent advances in Generative Artificial Intelligence (GenAI) have given rise to realistic image content creation, their application in e-commerce advertising is challenging because high-quality ads must authentically representing the products in realistic scearios. Therefore, manual intervention is usually required for individual generations, making it difficult to scale to larger product catalogs. To understand the challenges faced by advertisers using GenAI to create lifestyle images at scale, we conducted evaluations on ad images generated using state-of-the-art image generation models and identified the major challenges. Based on our findings, we present CreativeAds, a multi-product ad creation system that supports scalable automated generation with customized parameter adjustment for individual generation. To ensure automated high-quality ad generation, CreativeAds innovates a pipeline that consists of three modules to address challenges in product pairing, layout generation, and background generation separately. Furthermore, CreativeAds contains an intuitive user interface to allow users to oversee generation at scale, and it also supports detailed controls on individual generation for user customized adjustments. We performed a user study on CreativeAds and extensive evaluations of the generated images, demonstrating CreativeAds's ability to create large number of high-quality images at scale for advertisers without requiring expertise in GenAI tools.

Multi-Object Advertisement Creative Generation

Abstract

Lifestyle images are photographs that capture environments and objects in everyday settings. In furniture product marketing, advertisers often create lifestyle images containing products to resonate with potential buyers, allowing buyers to visualize how the products fit into their daily lives. While recent advances in Generative Artificial Intelligence (GenAI) have given rise to realistic image content creation, their application in e-commerce advertising is challenging because high-quality ads must authentically representing the products in realistic scearios. Therefore, manual intervention is usually required for individual generations, making it difficult to scale to larger product catalogs. To understand the challenges faced by advertisers using GenAI to create lifestyle images at scale, we conducted evaluations on ad images generated using state-of-the-art image generation models and identified the major challenges. Based on our findings, we present CreativeAds, a multi-product ad creation system that supports scalable automated generation with customized parameter adjustment for individual generation. To ensure automated high-quality ad generation, CreativeAds innovates a pipeline that consists of three modules to address challenges in product pairing, layout generation, and background generation separately. Furthermore, CreativeAds contains an intuitive user interface to allow users to oversee generation at scale, and it also supports detailed controls on individual generation for user customized adjustments. We performed a user study on CreativeAds and extensive evaluations of the generated images, demonstrating CreativeAds's ability to create large number of high-quality images at scale for advertisers without requiring expertise in GenAI tools.
Paper Structure (30 sections, 13 figures, 5 tables)

This paper contains 30 sections, 13 figures, 5 tables.

Figures (13)

  • Figure 1: The initial workflow design for multi-product ad generations consists of three stages: product sampling, placement, and background generation.
  • Figure 2: Common failure cases for directly applying state-of-the-art generation models for multi-product ad creations.
  • Figure 3: Overview of the system workflow. CreativeAds can batch generate high-quality ads given a user specified room-type and generation style. The system consists of three core modules, each designed to address a key challenge in realistic and authentic ad creation. First, the Product Pairing Module selects product combinations that are both semantically and visually compatible, ensuring functional coherence and viewpoint consistency. Next, the Layout Generation Module determines the spatial arrangement and relative sizes of products on a blank canvas to achieve a natural and balanced composition. Finally, the Background Generation Module inpaints the surrounding environment to highlight the selected products while preserving overall visual authenticity.
  • Figure 4: User Interface of CreativeAds. The interface of CreativeAds is designed to support efficient control and inspection of batch ad generation. On the left panel, users can specify high-level parameters such as the desired room type and visual style. The central panel allows users to select specific product category combinations to guide the generation process. On the right panel, a gallery view presents the generated images in a compact, scrollable layout. Users can click on individual thumbnails to view full-size images, inspect generation details, and make fine-grained adjustments to individual outputs as needed.
  • Figure 5: Illustration of the Product Pairing Module. This module first categorizes products into room types using a vision-language model (VLM) and then pairs semantically compatible items. To ensure visual coherence, it further filters product pairs by inferring the camera tilt from product images, selecting only those captured from similar viewpoints.
  • ...and 8 more figures