How to Strategize Human Content Creation in the Era of GenAI?
Seyed A. Esmaeili, Kevin Lim, Kshipra Bhawalkar, Zhe Feng, Di Wang, Haifeng Xu
TL;DR
The paper analyzes a dynamic competition between a human content creator and GenAI across $k$ topics, capturing time-sensitive value via per-topic discounts $\gamma_i$ and GenAI learning through discounted counts $N_i(t)$. It proves hardness for time-sensitive domains under the rETH and delivers a near-optimal $(1-\epsilon)/2$-approximation via Myopically-Optimize-then-Pause, while in time-insensitive domains ($\gamma_i=1$) it yields a polynomial-time optimal strategy using a reduced DAG and a longest-path computation with complexity $O(Tk^3)$. The work provides extensive simulations showing the proposed methods outperform baselines and offers insights into when to rely on pausing versus continuous content generation. Together, these results guide platform operators on scheduling human content creation and leveraging GenAI data for training in both decay-prone and timeless content domains.
Abstract
Generative AI (GenAI) will have significant impact on content creation platforms. In this paper, we study the dynamic competition between a GenAI and a human contributor. Unlike the human, the GenAI's content only improves when more contents are created by the human over time; however, GenAI has the advantage of generating content at a lower cost. We study the algorithmic problem in this dynamic competition model about how the human contributor can maximize her utility when competing against the GenAI for content generation over a set of topics. In time-sensitive content domains (e.g., news or pop music creation) where contents' value diminishes over time, we show that there is no polynomial time algorithm for finding the human's optimal (dynamic) strategy, unless the randomized exponential time hypothesis is false. Fortunately, we are able to design a polynomial time algorithm that naturally cycles between myopically optimizing over a short time window and pausing and provably guarantees an approximation ratio of $\frac{1}{2}$. We then turn to time-insensitive content domains where contents do not lose their value (e.g., contents on history facts). Interestingly, we show that this setting permits a polynomial time algorithm that maximizes the human's utility in the long run. Finally, we conduct simulations that demonstrate the advantage of our algorithms in comparison to a collection of baselines.
