Scheduling with Uncertain Holding Costs and its Application to Content Moderation

Caner Gocmen; Thodoris Lykouris; Deeksha Sinha; Wentao Weng

Scheduling with Uncertain Holding Costs and its Application to Content Moderation

Caner Gocmen, Thodoris Lykouris, Deeksha Sinha, Wentao Weng

TL;DR

The paper addresses scheduling with uncertain holding costs arising in content moderation, where view-driven costs evolve according to a Markovian tree. It develops the Opportunity-adjusted Remaining Cost (OaRC) index policy, grounded in a Markovian ski-rental analogy and a fluid relaxation, achieving an $\tilde{O}(\sqrt{N})$ regret that vanishes as system size grows and remains independent of the state-space size. A data-driven variant, HOaRC, uses hindsight approximations to predict future costs, enabling practical deployment. Across synthetic and real data, HOaRC reduces policy-violating views and saves reviewer-hours relative to canonical baselines, demonstrating significant operational impact for large-scale human moderation pipelines.

Abstract

In content moderation for social media platforms, the cost of delaying the review of a content is proportional to its view trajectory, which fluctuates and is apriori unknown. Motivated by such uncertain holding costs, we consider a queueing model where job states evolve based on a Markov chain with state-dependent instantaneous holding costs. We demonstrate that in the presence of such uncertain holding costs, the two canonical algorithmic principles, instantaneous-cost ($cμ$-rule) and expected-remaining-cost ($cμ/θ$-rule), are suboptimal. By viewing each job as a Markovian ski-rental problem, we develop a new index-based algorithm, Opportunity-adjusted Remaining Cost (OaRC), that adjusts to the opportunity of serving jobs in the future when uncertainty partly resolves. We show that the regret of OaRC scales as $\tilde{O}(L^{1.5}\sqrt{N})$, where $L$ is the maximum length of a job's holding cost trajectory and $N$ is the system size. This regret bound shows that OaRC achieves asymptotic optimality when the system size $N$ scales to infinity. Moreover, its regret is independent of the state-space size, which is a desirable property when job states contain contextual information. We corroborate our results with an extensive simulation study based on two holding cost patterns (online ads and user-generated content) that arise in content moderation for social media platforms. Our simulations based on synthetic and real datasets demonstrate that OaRC consistently outperforms existing practice, which is based on the two canonical algorithmic principles.

Scheduling with Uncertain Holding Costs and its Application to Content Moderation

TL;DR

regret that vanishes as system size grows and remains independent of the state-space size. A data-driven variant, HOaRC, uses hindsight approximations to predict future costs, enabling practical deployment. Across synthetic and real data, HOaRC reduces policy-violating views and saves reviewer-hours relative to canonical baselines, demonstrating significant operational impact for large-scale human moderation pipelines.

Abstract

-rule) and expected-remaining-cost (

-rule), are suboptimal. By viewing each job as a Markovian ski-rental problem, we develop a new index-based algorithm, Opportunity-adjusted Remaining Cost (OaRC), that adjusts to the opportunity of serving jobs in the future when uncertainty partly resolves. We show that the regret of OaRC scales as

, where

is the maximum length of a job's holding cost trajectory and

is the system size. This regret bound shows that OaRC achieves asymptotic optimality when the system size

scales to infinity. Moreover, its regret is independent of the state-space size, which is a desirable property when job states contain contextual information. We corroborate our results with an extensive simulation study based on two holding cost patterns (online ads and user-generated content) that arise in content moderation for social media platforms. Our simulations based on synthetic and real datasets demonstrate that OaRC consistently outperforms existing practice, which is based on the two canonical algorithmic principles.

Scheduling with Uncertain Holding Costs and its Application to Content Moderation

TL;DR

Abstract

Scheduling with Uncertain Holding Costs and its Application to Content Moderation

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (48)