iScheduler: Reinforcement Learning-Driven Continual Optimization for Large-Scale Resource Investment Problems
Yi-Xiang Hu, Yuke Wang, Feng Wu, Zirui Huang, Shuli Zeng, Xiang-Yang Li
TL;DR
This work tackles large-scale Resource Investment Problems (RIP) where scheduling precedence-constrained tasks under shared renewable resources incurs provisioning costs. It reframes RIP solving as an iterative Markov Decision Process over decomposed subproblems, enabling RL-driven adaptive ordering of process scheduling and a learning-based mechanism to select among local solution options. Key contributions include the iScheduler framework, a process-level decomposition with a process interaction graph, a reconfiguration-aware training interface, and the L-RIPLIB industrial-scale benchmark. Empirical results show iScheduler achieves competitive resource costs while reducing time-to-feasibility by up to 43x compared to strong baselines, and effectively reused schedules under dynamic updates to lower reconfiguration latency. Overall, the approach demonstrates a scalable, learning-guided continual optimization paradigm for industrial RIP instances with practical impact in data centers and cloud platforms.
Abstract
Scheduling precedence-constrained tasks under shared renewable resources is central to modern computing platforms. The Resource Investment Problem (RIP) models this setting by minimizing the cost of provisioned renewable resources under precedence and timing constraints. Exact mixed-integer programming and constraint programming become impractically slow on large instances, and dynamic updates require schedule revisions under tight latency budgets. We present iScheduler, a reinforcement-learning-driven iterative scheduling framework that formulates RIP solving as a Markov decision process over decomposed subproblems and constructs schedules through sequential process selection. The framework accelerates optimization and supports reconfiguration by reusing unchanged process schedules and rescheduling only affected processes. We also release L-RIPLIB, an industrial-scale benchmark derived from cloud-platform workloads with 1,000 instances of 2,500-10,000 tasks. Experiments show that iScheduler attains competitive resource costs while reducing time to feasibility by up to 43$\times$ against strong commercial baselines.
