Deep Reinforcement Learning for Controlled Traversing of the Attractor Landscape of Boolean Models in the Context of Cellular Reprogramming
Andrzej Mizera, Jakub Zarzycki
TL;DR
This work addresses the challenge of designing scalable interventions for cellular reprogramming by framing it as source-target attractor control in BN and PBN models under asynchronous updates. It introduces pbn-STAC, a DRL-based framework that uses pseudo-attractors (PASIP) to identify frequently revisited states during training and an exploration probability boost (EPB) to stabilize learning, with Branching Dueling Q-Networks (BDQ) to handle multi-gene perturbations. The approach delivers control strategies that are competitive with optimal solutions wherever ground truth is available and demonstrates robustness on realistic GRN models of melanoma and IRBB-33, highlighting potential for scalable wet-lab-applicable reprogramming guidance. The work advances scalable, realistic DRL-based control in large GRNs and provides a concrete pathway toward aiding cellular reprogramming experiments through in silico predictions.
Abstract
Cellular reprogramming can be used for both the prevention and cure of different diseases. However, the efficiency of discovering reprogramming strategies with classical wet-lab experiments is hindered by lengthy time commitments and high costs. In this study, we develop a novel computational framework based on deep reinforcement learning that facilitates the identification of reprogramming strategies. For this aim, we formulate a control problem in the context of cellular reprogramming for the frameworks of BNs and PBNs under the asynchronous update mode. Furthermore, we introduce the notion of a pseudo-attractor and a procedure for identification of pseudo-attractor state during training. Finally, we devise a computational framework for solving the control problem, which we test on a number of different models.
