Table of Contents
Fetching ...

Biomanufacturing Harvest Optimization with Small Data

Bo Wang, Wei Xie, Tugce Martagan, Alp Akcay, Bram van Ravenstein

TL;DR

This work tackles optimal fermentation harvesting under severe data scarcity by marrying mechanistic protein/impurity growth with Bayesian learning and Markov decision processes. It develops a model with $p_{t+1}=p_t e^{\Phi_t}$ and $i_{t+1}=i_t e^{\Psi_t}$, where $\Phi_t \sim \mathcal{N}(\mu^{(p)}_c,\sigma^{(p)2}_c)$ and $\Psi_t \sim \mathcal{N}(\mu^{(i)}_c,\sigma^{(i)2}_c)$, and updates unknown parameters via a Normal–Inverse–Gamma prior to obtain posterior predictive distributions $\widetilde{\Phi}_t$ and $\widetilde{\Psi}_t$ with generalized $t$-distributions. The harvesting problem is formulated as an MDP with a knowledge state $\mathcal{I}_t$ capturing posterior parameters and a hyper-state $\mathcal{H}_t=(p_t,i_t,\mathcal{I}_t)$, featuring a control-limit structure on impurity and a myopic policy that can be optimal under perfect information when certain conditions hold. The paper develops RL with model risk (RL with MR) using Bayesian sparse sampling and Thompson sampling to compute near-optimal policies online, and demonstrates a real MSD implementation that yields about a 50% increase in batch yield with reduced variability. Overall, accounting for model risk and leveraging small-data Bayesian learning significantly improves fermentation harvesting decisions and operational performance, with clear pathways for extension to continuous-time settings and broader biomanufacturing contexts. The results underscore the value of data-driven decision-making in early-stage bioprocess development and the practical impact of integrate learning and optimization under uncertainty.

Abstract

In biopharmaceutical manufacturing, fermentation processes play a critical role in productivity and profit. A fermentation process uses living cells with complex biological mechanisms, leading to high variability in the process outputs, namely, the protein and impurity levels. By building on the biological mechanisms of protein and impurity growth, we introduce a stochastic model to characterize the accumulation of the protein and impurity levels in the fermentation process. However, a common challenge in the industry is the availability of only a very limited amount of data, especially in the development and early stages of production. This adds an additional layer of uncertainty, referred to as model risk, due to the difficulty of estimating the model parameters with limited data. In this paper, we study the harvesting decision for a fermentation process (i.e., when to stop the fermentation and collect the production reward) under model risk. We adopt a Bayesian approach to update the unknown parameters of the growth-rate distributions, and use the resulting posterior distributions to characterize the impact of model risk on fermentation output variability. The harvesting problem is formulated as a Markov decision process model with knowledge states that summarize the posterior distributions and hence incorporate the model risk in decision-making. Our case studies at MSD Animal Health demonstrate that the proposed model and solution approach improve the harvesting decisions in real life by achieving substantially higher average output from a fermentation batch along with lower batch-to-batch variability.

Biomanufacturing Harvest Optimization with Small Data

TL;DR

This work tackles optimal fermentation harvesting under severe data scarcity by marrying mechanistic protein/impurity growth with Bayesian learning and Markov decision processes. It develops a model with and , where and , and updates unknown parameters via a Normal–Inverse–Gamma prior to obtain posterior predictive distributions and with generalized -distributions. The harvesting problem is formulated as an MDP with a knowledge state capturing posterior parameters and a hyper-state , featuring a control-limit structure on impurity and a myopic policy that can be optimal under perfect information when certain conditions hold. The paper develops RL with model risk (RL with MR) using Bayesian sparse sampling and Thompson sampling to compute near-optimal policies online, and demonstrates a real MSD implementation that yields about a 50% increase in batch yield with reduced variability. Overall, accounting for model risk and leveraging small-data Bayesian learning significantly improves fermentation harvesting decisions and operational performance, with clear pathways for extension to continuous-time settings and broader biomanufacturing contexts. The results underscore the value of data-driven decision-making in early-stage bioprocess development and the practical impact of integrate learning and optimization under uncertainty.

Abstract

In biopharmaceutical manufacturing, fermentation processes play a critical role in productivity and profit. A fermentation process uses living cells with complex biological mechanisms, leading to high variability in the process outputs, namely, the protein and impurity levels. By building on the biological mechanisms of protein and impurity growth, we introduce a stochastic model to characterize the accumulation of the protein and impurity levels in the fermentation process. However, a common challenge in the industry is the availability of only a very limited amount of data, especially in the development and early stages of production. This adds an additional layer of uncertainty, referred to as model risk, due to the difficulty of estimating the model parameters with limited data. In this paper, we study the harvesting decision for a fermentation process (i.e., when to stop the fermentation and collect the production reward) under model risk. We adopt a Bayesian approach to update the unknown parameters of the growth-rate distributions, and use the resulting posterior distributions to characterize the impact of model risk on fermentation output variability. The harvesting problem is formulated as a Markov decision process model with knowledge states that summarize the posterior distributions and hence incorporate the model risk in decision-making. Our case studies at MSD Animal Health demonstrate that the proposed model and solution approach improve the harvesting decisions in real life by achieving substantially higher average output from a fermentation batch along with lower batch-to-batch variability.

Paper Structure

This paper contains 32 sections, 6 theorems, 74 equations, 8 figures, 3 tables, 2 algorithms.

Key Result

Proposition 1

(i) $\hbox{E}\left[\widetilde{\sigma}_{t}^{(p)2}\right] = \sigma^{(p)2}_c + \dfrac{(2J_t-1)\sigma^{(p)2}_c}{(J_t^2-2J_t)}$ and $\hbox{Var}\left[\widetilde{\sigma}_{t}^{(p)2}\right] = \dfrac{2(J_t^3+J_t^2-J_t-1)\sigma^{(p)4}_c}{J_t^4 - 4J_t^3 + 4J_t^2}$. (ii) Conditional on the historical data $\math

Figures (8)

  • Figure 1: Illustration of fermentation dynamics using industry data from MSD
  • Figure 2: Illustration of how the harvest boundary is affected by the model risk (based on the case study parameters presented in Section \ref{['subsec:overview']}).
  • Figure 3: The frequency of the number of decision epochs at which the harvesting decision is made for the case with $J_0=3$ (out of 100 simulation replications).
  • Figure 4: Optimal harvesting thresholds (above curve denotes harvest region) with $J_0 \in \{3, 20\}$ under the strategies PI-MDP, RL with MR, and CP.
  • Figure 5: (Color online) Performance of different batches of the same product produced over time: Black dots represent performance before implementation. Red dots represent performance after implementation.
  • ...and 3 more figures

Theorems & Definitions (8)

  • Proposition 1
  • Proposition 2
  • Proposition 3
  • Definition 1: Myopic Policy under Perfect Information
  • Proposition 4
  • Definition 2: Myopic Policy under Model Risk
  • Proposition 5
  • Proposition 6