When many noisy genes optimize information flow
Nicholas Lawson, William Bialek
TL;DR
The paper asks how much information a single transcription factor can convey about its concentration when regulating many target genes under a fixed molecular budget. It analyzes a broadcasting regulatory network within an information-theoretic framework, incorporating Poisson mRNA noise and Berg-Purcell arrival fluctuations, and derives how the optimal input distribution and network parameters shape information transmission. Key findings show that information capacity is maximized by distributing resources across many targets, biasing the input toward low $c$, and allowing per-gene noise; the parameter landscape is 'sloppy', enabling near-optimal performance without fine-tuning, and the results are extended with a Boltzmann-like ensemble interpretation. This work provides a principled explanation for the coexistence of noisiness and near-optimal information processing in biology and suggests that diverse, ensemble-like parameter settings can underpin robust information transmission under resource constraints.
Abstract
It often is emphasized that gene expression is noisy. A seemingly contradictory view is that control mechanisms have been optimized to squeeze as much information as possible out of a limited number of molecules. Here we revisit these issues in a simple model where a single transcription factor (TF) controls a large number of target genes. We include only the physically required noise sources: random arrival of TFs at their targets and counting noise in the synthesis and degradation of mRNA. If the cell has a limited total number of mRNA molecules, then the capacity to transmit information about TF concentration is maximized when these resources are distributed across the largest possible number of target genes. To realize this capacity the distribution of TF concentrations must be biased toward smaller values. Thus, in some limits, information transmission is optimized when individual expression levels are noisy. In addition, the dependence of information transmission on the parameters of this multi-gene system has a "sloppy" spectrum, so that optimal performance can co-exist with substantial variability.
