Probing Semantic Routing in Large Mixture-of-Expert Models
Matthew Lyle Olson, Neale Ratzlaff, Musashi Hinck, Man Luo, Sungduk Yu, Chendi Xue, Vasudev Lal
TL;DR
The paper investigates whether large MoE models route information semantically rather than purely token-based. By two controlled probes—Word-in-Context (WiC) and lexical substitution (SWORDS)—and a Cohen-like normalized overlap metric, they reveal statistically significant semantic routing across six MoE models from three families, with stronger effects in middle layers and in larger models. A qualitative case study in DiscoveryWorld shows that specific reasoning patterns map to small sets of experts, suggesting emergent cognitive specialization. These findings advance interpretability and open avenues for targeted control and efficiency in sparse MoE deployments.
Abstract
In the past year, large (>100B parameter) mixture-of-expert (MoE) models have become increasingly common in the open domain. While their advantages are often framed in terms of efficiency, prior work has also explored functional differentiation through routing behavior. We investigate whether expert routing in large MoE models is influenced by the semantics of the inputs. To test this, we design two controlled experiments. First, we compare activations on sentence pairs with a shared target word used in the same or different senses. Second, we fix context and substitute the target word with semantically similar or dissimilar alternatives. Comparing expert overlap across these conditions reveals clear, statistically significant evidence of semantic routing in large MoE models.
