From Molecules to Mixtures: Learning Representations of Olfactory Mixture Similarity using Inductive Biases
Gary Tom, Cher Tian Ser, Ella M. Rajaonson, Stanley Lo, Hyun Suk Park, Brian K. Lee, Benjamin Sanchez-Lengeling
TL;DR
POMMix extends the Principal Odor Map to olfactory mixtures by combining a mono-molecular GraphNets-based representation with a permutation-invariant mixture attention module and a distance-based prediction head. The two-stage training regime leverages a mono-molecular pretraining phase and a mixture-focused fine-tuning phase, achieving state-of-the-art predictive performance on mixture similarity across public datasets and demonstrating generalization to unseen molecules and larger mixture sizes. The work highlights the power of incorporating domain-specific inductive biases in low-data olfactory domains, offers interpretable insights via attention maps, and provides fully open data and code to drive reproducibility. Collectively, POMMix advances the digitization of olfaction and has potential implications for fragrance design, flavor science, and related multi-component sensing tasks.
Abstract
Olfaction -- how molecules are perceived as odors to humans -- remains poorly understood. Recently, the principal odor map (POM) was introduced to digitize the olfactory properties of single compounds. However, smells in real life are not pure single molecules, but complex mixtures of molecules, whose representations remain relatively under-explored. In this work, we introduce POMMix, an extension of the POM to represent mixtures. Our representation builds upon the symmetries of the problem space in a hierarchical manner: (1) graph neural networks for building molecular embeddings, (2) attention mechanisms for aggregating molecular representations into mixture representations, and (3) cosine prediction heads to encode olfactory perceptual distance in the mixture embedding space. POMMix achieves state-of-the-art predictive performance across multiple datasets. We also evaluate the generalizability of the representation on multiple splits when applied to unseen molecules and mixture sizes. Our work advances the effort to digitize olfaction, and highlights the synergy of domain expertise and deep learning in crafting expressive representations in low-data regimes.
