Learning Molecular Chirality via Chiral Determinant Kernels
Runhan Shi, Zhicheng Zhang, Letian Chen, Gufeng Yu, Yang Yang
TL;DR
This work tackles the challenge of learning molecular chirality by introducing ChiDeK, a unified framework that explicitly encodes central and axial stereochemistry through chiral determinant kernels and a cross-attentive chiral transformer. The approach embeds the SE(3)-invariant chirality matrix via a differentiable kernel and propagates stereochemical signals to non-chiral atoms, enabling robust handling of diverse chiral forms. A new axial-chirality benchmark (ACMP) for ECD and OR prediction is introduced, and ChiDeK shows substantial improvements over state-of-the-art baselines, especially for axial chirality (average gains >7%). The paper also provides theoretical analysis of the chiral encoder's invariance properties and includes extensive ablations and robustness analyses, supporting the practical utility of explicit stereochemical encoding in molecular ML.
Abstract
Chirality is a fundamental molecular property that governs stereospecific behavior in chemistry and biology. Capturing chirality in machine learning models remains challenging due to the geometric complexity of stereochemical relationships and the limitations of traditional molecular representations that often lack explicit stereochemical encoding. Existing approaches to chiral molecular representation primarily focus on central chirality, relying on handcrafted stereochemical tags or limited 3D encodings, and thus fail to generalize to more complex forms such as axial chirality. In this work, we introduce ChiDeK (Chiral Determinant Kernels), a framework that systematically integrates stereogenic information into molecular representation learning. We propose the chiral determinant kernel to encode the SE(3)-invariant chirality matrix and employ cross-attention to integrate stereochemical information from local chiral centers into the global molecular representation. This design enables explicit modeling of chiral-related features within a unified architecture, capable of jointly encoding central and axial chirality. To support the evaluation of axial chirality, we construct a new benchmark for electronic circular dichroism (ECD) and optical rotation (OR) prediction. Across four tasks, including R/S configuration classification, enantiomer ranking, ECD spectrum prediction, and OR prediction, ChiDeK achieves substantial improvements over state-of-the-art baselines, most notably yielding over 7% higher accuracy on axially chiral tasks on average.
