Conformal Transformation of Kernels: A Geometric Perspective on Text Classification
Ioana Rădulescu, Alexandra Băicoianu, Adela Mihai
TL;DR
This work investigates conformal transformations of kernels within SVMs for high-dimensional text classification. It introduces a Gaussian Cosine kernel tailored to multinomial geometry and applies two conformal transformations (based on cosine and spherical geodesic distances) to improve class separability, particularly for suboptimal kernels. Experiments on the Reuters dataset across one-vs-rest and one-vs-one tasks show that conformal transforms can yield substantial gains in certain settings (e.g., Gaussian and GC kernels), and can reduce model complexity via fewer support vectors, though they do not consistently surpass the best original kernels. The study highlights the geometric interpretation of kernels on multinomial manifolds, offers a practical protocol to exploit these transforms, and suggests future work in alternative distance measures and representation schemes for further gains in text classification.
Abstract
In this article we investigate the effects of conformal transformations on kernel functions used in Support Vector Machines. Our focus lies in the task of text document categorization, which involves assigning each document to a particular category. We introduce a new Gaussian Cosine kernel alongside two conformal transformations. Building upon previous studies that demonstrated the efficacy of conformal transformations in increasing class separability on synthetic and low-dimensional datasets, we extend this analysis to the high-dimensional domain of text data. Our experiments, conducted on the Reuters dataset on two types of binary classification tasks, compare the performance of Linear, Gaussian, and Gaussian Cosine kernels against their conformally transformed counterparts. The findings indicate that conformal transformations can significantly improve kernel performance, particularly for sub-optimal kernels. Specifically, improvements were observed in 60% of the tested scenarios for the Linear kernel, 84% for the Gaussian kernel, and 80% for the Gaussian Cosine kernel. In light of these findings, it becomes clear that conformal transformations play a pivotal role in enhancing kernel performance, offering substantial benefits.
