Towards Vector Optimization on Low-Dimensional Vector Symbolic Architecture
Shijin Duan, Yejia Liu, Gaowen Liu, Ramana Rao Kompella, Shaolei Ren, Xiaolin Xu
TL;DR
This work analyzes gradient-based training for Low-Dimensional Computing (LDC) in binary Vector Symbolic Architecture (VSA), showing that aggressive dimensionality reduction can preserve accuracy when training dynamics are stabilized. It identifies Batch Normalization (BN) and Knowledge Distillation (KD) as key enablers: BN stabilizes feature-vector updates and can be incorporated as a dimension-wise inference threshold, while KD reshapes the learning signal and improves prediction confidence via temperature-controlled distillation. The authors provide extensive analyses and ablations across multiple lightweight datasets, demonstrating that BN+KD substantially boosts inference accuracy and confidence, enabling LDC to reach or approach state-of-the-art accuracy with only a fraction of the memory and latency of high-dimensional VSA models. The results also suggest the approach generalizes to binary neural networks, offering practical strategies for training dynamics and confidence calibration in resource-constrained settings.
Abstract
Vector Symbolic Architecture (VSA) is emerging in machine learning due to its efficiency, but they are hindered by issues of hyperdimensionality and accuracy. As a promising mitigation, the Low-Dimensional Computing (LDC) method significantly reduces the vector dimension by ~100 times while maintaining accuracy, by employing a gradient-based optimization. Despite its potential, LDC optimization for VSA is still underexplored. Our investigation into vector updates underscores the importance of stable, adaptive dynamics in LDC training. We also reveal the overlooked yet critical roles of batch normalization (BN) and knowledge distillation (KD) in standard approaches. Besides the accuracy boost, BN does not add computational overhead during inference, and KD significantly enhances inference confidence. Through extensive experiments and ablation studies across multiple benchmarks, we provide a thorough evaluation of our approach and extend the interpretability of binary neural network optimization similar to LDC, previously unaddressed in BNN literature.
