Squintability and Other Metrics for Assessing Projection Pursuit Indexes, and Guiding Optimization Choices
H. Sherry Zhang, Dianne Cook, Nicolas Langrené, Jessica Wai Yin Leung
TL;DR
The paper tackles optimizing projection pursuit (PP) indexes within the Projection Pursuit Guided Tour framework, addressing challenges from noisy surfaces and small squint angles. It introduces two quantitative index properties—smoothness, captured via a Gaussian process with a Matérn covariance and summarized by the smoothness parameter $\nu$, and squintability, defined as a normalized gain along projection distance using a parametric logistic model—and uses these to analyze optimizer performance. The Jellyfish Search Optimizer (JSO) is proposed and evaluated against Creeping Random Search (CRS) across multiple indexes, dimensions $d$, and hyper-parameters, finding that higher squintability and more jellyfish improve success while smoothness has limited impact. Implemented in the R packages tourr and ferrn, the approach provides a practical toolkit for selecting PP indexes and optimizers, and offers guidance for computing index properties in new indexes.
Abstract
The projection pursuit (PP) guided tour optimizes a criterion function, known as the PP index, to gradually reveal projections of interest from high-dimensional data through animation. Optimization of some PP indexes can be non-trivial, if they are non-smooth functions, or when the optimum has a small "squint angle", detectable only from close proximity. Here, measures for calculating the smoothness and squintability properties of the PP index are defined. These are used to investigate the performance of a recently introduced swarm-based algorithm, Jellyfish Search Optimizer (JSO), for optimizing PP indexes. The performance of JSO in detecting the target pattern (pipe shape) is compared with existing optimizers in PP. Additionally, JSO's performance on detecting the sine-wave shape is evaluated using different PP indexes (hence different smoothness and squintability) across various data dimensions (d = 4, 6, 8, 10, 12) and JSO hyper-parameters. We observe empirically that higher squintability improves the success rate of the PP index optimization, while smoothness has no significant effect. The JSO algorithm has been implemented in the R package, `tourr`, and functions to calculate smoothness and squintability measures are implemented in the `ferrn` package.
