Hybrid Lie semi-group and cascade structures for the generalized Gaussian derivative model for visual receptive fields

Tony Lindeberg

Hybrid Lie semi-group and cascade structures for the generalized Gaussian derivative model for visual receptive fields

Tony Lindeberg

TL;DR

This work tackles the problem of variability in visual image structures caused by viewing-condition–induced geometric transformations by formulating covariant receptive fields through a multi-parameter, generalized Gaussian derivative model. It develops two complementary theoretical strands: (i) infinitesimal relations that resemble hybrid Lie semi-group generators for spatial and spatio-temporal smoothing, expressed via generalized Hermite polynomials, and (ii) macroscopic cascade smoothing relations that describe how coarse-scale receptive field responses can be computed from finer-scale responses. The results cover purely spatial, isotropic spatio-temporal, and affine spatio-temporal smoothing, including time-causal variants via the time-causal limit kernel, and provide explicit parameter-transform relations and incremental kernels for cascade implementations. These theoretical contributions enable efficient bank-based computation of covariant receptive fields and offer foundational insights for modeling simple-cell computations in biological vision, with potential implications for geometric deep learning and robust visual processing across viewing conditions.

Abstract

Because of the variabilities of real-world image structures under the natural image transformations that arise when observing similar objects or spatio-temporal events under different viewing conditions, the receptive field responses computed in the earliest layers of the visual hierarchy may be strongly influenced by such geometric image transformations. One way of handling this variability is by basing the vision system on covariant receptive field families, which expand the receptive field shapes over the degrees of freedom in the image transformations. This paper addresses the problem of deriving relationships between spatial and spatio-temporal receptive field responses obtained for different values of the shape parameters in the resulting multi-parameter families of receptive fields. For this purpose, we derive both (i) infinitesimal relationships, roughly corresponding to a combination of notions from semi-groups and Lie groups, as well as (ii) macroscopic cascade smoothing properties, which describe how receptive field responses at coarser spatial and temporal scales can be computed by applying smaller support incremental filters to the output from corresponding receptive fields at finer spatial and temporal scales, structurally related to the notion of Lie algebras, although with directional preferences. The presented results provide (i) a deeper understanding of the relationships between spatial and spatio-temporal receptive field responses for different values of the filter parameters, which can be used for both (ii) designing more efficient schemes for computing receptive field responses over populations of multi-parameter families of receptive fields, as well as (iii)~formulating idealized theoretical models of the computations of simple cells in biological vision.

Hybrid Lie semi-group and cascade structures for the generalized Gaussian derivative model for visual receptive fields

TL;DR

Abstract

Hybrid Lie semi-group and cascade structures for the generalized Gaussian derivative model for visual receptive fields

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)