Lacunarity Pooling Layers for Plant Image Classification using Texture Analysis
Akshatha Mohan, Joshua Peeples
TL;DR
This work introduces lacunarity pooling layers to capture spatial texture heterogeneity in CNN feature maps for agricultural image classification. By implementing base lacunarity, differential box counting lacunarity, and multi-scale lacunarity—with integration into frozen backbones and fusion with GAP features—the approach enriches texture representation. Experiments on PlantVillage, LeavesTex1200, and DeepWeeds across ConvNeXt, ResNet18, and DenseNet161 demonstrate consistent improvements, with multi-scale lacunarity yielding the strongest gains and clearer class separability in embeddings. The technique offers a practical, parameter-conscious way to improve texture-aware vision models for plant disease and species identification, with potential extensions to other modalities and tasks.
Abstract
Pooling layers (e.g., max and average) may overlook important information encoded in the spatial arrangement of pixel intensity and/or feature values. We propose a novel lacunarity pooling layer that aims to capture the spatial heterogeneity of the feature maps by evaluating the variability within local windows. The layer operates at multiple scales, allowing the network to adaptively learn hierarchical features. The lacunarity pooling layer can be seamlessly integrated into any artificial neural network architecture. Experimental results demonstrate the layer's effectiveness in capturing intricate spatial patterns, leading to improved feature extraction capabilities. The proposed approach holds promise in various domains, especially in agricultural image analysis tasks. This work contributes to the evolving landscape of artificial neural network architectures by introducing a novel pooling layer that enriches the representation of spatial features. Our code is publicly available.
