Table of Contents
Fetching ...

Leaf clustering using circular densities

Luis E. Nieto-Barajas

TL;DR

This work casts leaf shape characterization via centroid contour distances (CCD) into a density-based framework by modeling CCDs as circular densities, enabling scale normalization and rotation alignment through mean or mode shifts. It compares four density-distance measures ($D_1$–$D_4$) within a complete-linkage hierarchical clustering setup, using both mean and mode rotations, and evaluates performance on a motivating 10-leaf set and the OSU Leaf dataset. Key contributions include a practical normalization pipeline, a quartet of distance metrics (with $D_4$ tied to first $2r$ trigonometric moments), and a heterogeneity-based criterion to compare clusterings on OSU data. The approach yields interpretable clusters for distinct leaf groups and provides ready-to-use code and data for researchers.

Abstract

In the biology field of botany, leaf shape recognition is an important task. One way of characterising the leaf shape is through the centroid contour distances (CCD). Each CCD path might have different resolution, so normalisation is done by associating each contour to a circular density. Densities are rotated by subtracting the mean or mode preferred direction. Distance measures between densities are used to produce a hierarchical clustering method to cluster the leaves. We illustrate our approach with a motivating small dataset as well as a larger dataset.

Leaf clustering using circular densities

TL;DR

This work casts leaf shape characterization via centroid contour distances (CCD) into a density-based framework by modeling CCDs as circular densities, enabling scale normalization and rotation alignment through mean or mode shifts. It compares four density-distance measures () within a complete-linkage hierarchical clustering setup, using both mean and mode rotations, and evaluates performance on a motivating 10-leaf set and the OSU Leaf dataset. Key contributions include a practical normalization pipeline, a quartet of distance metrics (with tied to first trigonometric moments), and a heterogeneity-based criterion to compare clusterings on OSU data. The approach yields interpretable clusters for distinct leaf groups and provides ready-to-use code and data for researchers.

Abstract

In the biology field of botany, leaf shape recognition is an important task. One way of characterising the leaf shape is through the centroid contour distances (CCD). Each CCD path might have different resolution, so normalisation is done by associating each contour to a circular density. Densities are rotated by subtracting the mean or mode preferred direction. Distance measures between densities are used to produce a hierarchical clustering method to cluster the leaves. We illustrate our approach with a motivating small dataset as well as a larger dataset.
Paper Structure (5 sections, 5 equations, 5 figures, 1 table)

This paper contains 5 sections, 5 equations, 5 figures, 1 table.

Figures (5)

  • Figure 1: Ten leaves CCD data as circular densities. Without normalisation (left) and with normalisation and mean rotation (right).
  • Figure 2: FRS leaves plot. Before rotation (top) and after mean rotation (bottom). The dot represents the center of the leaf.
  • Figure 3: Ten leaves plot after mean rotation.
  • Figure 4: Ten leaves dataset. Hierarchical clustering dendrograms for the four distances.
  • Figure 5: OSU dataset. Plot of two leaves for each of the six clusters obtained.