Estimating Appearance Models for Image Segmentation via Tensor Factorization
Jeova Farias Sales Rocha Neto
TL;DR
This work tackles the challenge of segmenting images into multiple regions without pre-specified appearance models. It introduces TEAM, a tensor-factorization–based method that estimates region appearances $\theta_1,\dots,\theta_K$ and their proportions $w$ from a single image by exploiting high-order color statistics via moments $\alpha,\beta,\gamma$ and a whitening/diagonalization procedure. When paired with a Markov Random Field graph-cut segmentation (TEAMSEG), the approach yields competitive or superior results on synthetic and real data, while automatically providing region proportions and avoiding filtering-based preprocessing. The method demonstrates robustness across IID-like and textured data, scales favorably in color and image size, and shows promise for applications in remote sensing and biomedical imaging, with future work aimed at linking to topic-modeling frameworks and improving memory efficiency.
Abstract
Image Segmentation is one of the core tasks in Computer Vision and solving it often depends on modeling the image appearance data via the color distributions of each it its constituent regions. Whereas many segmentation algorithms handle the appearance models dependence using alternation or implicit methods, we propose here a new approach to directly estimate them from the image without prior information on the underlying segmentation. Our method uses local high order color statistics from the image as an input to tensor factorization-based estimator for latent variable models. This approach is able to estimate models in multiregion images and automatically output the regions proportions without prior user interaction, overcoming the drawbacks from a prior attempt to this problem. We also demonstrate the performance of our proposed method in many challenging synthetic and real imaging scenarios and show that it leads to an efficient segmentation algorithm.
