Unsupervised Segmentation of Natural Images via Lossy Data Compression

  • John Wright

Computer Vision and Image Understanding (CVIU) |

In this paper, we cast natural-image segmentation as a problem of clustering texure features
as multivariate mixed data. We model the distribution of the texture features using a mixture
of Gaussian distributions. Unlike most existing clustering methods, we allow the mixture
components to be degenerate or nearly-degenerate. We contend that this assumption
is particularly important for mid-level image segmentation, where degeneracy is typically
introduced by using a common feature representation for different textures in an image.We
show that such a mixture distribution can be effectively segmented by a simple agglomerative
clustering algorithm derived from a lossy data compression approach. Using either 2D
texture filter banks or simple fixed-size windows to obtain texture features, the algorithm
effectively segments an image by minimizing the overall coding length of the feature vectors.
We conduct comprehensive experiments to measure the performance of the algorithm
in terms of visual evaluation and a variety of quantitative indices for image segmentation.
The algorithm compares favorably against other well-known image-segmentation methods
on the Berkeley image database.