Web Image Clustering by Consistent Utilization of Visual Features and Surrounding Texts

Bin Gao; Tie-Yan Liu; Xin Zheng; Qian-Sheng Cheng; Wei-Ying Ma; Tao Qin

Web Image Clustering by Consistent Utilization of Visual Features and Surrounding Texts

Bin Gao ,
Tie-Yan Liu ,
Xin Zheng ,
Qian-Sheng Cheng ,
Wei-Ying Ma ,
Tao Qin

Proceedings of the 13th annual ACM international conference on Multimedia | January 2005

Published by Association for Computing Machinery, Inc.

Download BibTex

Image clustering, an important technology for image processing, has been actively researched for a long period of time. Especially in recent years, with the explosive growth of the Web, image clustering has even been a critical technology to help users digest the large amount of online visual information. However, as far as we know, many previous works on image clustering only used either low-level visual features or surrounding texts, but rarely exploited these two kinds of information in the same framework. To tackle this problem, we proposed a novel method named consistent bipartite graph co-partitioning in this paper, which can cluster Web images based on the consistent fusion of the information contained in both low-level features and surrounding texts. In particular, we formulated it as a constrained multi-objective optimization problem, which can be efficiently solved by semi-definite programming (SDP). Experiments on a real-world Web image collection showed that our proposed method outperformed the methods only based on low-level features or surround texts.

Copyright © 2007 by the Association for Computing Machinery, Inc. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Publications Dept, ACM Inc., fax +1 (212) 869-0481, or permissions@acm.org. The definitive version of this paper can be found at ACM's Digital Library --http://www.acm.org/dl/.