Image collections clustering in large databases on the basis of recurrent optimization
Abstract
The following paper considers methods for clustering large amounts of data and proposes a modification of the density-based approach to clustering multimedia objects with disturbance. The analysis of the existing DENCLUE method is carried out, and the matrix influence function is introduced, which makes it possible to effectively use this approach in the analysis of multidimensional objects, the collections of images, video and multimedia data in particular. The introduced matrix form makes it possible to increase the speed of clustering due to the absence of vectorization-devectorization of the initial data.
Downloads
References
Han J., Kamber M. Data Mining: Concepts and Techniques., 2-nd ed., San Francisco: Morgan Kaufmann, 2006., 800 p.
Gan G., Ma C., Wu J. Data Clustering: Theory, Algorithms, and Applications., Philadelphia: SIAM, 2007. – 466 p.
Abonyi J., Feil B. Cluster Analysis for Data Mining and System Identification., Basel: Birkhäuser, 2007., 303 p.
Olson D.L., Dursun D. Advanced Data Mining Techniques., Berlin: Springer, 2008., 180 p.
Xu R., Wunsch D.C. Clustering., Hoboken: John Wiley&Sons, 2008., 358 p.
Kohonen T. Self-Organizing Maps., 1-st ed., Berlin: Springer, 1995., 501 p.
Ester M., Kriegel H.-P., Sander J., Xu X. A density-based algorithm for discovering clusters in large spatial database with noise // Proc. Int. Conf. on Knowledge Discovery in Databases and Data Mining., Portlend, Oregon: AAAIO Press, 1996., P. 226-331.
Xu X., Ester M., Kriegel H., Sander J. A distribution-based clustering algorithm for mining in large spatial databases // Proc. 14-th Int. Conf. in Data Clustering “ICDE’98”, Orlando FLA: IEEE Computer Society, 1998, P. 324-331.
Ankerst M., Breunig M., Krilgel H., Sander J. OPTICS: Ordering points to identify the clustering structure // Proc. 1999 ACM-SIGMOD Int. Conf. Management of Data. Philadelphia, PA, 1999, P. 49-60.
Dash M. “1+1>2”: Merging distance and density based clustering // Proc. Int. Conf. on Database systems for Advanced Applications., Hong Kong. AEEE Computer Society, 2001, P. 30-33.
Hu H., Ester M., Sander A. Distribution-based clustering algorithm for mining in large spatial databases // Proc. 14-th Int. Conf. on Data Clustering “ICDE’98”, Orlando: FLA AEEE Computer Society, 1998, P. 324-331.