Not logged in.

Contribution Details

Type Conference or Workshop Paper
Scope Discipline-based scholarship
Published in Proceedings Yes
Title CORE: Nonparametric Clustering of Large Numeric Databases
Organization Unit
Authors
  • Andrej Taliun
  • Michael Hanspeter Böhlen
  • Arturas Mazeika
Presentation Type paper
Item Subtype Original Work
Refereed Yes
Status Published in final form
Language
  • English
Page Range 14 - 25
Event Title SDM 2009: Proceedings of the SIAM International Conference on Data Mining
Event Type conference
Event Location Sparks, Nevada, USA
Event Start Date April 30 - 2009
Event End Date May 2 - 2009
Publisher SIAM (Society for Industrial and Applied Mathematics)
Abstract Text Current clustering techniques are able to identify arbitrarily shaped clusters in the presence of noise, but depend on carefully chosen model parameters. The choice of model parameters is difficult: it depends on the data and the clustering technique at hand, and finding good model parameters often requires time consuming human interaction. In this paper we propose CORE, a new nonparametric clustering technique that explicitly computes the local maxima of the density and represents them with cores. CORE proposes an adaptive grid and gradients to define and compute the cores of clusters. The incrementally constructed adaptive grid and the gradients make the identification of cores robust, scalable, and independent of small density fluctuations. Our experimental studies show that CORE without any carefully chosen model parameters produces better quality clustering than related techniques and is efficient for large datasets.
Official URL http://www.siam.org/proceedings/datamining/2009/dm09_003_taliuna.pdf
Other Identification Number merlin-id:2296
PDF File Download from ZORA
Export BibTeX
EP3 XML (ZORA)