A new partitioning around medoids algorithm
2003; Taylor & Francis; Volume: 73; Issue: 8 Linguagem: Inglês
10.1080/0094965031000136012
ISSN1563-5163
AutoresMark van der Laan, Katherine S. Pollard, Jennifer Bryan,
Tópico(s)Face and Expression Recognition
ResumoKaufman and Rousseeuw (1990) proposed a clustering algorithm Partitioning Around Medoids (PAM) which maps a distance matrix into a specified number of clusters. A particularly nice property is that PAM allows clustering with respect to any specified distance metric. In addition, the medoids are robust representations of the cluster centers, which is particularly important in the common context that many elements do not belong well to any cluster. Based on our experience in clustering gene expression data, we have noticed that PAM does have problems recognizing relatively small clusters in situations where good partitions around medoids clearly exist. In this paper, we propose to partition around medoids by maximizing a criteria "Average Silhouette" defined by Kaufman and Rousseeuw (1990). We also propose a fast-to-compute approximation of "Average Silhouette". We implement these two new partitioning around medoids algorithms and illustrate their performance relative to existing partitioning methods in simulations.
Referência(s)