Hi, I noitced that after k-means clustering, cluster with only one element is merged to the most similar cluster. My guess to this decision is that if this cluster has only one element, it cannot perform 1-vs-all classification. However, is it necessary? I think this kind of cluster could act as label itself and it's probability is given by its parent node's classifier. Did I miss something?