This content has been marked as final. Show 1 reply
Please see responses inline.
1) Is it possible to have a min set amount per leaf/cluster? (As in I don't want to create a cluster with just 1 person it doesn't really tell me anything)
Setting a min count for a leaf is not supported. However, to avoid very unbalanced clusters, you can change the split criterion setting to size. This would produce balanced clusters.
2) Is it possible to select K-means by the median rather than the centroid?
K-medoid is currently not supported.
3) Is it possible to view the probability formula for the k-means and O-Cluster?
For k-Means, the probability is based on using Gaussians with the centroids as means and standard deviations based on the dispersion.
For O-Cluster, the probability is based on an internally computed Baysian model.