How to estimate K for K-means clustring

Question

wisekily 2016-5-15

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/284309-how-to-estimate-k-for-k-means-clustring

评论： Bashar Saad 2019-7-12

I'm working on unsupervised classification or clustering, i want to estimate the K (which refers to cluster number) before starting th k-means algorithm

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Walter Roberson 2016-5-16

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/284309-how-to-estimate-k-for-k-means-clustring#answer_222279

You will probably not find any code already implemented for this purpose.

The theoretical answer for the "best" number of clusters to use is "one cluster for every unique point", as that will always have the best possible fit.

If you do not wish to use one cluster for every unique point, you need to have some kind of penalty term that favors fewer clusters. I read through the theory paper on that a few years ago, and it was clear to me that they were setting the weights arbitrarily (but usefully for the kinds of clustering they were doing), and that there was no way to calculate what the weights should be without some knowledge of the range of number of clusters that would be appropriate for the physical system being examined. The theoretical algorithms were not suitable for "unsupervised learning", only for "supervised learning". The work we were doing at the time required unsupervised learning, so there was no way for us to determine what the proper number of clusters should be.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

Answer 2

the cyclist 2016-5-15

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/284309-how-to-estimate-k-for-k-means-clustring#answer_222197

This is not really a MATLAB question, but rather a general data science question.

Googling "how to choose k in k means" found this Wikipedia page on the topic (and many others) that might help you.

4 个评论
显示 2更早的评论隐藏 2更早的评论

Image Analyst 2016-5-15

There are MATLAB functions for estimating the best k. I don't remember what they were - I'd have to look them up in the Machine Learning course notes.

wisekily 2016-5-15

I'm waiting for your answer

请先登录，再进行评论。

Answer 3

Image Analyst 2016-5-15

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/284309-how-to-estimate-k-for-k-means-clustring#answer_222237

The web page on kmeans explains how you can use silhouette() to determine the best number of clusters, k:

http://www.mathworks.com/help/stats/k-means-clustering.html

3 个评论
显示 1更早的评论隐藏 1更早的评论

Walter Roberson 2016-5-16

Did you read through the link that Image Analyst posted?

the cyclist 2016-5-16

Which is also the same link that I pointed you to earlier. So, uh, now you have 3 of the top 10 contributors to this forum telling you consistently the same thing.

请先登录，再进行评论。

Answer 4

kira 2019-5-2

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/284309-how-to-estimate-k-for-k-means-clustring#answer_373356

在 MATLAB Online 中打开

old question, but I just found a way myself looking at matlab documentation:

klist=2:n;%the number of clusters you want to try
myfunc = @(X,K)(kmeans(X, K));
eva = evalclusters(net.IW{1},myfunc,'CalinskiHarabasz','klist',klist)
classes=kmeans(net.IW{1},eva.OptimalK);

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

Bashar Saad 2019-7-12

could you help me pleas the code is not clear

请先登录，再进行评论。

How to estimate K for K-means clustring

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

更多回答（3 个）

4 个评论
显示 2更早的评论隐藏 2更早的评论

3 个评论
显示 1更早的评论隐藏 1更早的评论

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

另请参阅

类别

标签

Community Treasure Hunt

How to estimate K for K-means clustring

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

更多回答（3 个）

4 个评论 显示 2更早的评论隐藏 2更早的评论

3 个评论 显示 1更早的评论隐藏 1更早的评论

1 个评论 显示 -1更早的评论隐藏 -1更早的评论

另请参阅

类别

标签

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

4 个评论
显示 2更早的评论隐藏 2更早的评论

3 个评论
显示 1更早的评论隐藏 1更早的评论

1 个评论
显示 -1更早的评论隐藏 -1更早的评论