What does sumd method in k-means clustering function exactly calculate?

Question

Onur Kapucu 2018-5-8

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/399776-what-does-sumd-method-in-k-means-clustering-function-exactly-calculate

评论： Onur Kapucu 2018-5-8

I am doing basic experiments with kmeans function. As a real simple example, say that I have a data set of 4 items with 1 attribute and this attribute is their value:

Data=[1;2;3;4];

If I want to split this data set into 2 clusters I should get one centroid in 1.5 and another in 3.5:

[idx,C,sumd]=kmeans(Data,2)
C =     
1.5000
3.5000

and I get it. However to my understanding sumd in this case should be:

abs(1-1.5)+abs(2-1.5) or  abs(3-3.5)+abs(4-3.5)
ans =
       1

but I am getting sumd as:

sumd =
      0.5000
      0.5000

for both clusters. Instead of getting 1's for both.

My question is what exactly does sumd calculate?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Ameer Hamza 2018-5-8

1
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/399776-what-does-sumd-method-in-k-means-clustering-function-exactly-calculate#answer_319322

编辑：Ameer Hamza 2018-5-8

在 MATLAB Online 中打开

If you look at the documentation of kmeans(), you will know that it uses the square of the Euclidean distance, by default. So you should calculate it like this

abs(1-1.5).^2+abs(2-1.5).^2 or  abs(3-3.5).^2+abs(4-3.5).^2
ans = 
  0.5 (both cases)

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

Onur Kapucu 2018-5-8

Thanks

请先登录，再进行评论。

Answer 2

the cyclist 2018-5-8

1
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/399776-what-does-sumd-method-in-k-means-clustering-function-exactly-calculate#answer_319323

It's because the default distance metric used is the squared Euclidean distance (for minimization, and reporting). See the Distance input parameter.

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

Onur Kapucu 2018-5-8

Thanks

请先登录，再进行评论。

What does sumd method in k-means clustering function exactly calculate?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

更多回答（1 个）

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

另请参阅

类别

标签

Community Treasure Hunt

What does sumd method in k-means clustering function exactly calculate?

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

1 个评论 显示 -1更早的评论隐藏 -1更早的评论

更多回答（1 个）

1 个评论 显示 -1更早的评论隐藏 -1更早的评论

另请参阅

类别

标签

Community Treasure Hunt

WeChat

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

1 个评论
显示 -1更早的评论隐藏 -1更早的评论