What's the difference between the two arguments in the kmeans function: MaxIter and Replicates?

Question

ABC EFD 2017-10-30

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/364126-what-s-the-difference-between-the-two-arguments-in-the-kmeans-function-maxiter-and-replicates

编辑： Deepa Gupta 2020-3-29

Do they both mean how many times a new centroid is to be found?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Deepa Gupta 2020-3-29

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/364126-what-s-the-difference-between-the-two-arguments-in-the-kmeans-function-maxiter-and-replicates#answer_422734

编辑：Deepa Gupta 2020-3-29

I somewhat have the same question. My guess is that for the search of centroid, replicate=r (say r is the number of replicates) re-initializes the starting point with every new run, whereas Maxiter's new iteration run still uses the same random seed initialization/starting point maybe.

Most importantly, I think the minimum sumd (sum of differences between centroid and cluster's populants) or in other words the best solution is chosen from the r runs when replicates is mentioned and this may not be necessary with MaxIter's multiple iterations resultant runs given the MATLAB documentation's kmeans description.

Extra:

I think that's the best way to understand this. Having said that, I am myself looking/open to more discussion on this. Given the above, it may make sense to perhaps assign high number of replicates although computational load and corresponding time consumption could be the cost. Best thing to do would be to type open kmeans on the command line in matlab and check out the code to investigate the above for surity. I was myself doing that but ran out of time due to a deadline. I may revisit later but for now I am just running multiple runs with my data to see the best solution and then I take that r and MaxIter parameters for the rest of my similar natured/same domain data. At the end of the day, one should understand that multiple solutions are very much possible with clustering analysis but they still help human if dealing with large samples to get some idea i.e. centroids/prototypes. With that concept and tradeoff in mind, one could use kmeans.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

What's the difference between the two arguments in the kmeans function: MaxIter and Replicates?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

What's the difference between the two arguments in the kmeans function: MaxIter and Replicates?

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论