Setting number of clusters on clustergram object
3 次查看(过去 30 天)
显示 更早的评论
Hi, I'm using the clustergram tool to group some data. I know a priori how many groups I want to classify the data into. Is there a way to set the number of clusters that the clustergram object makes? I know it is possible to set the number of nodes with the dendrogram setting, but this is not working very well for me.
Thanks,
Avi
4 个评论
Arthur Goldsipe
2016-8-25
Sorry, but I'm still a little confused by your question. I should have been clearer with my original comment. What you mean by the "dendrogram setting"? Do you mean that you specify the 'Dendrogram' argument when you call clustergram? Are you happy with this coloring scheme as a way of identifying clusters? If I understand correctly, here's how the coloring works: Each node that is colored black is a "cluster of 1" but other identically colored nodes belong to the same cluster (for the specified threshold). And if I'm understanding everything correctly so far, then I would rephrase your question as follows: "How can I determine the appropriate value of the Dendrogram option to clustergram so that I end up with a specific number of clusters?"
回答(2 个)
Image Analyst
2016-8-25
I don't know the clustergram() function. Not in any of my toolboxes. Why not use kmeans() in the Statistics and Machine Learning Toolbox, where you can tell it how many clusters you want?
2 个评论
Image Analyst
2016-8-25
I don't know. Say "No" on the "Was this helpful" part of the help and someone will read your suggestion.
Arthur Goldsipe
2016-8-26
编辑:Arthur Goldsipe
2016-8-26
I'm not an expert with clustering, but I don't think clustergram was designed with this sort of functionality in mind. I think you have two options: (1) Use trial and error to adjust the Dendogram propety of the resulting clustergram. Or (2) implement your own function that behaves like clustergram but offers the option to determine the number of clusters.
If you want to take approach 1, you could write MATLAB code that automates the trial-and-error process. For example, maybe you can write code to inspect the figure and determine how many clusters were identified and increase or decrease the Dendrogram property value.
If you want to take approach 2, you might find the cluster function useful, since it allows you to specify the number of clusters: http://www.mathworks.com/help/stats/cluster.html
另请参阅
类别
在 Help Center 和 File Exchange 中查找有关 Expression Analysis 的更多信息
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!