Can we use k-means clustering on a 100×34596 matrix?

5 次查看(过去 30 天)
  • I have a feature matrix with size 100×34596, with class labels.
  • This matrix is very large because I want to cluster these matrix.
  • Is it possible?
  • I don't have experience about k-means.
  • Could you help me?*

回答(1 个)

Walter Roberson
Walter Roberson 2017-11-9
Just make sure to pass data to kmeans such that the rows correspond to observations (points).
The class table will not be of any use for kmeans clustering: kmeans is a method of trying to figure out which values belong together without knowledge of the class labels.
  1 个评论
Image Analyst
Image Analyst 2017-12-5
Hopefully you have 34,596 observations and 100 features (which still seems ridiculously high) and not 100 observations of 34,596 features (which I don't see as possible in most normal/typical scenarios).
Anyway, if you have some of them labeled already, then you can use knnsearch() to label/classify any other, non-labeled data.
kmeans will come up with its own labels, so if they're all labeled already, then all you can do is pass in the feature values, without the labels as Walter said, and see how what kmeans chose differs from whatever method you have that produced the labels in advance (like human judging or whatever). You can then create a confusion matrix or ROC curve.

请先登录,再进行评论。

类别

Help CenterFile Exchange 中查找有关 Statistics and Machine Learning Toolbox 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by