KMeans for big data using preconditioning and sparsification, Matlab implementation. This has three main features:
(1) it has good code: same accuracy and 100x faster than Matlab's K-means for some cases. It also incorporates the latest research, such as using K-Means++ for the initialization (Note: Matlab's R2015 K-Means now uses K-Means++ too). The code is well-documented and conforms to the conventions of Matlab's K-means function when possible.
(2) optionally, you can enable the precondition-and-sample feature which is a novel method to allow efficient processing when the datasets are extremely large and slow to work with.
(3) for datasets that are a few TB in size, you can use the read-from-disk option so that the entire matrix is never loaded into RAM all at once.
Installation is easy; run `setup_kmeans.m` and it will install the mex files for you if necessary, and setup the appropriate paths.
引用格式
Stephen Becker (2024). Sparsified K-Means (https://github.com/stephenbeckr/SparsifiedKMeans), GitHub. 检索时间: .
MATLAB 版本兼容性
平台兼容性
Windows macOS Linux类别
标签
致谢
参考作品: KMeans Segmentation - MEX, k-means++, Fast K-means, Fast K-means clustering, Fast kmeans Algorithm Code, K-means clustering
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!private
无法下载基于 GitHub 默认分支的版本
版本 | 已发布 | 发行说明 | |
---|---|---|---|
1.0.0.0 | Fixed typos in the description, no change to code (but github version is updated regularly) |
|