MDCGen v2

版本 2.0.2 (49.1 KB) 作者: Felix Iglesias
Generator of synthetic n-dimensional datasets for clustering and outlier detection
57.0 次下载
更新时间 2019/6/18

MdcGen allows a high-flexibility for parameterization, implementing clusters with varied shapes and generated by diverse underlying distributions. The tool enables the creation of clusters based on multivariate distributions but also clusters where distributions directly determine cluster intra-distances (i.e., the distance of objects to cluster centroids). Additionally, MDCGen implements classic functionalities, e.g., customization of cluster-separation, overlap control, addition of outliers and noisy features, correlated variables, rotations, and dataset quality evaluations, among others.

In order to allow a broad generation variety and flexibility, some configurations might create meaningless or useless datasets. Therefore, some experience dealing with the parameters is advisable (parameters are widely explained in the documentation). To validate the dataset, Silhouette evaluations provide performance indices to assess if the generated data follows a clear cluster-like structure.

Denis Ojdanic revised and improved MDCGen v1, developing the current MDCGen v2.

引用格式

Felix Iglesias (2024). MDCGen v2 (https://github.com/CN-TU/mdcgen-matlab), GitHub. 检索来源 .

F.Iglesias, T.Zseby, D.Ferreira and A.Zimek. MDCGen: Multidimensional Dataset Generator for Clustering. Journal of Classification (2019). https://doi.org/10.1007/s00357-019-9312-3

MATLAB 版本兼容性
创建方式 R2019a
兼容任何版本
平台兼容性
Windows macOS Linux
类别
Help CenterMATLAB Answers 中查找有关 Statistics and Machine Learning Toolbox 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

无法下载基于 GitHub 默认分支的版本

版本 已发布 发行说明
2.0.2

Typos corrected

2.0.1

MathWorks image added

2.0.0

要查看或报告此来自 GitHub 的附加功能中的问题,请访问其 GitHub 仓库
要查看或报告此来自 GitHub 的附加功能中的问题,请访问其 GitHub 仓库