GapEvaluation
Gap criterion clustering evaluation object
Description
GapEvaluation is an object consisting of sample data (X), clustering data (OptimalY), and gap criterion values
(CriterionValues) used to
evaluate the optimal number of clusters (OptimalK). The gap criterion values
correspond to the difference ExpectedLogW –
LogW, where W is the within-cluster dispersion,
ExpectedLogW is determined by Monte Carlo sampling from a reference
distribution, and LogW is computed from the sample data. The optimal
number of clusters corresponds to the solution with the largest local or global gap value
within a tolerance range (SearchMethod). For
more information, see Gap Value.
Creation
Create a gap criterion clustering evaluation object by using the evalclusters function and specifying the criterion as
"gap".
You can then use compact to create a compact version of the gap
criterion clustering evaluation object. The function removes the contents of the properties
X, OptimalY, and
Missing.
Properties
Object Functions
Examples
More About
References
[1] Tibshirani, R., G. Walther, and T. Hastie. “Estimating the number of clusters in a data set via the gap statistic.” Journal of the Royal Statistical Society: Series B. Vol. 63, Part 2, 2001, pp. 411–423.
Version History
Introduced in R2013b

