selectFeatures

版本 2.2.0.0 (18.5 KB) 作者: Elliot Layden

Improved sequential feature selection for linear or quadratic discriminant analysis.

关注

0.0

(0)

118.0 次下载

更新时间 2018/4/25

查看许可证

Matlab's sequentialfs.m provides a fast, but arguably sub-optimal, feature selection algorithm for linear or quadratic discriminant models. This submission provides a generally slower, but better optimized, forward selection algorithm. It sequentially selects predictors/features which improve cross-validated classification accuracy, using a cross-validation method of the user's choosing. The function provides the same cross-validation options as sequentialfs.m (Holdout, KFold, Leaveout), but also provides an additional customizable option, ‘sets’ (see help section within function). If two or more candidate features improve the model’s classification accuracy to the same degree (i.e., a “tie”), the algorithm proceeds to the next “depth” of candidate features, separately for each of the tied features. Proceeding to the next depth continues until one feature at the tied level is determined to unambiguously yield the best accuracy (in combination with the subsequent features at greater depths). The user can specify a maximum depth for which to search for "tie-breakers", or, by default, the algorithm can proceed to an unlimited depth (in practice, usually not more than 3-4). If the specified maximum depth is reached while comparing tied candidates, the algorithm will greedily select the tied feature in order of feature entry. If, at any point, additional features add no improvement to the model's classification accuracy, optimization ceases. If a tie persists after optimization ends, the tied feature in order of feature entry is selected.
--
Bootstrapping is now available to check each selected feature for significance and to generate confidence intervals for feature coefficients. Currently, this option can only be used for 2 category classification problems. A specified number of boostrapped samples (resamples with replacement) are generated, and a discriminant model is fitted to each using the selected features. 95% confidence intervals for each feature are calculated as the 2.5% and 97.5% of the sorted coefficient bootstrap distribution. P-values are calculated for the 2-tailed test that each feature's bootstrap distribution is significantly different from 0. Note: features are z-scored within each bootstrap sample so as to provide coefficients that are more comparable across features.

引用格式

Elliot Layden (2024). selectFeatures (https://www.mathworks.com/matlabcentral/fileexchange/65716-selectfeatures), MATLAB Central File Exchange. 检索时间: 2024/11/22.

MATLAB 版本兼容性

创建方式 R2017a

兼容任何版本

平台兼容性

Windows macOS Linux

类别

AI and Statistics > Statistics and Machine Learning Toolbox > Dimensionality Reduction and Feature Extraction >

在 Help Center 和 MATLAB Answers 中查找有关 Dimensionality Reduction and Feature Extraction 的更多信息

标签添加标签

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

selectFeatures(X, classes, discrimType, maxFeatures, verbose, maxDepth, varargin)

版本	已发布	发行说明
2.2.0.0	2018/4/25	Changed title	下载
2.1.0.0	2018/4/25	Fixed bootstrapping waitbar issue	下载
2.0.0.0	2018/4/25	Added a bootstrapping option to calculate 95% confidence intervals and p-values of selected features. The median or mean coefficient from each feature's bootstrap distribution could be taken as a more robust estimate of effect size.	下载
1.3.0.0	2018/1/18	Updated help info	下载
1.2.0.0	2018/1/18	Fixed history output	下载
1.1.0.0	2018/1/17	Corrected verbose output	下载
1.0.0.0	2018/1/14		下载

selectFeatures

引用格式

必需项

MATLAB 版本兼容性

平台兼容性

类别

标签添加标签

Community Treasure Hunt

探索实时编辑器

selectFeatures

引用格式

必需项

MATLAB 版本兼容性

平台兼容性

类别

标签 添加标签

Community Treasure Hunt

探索实时编辑器

标签添加标签