Holdout validation, data taken randomly? 3 questions

2 次查看(过去 30 天)
In classification learner, I got this accuracy of 97% using gaussian SVM technique. I used holdout validation (125 set of data) with 25% data as test set.
Q1: These 25% data taken randomly? Q2: How do I know which data are taken for testing? I have two classes defined. Q3: Does it mean it will take half of the 25% data from class 1 and other half from class 2?

采纳的回答

Sal
Sal 2015-12-30
When you are doing the partition, what variable are you supplying to the function? This should be your class labels. That way, you can ensure that you have a "balanced" training and testing set e.g. they will contain roughly the same percentage of data from each class as in the original data. Yes, I believe this 25% data are taken randomly.

更多回答(0 个)

类别

Help CenterFile Exchange 中查找有关 Get Started with Statistics and Machine Learning Toolbox 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by