Holdout validation, data taken randomly? 3 questions

3 次查看(过去 30 天)
In classification learner, I got this accuracy of 97% using gaussian SVM technique. I used holdout validation (125 set of data) with 25% data as test set.
Q1: These 25% data taken randomly? Q2: How do I know which data are taken for testing? I have two classes defined. Q3: Does it mean it will take half of the 25% data from class 1 and other half from class 2?

采纳的回答

Sal
Sal 2015-12-30
When you are doing the partition, what variable are you supplying to the function? This should be your class labels. That way, you can ensure that you have a "balanced" training and testing set e.g. they will contain roughly the same percentage of data from each class as in the original data. Yes, I believe this 25% data are taken randomly.

更多回答(0 个)

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by