Splitting data to training and testing without validation

2 次查看(过去 30 天)
Dear all,
my dataset has only 20 sample per class and i don't apply any augmentation in order to build a few shot learning model,
i ask is the results wil be correct ? if i test the data to train and test without validation in orde to i can calculate the generalization error
as i know the generalization error is the disability of the model to recognize new unseen data in training

回答(1 个)

Krishna
Krishna 2023-10-5
Hello Nagwa,
It seems like you're asking whether not using validation error for a few short learning models would lead to a larger generalization error.
The validation error is crucial for assessing the model's performance during training and tuning hyperparameters. The decision to split the data when working with limited data and using short learning models depends on the specific model you are using. In general, having a validation dataset helps prevent overfitting in the model. However, if you already have a small dataset, overfitting might not be a significant concern, so not splitting the data into a validation dataset could be acceptable.
Nevertheless, it is recommended to follow best practices, such as trying cross-validation and different scenarios, to determine what works best for your dataset and the specific few short learning algorithm you are using.
Hope it helps,
Krishna.

类别

Help CenterFile Exchange 中查找有关 Get Started with Statistics and Machine Learning Toolbox 的更多信息

产品


版本

R2022a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by