When use K-cross validation concept to classify images, is the dataset is divided for training and testing? or for used for training and validation?

Question

Fatemah Al Assfor 2023-2-18

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1914775-when-use-k-cross-validation-concept-to-classify-images-is-the-dataset-is-divided-for-training-and-t

编辑： Tejas 2024-11-12

When use K-cross validation concept in pretrained CNN model to classify images, the dataset is split to k folds for example folde=5. Which means the for each fold we have 80% of the dataset used for training the model.

My question is about the rest 20% of the dataset in each fold. Is this rest part of the dataset is used for testing (to evaluate the classifier) in each fold? or it is used to validate the algorithm in each fold?

Actually I asked this question because some of researchers use K- cross validation to train (k-1) parts of the dataset and the rest part to validate the algorithm in each fold. while other researchers use the K- cross validation to test rest part of the dataset in each fold.

I need to use K-cross validation in my research but I confused about using this dataset part. Shall I use it for testing in each fold or use it to validate the algorithm?

As you know there is a big difference between validate dataset and testing dataset.

Thank you very Much

Your Rapid response is highly appreciated

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Tejas 2024-11-12

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1914775-when-use-k-cross-validation-concept-to-classify-images-is-the-dataset-is-divided-for-training-and-t#answer_1543920

编辑：Tejas 2024-11-12

Hello Fatemah,

When using machine learning technique like K-Cross Validation, the best practice is to:

Start by splitting the data into training and test sets.
Divide the training data into K folds.
In each iteration, use K-1 folds to train the model and the remaining fold to validate it.
After this process, a model with the best parameters is obtained, which can then be tested using the test set.

This approach ensures the model is evaluated on unseen data, providing a better assessment of its performance.

Alternate approach is to use the entire dataset as training data:

Split the entire dataset into K folds, without a separate test set.
Again, use K-1 folds for training and the remaining fold for validation.
Here, since there is no separate test set, the validation fold in each iteration serves as the test fold, and the validation accuracy is used to evaluate the model.

Refer to below links for more information on:

K-Cross Validation: https://www.analyticsvidhya.com/blog/2022/02/k-fold-cross-validation-technique-and-its-essentials/
Implementation of K-Cross Validation in MATLAB: https://www.mathworks.com/support/search.html/answers/2149929-how-to-divide-image-datastore-into-training-set-validation-set-and-test-set-for-training-a-cnn-netw.html

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

When use K-cross validation concept to classify images, is the dataset is divided for training and testing? or for used for training and validation?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

产品

Community Treasure Hunt

When use K-cross validation concept to classify images, is the dataset is divided for training and testing? or for used for training and validation?

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

产品

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论