- Use of regularization technique
- Make sure each set (train, validation and test) has sufficient samples like 60%, 20%, 20% or 70%, 15%, 15% split for training, validation and test sets respectively.
- Perform k-fold cross validation
- Randomly shuffle the data before doing the spit, this will make sure that data distribution is nearly the same.If your data is in datastore you can use 'shuffle' function else you can use "randperm" function.
How to resolve if Validation and Testing accuracy are widely different?
8 次查看(过去 30 天)
显示 更早的评论
Dear experts,
I wrote a script in MATLAB to run my machine learning analysis (classification problem). I see a consistent but weird issue in my results (briefly I always get good/high, reproducible validation/training accuracy but my test accuracy is always too low). I checked all five tips mentioned here: https://stackoverflow.com/questions/48718663/validation-and-testing-accuracy-widely-different, but I am still unable to resolve the problem.
I would really appreciate if someone could help me in figuring out the solution.
Thanks,
Sahil
0 个评论
回答(1 个)
Prince Kumar
2021-11-19
编辑:Prince Kumar
2021-11-19
Hi Sahil Bajaj,
This generally happens when your model is learning the data instead of learning the pattern. This scenario is called 'Overfitting'.
You can try the following few things:
0 个评论
另请参阅
类别
在 Help Center 和 File Exchange 中查找有关 Testing Frameworks 的更多信息
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!