Doubt about early stopping

Question

jvbx 2024-7-31

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2141631-doubt-about-early-stopping

编辑： jvbx 2024-7-31

Hi,

I'm trying to trainning an neural network, but i'm facing a doubt.

I know that during the trainning of the neural network the error of trainning and data sets decreases and then, after some epochs the error validation set starts to grow and this moment indicates that is better to stop the trainning like this classic picture:

But, in my case, my neural network is showwing this behavior:

where the red line is the error of the validation set and the blue line is the error of the trainnig set during the epochs.

Why this is happening ?

Just to be clear, in my algorithm to each epoch i am:

1) Shuffling the lines inside the validation and trainning set ( The lines of one set don't interfere on the lines of other, they are shuffling the lines inside themselves)

2) processing the validation set through the forward step calculating the error of this set and saving this error

3)processing the trainning set through the forward and backward steps (to update the weights) ,calculating the error of this set and saving this error.

4) if the error is smaller than a value that I can choice, the algorithm stops (In the case of the image above, it don't happens, but I already find some configurations of hyperparameter that reach this value that i can choice)

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Satwik 2024-7-31

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2141631-doubt-about-early-stopping#answer_1492766

Hi,

Based on your description, I understand that you are experiencing unusual behaviour in the validation error and training error. In a typical neural network training process, the training error consistently decreases, while the validation error initially decreases and then starts to increase, indicating overfitting. However, in your case, both the training error (blue line) and the validation error (red line) initially decrease and then both start to increase after some epochs. This suggests a few points to consider and correct in your training procedure:

Shuffling the Validation Set: You mentioned shuffling the validation set before each epoch. This can lead to inconsistent evaluation metrics. The validation set should remain fixed throughout the training process to provide a consistent measure of the model's performance.
Using Validation Set for Training: You mentioned processing the validation set through forward and backward steps to update the weights, which is not standard practice. The validation set should only be used to evaluate the model’s performance, not to train it. Use the validation set solely for evaluation. Do not perform backpropagation or weight updates using the validation set.
Learning Rate and Hyperparameters: An inappropriate learning rate or other hyperparameters might cause unstable training. Experiment with different learning rates, batch sizes, and other hyperparameters.

By addressing these issues, you should be able to achieve more stable and predictable training behaviour, with the validation error and training error behaving as expected.

Hope this helps!

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

jvbx 2024-7-31

Sorry, I wrote wrong my earlier procedure. The validation set pass just through the forward step. The only set that pass by the backward step is the trainning set. Here is the correction:

1) Shuffling the lines inside the validation and trainning set ( The lines of one set don't interfere on the lines of other, they are shuffling the lines inside themselves)

2) processing the validation set through the forward step calculating the error of this set and saving this error

3)processing the trainning set (which I call estimation set too) through the forward and backward steps (to update the weights) ,calculating the error of this set and saving this error.

4) if the error is smaller than a value that I can choice, the algorithm stops (In the case of the image above, it don't happens, but I already find some configurations of hyperparameter that reach this value that i can choice)

I will check if your suggestion number one will produce some effect.

请先登录，再进行评论。

Doubt about early stopping

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

另请参阅

类别

标签

Community Treasure Hunt

Doubt about early stopping

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

1 个评论 显示 -1更早的评论隐藏 -1更早的评论

另请参阅

类别

标签

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

1 个评论
显示 -1更早的评论隐藏 -1更早的评论