Why do I see a drop (or jump) in my final validation accuracy when training a deep learning network?

7 次查看（过去 30 天）

显示更早的评论

MathWorks Support Team 2019-2-19

1
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/445800-why-do-i-see-a-drop-or-jump-in-my-final-validation-accuracy-when-training-a-deep-learning-network

编辑： MathWorks Support Team 2019-2-19

采纳的回答： MathWorks Support Team

Why do I see a drop (or jump) in my final validation accuracy when training a deep learning network?

请先登录，再回答此问题。

采纳的回答

MathWorks Support Team 2019-2-19

2
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/445800-why-do-i-see-a-drop-or-jump-in-my-final-validation-accuracy-when-training-a-deep-learning-network#answer_361722

If the network contains batch normalization layers, the final validation metrics are often different from the validation metrics evaluated during training. This is because the network undergoes a 'finalization' step after the last iteration to compute the batch normalization layer statistics on the entire training data, while during training the batch normalization statistics are computed from the mini-batches.

If in addition to batch normalization layers the network contains dropout layers, the interaction between these two layers can aggravate this issue, as described here: https://arxiv.org/abs/1801.05134

If one removes the batch normalization (and dropout) layers from the network, the 'final' accuracy should be the same as the last iteration accuracy.

Increasing the size of the mini-batches can also alleviate this issue, since the statistics from a larger mini-batch may be better estimates of the entire training data statistics.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

更多回答（0 个）

请先登录，再回答此问题。

类别

AI and Statistics Deep Learning Toolbox Image Data Workflows

在 Help Center 和 File Exchange 中查找有关 Image Data Workflows 的更多信息

标签

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Translated by