Insuring reproducibility in training YOLOv2 in the Deep Learning Toolbox

Question

Michael Younger 2020-4-30

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/521842-insuring-reproducibility-in-training-yolov2-in-the-deep-learning-toolbox

回答： Ryan Comeau 2020-5-10

I'm using the YOLOv2 network in the Deep Learning Toolbox. We are seeing significant variations in testing results running the same training code more than once.

Is it possible to insure reproducibility in training? If so, what options/flags would need to be set to insure reproducible training?

One option I see already is to set the "Shuffle" option to "none" (its default is "once").

But are there other flags/random seeds that I need to set to insure repeatability?

Thanks!

2 个评论
显示无隐藏无

Mohammad Sami 2020-4-30

编辑：Mohammad Sami 2020-4-30

You can try using rng with a seed as the first step.

I could not find a direct documentation for the training deep learning models, but i am assuming that this applies to training deep learning models as well.

https://www.mathworks.com/help/matlab/math/generate-random-numbers-that-are-repeatable.html

Michael Younger 2020-4-30

Interesting; thank you!

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Ryan Comeau 2020-5-10

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/521842-insuring-reproducibility-in-training-yolov2-in-the-deep-learning-toolbox#answer_431569

Hello,

What you are experiencing is very normal for deep learning. The process of network initialization involves assigning initial weights to each of your layers and activation functions. These initial weights can be fixed by fixing the random seed for initialization as mentioned in the comments above. This may not resolve your problem however. The algorithm which minimizes your loss function is called stochastic gradient descent. A stochastic gradient descent is by definition not deterministic, which means there will always be some variance in your results. This should be seen as a good thing however, we don't want to get stuck in a local minima, which is likely to occurr if our algorithm was deterministic.

If you want to see the performance of deep learning being as deterministic as possible, set the mini batch size to 1. This will remove the ability to not get stuck in local minima and you will see a drop in performance.

The shuffle option you are describing is to shuffle the order of data so that your mini-batches do not always have the same data in them.

Lastly, if you do want to have "consistent" training results, simply redefine what consistent means in this case. Run your training 10 times and the results which occurrs the most frequently will be your replicable results.

Hope this helps,

RC

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

Insuring reproducibility in training YOLOv2 in the Deep Learning Toolbox

2 个评论
显示无隐藏无

回答（1 个）

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

Insuring reproducibility in training YOLOv2 in the Deep Learning Toolbox

2 个评论 显示 无隐藏 无

回答（1 个）

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

2 个评论
显示无隐藏无

0 个评论
显示 -2更早的评论隐藏 -2更早的评论