how to solve overtrained nn with validation stop?

Question

Mahmoud 2016-4-25

1
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/280818-how-to-solve-overtrained-nn-with-validation-stop

编辑： Greg Heath 2016-4-26

I have 365 dataset related to 365 days. I split datas into 7 days as inputs and 8th day as target to train FeedForwardNet.I test lots of different neuron numbers and all of them were overtrain and for example stops at epoch # 16 with validation stop. I know my network overtrain but I got confused how to overcome it. any suggestion would be highly appreciated.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Greg Heath 2016-4-26

1
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/280818-how-to-solve-overtrained-nn-with-validation-stop#answer_219408

编辑：Greg Heath 2016-4-26

在 MATLAB Online 中打开

You cannot over-train unless the net is over-fit.

The net is over-fit if there are more unknown parameters than there are training equations.

If you accept all default parameters except No. of hidden nodes, H, then the number of unknown parameters for a single hidden layer net with I-H-O node topology is just the number of unknown weights which, for I-dimensional "i"nput vectors and O-dimensional "o"utput target vectors, is

Nw = (I+1)*H+(H+1)*O

With N pairs of input/target training vectors, the default number used for training is

Ntrn = N - 2*round(0.15*N) % ~0.7*N

The corresponding number of training equations is

Ntrneq = Ntrn*O

Therefore, the condition for not over-fitting is

Ntrneq >= Nw = O + (I+O+1)*H

or equivalently,

H <= Hub = (Ntrneq-O)/(I+O+1)

where Hub is the lowest upper-bound.

If you need more that Hub hidden nodes there are two methods that mitigate over-fitting;

 1. Validation stopping. The default data-division 
yields 
 Nval = Ntst = round(0.15*N).
 2. Bayesian regularization to prevent weights from 
getting too large. This replaces the mse training 
performance function with the regularization 
combination 
 msereg = mse(error) + alpha* mse(weights)

alpha is automatically determined by the training algorithm.

The two ways to implement this are

 a. net.trainFcn   = trainbr
 b. net.performFcn = msereg

Now, to answer your question. I don't know if you have overfit the net. However, valstop has prevented you from improving the training performance at the expense of nontraining performance.

Don't despair.

My approach is to design multiple nets in a double loop over number of hidden nodes (Hmin:dH:Hmax) with an inner loop over Ntrials random number states which dictate the random data-division and random initial weight assignments. Typically Ntrials = 10 and numH <= 10.

I have posted tens of examples. Searching

Hmin:dH:Hmax Ntrials

yields 70 hits.

Hope this helps.

Thank you for formally accepting my answer

Greg

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

how to solve overtrained nn with validation stop?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

how to solve overtrained nn with validation stop?

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论