The gradient of mini batches

Question

MAHSA YOUSEFI 2020-11-23

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/658543-the-gradient-of-mini-batches

评论： Mahesh Taparia 2020-12-21

采纳的回答： Mahesh Taparia

在 MATLAB Online 中打开

Hi there.

I need your confimation or rejection for this question...

In following code, if the minibatch size is h,

[grad,loss] = dlfeval(@modelGradients,dlnet,dlX_miniBatch,Y_miniBatch);

the grad is the average of gradients of loss over h samples? Does it calculate dradients automatically and at the end with:

grad = 1/h * sum_i=1:h (\nabla loss(y_i,yHat_i)) ??

Following this question, for computing the total loss and geadient (for a full batch), does we should take avarage of losses and averages of gradients (averaging with the number of batches, say 1000 batches each with h size)??

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Mahesh Taparia 2020-12-14

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/658543-the-gradient-of-mini-batches#answer_575280

Hi

The function dlfeval evaluate the custom deep learning models. The loss are calculated based on what has been defined in modelGradients function. So if you are calculating the average loss in this function, then it will return the averaged one. For example, consider this modelGradient function, it is calculating the average cross entropy loss, so it will return the average loss. The gradients are calculated with respect to the loss function defined in for the network.

2 个评论
显示无隐藏无

MAHSA YOUSEFI 2020-12-19

在 MATLAB Online 中打开

In the example you mentioned, there is a mistake.

function [gradients, loss] = modelGradients(parameters, dlX, T)
    % Forward data through the model function.
    dlY = model(parameters,dlX);
    % Compute loss.
    loss = crossentropy(dlX,T);
    % Compute gradients.
    gradients = dlgradient(loss,parameters);
end

dlY must be feed to crossentropy!

Mahesh Taparia 2020-12-21

Yeah, crossentropy loss will be calculated between dlY and T. The documentation page will be updated.

请先登录，再进行评论。

The gradient of mini batches

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

2 个评论
显示无隐藏无

更多回答（0 个）

另请参阅

类别

标签

产品

Community Treasure Hunt

The gradient of mini batches

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

2 个评论 显示 无隐藏 无

更多回答（0 个）

另请参阅

类别

标签

产品

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

2 个评论
显示无隐藏无