Unexpected hidden activation dimensions in convolutional neural network

Question

John Greenhall 2021-4-14

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/802306-unexpected-hidden-activation-dimensions-in-convolutional-neural-network

回答： Hrishikesh Borate 2021-4-20

I am attempting to build a multi-layer convolutional neural network, with multiple conv layers (and pooling, dropout, activation layers in between). However, I am a bit confused about the sizes of the weights and the activations from each conv layer.

For simplicity, let's assume each conv layer consists of M filters of size m x m. I define each conv layer using convolution2dLayer([m,m],M,'Padding','Same').

The first layer takes in a single image and outputs M images (4D array with last dimension M). The first layer also has weights of dimension m x m x 1 x M. This is all what I would expect.

The subsequent layers are where I am getting confused. I expect the 2nd conv layer to take in M images, and apply M filters of size m x m (weight dimension m x m x 1 x M), resulting in an output with M^2 images, as we apply all M filters to each of the M inputs. Instead, the weights have dimensions m x m x M x M, and there are only M output images (according to the "activations" function).

The later conv layers are the same as the 2nd layer, where the weights are size m x m x M x M, and there are only M output images from each layer.

Am I missing something?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Hrishikesh Borate 2021-4-20

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/802306-unexpected-hidden-activation-dimensions-in-convolutional-neural-network#answer_679682

在 MATLAB Online 中打开

Hi,

In a convolution layer, the depth of a filter is equal to the depth of the input or the number of input channels. Hence, the dimension of weights in a convolution layer can be calculated as :-

(filter height) x (filter width) x (input depth or number of input channels) x (number of filters).

For example, if input to a network is an image with single channel and each convolution layer is defined as :-

convolution2dLayer([m,m], M, 'Padding', 'same');

Under the assumption that the network contains only convolution layers, the weights in the first convolution layer will have dimension = m x m x 1 x M (as the input depth = 1) and the output of this layer will have dimension = (input image height) x (input image width) x (number of filters = M). These output activations will be the input to second convolution layer, hence the weights of the second convolution layer will have the following dimension :-

(filter height = m) x (filter width = m) x (input depth = M) x (number of filters = M)

Similarly, the dimension of weights in subsequent convolution layers will be m x m x M x M.

For more information, refer to convolution2dLayer.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

Unexpected hidden activation dimensions in convolutional neural network

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

Unexpected hidden activation dimensions in convolutional neural network

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论