how to find the filter size, kernels, stride and padding for the next convolutional layer in a neural network

Question

Aiman Zara 2023-4-16

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1947993-how-to-find-the-filter-size-kernels-stride-and-padding-for-the-next-convolutional-layer-in-a-neura

回答： Prasanna 2024-12-4，5:11

I want to know how one can find the kernels, filter size in a neural network (say it is Alexnet). How we got filter size 11,11 and kernel 96 in first convolutional layer, then 5,5 and kernel 256 in second layer, also how to find max pooling, stride etc. Is there any formula to get these calculations or it is done with some other way? plz tell by example how to do it?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Prasanna 2024-12-4，5:11

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1947993-how-to-find-the-filter-size-kernels-stride-and-padding-for-the-next-convolutional-layer-in-a-neura#answer_1553193

Hi Aiman,

Designing a neural network architecture like AlexNet involves a mix of empirical experimentation, domain knowledge, and some established principles rather than a formula. The choices of filter size, number of kernels (filters), stride, and pooling are typically made based on the following considerations:

Large filter size captures more spatial information but are computationally expensive.
Small filter size captures finer details and reduce the number of inputs allowing for deeper networks. Eg: AlexNet starts with an 11x11 filter to capture large spatial features in the first layer, followed by smaller 5x5 filters to refine the features in subsequent layers.
Similarly, more kernels capture more features at each layer and typically increases with depth to capture more complex patterns.
The stride controls the step size of the filter as it convolves over the input. Larger strides reduce the spatial dimensions quickly, leading to faster computations but potentially losing spatial resolution.
Pooling reduces spatial dimensions and computational load, while retaining important features.

The output size of a convolutional operation however can be calculated with the formula:

O = ((W – K + 2P) / S) + 1

Where O is the output size, W is the input size, K is the kernal size, P is the padding and S is the stride. This formula helps to determine the dimensions of the output feature map, which is essential for designing and understanding the architecture of a CNN. In practice, designing architectures often involves iterating over different configurations. The choices of designing are guided by empirical testing, computational constraints and domain knowledge.

Hope this helps!

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

how to find the filter size, kernels, stride and padding for the next convolutional layer in a neural network

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

how to find the filter size, kernels, stride and padding for the next convolutional layer in a neural network

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论