roiMaxPooling2dLayer
Neural network layer used to output fixed-size feature maps for rectangular ROIs
Description
An ROI max pooling layer outputs fixed size feature maps for every rectangular ROI within the input feature map. Use this layer to create a Fast or Faster R-CNN object detection network.
Given an input feature map of size [H
W
C
N], where C is the number of channels and
N is the number of observations, the output feature map size is
[height
width
C
sum
(M)], where height and
width are the output size. M is a vector of length
N and M(i) is the number of ROIs
associated with the i-th input feature map.
There are two inputs to this layer:
'in'
— The input feature map that will be cropped'roi'
— A list of ROIs to pool
Use the input names when connecting or disconnecting the ROI max pooling layer to other
layers using connectLayers
(Deep Learning Toolbox) or
disconnectLayers
(Deep Learning Toolbox)
(requires Deep Learning Toolbox™).
Creation
Description
layer = roiMaxPooling2dLayer(outputSize)
creates a max pooling
layer for ROIs and sets the OutputSize
property.
Properties
Examples
Version History
Introduced in R2018b
See Also
maxPooling2dLayer
(Deep Learning Toolbox) | dlnetwork
(Deep Learning Toolbox) | connectLayers
(Deep Learning Toolbox) | removeLayers
(Deep Learning Toolbox)
Topics
- Getting Started with R-CNN, Fast R-CNN, and Faster R-CNN
- Deep Learning in MATLAB (Deep Learning Toolbox)
- List of Deep Learning Layers (Deep Learning Toolbox)