试用软件

R-CNN Deep Learning with 3D Data

2 次查看（过去 30 天）

显示更早的评论

AnaM 2020-10-12

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/611496-r-cnn-deep-learning-with-3d-data

评论： AnaM 2021-3-2

采纳的回答： Shashank Gupta

Is it possible to "Train Object Detector Using R-CNN Deep Learning" with 3D data?

3D data: [x,y,z] and not [x,y,channel]

In this case, how do we define de "bounding boxes"?

For the 2D case is something like [x y width height]. And for a set of 2D images (i.e., 3D data)?

Any help please??

Thank you very much in advance!!

Best regards

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

采纳的回答

Shashank Gupta 2020-10-15

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/611496-r-cnn-deep-learning-with-3d-data#answer_514738

Hi Ana,

Generalising the 2d RCNN model to 3d is not so easy as it seems. the relevance that you intent to make from 2d and then trying to create a 3d architecture may not result in good performance, Although you can give a shot. So, there are few things you need to change. The input of 3d data should look something in the format [x,y,z,channel,batch_size] and the bounding boxes here will be cuboid, so the format will look somthing like [XMIN YMIN ZMIN WIDTH HEIGHT DEPTH]. You also need to change the layer to their respective 3d layers and write the custom training loop to train.

Hope this sounds good or atleast I provide you enough information to explore.

Cheers.

4 个评论
显示 2更早的评论隐藏 2更早的评论

AnaM 2021-1-27

OK, thank you! I will try that!!

AnaM 2021-3-2

Hello!!

Still in the context of faster r-cnn, I am trying to train the network (2D), but I always get empty detection results...

- I use a pre-trained backbone network on my data (which has an accuracy of around 70%);

- The images have size [512 512 1] and are uint8 (as well as the input size of the network);

- The bounding boxes are approximately between 30x30 to 60x60;

- I have 2 classes of objects;

- 250 epochs (already varied it but the result is the same) with MB size 64;

- I've tried it with a very low positive overlap range ([0.1 1]);

I used fasterRCNNLayers to create a faster R-CNN object.

(500 images+bounding boxes to train)

1) Is there a problem with the images being grayscale and not in color?

2) Do I have to have bounding boxes from a region other than the object? (as described here: https://www.mathworks.com/matlabcentral/answers/500950-bounding-box-not-drawn-some-variables-are-empty).

I apologize for asking this question in this topic!

Thank you so so much in advance!!!

请先登录，再进行评论。

更多回答（0 个）

请先登录，再回答此问题。

类别

Image Processing and Computer Vision Computer Vision Toolbox Recognition, Object Detection, and Semantic Segmentation Object Detection

在 Help Center 和 File Exchange 中查找有关 Object Detection 的更多信息

标签

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Translated by

试用软件