Video to Image Regression

1 次查看(过去 30 天)
Hello!
I have 32x32x256 (HeightxWidthXFrames) greyscale video data that I need to regress to a 32x32 image.
  1. What is the ideal format for me to save my input data so it can be read into a NN? Is there an appropriate image format? (I have not been succesfull using .mat files in an image datastore)
  2. Should I use a 2d or 3d ImageInputLayer? I intend to use a Unet architecture.
Thank you!

采纳的回答

Shashank Gupta
Shashank Gupta 2020-7-6
Hi Michael,
Since your input to the model is a video data, it is appropriate to use 3D image datastore. Also Unet archtecture you intent to design will be a 3D architecture and in that case going for 3d imageInputLayer is prompt.
When we deal with high dimension data, It is always good choice to go with ".mat" data storage. In particularly your case, you can write a custom function in @ReadFcn property of datastore to read the ".mat" file.
I hope this helps you,

更多回答(0 个)

产品


版本

R2020a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by