Neural Network: output representation (output layer)

Question

Karol Stanski 2017-12-25

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/374476-neural-network-output-representation-output-layer

编辑： Maria Duarte Rosa 2018-1-2

I have more of a high-level question. I’m trying to design a neural network for flower detection. The images are 440x440x3 pixels (RGB) and are taken by drone so flowers are white dots (of radius up to 10 px).

I have already annotated over 2000 images but I’m having trouble with the data representation for the output layer. For now, each flower is represented as x-y coordinates representing its centre. Each image contains a different number of flowers. This is not a standard classification/regression task and that's why I’m a bit lost.

I planned to use CNN but I don’t know how to define the output. For now, the input X is: 440 x 440 x 3 x N matrix (height x width x channels x index). This is pretty standard and there's an input layer already defined for that.

However, it gets tricky when I’m trying to design the output layer. I was thinking about representing output Y as: 440 x 440 x 1 x N matrix of binary images where 1 represents a flower (0 otherwise; according to the coordinates from annotated data). However, this approach seems hard to implement. Do you know how would I implement this in MATLAB? Has anyone heard of such an approach?

My other thought was to have flowers annotated as before (x-y coordinates) and have 2 outputs (one for x and one for y coordinates). These would work as probabilities for each of the coordinates and if they’re both acceptably high then we would predict a flower.

Are there any other ways to represent the output for this particular task? Any help would be greatly appreciated :)