How to represent gray scale images as affine subspaces?

4 次查看（过去 30 天）

显示更早的评论

M 2023-10-23

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2037381-how-to-represent-gray-scale-images-as-affine-subspaces

编辑： M 2023-12-11

How to represent gray scale images as affine subspaces?

4 个评论
显示 2更早的评论隐藏 2更早的评论

M 2023-10-24

Hi @Walter Roberson do you have any idea please?

Walter Roberson 2023-10-27

This is not a topic I know anything about.

请先登录，再进行评论。

请先登录，再回答此问题。

回答（4 个）

Image Analyst 2023-10-23

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2037381-how-to-represent-gray-scale-images-as-affine-subspaces#answer_1339101

I don't know what you mean. What's the context? What do you mean by "model"? What do you mean by "affine subspaces"? Do you just want to warp or spatially transform the image?

imwarp

Spatial transformations Defining and applying custom transforms Steve on Image Processing

If you have any more questions, then attach your data and code to read it in with the paperclip icon after you read this:

TUTORIAL: How to ask a question (on Answers) and get a fast answer

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

M 2023-10-23

编辑：M 2023-10-23

@Image Analyst @Matt J In the attached paper they represented the images in Affine subspaces, I am asking generally if there is a popular method/code of representing the image in affine space.

My data is huge attached is a sample.

请先登录，再进行评论。

Matt J 2023-10-23

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2037381-how-to-represent-gray-scale-images-as-affine-subspaces#answer_1339211

编辑：Matt J 2023-10-23

在 MATLAB Online 中打开

One way, I suppose would be to train an affine neural network with the Deep Learning Toolbox, e.g.,

layers=[imageinputLayer(120,160,1);
        convolution2dLayer([120,160],N) )
        regressionLayer];
XTrain=images;
YTrain=zeros(1,1,N,size(XTrain,4));
        
net=trainNetwork(XTrain, YTrain,  layers,.... )

but you would need a truly huge number of images and good regularization for the training to be well-posed. You should probably look at augmentedImageDatastore.

34 个评论
显示 32更早的评论隐藏 32更早的评论

M 2023-10-24

编辑：M 2023-10-24

在 MATLAB Online 中打开

@Matt J The weights of the CNN will provide equations for the best fitting N-dimensional affine subspace to your image data set.

How did you reach to this? what is the mathematics behind it?

net=trainNetwork(XTrain, YTrain, layers,.... )

What is the options that fit this NN?

@Matt J I tried to do the following, but I think what I have done is wrong

But I keep getting the following error: Number of observations in X and Y disagree.

Also, I still didnt get what is the relation between this and affine and grassmann space!!

layers=[    imageInputLayer([120 160 1], 'Normalization', 'rescale-zero-one')
convolution2dLayer([120,160],7200)
regressionLayer];
options = trainingOptions('adam', ...
    'MiniBatchSize',200, ...
    'MaxEpochs',10, ...
    'InitialLearnRate',1e-3, ...
    'LearnRateSchedule','piecewise', ...
    'LearnRateDropFactor',0.1, ...
    'LearnRateDropPeriod',20, ...
    'ValidationData', {images_Test, labels_Test}, ...
    'ValidationFrequency',200, ...
    'Shuffle','every-epoch', ...
    'Plots','training-progress');
Net = trainNetwork(XTrain, YTrain,  layers, options);

M 2023-10-24

编辑：Matt J 2023-10-24

@Matt J, Why did you replace the convolution2dLayer([120,160],N) by fullyConnectedLayer(N)

They do the same thing. You can use analyzeNetwork to verify that the activations are 1x1xN in both cases.

N is the number of image right?

N is the number of rows in the matrix equations A*x=b. So, if you are considering a Grassmanian manifold Gr(n,k) of k-dimensional affine subspaces of

then N=n-k.

Keep in mind that the total number of layer weights will be 120*160*N+N, so you mustn't be too liberal with your choice of N.

Do I have to train 3 neural nets, one for each class and pass the test image through these NN?

Yes. You must train 3 neural nets and each of these neural nets must be trained only with images in the corresponding class.

Also how can I test this neural net? Pass an image?

Yes. If the image is a member of the class that the NN is supposed to detect, the outputs should all be zero, or close to zero, depending on how well the test image agrees with the equations for the affine subspace, A*x(:)+b=0.

Matt J 2023-10-25

编辑：Matt J 2023-10-25

在 MATLAB Online 中打开

Here's a working example of the plane fitting, but with a neural network. I had to turn of L2-regularization to get it to work. Not sure what that will mean for your real use case.

images=randn(3,2)*randn(2,1000) + randn(3,1); %3x1 images

images=reshape(images,3,1,1,[]);

N=1;

layers=[imageInputLayer([3,1,1],'Normalization','none')

fullyConnectedLayer(N,'WeightL2Factor',0,'BiasL2Factor',0 )

regressionLayer];

XTrain=images;

YTrain=zeros(1,1,N,size(XTrain,4));

options = trainingOptions('adam', ...

'MiniBatchSize',100, ...

'MaxEpochs',50, ...

'InitialLearnRate',1, ...

'LearnRateSchedule','piecewise', ...

'LearnRateDropFactor',0.1, ...

'LearnRateDropPeriod',20, ...

'ValidationFrequency',200, ...

'Shuffle','every-epoch');

close all force

net=trainNetwork(XTrain, YTrain, layers,options);

Training on single CPU. |========================================================================================| | Epoch | Iteration | Time Elapsed | Mini-batch | Mini-batch | Base Learning | | | | (hh:mm:ss) | RMSE | Loss | Rate | |========================================================================================| | 1 | 1 | 00:00:00 | 1.78 | 1.6 | 1.0000 | | 5 | 50 | 00:00:00 | 0.06 | 2.0e-03 | 1.0000 | | 10 | 100 | 00:00:00 | 8.13e-03 | 3.3e-05 | 1.0000 | | 15 | 150 | 00:00:00 | 1.83e-03 | 1.7e-06 | 1.0000 | | 20 | 200 | 00:00:00 | 9.71e-05 | 4.7e-09 | 1.0000 | | 25 | 250 | 00:00:01 | 1.20e-05 | 7.2e-11 | 0.1000 | | 30 | 300 | 00:00:01 | 1.44e-06 | 1.0e-12 | 0.1000 | | 35 | 350 | 00:00:01 | 3.81e-08 | 7.3e-16 | 0.1000 | | 40 | 400 | 00:00:01 | 6.86e-09 | 2.4e-17 | 0.1000 | | 45 | 450 | 00:00:01 | 6.34e-09 | 2.0e-17 | 0.0100 | | 50 | 500 | 00:00:01 | 5.71e-09 | 1.6e-17 | 0.0100 | |========================================================================================| Training finished: Max epochs completed.

A=net.Layers(2).Weights; s=norm(A);

A=A/s;

b=net.Layers(2).Bias/s;

norm(A*images(:,:)+b)

ans = single 2.7123e-06

p=planarFit.groundtruth(images(:,:), A,-b);

plot(p)

M 2023-10-26

编辑：M 2023-10-26

在 MATLAB Online 中打开

@Matt J

because the norm of my real data is 1.3900e+05

And

| 200 | 2600 | 00:01:20 | 9.26 | 42.9 | 1.0000e-09 |

My real image which belong to the same class 2608 images are attached in the link without augmentation https://we.tl/t-hDRA25aaJ1

N=15;
 layers=[imageInputLayer([120 120 1],'Normalization', 'rescale-zero-one')
        fullyConnectedLayer(N,'WeightL2Factor',0,'BiasL2Factor',0 ) 
        regressionLayer];
YTrain=zeros(1,1,N,size(XTrain,4));
options = trainingOptions('adam', ...
    'MiniBatchSize',200, ...
    'MaxEpochs',200, ...
    'InitialLearnRate',1, ...
    'LearnRateSchedule','piecewise', ...
    'LearnRateDropFactor',0.1, ...
    'LearnRateDropPeriod',20, ...
    'ValidationFrequency',200, ...
    'Shuffle','every-epoch', ...
    'Plots','training-progress'); 
net=trainNetwork(XTrain, YTrain,  layers,options);
A=net.Layers(2).Weights; 
%s=norm(A);
%A=A/s;
b=net.Layers(2).Bias;

M 2023-11-23

Hi @Matt J, I have a question please, Why did you decide her to use regressionLayer in your Network? And what is the advantage of using this layer? thanks

Matt J 2023-11-23

As opposed to what? What else might we have used?

请先登录，再进行评论。

Matt J 2023-10-27

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2037381-how-to-represent-gray-scale-images-as-affine-subspaces#answer_1342291

编辑：Matt J 2023-10-27

在 MATLAB Online 中打开

Well, in general, we can write the estimation of A,b as the norm minimization problem,

If X can be fit in RAM, you could just use svd() to solve it

N=14;
X=images(:,:)';
vn=vecnorm(X,inf,1);
[~,~,V]=svd([X./vn,  ones(height(X),1)] , 0);
Abt=V(:,end+1-N:end)./vn';
A=Abt(1:end-1,:)';
b=Abt(end,:)';
s=vecnorm(A,2,2);
[A,b]=deal(A./s, b./s);

43 个评论
显示 41更早的评论隐藏 41更早的评论

Torsten 2023-12-5

编辑：Torsten 2023-12-5

So you say you have 3 classes for your images that are known right from the beginning.

Say you determine the affine subspace for each 49000 images that best represents your image and you compute the mutual distance by SVD between these 49000 affine subspaces (which would give a 49000x49000 matrix). Now say you cluster your images according to this distance matrix into 3 (similar) clusters derived from the distance matrix.

The question you should ask yourself is: would these three clusters resemble the 3 classes that you think the images belong to right at the beginning ?

If the answer is no and if you consider the 3 classes from the beginning as fixed, then the distance measure via SVD is not adequate for your application.

M 2023-12-6

编辑：M 2023-12-6

@Matt J unfortunately the pre-normalization, and increasing the number of N didnt improve the performance.

I sure that the problem in the indicitor (norm), other indicators may provide better results as Grassman Kernel and distance. because that's proved in the literature. but still I am looking how can I apply them

请先登录，再进行评论。

Matt J 2023-12-5

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2037381-how-to-represent-gray-scale-images-as-affine-subspaces#answer_1366084

编辑：Matt J 2023-12-5

在 MATLAB Online 中打开

So how Can I get the direction and origins so I can compute the grassman distance?

Here is another variant that gives a Basis/Origin description of the subspace.

N=100; %estimated upper bound on subspace dimension
X=reshape(XTrain,[], size(XTrain,4)); %<----no tranpose
mu=mean(X,2); 
X=X-mu; 
[Q,~,~]=qr(X , 0);
Basis=Q(:,1:N); %Basis direction vectors
Origin=mu-Basis*(Basis.'*mu);  %Orthogonalize

18 个评论
显示 16更早的评论隐藏 16更早的评论

Matt J 2023-12-6

编辑：Matt J 2023-12-6

在 MATLAB Online 中打开

why did you decide to take into consideration only the first N columns of Q

The final columns of a QRP decomposition contribute the least to the decomposition, and might be attributable to noise. For example, here is a wide matrix X of rank N=2 as well as a noise-corrupted version, Xnoisy:

X=repmat(  rand(4,2)   ,1,4)
X = 4×8
    0.8430    0.7861    0.8430    0.7861    0.8430    0.7861    0.8430    0.7861
    0.7153    0.5335    0.7153    0.5335    0.7153    0.5335    0.7153    0.5335
    0.4414    0.4160    0.4414    0.4160    0.4414    0.4160    0.4414    0.4160
    0.0416    0.8772    0.0416    0.8772    0.0416    0.8772    0.0416    0.8772
Xnoisy=X + 1e-4*randn(size(X))
Xnoisy = 4×8
    0.8429    0.7862    0.8430    0.7861    0.8431    0.7862    0.8430    0.7861
    0.7155    0.5336    0.7153    0.5334    0.7151    0.5336    0.7152    0.5335
    0.4414    0.4159    0.4413    0.4160    0.4414    0.4160    0.4414    0.4159
    0.0418    0.8770    0.0415    0.8772    0.0417    0.8771    0.0416    0.8771

When we take the QRP decomposition of X, we see that the last two rows of R are essentially 0, which means that only linear combinations of the first 2 columns of Q are used in the decomposition to regenerate the columns of X. The final two columns of Q can be discarded. This makes sense, because we know that X is rank-2.

[Q,R,~]=qr(X,"econ"); R
R = 4×8
   -1.3583   -0.9308   -1.3583   -0.9308   -0.9308   -1.3583   -0.9308   -1.3583
         0   -0.7432   -0.0000   -0.7432   -0.7432   -0.0000   -0.7432   -0.0000
         0         0    0.0000    0.0000    0.0000    0.0000    0.0000    0.0000
         0         0         0   -0.0000   -0.0000    0.0000   -0.0000    0.0000

When we do this with Xnoisy, the final two rows are still quite small, but the effect of the noise is that they are not as close to zero as they should be. Therefore, the final two columns of Q need to be discarded based on some other criterion besides R(i,:)=0.

[Q,R]=qr(Xnoisy,  "econ"); R
R = 4×8
   -1.1912   -1.0617   -1.1911   -1.0615   -1.1911   -1.0617   -1.1911   -1.0616
         0    0.8472   -0.0003    0.8474   -0.0001    0.8473   -0.0002    0.8473
         0         0    0.0003    0.0000    0.0004   -0.0000    0.0003   -0.0000
         0         0         0   -0.0002   -0.0002   -0.0001   -0.0002   -0.0000

You don't necessarily have to choose N manually, though. You could have used some threshold on the diagonal values of R, e.g.,

N=find(abs(diag(abs(R))> 0.01*max(abs(R(:)))) ,1,'last')
N = 2

Matt J 2023-12-7

编辑：Matt J 2023-12-7

is there here a criteria for selcting the N final V columns of its SVD as you suggested for N initial columns of Q?

The dimension of the subspace that you fit to X will be NumPixels-N, where NumPixels is the total number of pixels in one image.

Also, Regarding Basis/Origin description, can you give me an idea how do we usually use this information for classification?

No, pursuing it was your idea. You said it would help you compute the Grassman distance, whatever that is.

M 2023-12-11

编辑：M 2023-12-11

Dear @Matt J thank you for your suggestions and clarifications.

I reached to a conclusion that representing the images as it is as vectors in a subspace is not a good idea!

Especially if the test images are not typical for the training set.(stacking the images as vectors causes problems!)

I think I have to do some feature extractions of region of interest first then representing the features as subspaces.(whether as matrices or vectors) , Still I am thinking how to do that.

请先登录，再进行评论。

请先登录，再回答此问题。

类别

AI and Statistics Deep Learning Toolbox Automatic Differentiation Custom Training Loops

在 Help Center 和 File Exchange 中查找有关 Custom Training Loops 的更多信息

产品

Image Processing Toolbox

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

How to represent gray scale images as affine subspaces?

4 个评论
显示 2更早的评论隐藏 2更早的评论

回答（4 个）

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

34 个评论
显示 32更早的评论隐藏 32更早的评论

43 个评论
显示 41更早的评论隐藏 41更早的评论

18 个评论
显示 16更早的评论隐藏 16更早的评论

另请参阅

类别

标签

产品

Community Treasure Hunt

How to represent gray scale images as affine subspaces?

4 个评论 显示 2更早的评论隐藏 2更早的评论

回答（4 个）

1 个评论 显示 -1更早的评论隐藏 -1更早的评论

34 个评论 显示 32更早的评论隐藏 32更早的评论

43 个评论 显示 41更早的评论隐藏 41更早的评论

4 个评论
显示 2更早的评论隐藏 2更早的评论

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

34 个评论
显示 32更早的评论隐藏 32更早的评论

43 个评论
显示 41更早的评论隐藏 41更早的评论