Image recognition: extracting numbers from white paper

10 次查看(过去 30 天)
I have a large dataset of coloured images, all with one person holding a white piece of paper with a printed number. I am trying to extract the number as a class label for each person. By binarizing the image and removing small areas I can create an image as shown. But from here, all implementations of the ocr function I have attempted fail to extract the number. I have also attempted using corner extraction, but this does not work easily as some candidates are obscuring the corners of the paper with their hands. Could anyone provide some tips on how to achieve this?
Code:
clc
clear all
close all
% Load an image
rgbImage = imread('person11.jpeg');
grayImage = rgb2gray(rgbImage);
% Binarize the image.
binaryImage = grayImage > 120;
% Remove small objects.
binaryImage = bwareaopen(binaryImage, 5000);
figure(1)
imshow(binaryImage);
title('Cleaned Binary Image');
% Use the 'CharacterSet' parameter to constrain OCR
results = ocr(binaryImage, 'CharacterSet', '0123456789', 'TextLayout','Block');
results.Text
Output lots of different numbers, not 11!
Image:

采纳的回答

Image Analyst
Image Analyst 2019-2-11
Stephanie:
I'm sure you've got it working by now, but for others (or if you want to compare your algorithm to mine), here is how I would do it (attached). You could make it a lot faster if you didn't have the background be the same color as the sheet of paper the subject is holding. That could save us time because we wouldn't have to spend a lot of time to separate the two with an erosion. Have the background be some vivid color - any color except white or black or gray.
0000 Screenshot.png
  1 个评论
pldr-bny
pldr-bny 2019-2-11
Thank you for providing some very useful pointers in your example! I had not used the strel object before so that alone has helped me a lot in writing my own solution.

请先登录,再进行评论。

更多回答(1 个)

Image Analyst
Image Analyst 2019-2-10
编辑:Image Analyst 2019-2-10
First I would do color segmentation to find neutral colored regions (not what you have done). Use the Color Thresholder app on the Apps tab of the MATLAB tool ribbon to do this.
Then I would crop the region to only the paper they are holding. Then I would call the ocr() function in the Computer Vision System Toolbox.
Attach your original image if you still need help and I might be able to do some of it if you're unable to and I have time.
  2 个评论
pldr-bny
pldr-bny 2019-2-10
编辑:pldr-bny 2019-2-10
Thank you that's really helpful! I will try to implement your suggestions, but how would I define the region to crop given the position in the frame is slightly different for each person? Here is the original image if you are prepared to have a look for me.
Image Analyst
Image Analyst 2019-2-10
I just got back after being away all day. What did you accomplish while I was gone?

请先登录,再进行评论。

类别

Help CenterFile Exchange 中查找有关 Computer Vision with Simulink 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by