ocr problem for reading numbers

Question

Abhishek Kashyap 2018-9-9

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/418250-ocr-problem-for-reading-numbers

回答： Sourabh 2025-3-27

I am totally unable to read the numbers from the following image. I tried CharacterSet, regionprops, imdialate. but not even one number gets recognized. please help.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Sourabh 2025-3-27

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/418250-ocr-problem-for-reading-numbers#answer_1562638

在 MATLAB Online 中打开

Hey @Abhishek Kashyap,

I was also unable to extract the numbers when I tried MATLAB’s “ocr” function on the given image. The result was an empty string, and it couldn’t recognize the numbers as expected.

I found that “ocr” performs best when the text is located on a uniform background and is formatted like a document with dark text on a light background. When the text appears on a non-uniform dark background, additional pre-processing steps are required to get the best OCR results.

To overcome the challenges obtaining accurate results, kindly follow the preprocessing steps given below:

Convert to Grayscale

OCR performs better on grayscale images than on RGB images

grayImage = rgb2gray(rgbImage);

Sharpen the Image

To enhance the edges of the text and make it more readable for OCR:

sharpenedImage = imsharpen(grayImage);

Noise Reduction

Use a median filter or Gaussian filter to smooth the image while preserving edges:

filteredImage = medfilt2(grayImage, [3 3]); % Median filter

Binarization (Thresholding)

Convert the grayscale image to a binary image using Otsu's method

binaryImage = imbinarize(filteredImage);

Text Stroke Width Normalization

If text has varying thickness, normalize it using the Stroke Width Transform (SWT):

    binaryImage = bwareafilt(binaryImage, [30, Inf]); % Keep only large connected components 

Morphological Operations

Remove small noise and connect broken characters using dilation and erosion:

    se = strel('disk', 1); 
    erodeImage = imerode(binaryImage, se);   

You can also set “LayoutAnalysis” to "Block" to instruct “ocr” to assume the image contains just one block of text.

    results = ocr(binaryImage,LayoutAnalysis="Block"); 
    results.Text 

For more information, please refer to the following MATLAB documentation:

I hope this helps!

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

ocr problem for reading numbers

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

ocr problem for reading numbers

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论