how to extract data from table image?

4 个评论

I performed ocr. it didnt extract properly
Train the ocr on that character set.
The images are pretty low quality.
I have experienced the OCR having a really rough time with vertical and horizontal lines...

请先登录,再进行评论。

回答(1 个)

You need to have at least 20 lines in the numbers for ocr() to work -- it says so in the help. Also, you can get rid of lines by thresholding and removing the largest blob, but you might want to use them to extract each rectangle one at a time so you can do ocr on just that one rectangle. If you still need help, attach a higher resolution image

2 个评论

Can you elaborate more how to get rid of the lines please?
If the lines are connected to the edge of the image you can do
mask = grayImage > somethreshold;
mask = imclearborder(mask);
otherwise, the grid looks like it would be the biggest blob in the mask you can use bwareafilt
mask = mask & ~bwareafilt(mask, 1);

请先登录,再进行评论。

类别

帮助中心File Exchange 中查找有关 Convert Image Type 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by