Vision API: preprocessing

Copper Contributor

Hello,

 

I am looking to do Vision OCR on some bulk scanned docs/photos of docs and the quality (of the image and subsequently of the text output) is not always good. What kind of preprocessing/tools would you recommend to improve the model's results?

Thanks,

Roxana

1 Reply
Hi Roxana, in my experience OpenCV and SciPy are my go to libraries for preprocessing of images. It's important to provide a high-quality image in order for the OCR to detect text. Try to use at least 300dpi resolution images in order to get acceptable results.