Forum Discussion
SharePoint Capability to do OCR in PDF Documents
I saw your reply to his question on OCR so I wonder if you can help me. I have tens of thousands of PDFs and image files in my onedrive but I'm not sure if all of them are readable so when I do a keyword search on onedrive no files would escape my search. Do I need to identify pdf and image files that are not OCR enabled and convert them into OCR? if so, what would you recommend? finding and converting each file would take me years.
I thank you in advance for any help you can provide with this issue.
Regards
You can use a free audit tool such as https://www.encodian.com/product/indxr/ to determine how many files are missing text layers (even on a page level basis). Indxr provides low fixed cost unlimited OCR for bulk requirements in instances where using OCR via Power Automate is not cost effective. Indxr can have automated run schedules to achieve automated bulk OCR at a fixed price.