Forum Discussion
EricStarker
Oct 21, 2022Community Manager
Generate searchable PDFs with Azure Form Recognizer
PDF documents are widely used in business processes. Digitally created PDFs are very convenient to use. Text can be searched, highlighted, and annotated.
Unfortunately, a lot of PDFs are creat...
isspid
Aug 28, 2023Copper Contributor
This is a very useful usecase. The blog post is a very interesting solution, with the problem that the resulting PDF is much larger in size than the original one. Considering a use case (my experience) where you are dealing with a very large number of PDFs the size can become a very serious problem.
OCRmyPDF (https://github.com/ocrmypdf/OCRmyPDF) do an amazing job of perserving the original size of the PDF, but at the moment they do not support Azure Form Recognizer as an egine.
OCRmyPDF (https://github.com/ocrmypdf/OCRmyPDF) do an amazing job of perserving the original size of the PDF, but at the moment they do not support Azure Form Recognizer as an egine.