Event banner
Microsoft Syntex AMA
Event Ended
Tuesday, Nov 15, 2022, 10:00 AM PSTEvent details
Microsoft Syntex is Content AI integrated in the flow of work. Syntex automatically reads, tags, and indexes high volumes of content and connects it where it’s needed—in search, in applications, and ...
EmilyPerina
Updated Nov 15, 2022
Mario_Fulan
Nov 02, 2022Iron Contributor
Can you explain a bit more about the differences in functionality for Freeform documents (using AI Builder) and Unstructured Documents (using doc understanding models)? I know the training is different, freeform doesn't do classification, and a few other things. One question I have is whether both can do the "deskew" of PDF or scanned images before processing. Freeform seems to handle rotated documents but unstructured documents have trouble with OCR text extraction positioning if the image is rotated
JamesEccles
Microsoft
Nov 15, 2022This article gives a good overview of the different model types - https://learn.microsoft.com/en-gb/microsoft-365/contentunderstanding/difference-between-document-understanding-and-form-processing-model
Both Unstructured and Freeform models can be used to tackle similar use cases. Limitations on file format and language may push you in one direction or other. But assuming both are possibilities for your use case, then I would start with Freeform. It has a lower bar for effort during training. If that doesn't get at the right data, then shift to Unstructured which has more of a teaching element and more need for human training.
Both model types use the same OCR engine, so should be broadly the same for skewed docs. One thing to note though is that Freeform does factor in layout to the model, where Unstructured has to restructure into linear text.