Event banner
Microsoft Syntex AMA: New document processing pay-as-you-go metered services
Event Ended
Wednesday, Mar 01, 2023, 10:00 AM PSTEvent details
Microsoft Syntex is Content AI integrated in the flow of work. Syntex brings together AI and automation from the Microsoft Cloud to the content and apps you use every day. Whether youโre processing i...
EmilyPerina
Updated Mar 01, 2023
ja_Hor_365
Mar 01, 2023Brass Contributor
As I am sure you are aware, both in Latam and in other regions, there are a variety of types and formats of documents, even with a high percentage in manuscript, what is the strategy to deal with or progress on the subject?
- IanStoryMar 01, 2023
Microsoft
Hi Jaime - can you elaborate on what you mean by manuscript? Are you talking handwriting? Printed documents that need to be scanned? For handwriting, you can use Syntex today to process handwritten documents, in particular I'd look at freeform and structured document processing. If printed documents that need to be scanned, we don't plan to build scanner software as there are many, many software packages that work with scanners today to take paper and turn it into a PDF or TIF, which can then be processed by Syntex (in fact, every scanner made for the last few years includes various software packages to operate the scanner itself).- ja_Hor_365Mar 01, 2023Brass Contributorexact, handwritten With respect to digitization solutions, it is correct, there are several, and we have already covered several functionalities...in my opinion, it would only be necessary to add certain functionalities in the pre-processing of the image to ensure that the best possible image reaches Syntex and facilitate the removal
- IanStoryMar 01, 2023
Microsoft
Thanks Jaime - handwriting is definitely one of the harder things in this space, especially if it is paper (some folks do electronic handwriting on tablets, for instance). In that world, I'd propose the following: 1) Carefully prep the paper to be scanned - this may involve removing staples, tape, etc. and getting the paper ready to be scanned (might also include taping down smaller pieces of paper for scanning, depending on the type of scanner) 2) Scan the documents, and use the variety of scanning software that exists on the market (many, many third parties out there make scanning software, and again, it comes with most scanners), including post-processing and image cleanup, to your point. 3) Take the newly created images (ideally PDF) from the scanner/scanning software and upload those to Microsoft 365 - this can be done either with the scanning software itself, our APIs, our sync client, our migration tools, drag and drop...there are many options for this. 4) Once the images (PDF files) are uploaded, Syntex can process them using models you've created, including freeform and structured, doing handwriting recognition (to the limits of the technology, penmanship comes into play here, the more clear the writing was, the more success you'll have) ๐
- JamesEcclesMar 01, 2023
Microsoft
We continue to expand the portfolio of models in Syntex to handle different types of file. For example unstructured and freeform models can be used for more varied datasets. In addition, we will be releasing the Taxonomy Tagger feature which will do keyword extraction from documents based on term sets, with no training required by users.