Vision
7 TopicsHow to process multiple receipts on one scan
Dear Community, I am building a simple receipt recognizer solution and I face the issue when users tend to fill a full A4 with receipts. As far as I understand, the built-in receipt model does not work with multiple items on one page. Tried a few libraries, to process these scans with Python (like cv2), but in general, edge detection does not work, because many times there's no visible edge, receipts are rotated or partly pushed under each other. Is there any (AI) solution that can help me to first extract the distinct receipts before feeding it to document intelligence? I don't need the text at this point, only the receipts as image. Thanks, vm90Views0likes0CommentsUniversity Texas Austin Students Complete VizWiz Challenge with Microsoft Azure
Applications that utilize machine learning have greatly improved human life in many fields. One such application is to help people who are blind or with low vision learn about their visual surroundings using computer vision. By creating such applications, people with visual impairments can more independently accomplish daily tasks such as recognizing the denomination of currency, whether their socks match, and what flavor of yogurt they select to eat for breakfast. To spur research in developing such applications, a series of VizWiz datasets were created to support several AI challenges. One such dataset challenge includes images taken by visually impaired users and an associated question about each image.1.5KViews0likes0CommentsTying it all together
Hi Everybody! New here on the Microsoft Community scene. The organization for which I work has Office 365, but no one really owns it or knows how to use it or what to do with it beyond the very basic Access, Excel, PowerPoint, and Word usage. And I mean basic. I think it's unbelievably cool what Office has morphed into and I'm looking to transform where I work. Trouble is, I'm not really sure how it all ties together. In short, what is Microsoft's vision for how all these awesome programs are supposed to work together? Any insight is much appreciated! Thanks, Travis1.1KViews0likes2CommentsONNX Runtime 0.5 releases with support for hardware optimized inferencing
If you are creating an Intelligent Edge project that uses vision machine learning models in the ONNX (Open Neural Network eXchange) format, the recent ONNX Runtime 0.5 release provides support and tutorials for using the NVIDIDA Jetson Nano and Intel's OpenVINO Toolkit for hardware-based optimization. ONNX Runtime is a performance-focused complete scoring engine for Open Neural Network Exchange (ONNX) models, with an open extensible architecture to continually address the latest developments in AI and Deep Learning. ONNX Runtime stays up to date with the ONNX standard with complete implementation of all ONNX operators, and supports all ONNX releases (1.2+) with both future and backwards compatibility. Please refer to this page for ONNX opset compatibility details. ONNX is an interoperable format for machine learning models supported by various ML and DNN frameworks and tools. The universal format makes it easier to interoperate between frameworks and maximize the reach of hardware optimization investments. Read the complete blog announcement on the Microsoft Open Source Blog - Now available: ONNX Runtime 0.5 with support for edge hardware acceleration Additional Resources - Microsoft Open Source Blog ONNX Runtime GitHub repo Intel Distribution of OpenVINO Toolkit NVidia Jetson Nano ONNX Find more resources for Intelligent Edge device builders at the Intelligent Edge Device Builder Resource Center.694Views0likes0CommentsIs custom Vision a superset of Computer Vision?
If I choose Custom Vision, am I going to benefit from the "knowledge" already in Computer Vision? Or is Custom Vision a separate service that has no prior knowledge and needs to be trained from scratch? For example, if there's a bottle in the picture, will Custom Vision identify it even before I start any training?1.6KViews0likes0CommentsFormula linking to itself after upgrading
Hi, If anyone can help I would be most grateful. I am currently using a spreadsheet that links to a Vision Cache table which is in turn linked to SAP 9.0. The Cache Table link is located in: C:\Program Files\Infor\Query and Analysis\Vision Cache.xla The system is being upgraded to Windows 10 and SAP 9.3. The new Vision Cache Table link is found in: C:\Program Files (x86)\Infor\Query and Analysis\LsAgXLB.xla When I open the document on the Windows 10 machine the cell shows "?NAME", and the formula is looking for the old link: ='C:\Program Files\Infor\Query and Analysis\Vision Cache.xla'!AG_VICSUM("SAP P&L 2019",1,$C$4,$C38,$C$1)*-1 If I find and replace the file path up to the start of the correct formula, it works. However, when I save and close the document, then reopen it, I get a following trying to link the file name of the document itself, formula: ='Management Ac_Master1A_2018-19_2001.xlsm'!AG_VICSUM("SAP P&L 2019",1,$C$4,$C38,$C$1)*-1 This causes the "?NAME" to appear in the cell again. I can find and replace it and the formulas work again. The problem is it keeps on coming back and I don't know how to stop it from reappearing. Any suggestions would be greatly appreciated? Cheers, Toby1.5KViews0likes1CommentMicrosoft Vision API - How to transform Extracted data from OCR to a Template?
Hi All, I am trying to work on a solution where we are submitting the bill receipt to Vision OCR API and after receiving the extracted data we need to transform it to a predefined template so that we will extract the specific data like- Total Amount, Store name, Item name, qty, price etc. Your help and guidance is appreciated. Thanks & Regards, Sachin Deorah948Views0likes0Comments