Forum Discussion
Searching pdf files using Natural Language
I'm new to all the Azure services and have been tasked with finding a solution to search a collection of pdf files using natural language queries. One use case would be to search resumés by saying (or entering) something like. "Find me all the candidates who have worked for XYZ company within the last 5 years".
It looks like Azure Cognitive Search could be very useful for this. I've created a search service, and resource and data source etc and have successfully done a simple text search of my file. The resulting JSON looks like it did a good identifying all the various text concepts (names and places etc). I'm overwhelmed with all the various services that are available. If anyone can give a little newbie guidances as to which direction to go it would be greatly appreciated and would save me insane amounts of time running down rabbit holes looking for the correct services to use for my purposes. Thanks.
- Luis Cabrera-CordonMicrosoft
Here are a few resources to get you started with Azure Cognitive Search.
Cognitive Search related Presentations
- Intro to Cognitive Search (Ignite 2018)
- Session on creating custom skills (Build 2019)
- Session on Knowledge Store and Complex Types (Build 2019)
Cognitive Search Documentation Resources
Full List of documentation resources here, but I recommend technical folks that want to get their feet wet with our technology to start with:
- What is cognitive search?
- Quick-start: Try cognitive search in the portal
- Tutorial: Learn the cognitive search APIs
- Example: create a custom skill
As far as the specific problem you are trying to solve.
You could extract terms such as the name of companies (Organizations) from the Resumes and push that information into a "Companies" field. And then create a query that would filter based on the Organization.
Best of luck,
-Luis Cabrera
PM, Azure Cognitive Search