Event banner
Azure Cognitive Search AMA: Vector search, Azure OpenAI Service, generative apps, plugins & more
Event details
In this session we’ll answer questions about the emerging Retrieval-Augmented Generation pattern and how you can use Azure OpenAI service and Azure Cognitive Search to implement it today in your applications to power ChatGPT-like experiences, generative scenarios, and more. Bring your questions about vector search in Azure Cognitive Search, which is coming to public preview soon, as well as about implementation details, data preparation, integration with large language models, and anything else related to Azure Cognitive Search.
An AMA is a live text-based online event similar to an “Ask Me Anything” on Reddit. This AMA gives you the opportunity to connect with Microsoft product experts who will be on hand to answer your questions and listen to feedback.
Feel free to post your questions anytime in the comments below beforehand, if it fits your schedule or time zone better, though questions will not be answered until the live hour.
116 Comments
- EricStarkerGold Contributor
- morriswongCopper ContributorHow do you manage the cost of using Azure Search as vector store for RAG use cases? I had an experience trying out an azure search example that involves some 90k arvix and covid journals and it costs £500 just to load them. Would there is a way to estimate cost better and how should one approach it?
- fsunavala-msft
Microsoft
Cost ultimately comes down to your SKU and the number of Search Units (SUs) which many factors play a role in the decision making such as document count, availability and reliability, index schema design, count of vector fields, vector field dimensions, etc. The most accurate way to estimate cost is to perform a proof-of-concept index with the desired schema you want and load a sample of documents that will reflect your production workload. Then, you can extrapolate your sample document count and index size with your production document count and get your production index size. You can visit our documentation on service limits by SKU and decide which SKU and Search Unit Count (partitions/replicas) fit your needs best. https://learn.microsoft.com/azure/search/search-limits-quotas-capacity Once you have an estimated SKU and number of search units, you can visit our pricing calculator here, select your region, and see a cost estimate of your Cognitive Search service: https://azure.microsoft.com/pricing/details/search/
- bhupeshm82Copper ContributorDo we have any best practices or case studies for beginners to learn Azure OpenAI & Cognitive Search?
- fsunavala-msft
Microsoft
For Azure Cognitive Search, a good hands-on holistic overview of the whole product is here Knowledge Mining Accelerator, and specifically for Vector Search, Quickstart vector search - Azure Cognitive Search | Microsoft Learn.
For Azure OpenAI, a good start is Quickstart - Deploy a model and generate text using Azure OpenAI Service - Azure OpenAI | Microsoft Learn
- EricStarkerGold ContributorThanks for joining us for this Azure Cognitive Search AMA! The event is now over, but we'll be posting a summary of the questions and answers here soon.
- liamca-msft
Microsoft
Thanks everyone for the amazing questions! It was great speaking with you today!
- Thomas BrownCopper ContributorPlug In environment roadmap - I've seen technical specs. how 3rd parties will build. Specific to parsing/reading/integrating docs. What is timing for integration/rollout? Reference to tech. articles showing partner examples?
- DerekLegenzoffIron Contributor
Plugins support in Azure OpenAI is currently in a limited private preview. For now, I'd recommend checking out these resources on additional capabilities that you can leverage:
- Function calling is now available in Azure OpenAI Service - Microsoft Community Hub- Introducing Azure OpenAI Service On Your Data in Public Preview - Microsoft Community Hub
In particular, function calling allows you to do all the same things you could do with plugins. The main difference is that with functions, you're in control of the orchestration and need to call out to the plugins on the client side.
- AbdelrahmanYassinOccasional ReaderHow can I use this if OpenAI is not currently supported in my region?
- DerekLegenzoffIron Contributor
I'd recommend creating an Azure OpenAI service in a region close to your search service. There will be some added network latency but it should be manageable. You can check out the regional availability of different models here: Azure OpenAI Service models - Azure OpenAI | Microsoft Learn
We're also working hard to add new Azure OpenAI regions!
- gpagliaCopper ContributorWhat is the approach recommended to store a chatbot conversation history for future context?
- liamca-msft
Microsoft
Hi Gerardo, I would highly recommend you check out this blog post from Semantic Kernel which discusses memory management, There is also this video on the topic.
- Murthy582Copper Contributor
OpenAI studio (Chat Playground) is able to connect to data sources only if they are in public network, which not the ideal scenario in corporate environments. Any plans to add private endpoint connectivity to studio?
- gia_mondragon
Microsoft
Supporting private endpoints is part of that feature roadmap and ETA is the coming quarter (Q4 '23) as per the team supporting it.
- jasoneilifCopper ContributorHow will vector search change or improve upon the pattern in the blog post here https://techcommunity.microsoft.com/t5/ai-applied-ai-blog/revolutionize-your-enterprise-data-with-chatgpt-next-gen-apps-w/ba-p/3762087 and sample code here https://github.com/Azure-Samples/azure-search-openai-demo?
- liamca-msft
Microsoft
Jason, you are absolutely correct that vector search will have a very positive impact on this from the ability to get the most relevant results, and we have already updated the sample to leverage vectors (check out the "embedding" field).
In fact, we believe that it is best to leverage Hybrid Search along with the reranking layer of Semantic Search to get the best results. I talk a bit about Hybrid Search in my blog post.
- Faddykenny_09Copper Contributor
Could you EricStarker provide insights into the scalability and performance aspects of the Azure OpenAI interface? How does it handle large-scale AI workloads and ensure efficient resource utilization?
- DerekLegenzoffIron Contributor
The biggest thing to keep in mind when it comes to the scalability of Azure OpenAI service are the quota and limits. I'd start by taking a look at this page that talks about the limits. It's also important to learn how to manage the quota for your service.
In terms of performance, the best thing I can recommend is trying out your prompts in Azure OpenAI to understand the performance. One big factor for performance is the length of your prompts, particularly the number of output tokens.
- Faddykenny_09Copper ContributorSounds great. I will work on that and learn for myself how it works.
- EricStarkerGold ContributorHello! I am not a product expert, just a community manager here, so I'll leave that for the Microsoft folks here to answer when they can.
- Faddykenny_09Copper ContributorOh, Sorry about that misunderstanding. I thought you was an expert.