Event details
Retrieval-augmented generation (RAG) allows you to build GenAI applications that use your own data, to optimize LLM performance.
Join our AMA to ask us about RAG, vector databases, running RAG...
EricStarker
Updated Feb 14, 2024
fsunavala-msft
Microsoft
Feb 14, 2024Q6: Quota limits exist for capacity reasons and to maintain the health of your service. For further information on quota limitations, please visit the Azure OpenAI Service documentation: Azure OpenAI Service quotas and limits - Azure AI services | Microsoft Learn. Additionally, you can find how to manage your quota here: Manage Azure OpenAI Service quota - Azure AI services | Microsoft
CPS
Feb 14, 2024Occasional Reader
Re. Q6, we are hitting the limit with just two human users doing some basic and simple testing in the "Contoso" Chatbot created and deployed by the Studio.
The index was created from a 2000 record CSV, i.e. not a big dataset.
This would make it very unusable for a production environment accessible to the public, even if it has only a few visitors.