RAG with Azure AI: why your retrieval strategy matters AMA

Retrieval-augmented generation (RAG) allows you to build GenAI applications that use your own data, to optimize LLM performance. Join our AMA to ask us about RAG, vector databases, running RAG...

EricStarker

Updated Feb 14, 2024

gyangupta

Copper Contributor

Feb 14, 2024

Along with LLM model size, what is the limitation for Vector DB size to get optimal performance?

gia_mondragon

Microsoft

Feb 14, 2024

The vector size limits in AI Search can be found here: https://learn.microsoft.com/en-us/azure/search/vector-search-index-size. Limits are set based not only on technical limitations (depending on the limit) but also based on performance testing. However, to obtain optimal retrieval performance from your RAG app, you can take a look at best practices listed here: https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/azure-ai-search-outperforming-vector-search-with-hybrid/ba-p/3929167

Event details