Event details
Retrieval-augmented generation (RAG) allows you to build GenAI applications that use your own data, to optimize LLM performance.
Join our AMA to ask us about RAG, vector databases, running RAG...
EricStarker
Updated Feb 14, 2024
gyangupta
Feb 14, 2024Copper Contributor
Along with LLM model size, what is the limitation for Vector DB size to get optimal performance?
gia_mondragon
Microsoft
Feb 14, 2024The vector size limits in AI Search can be found here: https://learn.microsoft.com/en-us/azure/search/vector-search-index-size. Limits are set based not only on technical limitations (depending on the limit) but also based on performance testing.
However, to obtain optimal retrieval performance from your RAG app, you can take a look at best practices listed here: https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/azure-ai-search-outperforming-vector-search-with-hybrid/ba-p/3929167