Generative AI and large language models (LLMs) are at the heart of innovation and top of mind for all enterprises. Enterprises are looking to leverage OpenAI models capabilities of content generation, summarization, code generation and semantic search to deliver next generation of user experiences and increase productivity of employees.
"Chat with my data" and "Talk to your docs" are common themes and use cases of semantic search that we discuss with our customers. The most significant challenge for customers across industries is to quickly find the most relevant and accurate information from a vast ocean of knowledge available within their enterprise. In this blog, we will share our perspective on how enterprises can leverage Azure Open AI to develop a platform for smart enterprise knowledge search. We will also discuss other essential components of the platform to build a holistic system which caters for enterprise guardrails.
Enterprise knowledge search / Semantic Search use cases leverage Retrieval Augmented Generation (RAG) pattern that augments and provides relevant content to the LLMs aiding to deliver the outcome defined in the user prompt. In enterprises we have observed implementation of the RAG pattern is repeated by various teams often siloed from each other. Instead, enterprises must aim to build a platform which collates knowledge sources, provides conversational experience to access information & knowledge, standardizes the implementation that adheres to organizational AI Governance processes & practices.
Platform Tenets are the key guiding principles and considerations for defining the technical architecture of the smart enterprise knowledge search platform. Below are few key principles that Platform must deliver to achieve broad adoption, usage and intended value.
Upcoming sections of the platform architecture will address on how the above principles are achieved as part of the implementation of platform components.
The logical architecture view represents the solution components required to build a smart enterprise knowledge search platform which is powered by services such as Azure Open AI, Azure Cognitive Search, AI Content Safety and API management. The components listed below inherently aim to address the platform principles defined in the “Platform Tenets” section.
The content landing zone is where enterprise knowledge sources are collated but logically segregated based on organizational boundaries i.e., line of business (LoB) / product offerings / internal org content etc. The logical segregation is achieved using resource groups which is represented for e.g., as “LoB1 RG” in the architecture view. Below are the key functions of this component:
AI capabilities are better appreciated when they are delivered directly to end users. Teams provides the best interface for surfacing enterprise unified knowledge search for two reasons: Firstly, Teams is the primary business collaboration platform for most enterprises. Secondly, Teams AI library makes it easy to integrate LLM solutions as a pluggable app in Teams. For cases where Teams integration is not trivial, the solutions should be plugged into existing in-house products or business applications.
API Management service provides an entry point for User Interface (Teams, in-house business applications) to integrate with enterprise knowledge search platform. Each UI App will have unique subscription key to identify the application and forwards to the “Search Orchestrator” component as input to determine the relevant knowledge base harvested in the platform that must be served to address the user prompt.
Cosmos DB is used to store the app metadata such as mapping of knowledge sources to the consuming UI App. The metadata is stored separately for each consuming App and must include details of Cognitive Search index, System Prompt (e.g., defining the relevant context, tone, and persona of the AI assistant) and Azure Open AI model deployment endpoints & parameters (e.g., temperature) specific to the consuming App.
The search orchestrator is a key component of the platform which employs AI Orchestrators like Semantic Kernel and Langchain to break down the response flow into tasks, enabling a seamless response from LLM to a user prompt. This component can leverage internal workflows and external skills/plugins for a given prompt. Here are the key tasks performed by the search orchestrator:
Each deployed Azure Open AI model in the platform is associated with the content filtering configuration. The platform can have a default content filtering configuration which applies to all knowledge search use cases. However, in specific end customer facing scenarios the content filtering thresholds and severities can be tailored as appropriate which are associated with a specific GPT model deployment.
Enterprise Knowledge search is a common use case across all enterprises and is best addressed by developing a central platform which delivers on the enterprise guardrails and principles in a consistent manner. In this blog we highlighted essential components required to build enterprise knowledge search platform leveraging the relevant Azure services. Before moving into production, an end-to-end LLMOps implementation is necessary, along with measuring metrics like groundedness, informativeness, performance in latency, and cost of API calls for solutions built on such platforms.
In our upcoming blog, we will discuss platform scalability, specifically quotas and limits of Azure Open AI models. This is crucial when building an enterprise-wide platform to ensure the right level of throughput from deployed models for different consuming apps.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.