Need Guidance on cost breakdown of Microsoft Foundry Agent portal I created

Question

I have developed a complaint handling portal for customers and employees using Azure AI Foundry. The solution is built with Foundry agents, models from the catalog, input/output caching, agent logging/tracing, and other Foundry capabilities. The frontend and orchestration layer are deployed on Azure Container Apps.

While Azure Cost Analysis provides an overview of spending, several parts remain unclear or act as a black box for accurate estimation, including:

Token consumption assumptions (input/output tokens across different models and agents)
User concurrency, sessions, and behavior patterns
Agent logging and observability costs
Impact of input/output caching
Detailed resource consumption and billing in Azure Container Apps

What is the best way to accurately calculate or estimate the total running cost for such an Azure AI Foundry-based platform with Container Apps frontend?

Are there official Microsoft documentation, pricing guides, or reference architectures for cost breakdown? How do companies typically present costs for such AI platforms to attract customers (e.g., TCO models or per-user pricing)? I want to know how the platform costs are shown to customers.

Thank you.

surya_narayana · Answer

hi Tasmia_Monzoor​&nbsp; This is a very common challenge with AI Foundry solutions right now ,Azure Cost Analysis gives overall spend, but detailed AI-agent cost attribution is still not very transparent.For a Foundry + Container Apps architecture, the main cost drivers are usually:Model token usage (input/output tokens)Number of agent calls/tool executionsConcurrent users &amp; session durationContainer Apps scaling (CPU/memory replicas)Logging/tracing/Application Insights ingestionVector/search/storage componentsCaching effectivenessFor estimating costs more accurately, most teams combine:Azure Pricing CalculatorAzure Monitor + Application Insights metricsToken usage telemetry from models/endpointsLoad testing for concurrency/session patterns&nbsp;A practical approach is to calculate:Cost per request/conversationAverage tokens per interactionExpected monthly active users/concurrencyInfrastructure baseline (Container Apps minimum replicas, monitoring, storage, etc.)&nbsp;For customer-facing pricing, companies typically present:Per-user/monthPer-conversation/requestTiered usage bundlesOr platform + consumption-based pricing&nbsp;And internally they build a TCO model including:AI inferenceHostingObservabilitySupport/operationsBuffer for scaling peaksMicrosoft does have useful references across:Azure AI Foundry pricing docsAzure OpenAI pricingAzure Container Apps pricingWell-Architected Framework (Cost Optimization pillar)&nbsp;But today, there’s still some manual estimation involved, especially around agent orchestration and token behavior.

Forum Discussion

Need Guidance on cost breakdown of Microsoft Foundry Agent portal I created

1 Reply