Blog Post

Azure Integration Services Blog
2 MIN READ

Autoscaling Now Available in Azure API Management v2 Tiers

Sreekanth_Thirthala's avatar
Jun 16, 2025

We’re excited to share a major update to Azure API Management: the release of gateway-level metrics and native autoscaling in v2 tiers—Basic v2, Standard v2, and Premium v2.

Gateway-Level Metrics: Deep Insight into Performance

Azure API Management now exposes fine-grained metrics for each Azure API management v2 gateway instance, giving you more control and observability. These enhancements give you deeper visibility into your infrastructure and the ability to scale automatically based on real-time usage—without manual effort.

Key Gateway Metrics

  • CPU Percentage of Gateway – Available in Basic v2, Standard v2, and Premium v2
  • Memory Percentage of Gateway – Available in Basic v2 and Standard v2

These metrics are essential for performance monitoring, diagnostics, and intelligent scaling.

Native Autoscaling: Adaptive, Metric-Driven Scaling

With gateway-level metrics in place, Azure Monitor autoscale rules can now drive automatic scaling of Azure API Management v2 gateways.

How It Works

You define scaling rules that automatically increase or decrease gateway instances based on:

  • CPU percentage 
  • Memory percentage (for Basic v2 and Standard v2)

Autoscale evaluates these metrics against your thresholds and acts accordingly, eliminating the need for manual scaling or complex scripts.

Benefits of Autoscaling in Azure API management v2 tiers

Autoscaling in Azure API Management brings several critical benefits for operational resilience, efficiency, and cost control:

Reliability

Maintain consistent performance by automatically scaling out during periods of high traffic. Your APIs stay responsive and available—even under sudden load spikes.

Operational Efficiency

Automated scaling eliminates manual, error-prone intervention. This allows teams to focus on innovation, not infrastructure management.

Cost Optimization

When traffic drops, auto scale automatically scales in to reduce the number of gateway instances—helping you save on infrastructure costs without sacrificing performance.

Use Case Highlights

Autoscaling is ideal for:

  • APIs with unpredictable or seasonal traffic
  • Enterprise systems needing automated resiliency
  • Teams seeking cost control and governance
  • Premium environments that demand always-on performance

Get Started Today

Enabling autoscaling is easy via the Azure Portal:

  1. Open your API Management instance
  2. Go to Settings > Scale out (Autoscale)
  3. Enable autoscaling and define rules using gateway metrics
  4. Monitor performance in real time via Azure Monitor

Configuration walkthrough: Autoscale your Azure API Management v2 instance

Published Jun 16, 2025
Version 1.0
No CommentsBe the first to comment