Blog Post

AI - Azure AI services Blog
3 MIN READ

Exciting Enhancements in Azure OpenAI Service: Provisioned Deployment and Global Standard Support

AleRico's avatar
AleRico
Former Employee
Nov 19, 2024

We are delighted to bring you two major updates that will significantly enhance your experience with Azure OpenAI Service. First, we are introducing Provisioned deployment support for fine-tuned models, ensuring consistent and reliable AI performance. Secondly, we are rolling out Global Standard support for fine-tuned model inferencing, offering greater flexibility and availability. Let’s dive into what these exciting updates mean for you.

Provisioned Deployment Support for Fine-Tuned Models 

Starting from December, Azure OpenAI Service will offer Provisioned deployment support for fine-tuned models. This new feature ensures that your AI workloads receive the necessary compute resources, providing fast and consistent results regardless of their demands. Whether you’re building a chatbot to handle thousands of users or running complex batch processes, this feature can elevate your application to the next level. 

Models Supported: GPT-4o, GPT-4o-mini 

Regions Supported (Public Preview): North Central US and Sweden Central 

What is Provisioned Throughput? 

Imagine hosting a party where you have a private table just for your guests, separate from the general buffet. Provisioned Throughput is similar for your AI models. It reserves dedicated compute resources in Azure OpenAI Service, so your models don’t have to compete with others for power, resulting in consistent performance. 

What’s New About This Offering? 

Provisioned deployment for fine-tuned models offers the same features as the existing Provisioned deployment for base models, with the added option to deploy a fine-tuned model with the Provisioned SKU.  Overall, provisioned deployment for fine-tuning offers customers the benefits of consistent performance, predictable pricing, cost-effective scaling, and the ability to handle high throughput requirements, making it a valuable option for those looking to optimize their AI model deployments. 

How to Get Started? 

The public preview will be available in early December. Stay tuned for more updates! 

Global Standard Support for Fine-Tuned Model Inferencing 

We are thrilled to announce that Global Standard deployment support for Azure OpenAI Service fine-tuned model inferencing will be available as a Public Preview starting early December. This launch provides greater flexibility and higher availability for our valued customers. 

What Motivated Us? 

Until now, Azure OpenAI Service fine-tuning was limited to specific regions, creating challenges in managing resources and capacity for training and inferencing. Customers often had to copy their fine-tuned models across different regions, which was not ideal. We understand the inconvenience and are excited to address it with this new feature. 

What’s in It for You? 

  • Expanded Regional Support: Azure OpenAI fine-tune capabilities will now be available in additional regions, including East US, West US 3, and various European regions. 
  • Increased Flexibility: You will no longer be constrained by regional capacity limits, allowing for seamless growth in your fine-tuned model usage. 
  • Enhanced Backup and Disaster Recovery: Improved backup and disaster recovery options will be available with multi-LoRA endpoints for a base model enabled across multiple regions. 

 

 

The launch of Provisioned deployment support and Global Standard support for Azure OpenAI fine-tuned model inferencing marks a significant step forward. By ensuring consistent AI performance and addressing regional capacity limits, we aim to provide you with a smoother, more flexible experience. We’re excited to see the positive impact these updates will have on your operations. 

Stay tuned for more updates as we continue to innovate and expand our services. Thank you for being on this journey with us! 

Ready to Get Started? 

  • Watch this Ignite session about new fine-tuning capabilities in Azure OpenAI Service 

 

Updated Nov 20, 2024
Version 2.0
No CommentsBe the first to comment