The gpt-35-turbo and gpt-4 models have been a catalyst for widespread adoption of large language models (LLMs), and we’ve seen tremendous ingenuity from our customers in virtually every industry. We’re excited to announce a few updates to the Azure OpenAI Service:
New versions of gpt-35-turbo and gpt-4 are now generally available.
Price reductions for our most popular models – 75% lower pricing for the state-of-the-art embeddings model and 25% lower pricing on prompt tokens for gpt-35-turbo.
New Models in Azure OpenAI
New versions of both gpt-35-turbo and gpt-4 are now available in Azure OpenAI.
Updates to gpt-35-turbo
There are two new versions of gpt-35-turbo:
gpt-35-turbo (version 0613)
gpt-35-turbo-16k (version 0613)
One of the biggest updates with the 0613 version is enhanced steerability via the system message. This means that the model is better at following the instructions that you outline in the system message which gives you the ability to further customize the behavior of the model or set guardrails to ensure the model is used responsibly.
The 16k version of the model, gpt-35-turbo-16k, offers 4x the context length of the previous gpt-35-turbo model which will help you unlock scenarios requiring longer prompts.
Both new gpt-35-turbo models will now have separate prices for prompt tokens and completion tokens:
$0.0015 per 1,000 tokens
$0.002 per 1,000 tokens
$0.003 per 1,000 tokens
$0.004 per 1,000 tokens
Updates to gpt-4
Both gpt-4 and gpt-4-32k also have a new and improved 0613 model version available that come with the same enhanced steerability. All gpt-4 models are still available by request only. Existing Azure OpenAI customers can apply by filling out this form.
Function calling will be coming soon and will only be available for the new model versions announced today.
To make handling model updates easier, we’ve made some important updates to Azure OpenAI Service. Now when you create a deployment, you can choose whether you want to have the model auto-update to the default version of the model or if you’d like to be pinned to a particular version. You can also go into the Azure AI Studio to update the model version for your deployment at any time.
If you’ve set your deployment to “auto-update to default”, you’ll automatically be upgraded to the new 0613 version of the model in two weeks. Otherwise, your deployment won’t be automatically updated until the model version you’re using is deprecated. You can also choose to set your deployment to never auto-upgrade via our API, although this isn’t recommended as the deployment will start returning 400s after the deprecation date.
When you use Azure services, you are entrusting us with one of your most valuable assets—your data. You trust that the privacy and confidentiality of the data you store and process in Azure services will be protected and that it will be used only in a way that is consistent with your expectations. This means that when you use Azure OpenAI, your data including your prompts and completions, embeddings, training data and finetuned models:
Are never used to improve OpenAI models and is never made available to 3rd parties.
Are never used to improve any Microsoft or 3rd party products or services.
Models you finetune are available exclusively for your use.