We are excited to announce that GPT-4 Turbo with Vision is now available for public preview on Azure OpenAI Service! This advanced multimodal AI model retains all the powerful capabilities of GPT-4 Turbo while introducing the ability to process and analyze image inputs. This provides the opportunity to utilize GPT-4 for a wider range of tasks, including accessibility improvements, visual data interpretation and analysis, and visual question answering (VQA).
All existing Azure OpenAI Service customers now have access to this service. GPT-4 Turbo with Vision can be accessed in the following Azure regions: Australia East, Sweden Central, Switzerland North, and West US.
Additionally, we are releasing curated Azure AI Service enhancements for GPT-4 Turbo with Vision, which introduces an array of advanced functionalities, including:
To deploy GPT-4 Turbo with Vision from the Studio UI, select "GPT-4" and then choose the "vision-preview" version from the dropdown menu. This preview version has a separate quota from the existing GPT-4 versions, which allows you to experiment without affecting your current deployments.
Model |
Input |
Output |
GPT-4 Turbo with Vision1 |
$0.01 per 1000 tokens |
$0.03 per 1000 tokens |
+ Enhanced add-on features for OCR |
$1.50 per 1000 transactions |
|
+ Enhanced add-on features for Object Grounding |
$1.50 per 1000 transactions |
|
+ Enhanced add-on feature for “Add your Image” Image Embedding |
$0.10 per 1000 transactions |
|
+ Enhanced add-on feature for Video prompts integrating Video Retrieval |
$0.05 per minute for indexing $0.25 per 1000 transactions2 |
1GPT-4 Turbo with Vision pricing explained in detail here.
2 Additional input and output tokens for video prompts: Processing videos will involve the use of extra tokens to identify key frames for analysis. The number of these additional tokens will be roughly equivalent to the sum of the tokens in the text input plus 700 tokens.
Guidelines for Crafting Effective System Prompts with GPT-4 Turbo with Vision
To unlock the full potential of GPT-4 Turbo with Vision, it's essential to skillfully tailor system prompt to your specific needs. Here are some guidelines to enhance the accuracy and efficiency of your prompts:
Use Case |
Example System Prompt |
Image Description |
"As an AI assistant, provide a clear, detailed sentence describing the content depicted in this image." |
Image Tagging |
"Identify and list prevalent tags associated with the content of this image." |
Defect Detection |
"Act as a professional defect detector. Compare this test image with a reference image and state 'No defect detected' or 'Defect detected', providing detailed reasoning." |
Car Insurance Damage Report Writing |
"Function as a car insurance and accident expert. Extract detailed information about the car's make, model, damage extent, license plate, airbag deployment status, etc., and present the results in JSON format." |
These guidelines and examples demonstrate how tailored system prompts can significantly enhance the performance of GPT-4 Turbo with Vision, ensuring that the responses are not only accurate but also perfectly suited to the specific context of the task at hand.
The first version of GPT-4 Turbo with Vision, "gpt-4-vision-preview" is in preview and will be replaced with a stable, production-ready release in the coming weeks. Customer deployments using "gpt-4-vision-preview" will be automatically updated to the GA version of GPT-4 Turbo upon the launch of the stable version.
Apply now for access to Azure OpenAI Service
Learn more about GPT-4 Turbo with Vision on Azure OpenAI Service
AI Studio Quickstart: Get started using GPT-4 Turbo with Vision on your images and videos in Azure AI Studio
Azure Open AI Quickstart: Quickstart: Use GPT-4 Turbo with Vision on your images and videos with the Azure Open AI Service
Azure Open AI How-To Guide: How to use the GPT-4 Turbo with Vision model on Azure OpenAI Service
RAG with GPT-4V Turbo with Vision using your own data: Azure OpenAI on your data with images using GPT-4 Turbo with Vision
Use Azure AI Search and GPT-4 Turbo with Vision on your image data (e.g., charts and graphs, like financial reports) using the Retrieval Augmented Generation pattern: GitHub samples repository
GPT-4 Turbo with Vision pricing explained in detail: Text and Image tokens
Responsible AI: Transparency Note for Azure OpenAI Service
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.