Today, we’re excited to announce that Cohere Rerank 4.0 Fast and Rerank 4.0 Pro are now available as direct from Azure models in Microsoft Foundry. Faster, More Accurate Retrieval for Enterprise AI
These new retrieval models deliver state-of-the-art accuracy, multilingual coverage across 100+ languages, and breakthrough performance for enterprise search and retrieval-augmented generation (RAG) systems.
With Rerank 4.0, customers can dramatically improve the quality of search, reduce hallucinations in RAG applications, and strengthen the reasoning capabilities of their AI agents, all with just a few lines of code.
Why Rerank Models Matter for Enterprise AI
Retrieval is the foundation of grounded AI systems. Whether you are building an internal assistant, a customer-facing chatbot, or a domain-specific knowledge engine, the quality of the retrieved documents determines the quality of the final answer.
Traditional embeddings get you close, but reranking is what gets you the right answer.
Rerank improves this step by reading both the query and document together (cross-encoding), producing highly precise semantic relevance scores. This means:
- More accurate search results
- More grounded responses in RAG pipelines
- Lower generative model usage , reducing cost
- Higher trust and quality across enterprise workloads
Introducing Cohere Rerank 4.0 Fast and Rerank 4.0 Pro
Microsoft Foundry now offers two versions of Rerank 4.0 to meet different enterprise needs:
Rerank 4.0 Fast
- Best balance of speed and accuracy
- Same latency as Cohere Rerank 3.5, with significantly higher accuracy
- Ideal for high-traffic applications and real-time systems
Rerank 4.0 Pro
- Highest accuracy across all benchmarks
- Excels at complex, reasoning-heavy, domain-specific retrieval
- Tuned for industries like finance, healthcare, manufacturing, government, and energy
Multilingual & Cross-Domain Performance
Rerank 4.0 delivers unmatched multilingual and cross-domain performance, supporting more than 100 languages and enabling powerful cross-lingual search across complex enterprise datasets.
The models achieve state-of-the-art accuracy in 10 of the world’s most important business languages, including Arabic, Chinese, French, German, Hindi, Japanese, Korean, Portuguese, Russian, and Spanish, making them exceptionally well suited for global organizations with multilingual knowledge bases, compliance archives, or international operations.
Effortless Integration: Add Rerank to Any System
One of the biggest benefits of Rerank 4.0 is how easy it is to adopt.
You can add reranking to:
- Existing enterprise search
- Vector DB pipelines
- Keyword search systems
- Hybrid retrieval setups
- RAG architectures
- Agent workflows
No infrastructure changes required. Just a few lines of code.This makes it one of the fastest ways to meaningfully upgrade grounding, precision, and search quality in enterprise AI systems.
Better RAG, Better Agents, Better Outcomes
In Foundry, customers can pair Cohere Rerank 4.0 with Azure Search, vector databases, Agent Service, Azure Functions, Foundry orchestration, and any LLM—including GPT-4.1, Claude, DeepSeek, and Mistral—to deliver more grounded copilots, higher-fidelity agent actions, and better reasoning from cleaner context windows. This reduces hallucinations, lowers LLM spend, and provides a foundational upgrade for mission-critical AI systems.
Built for Enterprise: Security, Observability, Governance
As a direct from Azure model, Rerank 4.0 is fully integrated with:
- Azure role-based access control (RBAC)
- Virtual network isolation
- Customer-managed keys
- Logging & observability
- Entra ID authentication
- Private deployments
You can run Rerank 4.0 in environments that meet the strictest enterprise security and compliance needs.
Optimized for Enterprise Models & High-Value Industries
Rerank 4.0 is built for sectors where accuracy matters:
- Finance - Delivers precise retrieval for complex disclosures, compliance documents, and regulatory filings.
- Healthcare- Accurately retrieves clinical notes, biomedical literature, and care protocols for safer, more reliable insights.
- Manufacturing- Surfaces the right engineering specs, manuals, and parts data to streamline operations and reduce downtime.
- Government & Public Sector - Improves access to policy documents, case archives, and citizen service information with semantic precision.
- Energy- Understands industrial logs, safety manuals, and technical standards to support safer and more efficient operations.
Pricing
|
Model Name |
Deployment Type |
Azure Resource Region |
Price /1K Search Units |
Availability |
|
Cohere Rerank 4.0 Pro |
Global Standard |
All regions (Check this page for region details) |
$2.50 |
Public Preview, Dec 11, 2025 |
|
Cohere Rerank 4.0 Fast |
Global Standard |
All regions (Check this page for region details) |
$2.00 |
Public Preview, Dec 11, 2025 |
Get Started Today
Cohere Rerank 4.0 Fast and Rerank 4.0 Pro are now available in Microsoft Foundry.
Rerank 4.0 is one of the simplest and highest impact upgrades you can make to your enterprise AI stack, bringing better retrieval, better agents, and more trustworthy AI to every application.