Blog Post

Microsoft Foundry Blog
3 MIN READ

Introducing Cohere Rerank 4.0 in Microsoft Foundry

Naomi Moneypenny's avatar
Dec 11, 2025

Today, we’re excited to announce that Cohere Rerank 4.0 Fast and Rerank 4.0 Pro are now available as direct from Azure models in Microsoft Foundry. Faster, More Accurate Retrieval for Enterprise AI

These new retrieval models deliver state-of-the-art accuracy, multilingual coverage across 100+ languages, and breakthrough performance for enterprise search and retrieval-augmented generation (RAG) systems. 

With Rerank 4.0, customers can dramatically improve the quality of search, reduce hallucinations in RAG applications, and strengthen the reasoning capabilities of their AI agents, all with just a few lines of code. 

 

Why Rerank Models Matter for Enterprise AI 

Retrieval is the foundation of grounded AI systems. Whether you are building an internal assistant, a customer-facing chatbot, or a domain-specific knowledge engine, the quality of the retrieved documents determines the quality of the final answer. 

Traditional embeddings get you close, but reranking is what gets you the right answer. 

Rerank improves this step by reading both the query and document together (cross-encoding), producing highly precise semantic relevance scores. This means: 

  • More accurate search results 
  • More grounded responses in RAG pipelines 
  • Lower generative model usage , reducing cost 
  • Higher trust and quality across enterprise workloads 

Introducing Cohere Rerank 4.0 Fast and Rerank 4.0 Pro 

Microsoft Foundry now offers two versions of Rerank 4.0 to meet different enterprise needs: 

 Rerank 4.0 Fast 

  • Best balance of speed and accuracy 
  • Same latency as Cohere Rerank 3.5, with significantly higher accuracy 
  • Ideal for high-traffic applications and real-time systems 

 Rerank 4.0 Pro 

  • Highest accuracy across all benchmarks 
  • Excels at complex, reasoning-heavy, domain-specific retrieval 
  • Tuned for industries like finance, healthcare, manufacturing, government, and energy 

 

Multilingual & Cross-Domain Performance 

Rerank 4.0 delivers unmatched multilingual and cross-domain performance, supporting more than 100 languages and enabling powerful cross-lingual search across complex enterprise datasets. 

The models achieve state-of-the-art accuracy in 10 of the world’s most important business languages, including Arabic, Chinese, French, German, Hindi, Japanese, Korean, Portuguese, Russian, and Spanish, making them exceptionally well suited for global organizations with multilingual knowledge bases, compliance archives, or international operations. 

Effortless Integration: Add Rerank to Any System 

One of the biggest benefits of Rerank 4.0 is how easy it is to adopt. 

You can add reranking to: 

  • Existing enterprise search 
  • Vector DB pipelines 
  • Keyword search systems 
  • Hybrid retrieval setups 
  • RAG architectures 
  • Agent workflows 

No infrastructure changes required. Just a few lines of code.This makes it one of the fastest ways to meaningfully upgrade grounding, precision, and search quality in enterprise AI systems. 

 

Better RAG, Better Agents, Better Outcomes 

In Foundry, customers can pair Cohere Rerank 4.0 with Azure Search, vector databases, Agent Service, Azure Functions, Foundry orchestration, and any LLM—including GPT-4.1, Claude, DeepSeek, and Mistral—to deliver more grounded copilots, higher-fidelity agent actions, and better reasoning from cleaner context windows. This reduces hallucinations, lowers LLM spend, and provides a foundational upgrade for mission-critical AI systems.  

Built for Enterprise: Security, Observability, Governance 

As a direct from Azure model, Rerank 4.0 is fully integrated with: 

  • Azure role-based access control (RBAC) 
  • Virtual network isolation 
  • Customer-managed keys 
  • Logging & observability 
  • Entra ID authentication 
  • Private deployments 

You can run Rerank 4.0 in environments that meet the strictest enterprise security and compliance needs. 

 

Optimized for Enterprise Models & High-Value Industries 

Rerank 4.0 is built for sectors where accuracy matters: 

  • Finance - Delivers precise retrieval for complex disclosures, compliance documents, and regulatory filings. 
  • HealthcareAccurately retrieves clinical notes, biomedical literature, and care protocols for safer, more reliable insights. 
  • ManufacturingSurfaces the right engineering specs, manuals, and parts data to streamline operations and reduce downtime. 
  • Government & Public Sector - Improves access to policy documents, case archives, and citizen service information with semantic precision. 
  • EnergyUnderstands industrial logs, safety manuals, and technical standards to support safer and more efficient operations. 

 

Pricing 

Model Name 

Deployment Type 

Azure Resource Region 

Price /1K Search Units

Availability 

Cohere Rerank 4.0 Pro 

Global Standard 

 All regions (Check this page for region details) 

$2.50

 Public Preview, Dec 11, 2025

Cohere Rerank 4.0 Fast 

Global Standard 

 All regions (Check this page for region details) 

 $2.00

  Public Preview, Dec 11, 2025

 

Get Started Today 

Cohere Rerank 4.0 Fast and Rerank 4.0 Pro are now available in Microsoft Foundry. 

Rerank 4.0 is one of the simplest and highest impact upgrades you can make to your enterprise AI stack, bringing better retrieval, better agents, and more trustworthy AI to every application.

Updated Dec 11, 2025
Version 2.0
No CommentsBe the first to comment