<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>Azure Arc Blog articles</title>
    <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/bg-p/AzureArcBlog</link>
    <description>Azure Arc Blog articles</description>
    <pubDate>Thu, 04 Jun 2026 05:11:12 GMT</pubDate>
    <dc:creator>AzureArcBlog</dc:creator>
    <dc:date>2026-06-04T05:11:12Z</dc:date>
    <item>
      <title>Your first model deployment on Foundry Local on Azure Local: from catalog to inference in 10 minutes</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/your-first-model-deployment-on-foundry-local-on-azure-local-from/ba-p/4520640</link>
      <description>&lt;P&gt;Foundry Local on Azure Local lets you run open-source models directly on your own Azure Local cluster, behind an OpenAI-compatible AP. It's the same experience you've gotten used to in the cloud, but the inference runs on hardware you own.&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;Foundry Local on Azure Local is in public preview at the time of this writing.&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;You've installed Foundry Local on your Azure Local cluster. The operator's pods are running, the CRDs are registered, you've checked it twice with &lt;CODE&gt;kubectl get pods&lt;/CODE&gt;. Now what?&lt;/P&gt;
&lt;P&gt;This blog covers the part that comes right before all of that - the lifecycle pattern you'll use to deploy any model on Foundry Local on Azure Local. Our recent &lt;A class="lia-external-url" href="https://aka.ms/build26blog" target="_blank" rel="noopener" data-lia-auto-title="announcement" data-lia-auto-title-active="0"&gt;announcement&lt;/A&gt; covers the bigger picture: multi-node inference, vLLM as a first-class runtime alongside ONNX-GenAI, and an expanded catalog. We'll keep this walkthrough single-node for clarity, but the same &lt;CODE&gt;ModelDeployment&lt;/CODE&gt; pattern scales without changes to your client code or workflow.&lt;/P&gt;
&lt;P&gt;By the end of this walkthrough, you'll have gone from an empty &lt;CODE&gt;kubectl&lt;/CODE&gt; prompt to a working, OpenAI-compatible inference endpoint serving Phi-4. All in about ten minutes, using nothing but&amp;nbsp;&lt;CODE&gt;kubectl&lt;/CODE&gt;, Python, and a small sample script. We'll also show you how to switch that same flow to the new vLLM runtime by changing roughly five lines of YAML.&lt;/P&gt;
&lt;P&gt;All the code lives in &lt;A class="lia-external-url" href="https://github.com/Azure-Samples/foundry-local-model-catalog" target="_blank" rel="noopener"&gt;Azure-Samples/foundry-local-model-catalog&lt;/A&gt;. Clone it and follow along.&lt;/P&gt;
&lt;H2&gt;What you'll build&lt;/H2&gt;
&lt;P&gt;The sample walks through five steps, each driven by the same Python script with different flags:&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;&lt;STRONG&gt;Query the model catalog -&lt;/STRONG&gt;&amp;nbsp;read the ConfigMap the operator syncs from the Microsoft Foundry catalog API.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Deploy a model&lt;/STRONG&gt; - create a &lt;CODE&gt;ModelDeployment&lt;/CODE&gt; custom resource pointing at one catalog entry.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Wait for ready&lt;/STRONG&gt; - the operator pulls the model image, schedules pods, and reports state.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Run inference&lt;/STRONG&gt; - call the OpenAI-compatible &lt;CODE&gt;/v1/chat/completions&lt;/CODE&gt; endpoint with an API key the operator generates for you.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Clean up&lt;/STRONG&gt; - delete the deployment.&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;The same five steps apply whether you're serving an ONNX model on a CPU node or a vLLM model on a GPU node. We'll start with the simpler path: Phi-4 on CPU on the ONNX runtime, then show you the vLLM variant at the end.&lt;/P&gt;
&lt;H2&gt;Before you start&lt;/H2&gt;
&lt;P&gt;You'll need:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;An Azure Local cluster (or any Arc-enabled Kubernetes cluster) with the Foundry Local extension installed. If you haven't set this up yet, the Foundry Local on Azure Local install guide walks through the cluster, extension, and resource requirements.&lt;/LI&gt;
&lt;LI&gt;&lt;CODE&gt;kubectl&lt;/CODE&gt; configured against that cluster, with permissions to read ConfigMaps and Secrets and to create &lt;CODE&gt;ModelDeployment&lt;/CODE&gt; resources in the &lt;CODE&gt;foundry-local-operator&lt;/CODE&gt; namespace.&lt;/LI&gt;
&lt;LI&gt;Python 3.9 or later.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Verify the operator is alive before you go further:&lt;/P&gt;
&lt;LI-CODE lang="bash"&gt;kubectl get pods -n foundry-local-operator kubectl get crd | grep foundry&lt;/LI-CODE&gt;
&lt;P&gt;You should see operator pods in &lt;CODE&gt;Running&lt;/CODE&gt; state and at least one CRD named &lt;CODE&gt;modeldeployments.foundrylocal.azure.com&lt;/CODE&gt;. If you don't, the install docs are the right place to back up to.&lt;/P&gt;
&lt;P&gt;Then clone and install:&lt;/P&gt;
&lt;LI-CODE lang="bash"&gt;git clone https://github.com/Azure-Samples/foundry-local-model-catalog.git &amp;amp;&amp;amp; cd foundry-local-model-catalog &amp;amp;&amp;amp; python3 -m venv .venv &amp;amp;&amp;amp; source .venv/bin/activate &amp;amp;&amp;amp; pip install -r requirements.txt&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H2&gt;Step 1 - See what's in your catalog&lt;/H2&gt;
&lt;P&gt;Start with the lowest-risk command in the sample, it talks to the cluster but changes nothing:&lt;/P&gt;
&lt;LI-CODE lang="bash"&gt;python catalog_sample.py --catalog-only&lt;/LI-CODE&gt;&lt;img /&gt;
&lt;P&gt;A few things to notice in that table:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Most models appear more than once. The same model gets packaged for different runtime/hardware combinations, and the operator picks the right container image based on which entry you reference.&lt;/LI&gt;
&lt;LI&gt;There's a &lt;CODE&gt;RUNTIME&lt;/CODE&gt; column. We'll come back to vLLM at the end of this post; for now, the default &lt;CODE&gt;Phi-4-generic-cpu&lt;/CODE&gt; ONNX entry is what we'll deploy.&lt;/LI&gt;
&lt;LI&gt;The catalog itself is just a ConfigMap. No magic, no hidden registry. The operator syncs it from the Microsoft Foundry catalog API on a CronJob, and the sample reads it the same way you would:&lt;/LI&gt;
&lt;/UL&gt;
&lt;LI-CODE lang="bash"&gt;kubectl get configmap foundry-local-catalog -n foundry-local-operator -o yaml&lt;/LI-CODE&gt;
&lt;P&gt;If you ever wonder what's actually available on your cluster, that's the source of truth.&lt;/P&gt;
&lt;H2&gt;Step 2 - Deploy a model&lt;/H2&gt;
&lt;P&gt;Now the side-effecting part. We're going to ask the operator to deploy Phi-4 on CPU:&lt;/P&gt;
&lt;LI-CODE lang="bash"&gt;python catalog_sample.py --deploy-only&lt;/LI-CODE&gt;
&lt;P&gt;Under the hood, the sample builds and applies a &lt;CODE&gt;ModelDeployment&lt;/CODE&gt; manifest that looks like this:&lt;/P&gt;
&lt;LI-CODE lang="yaml"&gt;apiVersion: foundrylocal.azure.com/v1 kind: ModelDeployment metadata: name: phi-4-generic-cpu namespace: foundry-local-operator spec: model: catalog: name: Phi-4-generic-cpu workloadType: generative compute: cpu replicas: 1 port: 5000&lt;/LI-CODE&gt;
&lt;P&gt;The operator takes it from there. It pulls the model container image, schedules a pod, generates an API-key Secret named &lt;CODE&gt;phi-4-generic-cpu-api-keys&lt;/CODE&gt;, and walks the deployment through &lt;CODE&gt;Pending&lt;/CODE&gt; → &lt;CODE&gt;Creating&lt;/CODE&gt; → &lt;CODE&gt;Running&lt;/CODE&gt; states. The sample polls until both &lt;CODE&gt;status.state == Running&lt;/CODE&gt; and &lt;CODE&gt;status.deploymentReady == true&lt;/CODE&gt;.&lt;/P&gt;
&lt;img /&gt;&lt;LI-SPOILER label="Heads up!"&gt;
&lt;P&gt;the first deployment of a given model pulls the model image, which can take a few minutes depending on model size and network speed. The image is cached on a PersistentVolume, so the second deploy of the same model is much faster.&lt;/P&gt;
&lt;/LI-SPOILER&gt;
&lt;H2&gt;Step 3 - Run inference&lt;/H2&gt;
&lt;P&gt;The endpoint is up. Time to actually use it.&lt;/P&gt;
&lt;P&gt;If you're running this script from inside the cluster (say, from a debug pod), the endpoint lives at the in-cluster service DNS and the sample picks that up automatically. Most readers will be running from a laptop, though, so we'll cover that path explicitly.&lt;/P&gt;
&lt;P&gt;In one terminal, port-forward the deployment's service:&lt;/P&gt;
&lt;LI-CODE lang="bash"&gt;kubectl port-forward svc/phi-4-generic-cpu 5000:5000 -n foundry-local-operator&lt;/LI-CODE&gt;
&lt;P&gt;In another, run the sample's inference-only mode against the forwarded endpoint:&lt;/P&gt;
&lt;LI-CODE lang="bash"&gt;python catalog_sample.py --infer-only --endpoint https://localhost:5000 --insecure
&lt;/LI-CODE&gt;
&lt;P&gt;The sample reads the API key from the auto-generated Secret and sends it as &lt;CODE&gt;Authorization: Bearer &amp;lt;key&amp;gt;.&lt;/CODE&gt;&amp;nbsp;same pattern as cloud OpenAI.&lt;/P&gt;
&lt;LI-SPOILER label="Heads up!"&gt;
&lt;P&gt;The &lt;CODE&gt;--insecure&lt;/CODE&gt; flag bypasses self-signed TLS. Together with &lt;CODE&gt;kubectl port-forward&lt;/CODE&gt;, this is a smoke-test setup. for real traffic, expose the deployment via Ingress or LoadBalancer.&lt;/P&gt;
&lt;/LI-SPOILER&gt;
&lt;P&gt;The request body is the standard OpenAI chat-completions shape:&lt;/P&gt;
&lt;LI-CODE lang="json"&gt;{ "model": "Phi-4-generic-cpu:1.0.0", "messages": [ {"role": "system", "content": "You are a helpful assistant."}, {"role": "user", "content": "What is the capital of France? Reply in one sentence."} ], "max_tokens": 256 }&lt;/LI-CODE&gt;&lt;img /&gt;
&lt;P&gt;That response just traveled from your terminal, through&amp;nbsp;&lt;CODE&gt;kubectl port-forward&lt;/CODE&gt;, into a model serving inside your Azure Local cluster, and back.&lt;/P&gt;
&lt;H2&gt;Step 4 - Clean up&lt;/H2&gt;
&lt;P&gt;The sample's default flow deletes the deployment on its way out, so if you ran the full &lt;CODE&gt;python catalog_sample.py&lt;/CODE&gt; (no flags) you're already clean. If you used &lt;CODE&gt;--deploy-only&lt;/CODE&gt; or &lt;CODE&gt;--skip-cleanup&lt;/CODE&gt;, drop it explicitly:&lt;/P&gt;
&lt;LI-CODE lang="bash"&gt;kubectl delete modeldeployment phi-4-generic-cpu -n foundry-local-operator&lt;/LI-CODE&gt;
&lt;P&gt;The operator garbage-collects the pod, service, and API-key Secret. The cached model image stays on its PersistentVolume, so your next deploy of the same model skips the image pull.&lt;/P&gt;
&lt;H2&gt;Going beyond CPU: vLLM in 5 lines&lt;/H2&gt;
&lt;P&gt;Now for the variant we've been pointing at. The same &lt;CODE&gt;ModelDeployment&lt;/CODE&gt; CR, the same OpenAI-compatible endpoint, switched to a runtime built for concurrent users, demonstrated on a popular open-source model.&lt;/P&gt;
&lt;P&gt;The diff against the manifest from Step 2:&lt;/P&gt;
&lt;LI-CODE lang="yaml"&gt;spec: - compute: cpu + compute: gpu + runtime: vllm model: catalog: - name: Phi-4-generic-cpu + name: Mistral-7B-v0.2&lt;/LI-CODE&gt;
&lt;P&gt;The same sample script handles it:&lt;/P&gt;
&lt;LI-CODE lang="bash"&gt;python catalog_sample.py \ --model Mistral-7B-v0.2 \ --compute gpu \ --runtime vllm&lt;/LI-CODE&gt;
&lt;P&gt;What you get back is the same OpenAI-compatible endpoint your client code already knows,&amp;nbsp; but now backed by vLLM's PagedAttention, continuous batching, and automatic planner-tuned configuration. Your application code doesn't need to know any of that; it's still &lt;CODE&gt;POST /v1/chat/completions&lt;/CODE&gt; with a Bearer token.&lt;/P&gt;
&lt;P&gt;That's the point of the lifecycle pattern: ONNX or vLLM, CPU or GPU, the platform engineer's deployment loop looks the same. For the architectural "why" behind multi-node, vLLM, and the expanded catalog, read the &lt;A class="lia-external-url" href="https://aka.ms/FoundryLoca_Techcommunity_Build_blog" target="_blank" rel="noopener"&gt;announcement&lt;/A&gt;.&lt;/P&gt;
&lt;H2&gt;What you've built&lt;/H2&gt;
&lt;P&gt;Ten minutes ago you had an installed-but-empty Foundry Local cluster. You now have:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;A working OpenAI-compatible chat endpoint serving Phi-4.&lt;/LI&gt;
&lt;LI&gt;A clear sense of what the operator manages for you (&lt;CODE&gt;ModelDeployment&lt;/CODE&gt; CR, model image cache, API-key Secret) and what stays in your hands (which model, which runtime, which compute target).&lt;/LI&gt;
&lt;LI&gt;A small, modular script you can wire into three things:
&lt;UL&gt;
&lt;LI&gt;A smoke test for every new cluster you stand up (&lt;CODE&gt;--catalog-only&lt;/CODE&gt; → &lt;CODE&gt;--deploy-only&lt;/CODE&gt; → &lt;CODE&gt;--infer-only&lt;/CODE&gt; is a one-line CI step).&lt;/LI&gt;
&lt;LI&gt;An internal demo when you need to show a team that Foundry Local on Azure Local is real and reachable.&lt;/LI&gt;
&lt;LI&gt;The foundation for your team's own deployment automation - copy the manifest builder, drop the CLI, and wire it into your existing GitOps or platform tooling.&lt;/LI&gt;
&lt;/UL&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H2&gt;From endpoint to chat surface&lt;/H2&gt;
&lt;P&gt;The model deployment above exposes a standard OpenAI-compatible API - enough for any existing chat client to point at. If you'd like to see exactly that, &lt;A class="lia-external-url" href="https://github.com/Azure-Samples/local-chat-with-foundry-local" target="_blank" rel="noopener"&gt;Azure-Samples/local-chat-with-foundry-local&lt;/A&gt;&amp;nbsp;picks up where this walkthrough ends: it wires the running endpoint into the Sovereign Chat Experience starter UI. About ten more minutes from working endpoint to working chat in the browser.&lt;/P&gt;
&lt;H2&gt;Where to go from here&lt;/H2&gt;
&lt;UL&gt;
&lt;LI&gt;Try the rest of the sample:&amp;nbsp; &lt;CODE&gt;--catalog-only&lt;/CODE&gt;, &lt;CODE&gt;--deploy-only&lt;/CODE&gt;, &lt;CODE&gt;--infer-only&lt;/CODE&gt;, and &lt;CODE&gt;--skip-cleanup&lt;/CODE&gt; compose into whichever workflow you're testing.&lt;/LI&gt;
&lt;LI&gt;Read the &lt;A class="lia-external-url" href="https://aka.ms/build26blog" target="_blank" rel="noopener"&gt;announcement&lt;/A&gt;&amp;nbsp;for the architectural why behind multi-node, vLLM, and the expanded catalog.&lt;/LI&gt;
&lt;LI&gt;Read the docs for the full operator and CRD reference.&lt;/LI&gt;
&lt;LI&gt;Feel free to share your feedback with us at &lt;A href="mailto:FoundryLocal_Support@microsoft.com" target="_blank" rel="noopener"&gt;FoundryLocalOnAzure@microsoft.com&lt;/A&gt;. The product is in public preview, so your feedback shapes what ships next.&lt;/LI&gt;
&lt;/UL&gt;</description>
      <pubDate>Wed, 03 Jun 2026 16:25:52 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/your-first-model-deployment-on-foundry-local-on-azure-local-from/ba-p/4520640</guid>
      <dc:creator>JonathanSchtechel</dc:creator>
      <dc:date>2026-06-03T16:25:52Z</dc:date>
    </item>
    <item>
      <title>Build, deploy, and govern sovereign AI with Foundry Local on Azure Local</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/build-deploy-and-govern-sovereign-ai-with-foundry-local-on-azure/ba-p/4522945</link>
      <description>&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;Not every AI workload can run in the cloud. For many of our customers, data needs to stay within defined&amp;nbsp;boundaries,&amp;nbsp;connectivity may be limited or absent, and latency, governance, and auditability are non-negotiable.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;With Foundry Local on Azure Local, you can use the same model catalog, developer workflows, and governance capabilities you know from Azure, while running AI entirely within your own environment where your data&amp;nbsp;resides.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;Foundry Local&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;provides the model catalog and developer experience.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;Azure Local&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;provides&amp;nbsp;the customer-managed infrastructure.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;Azure Arc&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;provides unified policy, governance, and lifecycle management across cloud and local environments.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;This&amp;nbsp;gives&amp;nbsp;developers a consistent way to build, deploy, and&amp;nbsp;operate&amp;nbsp;AI. The same&amp;nbsp;az&amp;nbsp;commands, the same model catalog, the same Arc policies, all&amp;nbsp;running on hardware you control.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;H2&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;Expansion of Foundry Local on Azure Local&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H2&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;We're expanding the Foundry Local model offering on Azure Local, with support for multi-node deployments and new agents and tools that run locally, in preview.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="21" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="1" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;Deploy and run AI models locally&lt;/STRONG&gt;.&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;Run models with Foundry Local in customer-managed environments on Azure Local, across sovereign, private, and edge scenarios, including fully disconnected operation.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="21" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="2" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;Choose from a flexible, high-performance&amp;nbsp;model catalog&lt;/STRONG&gt;.&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;Access&amp;nbsp;proprietary and community models through Foundry Local, now expanded with vLLM-optimized models alongside ONNX-based offerings.&amp;nbsp;You explore&amp;nbsp;and deploy through the same&amp;nbsp;catalog&amp;nbsp;API&amp;nbsp;experience, then&amp;nbsp;operate&amp;nbsp;locally on Azure Local.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="21" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="3" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;Build for production realities&lt;/STRONG&gt;.&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;Bring governance, identity, and auditability into your applications while keeping execution inside your&amp;nbsp;controlled&amp;nbsp;boundary.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;A class="lia-internal-link lia-internal-url lia-internal-url-content-type-blog" href="https://techcommunity.microsoft.com/blog/azurearcblog/scale-on-prem-ai-with-foundry-local-on-azure-local-multi-node-inference-and-vllm/4516692" target="_blank" rel="noopener" data-lia-auto-title="See what’s new in Foundry Local on Azure Local in the Tech Community blog." data-lia-auto-title-active="0"&gt;See what’s new in Foundry Local on Azure Local in the Tech Community blog.&lt;/A&gt;&lt;/P&gt;
&lt;H2&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;From intelligence to action: agents and tools inside the enterprise boundary&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H2&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;Most production AI use cases need two things: grounded answers and the ability to act on them,&amp;nbsp;without sending data outside the environment.&amp;nbsp;Here's&amp;nbsp;how&amp;nbsp;we're&amp;nbsp;enabling that locally.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="25" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="1" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;Preview&lt;/STRONG&gt;:&amp;nbsp;&lt;STRONG&gt;Agentic&amp;nbsp;retrieval&amp;nbsp;with Foundry Local&lt;/STRONG&gt;:&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;Ground agents in enterprise data using retrieval-augmented generation across local Microsoft 365 services, including Exchange and SharePoint.&amp;nbsp;&lt;/SPAN&gt;&lt;A class="lia-external-url" href="https://aka.ms/AgentsAndToolsBuildBlog2026" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Read the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;T&lt;/SPAN&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;ech&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;C&lt;/SPAN&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;ommunity blog to learn more&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;.&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="1" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="1" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;Preview&lt;/STRONG&gt;:&amp;nbsp;&lt;STRONG&gt;Agents and tools with Foundry Local&lt;/STRONG&gt;:&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;Build AI systems that reason, retrieve information, and take action within customer-controlled environments. &lt;A class="lia-external-url" href="https://aka.ms/AgentsAndToolsBuildBlog2026" target="_blank" rel="noopener"&gt;Learn more. &lt;/A&gt;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="1" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="2" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;Preview:&amp;nbsp;Developer&amp;nbsp;acceleration&amp;nbsp;templates:&lt;/STRONG&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;Jump-start local AI application development with new &lt;STRONG&gt;Foundry solution templates&lt;/STRONG&gt;, including &lt;STRONG&gt;local chat experiences &lt;/STRONG&gt;and&lt;STRONG&gt; video agents, powered by Azure AI Video Indexer&lt;/STRONG&gt;. &lt;A class="lia-external-url" href="https://aka.ms/fl-chat/" target="_blank" rel="noopener"&gt;Read the Tech Community to learn more. &lt;/A&gt;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;img /&gt;
&lt;H2&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;GitHub Enterprise Local: Now available in public preview&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H2&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;Sovereign AI is also about how systems are built and secured, not just where they run. With GitHub Enterprise Local on Azure Local, you can bring your full software development lifecycle&amp;nbsp;on-premises:&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Source control and repositories&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;CI/CD pipelines&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Security and&amp;nbsp;DevSecOps&amp;nbsp;workflows&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;GitHub Enterprise Local deploys entirely within customer-owned infrastructure, so teams get the developer tools they expect without&amp;nbsp;compromising on&amp;nbsp;data residency or operational control. This extends modern&amp;nbsp;DevSecOps&amp;nbsp;practice into sovereign environments and pairs naturally with the AI development workflows above: build, secure, and ship your AI applications within the same boundary where they run.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;A class="lia-external-url" href="https://aka.ms/GitHubEnterpriseLocalBlog" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Read the tech community blog to learn more about GitHub Enterprise Local&lt;/SPAN&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;&amp;nbsp;and&lt;/SPAN&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;&amp;nbsp;how to&lt;/SPAN&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;join the preview&lt;/SPAN&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/A&gt;&lt;/P&gt;
&lt;H2&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;Accelerating High-performance AI at the&amp;nbsp;Edge with NVIDIA &lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H2&gt;
&lt;P&gt;We are expanding our collaboration with NVIDIA to deliver high-performance AI capabilities directly at the edge.&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;At Build, we are bringing:&amp;nbsp;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Azure Local and Foundry Local&amp;nbsp;on&amp;nbsp;NVIDIA-powered GPUs, including&amp;nbsp;NVIDIA&amp;nbsp;RTX PRO 6000 Blackwell Server Edition, with expanded GPU support coming soon&amp;nbsp;&lt;/LI&gt;
&lt;LI&gt;Integration with&amp;nbsp;Nemotron&amp;nbsp;models,&amp;nbsp;optimized&amp;nbsp;for enterprise performance&amp;nbsp;&lt;/LI&gt;
&lt;LI&gt;A scalable foundation for&amp;nbsp;data-intensive,&amp;nbsp;low-latency&amp;nbsp;workloads&amp;nbsp;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;This partnership ensures that organizations can run advanced AI workloads where data is generated - without dependency on centralized cloud infrastructure.&amp;nbsp;&lt;/P&gt;
&lt;H2&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;Hardware options: AI factory configurations are available now in the catalog&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H2&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;Alongside our hardware partners, we’re bringing integrated solutions to customers building AI within sovereign environments. The&amp;nbsp;&lt;/SPAN&gt;&lt;A href="https://aka.ms/azurelocalcatalog" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Azure Local hardware catalog&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;now includes AI factory configurations from our OEM partners, including NVIDIA-certified 8xH100 systems, with options from&amp;nbsp;DataON, Dell, HPE, and Lenovo. These configurations are sized for the performance that model&amp;nbsp;serving&amp;nbsp;and agentic workloads require on customer-managed infrastructure.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;&lt;EM&gt;Together with Microsoft, we are advancing sovereign AI by bringing the open NVIDIA Nemotron model family to Microsoft Foundry Local on Azure Local. This collaboration gives organizations a production-ready AI platform that enables them to deploy AI where their data resides while maintaining the governance, control, and performance needed to scale AI across the enterprise.”&lt;BR /&gt;&lt;STRONG&gt;Kari Briski, VP Generative AI Software Products, NVIDIA&lt;/STRONG&gt;&lt;/EM&gt;&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;&lt;EM&gt;&lt;SPAN data-contrast="none"&gt;”Sovereign&amp;nbsp;AI is becoming increasingly important for governments, regulated industries, and enterprises that want to use AI while&amp;nbsp;maintaining&amp;nbsp;control of their data, location, and operations. Lenovo’s&amp;nbsp;ThinkAgile&amp;nbsp;MX Series delivers trusted, enterprise-grade infrastructure with global deployment&amp;nbsp;expertise&amp;nbsp;to help customers run AI wherever their data&amp;nbsp;resides. Co-engineered with Foundry Local and Azure Local, this solution provides an optimized platform to deploy, run, and scale AI locally with greater simplicity, consistency, and control, while helping meet strict data residency, security, and compliance requirements."&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;EM&gt;&lt;SPAN data-contrast="none"&gt;Scott Patti - VP Infrastructure Solutions Group (ISG), Lenovo&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/EM&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;H2&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;From AI models to trusted, mission-critical systems: what this unlocks for developers and operators&lt;/STRONG&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H2&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;AI is evolving from systems that answer questions to systems that plan, reason, and&amp;nbsp;take action&amp;nbsp;across workloads. These capabilities move AI from a cloud-only assumption to something you can deploy where sensitive work&amp;nbsp;actually happens, with governance and operational controls intact. For our customers, this means you can&amp;nbsp;now:&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Keep data, identities, and audit trails inside your sovereign boundary.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Run AI inference and agentic workloads in connected, intermittently connected, or fully disconnected modes.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Apply consistent policy and governance across cloud and local environments through Azure Arc.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Use the same Foundry catalog and developer experience you already know, on infrastructure you own.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Build, secure, and ship your AI applications with GitHub Enterprise Local, keeping source control, CI/CD, and&amp;nbsp;DevSecOps&amp;nbsp;workflows inside the same sovereign boundary.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;H2&gt;&lt;SPAN data-contrast="auto"&gt;Resources&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H2&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="4" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="1" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;Join us at Build&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;
&lt;UL&gt;
&lt;LI data-olk-copy-source="MailCompose"&gt;OD837 Shipping physical AI to the edge with Azure Local and Foundry Local &lt;A href="https://github.com/microsoft/build26-OD837" target="_blank" rel="noopener"&gt;https://github.com/microsoft/build26-OD837&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;OD839 Foundry Local: AI solutions for industrial and sovereign needs&amp;nbsp;&lt;A href="https://github.com/microsoft/build26-OD839" target="_blank" rel="noopener"&gt;https://github.com/microsoft/build26-OD839&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;A href="https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fbuild.microsoft.com%2Fen-US%2Fsessions%2FLTG425%3Fsource%3Dsessions&amp;amp;data=05%7C02%7CInbal.Sagiv%40microsoft.com%7Ca31a728659494d2604c208debf349adf%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C639158429847548219%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&amp;amp;sdata=T89yxOIs0Ohw2ZlLotMl%2FEIr1TJU98SBk45Pqjxg2zw%3D&amp;amp;reserved=0" target="_blank" rel="noopener" data-auth="NotApplicable" data-linkindex="7" data-olk-copy-source="MessageBody"&gt;&lt;SPAN class="lia-text-color-21"&gt;LTG425 Expanding horizons: Foundry Local for devices and on-prem &lt;/SPAN&gt;&lt;/A&gt;&lt;A style="background-color: rgb(255, 255, 255); font-style: normal; font-weight: 400; display: inline !important;" href="https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fbuild.microsoft.com%2Fen-US%2Fsessions%2FLTG425%3Fsource%3Dsessions&amp;amp;data=05%7C02%7CInbal.Sagiv%40microsoft.com%7Ca31a728659494d2604c208debf349adf%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C639158429847548219%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&amp;amp;sdata=T89yxOIs0Ohw2ZlLotMl%2FEIr1TJU98SBk45Pqjxg2zw%3D&amp;amp;reserved=0" target="_blank" rel="noopener" data-auth="NotApplicable" data-linkindex="7" data-olk-copy-source="MessageBody"&gt;
&lt;P style="display: inline !important;"&gt;https://build.microsoft.com/en-US/sessions/LTG425&lt;/P&gt;
&lt;P style="display: inline !important;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;/LI&gt;
&lt;LI&gt;&lt;A href="https://aka.ms/FoundryLocalAzure_PreviewRequest" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Req&lt;/SPAN&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;uest&lt;/SPAN&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;&amp;nbsp;to j&lt;/SPAN&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;oin the Foundry Local on Azure Local preview&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="4" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="2" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;Hands-on walkthrough: &lt;A href="https://techcommunity.microsoft.com/blog/azurearcblog/your-first-model-deployment-on-foundry-local-on-azure-local-from-catalog-to-infe/4520640" target="_blank" rel="noopener"&gt;Your first model deployment on Foundry Local on Azure Local: from catalog to inference in 10 minutes | Microsoft Community Hub&lt;/A&gt;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="4" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="2" data-aria-level="1"&gt;Read our Tech Community blogs:&amp;nbsp;&amp;nbsp;
&lt;UL&gt;
&lt;LI&gt;&lt;A class="lia-internal-link lia-internal-url lia-internal-url-content-type-blog" href="https://techcommunity.microsoft.com/blog/azurearcblog/scale-on-prem-ai-with-foundry-local-on-azure-local-multi-node-inference-and-vllm/4516692" target="_blank" rel="noopener" data-lia-auto-title="Foundry Local announcing multi-node and vLLM support" data-lia-auto-title-active="0"&gt;Foundry Local announcing multi-node and vLLM support&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Agentic Retrival with Foundry Local blog:&amp;nbsp;&lt;A href="https://aka.ms/AgentsAndToolsBuildBlog2026" target="_blank" rel="noopener" aria-label="Link https://aka.ms/AgentsAndToolsBuildBlog2026"&gt;https://aka.ms/AgentsAndToolsBuildBlog2026&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Code sample / model catalog blog: &lt;A href="https://aka.ms/foundry-local-model-catalog-blog" target="_blank" rel="noopener" aria-label="Link https://aka.ms/foundry-local-model-catalog-blog"&gt;https://aka.ms/foundry-local-model-catalog-blog&lt;/A&gt;&lt;SPAN data-ccp-props="{}"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="4" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="4" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;For more details on the expanded capabilities of Foundry Local for highly secure environments, contact your Microsoft account team&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="4" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="4" data-aria-level="1"&gt;&lt;A style="font-style: normal; font-weight: 400; background-color: rgb(255, 255, 255);" href="https://aka.ms/sovereignty" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Discover Microsoft Sovereign Cloud&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="4" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="4" data-aria-level="1"&gt;&lt;A style="font-style: normal; font-weight: 400; background-color: rgb(255, 255, 255);" href="https://aka.ms/edge-agentic-retrieval-docs" target="_blank" rel="noopener" aria-label="Link https://aka.ms/edge-agentic-retrieval-docs"&gt;&lt;SPAN class="lia-text-color-21"&gt;Explore product documentation at&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN style="color: rgb(30, 30, 30);"&gt;: &lt;/SPAN&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="4" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="4" data-aria-level="1"&gt;Foundry Local models on Azure Local:&amp;nbsp;&lt;A style="font-style: normal; font-weight: 400; background-color: rgb(255, 255, 255);" href="https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Faka.ms%2FFoundryLocalonAzureLocal_documentation&amp;amp;data=05%7C02%7CDavid.Dyckman%40microsoft.com%7C7137c8b35ea84ecbd31d08debcda0387%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C639155841792477848%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&amp;amp;sdata=ii3%2B0eLURkiqAnA1LupOX0EwETGO95nGt0RZIdn4mdg%3D&amp;amp;reserved=0" target="_blank" rel="noopener" data-hyperlinktype="0"&gt;&lt;SPAN data-scheme-color="@0078D4,," data-usefontface="false" data-hyperlinkhascustomcolor="true" data-contrast="none"&gt;https://aka.ms/FoundryLocalonAzureLocal_documentation&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN style="color: rgb(30, 30, 30);"&gt;​&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="4" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="4" data-aria-level="1"&gt;Local Agentic retrieval with Foundry Local: &lt;SPAN style="color: rgb(30, 30, 30);"&gt;​&lt;/SPAN&gt;&lt;A style="font-style: normal; font-weight: 400; background-color: rgb(255, 255, 255);" href="https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Faka.ms%2Fedge-agentic-retrieval-docs&amp;amp;data=05%7C02%7CDavid.Dyckman%40microsoft.com%7C7137c8b35ea84ecbd31d08debcda0387%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C639155841792502644%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&amp;amp;sdata=aaA1S8V1R9Cv29xdla5g5fnhEZVxEuT8vKSxHJUgg5E%3D&amp;amp;reserved=0" target="_blank" rel="noopener" data-hyperlinktype="0"&gt;&lt;SPAN data-scheme-color="@0078D4,," data-usefontface="false" data-hyperlinkhascustomcolor="true" data-contrast="none"&gt;https://aka.ms/edge-agentic-retrieval-docs&lt;/SPAN&gt;&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;/LI&gt;
&lt;/UL&gt;</description>
      <pubDate>Wed, 03 Jun 2026 07:19:19 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/build-deploy-and-govern-sovereign-ai-with-foundry-local-on-azure/ba-p/4522945</guid>
      <dc:creator>insagiv</dc:creator>
      <dc:date>2026-06-03T07:19:19Z</dc:date>
    </item>
    <item>
      <title>Unlock On-Prem Productivity with Agentic Retrieval in Foundry Local</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/unlock-on-prem-productivity-with-agentic-retrieval-in-foundry/ba-p/4523646</link>
      <description>&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;In today’s connected world, customers expect instant, context-rich interactions, even in environments where cloud connectivity&amp;nbsp;isn’t&amp;nbsp;guaranteed.&amp;nbsp;That’s&amp;nbsp;where Retrieval-Augmented Generation at the edge comes in. Since we launched into public preview,&amp;nbsp;we’ve&amp;nbsp;watched teams across regulated, disconnected, and mission-critical environments push this technology into places cloud GenAI simply&amp;nbsp;couldn’t&amp;nbsp;reach.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:120,&amp;quot;335559739&amp;quot;:120,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;What we heard back shaped everything in this release: customers&amp;nbsp;don’t&amp;nbsp;just want retrieval. They want reasoning, they want agency, and they want an end-user experience that feels as natural as the one they already&amp;nbsp;use&amp;nbsp;in the cloud.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:120,&amp;quot;335559739&amp;quot;:120,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;Today at Build 2026,&amp;nbsp;we're&amp;nbsp;excited to introduce Agentic Retrieval, the next evolution of our on-prem RAG platform, enabled by Azure Arc and powered by Foundry language models. Agentic Retrieval is part of Microsoft's Adaptive Cloud approach, which extends Azure capabilities to wherever customer data and workloads&amp;nbsp;actually live, with Edge AI focused on bringing reasoning and grounding to on-prem, distributed, and disconnected environments. Together with Foundry Local, Agentic Retrieval continues to shape Microsoft's Foundry Anywhere commitment: flexibility, resilience, and intelligence wherever customers&amp;nbsp;operate.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:120,&amp;quot;335559739&amp;quot;:120,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H3&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 20" data-ccp-parastyle-defn="{&amp;quot;ObjectId&amp;quot;:&amp;quot;741782aa-652f-563c-bdc7-8d2b1b3f555d|1&amp;quot;,&amp;quot;ClassId&amp;quot;:1073872969,&amp;quot;Properties&amp;quot;:[469775450,&amp;quot;heading 20&amp;quot;,201340122,&amp;quot;2&amp;quot;,134234082,&amp;quot;true&amp;quot;,134233614,&amp;quot;true&amp;quot;,469778129,&amp;quot;heading20&amp;quot;,335572020,&amp;quot;1&amp;quot;,134224900,&amp;quot;true&amp;quot;,335551500,&amp;quot;3615007&amp;quot;,268442635,&amp;quot;28&amp;quot;,335559739,&amp;quot;140&amp;quot;,335559738,&amp;quot;320&amp;quot;,335560102,&amp;quot;1&amp;quot;,469777841,&amp;quot;Arial&amp;quot;,469777842,&amp;quot;Arial&amp;quot;,469777843,&amp;quot;Arial&amp;quot;,469777844,&amp;quot;Arial&amp;quot;,469769226,&amp;quot;Arial&amp;quot;]}"&gt;What’s&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 20"&gt;&amp;nbsp;new at Build 2026&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:360,&amp;quot;335559739&amp;quot;:160}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;This release introduces&amp;nbsp;three&amp;nbsp;major pillars that work independently or together:&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:120,&amp;quot;335559739&amp;quot;:120,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="•" data-font="Aptos" data-listid="2" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;•&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="1" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;Agentic&amp;nbsp;Retrieval&amp;nbsp;engine&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;:&lt;/STRONG&gt; a first-party orchestration runtime for planning, reasoning, conversation state, and tool calls over your local data&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:40,&amp;quot;335559739&amp;quot;:40,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="•" data-font="Aptos" data-listid="2" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;•&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="2" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;Knowledge&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;:&lt;/STRONG&gt; a dedicated layer for organizing, curating, and governing your grounding data, exposed via MCP and connectable to any agentic retrieval layer&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:40,&amp;quot;335559739&amp;quot;:40,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="•" data-font="Aptos" data-listid="2" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;•&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="3" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;Chat UI&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;:&lt;/STRONG&gt; a production-ready, polished conversational experience that ships as the default UX for Agentic&amp;nbsp;Retrieval&amp;nbsp;and can also be deployed standalone&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:40,&amp;quot;335559739&amp;quot;:40,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;Alongside, we’re delivering the platform upgrades customers asked for: flexible deployment modes (Agentic-only, Knowledge-only, or Combined), BYOM with pluggable backends, Foundry Local model catalog integration, Entra ID support, disconnected-ready, and hybrid search combined with agentic retrieval.&lt;/SPAN&gt;&lt;/P&gt;
&lt;H4&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 20" data-ccp-parastyle-defn="{&amp;quot;ObjectId&amp;quot;:&amp;quot;741782aa-652f-563c-bdc7-8d2b1b3f555d|1&amp;quot;,&amp;quot;ClassId&amp;quot;:1073872969,&amp;quot;Properties&amp;quot;:[469775450,&amp;quot;heading 20&amp;quot;,201340122,&amp;quot;2&amp;quot;,134234082,&amp;quot;true&amp;quot;,134233614,&amp;quot;true&amp;quot;,469778129,&amp;quot;heading20&amp;quot;,335572020,&amp;quot;1&amp;quot;,134224900,&amp;quot;true&amp;quot;,335551500,&amp;quot;3615007&amp;quot;,268442635,&amp;quot;28&amp;quot;,335559739,&amp;quot;140&amp;quot;,335559738,&amp;quot;320&amp;quot;,335560102,&amp;quot;1&amp;quot;,469777841,&amp;quot;Arial&amp;quot;,469777842,&amp;quot;Arial&amp;quot;,469777843,&amp;quot;Arial&amp;quot;,469777844,&amp;quot;Arial&amp;quot;,469769226,&amp;quot;Arial&amp;quot;]}"&gt;Agentic&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 20"&gt;Retrieval&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 20"&gt;: From Answering to Reasoning&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:360,&amp;quot;335559739&amp;quot;:160,&amp;quot;335559740&amp;quot;:259}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H4&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;Classic RAG retrieves, then generates. Agentic&amp;nbsp;Retrieval&amp;nbsp;plans, reasons, and acts, running multi-step retrieval and tool invocation under a first-party orchestration runtime, entirely on your infrastructure. Under the hood it manages query planning, iterative multi-hop retrieval, tool calls via MCP, conversation state, and mandatory grounding with citations and audit logging built in.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:120,&amp;quot;335559739&amp;quot;:120,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img&gt;Image 1: Agentic reasoning trace: query decomposition, retrievals, tool calls, grounded answer&lt;/img&gt;
&lt;H5 class="lia-indent-padding-left-30px"&gt;&lt;SPAN data-contrast="auto"&gt;What customers can achieve:&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:120,&amp;quot;335559739&amp;quot;:120,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H5&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Compliance, policy, and&amp;nbsp;permit&amp;nbsp;workflows for public sector, regulators, and defense operations, with data never leaving sovereign infrastructure&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:40,&amp;quot;335559739&amp;quot;:40,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Multi-document synthesis across standards, technical manuals, contracts, and field procedures for industrial operators&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:40,&amp;quot;335559739&amp;quot;:40,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;An agentic chat experience for regulated and operational teams (engineers, inspectors, analysts) that reasons like a subject-matter expert&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:40,&amp;quot;335559739&amp;quot;:40,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Auditable AI for sovereign and mission-critical environments, with every answer traceable to its source&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:40,&amp;quot;335559739&amp;quot;:40,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;H4&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 20" data-ccp-parastyle-defn="{&amp;quot;ObjectId&amp;quot;:&amp;quot;741782aa-652f-563c-bdc7-8d2b1b3f555d|1&amp;quot;,&amp;quot;ClassId&amp;quot;:1073872969,&amp;quot;Properties&amp;quot;:[469775450,&amp;quot;heading 20&amp;quot;,201340122,&amp;quot;2&amp;quot;,134234082,&amp;quot;true&amp;quot;,134233614,&amp;quot;true&amp;quot;,469778129,&amp;quot;heading20&amp;quot;,335572020,&amp;quot;1&amp;quot;,134224900,&amp;quot;true&amp;quot;,335551500,&amp;quot;3615007&amp;quot;,268442635,&amp;quot;28&amp;quot;,335559739,&amp;quot;140&amp;quot;,335559738,&amp;quot;320&amp;quot;,335560102,&amp;quot;1&amp;quot;,469777841,&amp;quot;Arial&amp;quot;,469777842,&amp;quot;Arial&amp;quot;,469777843,&amp;quot;Arial&amp;quot;,469777844,&amp;quot;Arial&amp;quot;,469769226,&amp;quot;Arial&amp;quot;]}"&gt;Knowledge: A First-Class, Governed Data Layer&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:360,&amp;quot;335559739&amp;quot;:160}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H4&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;Great answers start with&amp;nbsp;great knowledge. Knowledge is now a standalone&amp;nbsp;component&amp;nbsp;customers can deploy on&amp;nbsp;its&amp;nbsp;own or alongside Agentic&amp;nbsp;Retrieval, exposed through an MCP wrapper so it can connect to any agentic retrieval layer,&amp;nbsp;ours&amp;nbsp;or yours.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:120,&amp;quot;335559739&amp;quot;:120,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;This release brings Collections (segmented groups of indexed knowledge with granular access permissions), multi-source ingestion across documents, tables, images, and SharePoint (indexed source moving to public preview), high-fidelity parsing for complex enterprise content, Bring Your Own MCP to connect customer-owned data sources directly into Agentic&amp;nbsp;Retrieval&amp;nbsp;and the chat experience, and governance enforced at the data layer itself.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:120,&amp;quot;335559739&amp;quot;:120,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img&gt;&lt;SPAN data-contrast="none"&gt;Image 2: Knowledge management view - collections, sources, and permission scopes&lt;/SPAN&gt;&lt;/img&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H5 class="lia-indent-padding-left-30px"&gt;&lt;SPAN data-contrast="auto"&gt;What customers can achieve:&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:120,&amp;quot;335559739&amp;quot;:120,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H5&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Scope knowledge access to different slices of the same corpus, by plant, site, classification, or&amp;nbsp;jurisdiction&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:40,&amp;quot;335559739&amp;quot;:40,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Enforce data sovereignty, residency, and regulatory compliance at the knowledge layer itself&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:40,&amp;quot;335559739&amp;quot;:40,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Ground both first-party Agentic&amp;nbsp;Retrieval&amp;nbsp;and BYO orchestration through a single governed source of truth across distributed sites&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:40,&amp;quot;335559739&amp;quot;:40,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Keep classified, proprietary, and operational data fully on-prem while delivering premium chat experiences&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:40,&amp;quot;335559739&amp;quot;:40,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;H4&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 20" data-ccp-parastyle-defn="{&amp;quot;ObjectId&amp;quot;:&amp;quot;741782aa-652f-563c-bdc7-8d2b1b3f555d|1&amp;quot;,&amp;quot;ClassId&amp;quot;:1073872969,&amp;quot;Properties&amp;quot;:[469775450,&amp;quot;heading 20&amp;quot;,201340122,&amp;quot;2&amp;quot;,134234082,&amp;quot;true&amp;quot;,134233614,&amp;quot;true&amp;quot;,469778129,&amp;quot;heading20&amp;quot;,335572020,&amp;quot;1&amp;quot;,134224900,&amp;quot;true&amp;quot;,335551500,&amp;quot;3615007&amp;quot;,268442635,&amp;quot;28&amp;quot;,335559739,&amp;quot;140&amp;quot;,335559738,&amp;quot;320&amp;quot;,335560102,&amp;quot;1&amp;quot;,469777841,&amp;quot;Arial&amp;quot;,469777842,&amp;quot;Arial&amp;quot;,469777843,&amp;quot;Arial&amp;quot;,469777844,&amp;quot;Arial&amp;quot;,469769226,&amp;quot;Arial&amp;quot;]}"&gt;Chat UI: Production-Ready Conversational Experience&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:360,&amp;quot;335559739&amp;quot;:160}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H4&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;Agentic&amp;nbsp;Retrieval&amp;nbsp;now ships with a polished, production-ready Chat UI as its default experience, and the same&amp;nbsp;component&amp;nbsp;can be deployed standalone for customers building their own stack on Foundry Local.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:120,&amp;quot;335559739&amp;quot;:120,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN data-contrast="auto"&gt;Highlights include Entra ID authentication (MSAL login, Bearer tokens, user identity display), pluggable backends across AI Foundry, BYOM, or mock mode with zero code changes, Chain-of-Thought visibility and inline citations that make grounding transparent to end users, standalone frontend deployment via Helm chart and container image, and disconnected-ready operation for air-gapped environments.&lt;/SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img&gt;Image 3: Chat UI answer with Chain-of-Thought panel expanded and inline citations&lt;/img&gt;
&lt;H5 class="lia-indent-padding-left-30px"&gt;&lt;SPAN data-contrast="auto"&gt;What customers can achieve:&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:120,&amp;quot;335559739&amp;quot;:120,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H5&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Deliver a polished end-user experience to operators, inspectors, and analysts without building UI from scratch&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:40,&amp;quot;335559739&amp;quot;:40,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Build trust in regulated and industrial workflows through transparent, inspectable reasoning and grounding&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:40,&amp;quot;335559739&amp;quot;:40,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Run the same UI across air-gapped facilities, sovereign clouds, and connected industrial sites&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:40,&amp;quot;335559739&amp;quot;:40,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Accelerate rollout across public sector, defense, manufacturing, and other mission-critical environments&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:40,&amp;quot;335559739&amp;quot;:40,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 20" data-ccp-parastyle-defn="{&amp;quot;ObjectId&amp;quot;:&amp;quot;741782aa-652f-563c-bdc7-8d2b1b3f555d|1&amp;quot;,&amp;quot;ClassId&amp;quot;:1073872969,&amp;quot;Properties&amp;quot;:[469775450,&amp;quot;heading 20&amp;quot;,201340122,&amp;quot;2&amp;quot;,134234082,&amp;quot;true&amp;quot;,134233614,&amp;quot;true&amp;quot;,469778129,&amp;quot;heading20&amp;quot;,335572020,&amp;quot;1&amp;quot;,134224900,&amp;quot;true&amp;quot;,335551500,&amp;quot;3615007&amp;quot;,268442635,&amp;quot;28&amp;quot;,335559739,&amp;quot;140&amp;quot;,335559738,&amp;quot;320&amp;quot;,335560102,&amp;quot;1&amp;quot;,469777841,&amp;quot;Arial&amp;quot;,469777842,&amp;quot;Arial&amp;quot;,469777843,&amp;quot;Arial&amp;quot;,469777844,&amp;quot;Arial&amp;quot;,469769226,&amp;quot;Arial&amp;quot;]}"&gt;Why This Release Matters&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:360,&amp;quot;335559739&amp;quot;:160}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;Every update to our on-prem RAG platform has moved us toward a simple conviction:&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;GenAI should be useful wherever customers&amp;nbsp;operate, whether regulated or open, connected or disconnected,&amp;nbsp;centralized&amp;nbsp;or distributed.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:120,&amp;quot;335559739&amp;quot;:120,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;With Agentic Retrieval, Knowledge, and Chat UI coming together, backed by Foundry on Arc, BYOM, and fully disconnected support, this is no longer “cloud RAG, but local.” It’s an agentic knowledge platform purpose-built for the realities of enterprise data: on-prem, governed, and increasingly autonomous.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:0,&amp;quot;335559738&amp;quot;:120,&amp;quot;335559739&amp;quot;:120,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;H5&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 20" data-ccp-parastyle-defn="{&amp;quot;ObjectId&amp;quot;:&amp;quot;741782aa-652f-563c-bdc7-8d2b1b3f555d|1&amp;quot;,&amp;quot;ClassId&amp;quot;:1073872969,&amp;quot;Properties&amp;quot;:[469775450,&amp;quot;heading 20&amp;quot;,201340122,&amp;quot;2&amp;quot;,134234082,&amp;quot;true&amp;quot;,134233614,&amp;quot;true&amp;quot;,469778129,&amp;quot;heading20&amp;quot;,335572020,&amp;quot;1&amp;quot;,134224900,&amp;quot;true&amp;quot;,335551500,&amp;quot;3615007&amp;quot;,268442635,&amp;quot;28&amp;quot;,335559739,&amp;quot;140&amp;quot;,335559738,&amp;quot;320&amp;quot;,335560102,&amp;quot;1&amp;quot;,469777841,&amp;quot;Arial&amp;quot;,469777842,&amp;quot;Arial&amp;quot;,469777843,&amp;quot;Arial&amp;quot;,469777844,&amp;quot;Arial&amp;quot;,469769226,&amp;quot;Arial&amp;quot;]}"&gt;Learn More&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:360,&amp;quot;335559739&amp;quot;:160}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H5&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Explore&lt;/SPAN&gt;&lt;/SPAN&gt;&amp;nbsp;&lt;A href="https://aka.ms/edge-agentic-retrieval-docs" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Agentic &lt;/SPAN&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;r&lt;/SPAN&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;etrieval&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;documentation&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;/LI&gt;
&lt;LI data-leveltext="" data-font="Symbol" data-listid="1" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" aria-setsize="-1" data-aria-posinset="2" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;Read&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;Foundry Local on Azure Local model inferencing&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;A class="lia-internal-link lia-internal-url lia-internal-url-content-type-blog" href="https://techcommunity.microsoft.com/blog/azurearcblog/scale-on-prem-ai-with-foundry-local-on-azure-local-multi-node-inference-and-vllm/4516692" data-lia-auto-title="blog post" data-lia-auto-title-active="0" target="_blank"&gt;blog post&lt;/A&gt;&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="•" data-font="Aptos" data-listid="2" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;•&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="16" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;For more information reach out to the team at&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;A href="mailto:FoundryLocalOnAzure@microsoft.com" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;FoundryLocalOnAzure@microsoft.com&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;</description>
      <pubDate>Tue, 02 Jun 2026 19:20:52 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/unlock-on-prem-productivity-with-agentic-retrieval-in-foundry/ba-p/4523646</guid>
      <dc:creator>moran_assaf</dc:creator>
      <dc:date>2026-06-02T19:20:52Z</dc:date>
    </item>
    <item>
      <title>Scale On-Prem AI with Foundry Local on Azure Local: Multi-Node Inference and vLLM Support</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/scale-on-prem-ai-with-foundry-local-on-azure-local-multi-node/ba-p/4516692</link>
      <description>&lt;P&gt;Since announcing the public preview of Foundry Local on Azure Local for single-node, we’ve seen strong adoption in regulated industries and consistent customer demand to expand the platform for scalable deployments.&lt;/P&gt;
&lt;P&gt;Today, we’re expanding Foundry Local model offering on Azure Local (preview) with three additions that broaden where and how you can use it:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;
&lt;P&gt;&lt;STRONG&gt;Multi-node scheduling&lt;/STRONG&gt; - distribute inference workloads across the GPU capacity in your Azure Local cluster, not just a single node&lt;/P&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;P&gt;&lt;STRONG&gt;vLLM runtime support&lt;/STRONG&gt; - a high-throughput serving engine purpose-built for large language models and concurrent workloads&lt;/P&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;P&gt;&lt;STRONG&gt;An expanded model catalog&lt;/STRONG&gt; - new models available in vLLM optimized format alongside the existing ONNX offerings&lt;/P&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Together, these additions let you scale to higher concurrency, serve more users from a single endpoint, and run larger models on-premises. They round out Foundry Local on Azure Local into a more complete, production-grade on-premises inference platform - covering a wider range of model sizes, concurrency profiles, and hardware footprints, while preserving the same Kubernetes-native, OpenAI-compatible patterns you're already using.&lt;/P&gt;
&lt;H3 aria-level="2"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;Runs disconnected &lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;&amp;nbsp;no cloud round-trip required&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph" data-ccp-parastyle-defn="{&amp;quot;ObjectId&amp;quot;:&amp;quot;276295ed-5df2-5456-988c-5fdbab071054|1&amp;quot;,&amp;quot;ClassId&amp;quot;:1073872969,&amp;quot;Properties&amp;quot;:[201342446,&amp;quot;1&amp;quot;,201342447,&amp;quot;5&amp;quot;,201342448,&amp;quot;3&amp;quot;,201342449,&amp;quot;1&amp;quot;,469777841,&amp;quot;Aptos&amp;quot;,469777842,&amp;quot;Arial&amp;quot;,469777843,&amp;quot;游明朝&amp;quot;,469777844,&amp;quot;Aptos&amp;quot;,201341986,&amp;quot;1&amp;quot;,469769226,&amp;quot;Aptos,Arial,游明朝&amp;quot;,268442635,&amp;quot;24&amp;quot;,335559739,&amp;quot;180&amp;quot;,335559738,&amp;quot;180&amp;quot;,469775450,&amp;quot;First Paragraph&amp;quot;,201340122,&amp;quot;2&amp;quot;,134234082,&amp;quot;true&amp;quot;,134233614,&amp;quot;true&amp;quot;,469778129,&amp;quot;FirstParagraph&amp;quot;,335572020,&amp;quot;1&amp;quot;,469775498,&amp;quot;Body Text&amp;quot;,469778324,&amp;quot;Body Text&amp;quot;]}"&gt;Foundry Local on Azure Local is designed to&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;run fully on-premises, including in disconnected and&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;intermittently-connected&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;environments&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&lt;STRONG&gt;. &lt;/STRONG&gt;Model weights,&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;prompts,&amp;nbsp; and&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;inference traffic stay entirely inside your Arc-enabled cluster&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;there is no per-request call to Azure, no data exfiltration to the cloud, and no dependency on a live WAN to serve inference.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="3" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="3" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact" data-ccp-parastyle-defn="{&amp;quot;ObjectId&amp;quot;:&amp;quot;eafb41c0-47a4-52dc-ac6b-3b5a7536b560|1&amp;quot;,&amp;quot;ClassId&amp;quot;:1073872969,&amp;quot;Properties&amp;quot;:[201342446,&amp;quot;1&amp;quot;,201342447,&amp;quot;5&amp;quot;,201342448,&amp;quot;3&amp;quot;,201342449,&amp;quot;1&amp;quot;,469777841,&amp;quot;Aptos&amp;quot;,469777842,&amp;quot;Arial&amp;quot;,469777843,&amp;quot;游明朝&amp;quot;,469777844,&amp;quot;Aptos&amp;quot;,201341986,&amp;quot;1&amp;quot;,469769226,&amp;quot;Aptos,Arial,游明朝&amp;quot;,268442635,&amp;quot;24&amp;quot;,335559739,&amp;quot;36&amp;quot;,335559738,&amp;quot;36&amp;quot;,469775450,&amp;quot;Compact&amp;quot;,201340122,&amp;quot;2&amp;quot;,134234082,&amp;quot;true&amp;quot;,134233614,&amp;quot;true&amp;quot;,469778129,&amp;quot;Compact&amp;quot;,335572020,&amp;quot;1&amp;quot;,469778324,&amp;quot;Body Text&amp;quot;]}"&gt;Models are cached locally&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;on&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Persistent Volumes&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;after the first pull. Once cached, the inference endpoint keeps serving even when the WAN is down&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;across reboots, network outages, and extended disconnected&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;operation&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="3" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="4" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;API-key authentication continues working uninterrupted&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;during disconnected periods. Microsoft Entra ID auth resumes seamlessly when connectivity returns.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="3" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="5" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;The control plane is local to the cluster.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&lt;STRONG&gt;&amp;nbsp;&lt;/STRONG&gt;The Foundry Local operator, the model catalog, and the inference runtimes all live inside Azure Local&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;Arc is used for fleet management and updates, not for the inference data path.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;For factory floors, offshore platforms, sovereign data centers, classified sites, and remote branch offices where cloud connectivity is unreliable, restricted, or prohibited, this is what makes on-premises AI inference actually&amp;nbsp;viable&amp;nbsp;in&amp;nbsp;production.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H3 aria-level="2"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;Multi-node scheduling: more scenarios, more capacity&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;Foundry Local on Azure Local now&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;expands to support&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;multiple nodes in your cluster.&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;The inference operator schedules and manages deployments across the GPU capacity available cluster-wide, so you can:&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="4" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="6" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;GPU capacity from any node in the cluster, not just a single node’s resources&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="4" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="7" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Place inference workloads where the hardware lives&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;, with the operator managing deployments across nodes&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:2,&amp;quot;335559739&amp;quot;:0,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;The same&amp;nbsp;Model Deployment&amp;nbsp;custom resource you already use defines the workload, and it is served through the standard OpenAI-compatible endpoint (POST /v1/chat/completions).&amp;nbsp;The API used to interact with conversational AI models by sending structured messages and receiving model-generated responses. Existing applications work against multi-node deployments with zero code changes.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:2,&amp;quot;335559739&amp;quot;:0,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;img /&gt;
&lt;H3 aria-level="2"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;&amp;nbsp;runtime: high-throughput serving for production workloads&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;Alongside ONNX-GenAI, Foundry Local now offers&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;vLLM&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt; as a first-class inference runtime. &lt;A class="lia-external-url" href="https://docs.vllm.ai/en/latest/" target="_blank" rel="noopener"&gt;vLLM &lt;/A&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;is an open-source, high-throughput serving engine that has become the standard for&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;production&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;LLM inference in the cloud. Bringing it to Foundry Local on Azure Local means the same performance characteristics are available on your factory floor, in your sovereign data center, or at your remote site.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;H4 aria-level="3"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;Why&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;&amp;nbsp;matters for edge and on-premises inference&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H4&gt;
&lt;DIV class="styles_lia-table-wrapper__h6Xo9 styles_table-responsive__MW0lN"&gt;&lt;table border="1" style="width: 100%; height: 496.222px; border-width: 1px;"&gt;&lt;tbody&gt;&lt;tr style="height: 38.8889px;"&gt;&lt;td style="height: 38.8889px;"&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Capability&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td style="height: 38.8889px;"&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;ONNX-GenAI&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td style="height: 38.8889px;"&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;vLLM&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr style="height: 38.8889px;"&gt;&lt;td style="height: 38.8889px;"&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Hardware&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td style="height: 38.8889px;"&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;CPU and GPU&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td style="height: 38.8889px;"&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;GPU only&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr style="height: 66.8889px;"&gt;&lt;td style="height: 66.8889px;"&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Throughput&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td style="height: 66.8889px;"&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Optimized&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;for single-user, low-latency&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td style="height: 66.8889px;"&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Optimized&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;for high-throughput, multi-user concurrency&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr style="height: 94.8889px;"&gt;&lt;td style="height: 94.8889px;"&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Memory management&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td style="height: 94.8889px;"&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Standard allocation&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td style="height: 94.8889px;"&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;PagedAttention&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;efficient KV-cache management reduces VRAM waste&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr style="height: 94.8889px;"&gt;&lt;td style="height: 94.8889px;"&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Continuous batching&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td style="height: 94.8889px;"&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Not supported&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td style="height: 94.8889px;"&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Supported&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;incoming requests are batched dynamically for higher GPU&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;utilization&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr style="height: 94.8889px;"&gt;&lt;td style="height: 94.8889px;"&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;FP8 KV cache&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td style="height: 94.8889px;"&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Not supported&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td style="height: 94.8889px;"&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Supported on compatible models and GPUs&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;roughly doubles&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;token capacity&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr style="height: 66.8889px;"&gt;&lt;td style="height: 66.8889px;"&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Best for&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td style="height: 66.8889px;"&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Compact models, CPU-only nodes, single-client scenarios&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td style="height: 66.8889px;"&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Larger models, multi-user workloads, GPU-equipped clusters&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;colgroup&gt;&lt;col style="width: 33.33%" /&gt;&lt;col style="width: 33.33%" /&gt;&lt;col style="width: 33.33%" /&gt;&lt;/colgroup&gt;&lt;/table&gt;&lt;/DIV&gt;
&lt;H3 aria-level="3"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;Automatic GPU inference tuning with the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;&amp;nbsp;planner&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;One of the operational challenges with&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;is configuration tuning&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;setting GPU memory&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;utilization&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;, context length, batch sizes, and other parameters for a given model on a given hardware profile.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;Get it wrong and the pod either OOMs (runs out of memory) on startup&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;or wastes GPU capacity.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;Foundry Local addresses this with the&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;planner&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;, an automatic tuning&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;component&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;that inspects the available GPU resources, analyzes the target&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;model's&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;footprint, and generates a memory-safe, high-performance configuration before the model server starts. You declare&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;what&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;model you want to run; the planner figures out&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;how&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;to run it optimally on your hardware.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;Full configuration reference is in the&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;A class="lia-external-url" href="https://learn.microsoft.com/en-us/azure/azure-sovereign-clouds/private/foundry-local/concept-gpu-inference-planner" target="_blank" rel="noopener"&gt;vLLM planner docs&lt;/A&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H3 aria-level="2"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;Identity-based access for multi-user workloads&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;Serving more concurrent users&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;isn't&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;only a throughput problem&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;it's&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;also an access-control problem. Foundry Local supports two authentication modes side by side on the same endpoint:&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="5" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="8" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;API keys&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;primary and secondary keys per deployment, with zero-downtime rotation. Ideal for service-to-service traffic and automated pipelines.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="5" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="9" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Microsoft Entra ID with Azure RBAC&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;per-identity access using the&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-charstyle="Verbatim Char" data-ccp-charstyle-defn="{&amp;quot;ObjectId&amp;quot;:&amp;quot;a31b87fc-7929-525d-a968-aa0f568a0536|1&amp;quot;,&amp;quot;ClassId&amp;quot;:1073872969,&amp;quot;Properties&amp;quot;:[201342446,&amp;quot;1&amp;quot;,201342447,&amp;quot;5&amp;quot;,201342448,&amp;quot;3&amp;quot;,201342449,&amp;quot;1&amp;quot;,469777841,&amp;quot;Consolas&amp;quot;,469777842,&amp;quot;Arial&amp;quot;,469777843,&amp;quot;游明朝&amp;quot;,469777844,&amp;quot;Consolas&amp;quot;,201341986,&amp;quot;1&amp;quot;,469769226,&amp;quot;Consolas&amp;quot;,268442635,&amp;quot;22&amp;quot;,469775450,&amp;quot;Verbatim Char&amp;quot;,201340122,&amp;quot;1&amp;quot;,134233614,&amp;quot;true&amp;quot;,469778129,&amp;quot;VerbatimChar&amp;quot;,335572020,&amp;quot;1&amp;quot;,134231262,&amp;quot;true&amp;quot;,469777929,&amp;quot;Source Code&amp;quot;,469778324,&amp;quot;Caption Char&amp;quot;]}" data-ccp-charstyle-linked-defn="{&amp;quot;ObjectId&amp;quot;:&amp;quot;94116779-72d8-55e3-836f-dfb3e4f6a90b|1&amp;quot;,&amp;quot;ClassId&amp;quot;:1073872969,&amp;quot;Properties&amp;quot;:[201342446,&amp;quot;1&amp;quot;,201342447,&amp;quot;5&amp;quot;,201342448,&amp;quot;3&amp;quot;,201342449,&amp;quot;1&amp;quot;,469777841,&amp;quot;Aptos&amp;quot;,469777842,&amp;quot;Arial&amp;quot;,469777843,&amp;quot;游明朝&amp;quot;,469777844,&amp;quot;Aptos&amp;quot;,201341986,&amp;quot;1&amp;quot;,469769226,&amp;quot;Aptos,Arial,游明朝&amp;quot;,268442635,&amp;quot;24&amp;quot;,469775450,&amp;quot;Source Code&amp;quot;,201340122,&amp;quot;2&amp;quot;,134233614,&amp;quot;true&amp;quot;,469778129,&amp;quot;SourceCode&amp;quot;,335572020,&amp;quot;1&amp;quot;,469777929,&amp;quot;Verbatim Char&amp;quot;,469778324,&amp;quot;Normal&amp;quot;]}"&gt;Cognitive Services OpenAI User&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;role (or any role granting the equivalent data-plane action). JWT validation runs inside the inference pod; authorization is enforced through the cluster's Arc-managed identity.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;Enable both, and clients can present either credential type in the same&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-charstyle="Verbatim Char"&gt;Authorization: Bearer&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;header&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;the platform detects which one was sent and routes to the right validation path. API-key callers also keep working uninterrupted if external connectivity is briefly lost, giving you a natural degradation story for edge and disconnected sites.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;For a multi-user AI assistant on the factory floor or in a sovereign data center&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;, this is the difference between a shared service account and a per-user audit trail.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H3 aria-level="2"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;Expanded model catalog: ONNX and&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;&amp;nbsp;side by side&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;The Foundry Local model catalog now includes models in both&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;ONNX&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;and&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;vLLM&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;formats. The same model can appear multiple times in the catalog&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;once per runtime/compute target&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;so you can&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;pick&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;the build that matches your hardware without leaving the platform. The operator selects the right container image automatically based on the entry you reference.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;H3 aria-level="3"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;Broader open-model support&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;Beyond the Phi and GPTOSS families, the catalog now includes&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;additional&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;models across multiple open-source lineups that customers have requested for on-prem and sovereign deployments, including Mistral and NVIDIA&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;Nemotron&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;. Both are available as catalog entries, served by the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;runtime on GPU, and accessible through the same OpenAI-compatible endpoint you already use.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&lt;STRONG&gt;In collaboration with NVIDIA&lt;/STRONG&gt;, Foundry Local now supports the latest &lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;Nemotron&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;models,&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;optimized&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;for enterprise performance on NVIDIA&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;powered Azure Local hardware including NVIDIA RTX Pro 6000.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;Nemotron&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;models are tuned for reasoning, instruction-following, and agentic workflows, and run on the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;runtime with&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;PagedAttention&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;, continuous batching, and FP8 KV cache on compatible GPUs.&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;The&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;planner handles GPU memory&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;utilization&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;and context-length sizing&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;automatically&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;.&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;you declare the catalog&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;entry,&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;the platform sizes the deployment to your hardware.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;H4 aria-level="3"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;Models available in&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;&amp;nbsp;format &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/H4&gt;
&lt;P aria-level="3"&gt;&lt;EM&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;(see the model catalog docs for the full, regularly updated list)&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/EM&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;DIV class="styles_lia-table-wrapper__h6Xo9 styles_table-responsive__MW0lN"&gt;&lt;table border="1" style="border-width: 1px;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Model&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;ONNX&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;vLLM&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Notes&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Phi-4&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Microsoft's flagship SLM&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Phi-4-mini&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Compact, fast inference&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Phi-4-mini-reasoning&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Chain-of-thought reasoning&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Phi-4-reasoning&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;—&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-only, reasoning-focused&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;gpt-oss-20b&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Mid-range generative&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;gpt-oss-120b&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;—&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Large generative,&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-only&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Mistral-7B-v0.2&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Popular open-source LLM&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;DeepSeek-R1 (7b/14b)&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;—&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Reasoning-focused&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Qwen2.5 (0.5b–14b)&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;—&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Multilingual, coder variants&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Qwen3 (0.6b–14b)&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;—&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Latest generation&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Whisper (multiple sizes)&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;—&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Speech-to-text&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Nemotron&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&amp;nbsp;(CPU)&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;✓&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;colgroup&gt;&lt;col style="width: 25.00%" /&gt;&lt;col style="width: 25.00%" /&gt;&lt;col style="width: 25.00%" /&gt;&lt;col style="width: 25.00%" /&gt;&lt;/colgroup&gt;&lt;/table&gt;&lt;/DIV&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;The catalog now includes a growing list of models across both runtimes.&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;Models in&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;format are served using the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;engine with all its performance benefits&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;PagedAttention&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;, continuous batching, FP8 KV cache&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;while ONNX models continue to serve on CPU or GPU through the ONNX-GenAI runtime.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;H3 aria-level="3"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;Bring-your-own&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;model (BYOM)&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;When you need a model that isn’t in the catalog, bring-your-own&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;model&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;still works the same way:&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;package your&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;model&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;as an OCI artifact in any ORAS-compatible registry (Azure Container Registry, GitHub Container Registry, Docker Hub) and reference it from your&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-charstyle="Verbatim Char"&gt;ModelDeployment&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;. The operator caches it locally and reuses the cached copy on&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;subsequent&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;deployments.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;H3 aria-level="3"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;Choosing the right runtime&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="6" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="10" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&lt;STRONG&gt;ONNX&lt;/STRONG&gt;-&lt;STRONG&gt;GenAI&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&lt;STRONG&gt;&amp;nbsp;&lt;/STRONG&gt;when&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;you're&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;running on CPU-only hardware, serving a single application with a compact model, or need the broadest model compatibility including speech and predictive workloads.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="6" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="11" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;vLLM&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&lt;STRONG&gt;&amp;nbsp;&lt;/STRONG&gt;when you have GPU hardware, need to serve concurrent users, want to run larger models, or need production-grade throughput from your inference endpoint.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;Both runtimes expose the same OpenAI-compatible REST API&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;the choice is transparent to application code.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-charstyle="Verbatim Char"&gt;ModelDeployment&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Body Text"&gt;&amp;nbsp;is as simple as this:&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;Everything else&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;memory&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;utilization&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;, context length, batch sizing&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;is handled by the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;planner. See the&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/azure-sovereign-clouds/private/foundry-local/concept-model-catalog" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;model catalog docs&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;for the BYO pattern and full configuration options.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H3 aria-level="2"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;What &lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;hasn't&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;&amp;nbsp;changed&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;Everything from the public preview&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;remains&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;fully supported:&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="7" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="12" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Two installation paths&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;Azure Arc extension (recommended for fleet management) and Helm chart (for platform engineers who need full control)&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="7" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="13" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;OpenAI-compatible REST endpoints&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-charstyle="Verbatim Char"&gt;POST /v1/chat/completions&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;and standard patterns&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="7" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="14" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;API key and Microsoft Entra ID authentication&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;secured with bearer tokens, with the per-identity RBAC model described above&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="7" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="15" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;TLS-enabled ingress&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;encrypted traffic in transit&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="7" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="16" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Disconnected operation&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;models cached on local&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;PersistentVolumes&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;continue serving when WAN connectivity drops&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="7" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="17" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Bring-your-own predictive models&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;deploy custom ONNX models from OCI registries&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="7" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="18" data-aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Multi-model orchestration&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;agent-style patterns coordinating multiple local models&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;Your&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;existing&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-charstyle="Verbatim Char"&gt;ModelDeployment&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;manifests continue to work. Applications targeting the ONNX-GenAI runtime&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;don't&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;need any changes. The new capabilities are additive.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;H3&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;Real-world scenarios, now at scale&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;Over the past few months,&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;we’ve&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;partnered with customers in early&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;preview&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;to build and&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;validate&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;real-world scenarios. A consistent theme across these engagements is the need to run AI where data&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;resides&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;—on-premises—while&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;maintaining&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;the governance and consistency enabled by Azure Arc.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;"In energy operations, AI needs to run where the work happens – at remote facilities, offshore platforms, and field locations where connectivity is often limited, and safety is paramount. Foundry Local gives us a path to bring AI-driven decision-making closer to our operational data, with the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;governance&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;our industry demands. The ability to deploy and run AI workloads consistently across edge and field environments, even when disconnected, is critical as we advance Chevron's vision for autonomous and intelligent operations."&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/EM&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&lt;EM&gt; &amp;nbsp;&lt;/EM&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;(Chevron) Ed Moore - OT Strategist and Distinguished Engineer&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H3&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;With multi-node and&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;, the scenarios from our&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;initial&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;preview scale to meet production demands:&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;H5 aria-level="3"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;Manufacturing: multi-user quality inspection&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H5&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;A quality-control system on a production line previously ran Phi-4-mini for single-station anomaly explanation.&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;With&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;vLLM's&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;continuous batching, the same Foundry Local endpoint now serves&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;10+ inspection stations concurrently&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;each sending defect images and sensor telemetry for real-time root-cause analysis&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;without response-time degradation.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;H5 aria-level="3"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;Sovereign: identity-scoped document processing&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H5&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;A government agency processing sensitive casework needs production-grade throughput and a strict audit trail. Foundry Local serves the workload on-premises across multiple GPU nodes, with per-analyst access enforced through Entra ID and Azure RBAC, so every inference call is tied to a real identity&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;and no data leaves the cluster.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;H5 aria-level="3"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;Energy&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;: disconnected multi-user operations&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H5&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;An offshore platform runs Foundry Local on a multi-node Azure Local cluster. When WAN connectivity drops, the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;-powered endpoint continues serving safety procedure lookups, maintenance guidance, and operational queries to&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;multiple crew members simultaneously&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;each accessing the inference endpoint from their local application. API-key auth keeps working through the outage; Entra ID resumes seamlessly when the WAN comes back.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H3 aria-level="2"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;Getting started&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;If&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;you're&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;already running Foundry Local on Azure Local in the public preview:&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Once installed the &lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;Foundry Local extension is&lt;STRONG&gt; automatically kept up to date&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;, with multi-node and&amp;nbsp;vLLM&amp;nbsp;support included.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;201341983&amp;quot;:2,&amp;quot;335559739&amp;quot;:0,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Browse the updated catalog&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;to discover models available in&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;format&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Deploy a&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;model&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;by setting&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-charstyle="Verbatim Char"&gt;&lt;STRONG&gt;runtime:&lt;/STRONG&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-charstyle="Verbatim Char"&gt;vllm&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;in your&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-charstyle="Verbatim Char"&gt;ModelDeployment&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;manifest&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Let the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;planner&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;optimize&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;override only the preferences you care about and let the planner handle the rest&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;If&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;you're&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;new to Foundry Local on Azure Local:&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="9" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="19" data-aria-level="1"&gt;&lt;A href="https://aka.ms/foundry-local-model-catalog-blog" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Follow the get-started code-sample blog&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;to see the end-to-end flow&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="9" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="19" data-aria-level="1"&gt;&lt;A style="font-style: normal; font-weight: 400; background-color: rgb(255, 255, 255);" href="https://aka.ms/FoundryLocalAzure_PreviewRequest" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Request preview deployment access&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN style="color: rgb(30, 30, 30);" data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt; to get started&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="9" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="19" data-aria-level="1"&gt;&lt;A style="font-style: normal; font-weight: 400; background-color: rgb(255, 255, 255);" href="https://learn.microsoft.com/en-us/azure/azure-sovereign-clouds/private/foundry-local/what-is-foundry-local-on-azure-local" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Read the documentation&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN style="color: rgb(30, 30, 30);" data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;for architecture overview and deployment guide&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN style="color: rgb(30, 30, 30);" data-ccp-props="{&amp;quot;335559685&amp;quot;:720,&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H3 aria-level="2"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;What's&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;&amp;nbsp;next&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;Multi-node and&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;vLLM&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;are just the beginning.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;We're&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;continuing to invest in:&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="10" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="21" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Distributed LLM serving with LLM-D&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;KV-cache-aware routing and disaggregated serving for large models that span multiple nodes&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="10" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="22" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Autoscaling for inference workloads&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;dynamic capacity that follows demand&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="10" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="23" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Broader model catalog expansion&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;&amp;nbsp;more model families, more sizes, more task types&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="10" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="24" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Enhanced monitoring and observability for inference workloads&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="10" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="25" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Performance optimization for specific Azure Local hardware profiles&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="10" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="26" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;Expanded GPU hardware validation across the Azure Local catalog&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;We're&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;building Foundry Local to be the production AI inference platform for edge and sovereign environments. Your feedback is shaping every release&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;-&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;&amp;nbsp;keep it coming.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="First Paragraph"&gt;Learn more:&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:180,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="11" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="27" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;&lt;STRONG&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Foundry Local Model and inferencing on multi node&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt; &lt;A class="lia-external-url" href="https://aka.ms/fl-catalog-multinode-demo" target="_blank" rel="noopener"&gt;demo&lt;/A&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="11" data-list-defn-props="{&amp;quot;335551671&amp;quot;:0,&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="28" data-aria-level="1"&gt;&lt;A href="https://devblogs.microsoft.com/foundry/foundry-local-ga/" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Foundry Local for devices (GA)&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;SPAN data-ccp-parastyle="Compact"&gt;For more information reach out to the team at&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;A href="mailto:FoundryLocalOnAzure@microsoft.com" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;FoundryLocalOnAzure@microsoft.com&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559738&amp;quot;:36,&amp;quot;335559739&amp;quot;:36}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 02 Jun 2026 19:18:28 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/scale-on-prem-ai-with-foundry-local-on-azure-local-multi-node/ba-p/4516692</guid>
      <dc:creator>liranlyabock_microsoft</dc:creator>
      <dc:date>2026-06-02T19:18:28Z</dc:date>
    </item>
    <item>
      <title>Introducing GitHub Enterprise Local (Preview): DevOps for Sovereign and Private Cloud Environments</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/introducing-github-enterprise-local-preview-devops-for-sovereign/ba-p/4523046</link>
      <description>&lt;P&gt;Across the world, many organizations, particularly in government, defense, financial services, and critical infrastructure, must operate within strict sovereign boundaries, often due to regulatory, security, or disconnected environment requirements. Microsoft’s Sovereign Private Cloud is a customer operated cloud model designed for scenarios where sovereignty, operational control, and resiliency are non negotiable. It enables organizations to operate securely and at scale, even in restricted or disconnected environments, while maintaining governance aligned with regulatory and national obligations.&lt;/P&gt;
&lt;P&gt;Azure Local is the foundation that makes this possible. With Azure Local, organizations can run critical workloads—including virtual machines, Kubernetes, virtual desktop infrastructure, and AI workloads—on infrastructure they own and control, while still benefiting from Azure consistent management, governance, and lifecycle operations. We’re continuing to expand the set of workloads and capabilities supported on Azure Local to meet the needs of organizations operating in sovereign and highly regulated environments. With Microsoft 365 Local, Azure Local now extends beyond infrastructure to support communication and collaboration workloads, enabling productivity and resiliency even in disconnected or restricted conditions. And with Foundry Local, we are supporting modern AI workloads on Azure Local, bringing advanced AI capabilities to infrastructure customers own and operate.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;We are excited to announce the public preview of GitHub Enterprise Local,&lt;/STRONG&gt; which brings GitHub’s enterprise developer platform into sovereign and private cloud environments. GitHub Enterprise Local is fully hosted on customer owned infrastructure, enabling organizations to modernize application development while keeping source code, build pipelines, and development artifacts entirely within their own operational boundaries.&lt;/P&gt;
&lt;DIV class="lia-align-justify"&gt;
&lt;H4&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;What Is&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;GitHub Enterprise Local&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;?&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H4&gt;
&lt;/DIV&gt;
&lt;P class="lia-align-justify"&gt;&lt;SPAN data-contrast="auto"&gt;GitHub Enterprise Local enables organizations to deploy GitHub Enterprise Server (GHES) entirely within customer&lt;/SPAN&gt;‑&lt;SPAN data-contrast="auto"&gt;owned infrastructure using Azure Local as the underlying private cloud platform. The solution is delivered as a &lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;prebuilt virtual machine image&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt; that runs on Azure Local and operates fully within the customer’s security and network perimeter.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="lia-align-justify"&gt;&lt;SPAN data-contrast="auto"&gt;All repositories, metadata, CI/CD workflows, and artifacts remain on&lt;/SPAN&gt;‑&lt;SPAN data-contrast="auto"&gt;premises. GitHub Enterprise Local is designed to run without internet connectivity by default, making it suitable for both connected and fully disconnected or air&lt;/SPAN&gt;‑&lt;SPAN data-contrast="auto"&gt;gapped environments. At the same time, it preserves a GitHub&lt;/SPAN&gt;‑&lt;SPAN data-contrast="auto"&gt;consistent experience for developers, allowing teams to continue using familiar workflows for source control, collaboration, and automation.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;DIV class="lia-align-justify"&gt;
&lt;H4 aria-level="2"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;Developer and Platform Capabilities&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H4&gt;
&lt;/DIV&gt;
&lt;P class="lia-align-justify"&gt;GitHub Enterprise Local provides a comprehensive set of enterprise developer platform capabilities. Teams can host private repositories, manage organizations, and collaborate through pull requests, branch protection rules, and structured code reviews. Issues, wikis, and project collaboration features are also available, enabling end‑to‑end development workflows within the same platform.&lt;/P&gt;
&lt;P class="lia-align-justify"&gt;GitHub Enterprise Local can run on either a single-node or multi-node Azure Local instance depending on customer needs. Single‑node Azure Local runs GHES as a standalone VM, ideal for preview, PoC, and low‑risk scenarios focused on simplicity and cost efficiency. For production-oriented deployments, the same single GHES VM can run on a multi‑node Azure Local cluster, where Azure Local provides VM‑level high availability and failover.&lt;/P&gt;
&lt;P class="lia-align-justify"&gt;For automation and delivery, GitHub Enterprise Local supports GitHub Actions using self‑hosted runners. This allows organizations to build and run CI/CD pipelines entirely within their own environments, with full control over execution context, dependencies, and network access. GitHub Packages can be used for artifact management, supporting common ecosystems such as npm, NuGet, Maven, and container images.&lt;/P&gt;
&lt;P class="lia-align-justify"&gt;GitHub Enterprise Local extends modern development workflows with AI assisted experiences while keeping sensitive data within customer-controlled environments. Developers can use GitHub Copilot in several ways, including as a standalone experience, through Copilot CLI, and in VS Code. They can choose GitHub-managed models by connecting to GitHub.com, or connecting directly to model providers from Copilot CLI, allowing source code to avoid passing through GitHub Cloud. Foundry Local provides an on-premises inference layer that keeps prompts, code context, and model execution inside organizational boundaries. Together, these capabilities create a clear integration path across code automation and AI application development, enabling organizations to modernize the developer experience while preserving operational control, compliance, and auditability.&lt;/P&gt;
&lt;DIV class="lia-align-justify"&gt;
&lt;H4 aria-level="2"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;Developer AI Workflow Architecture&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H4&gt;
&lt;/DIV&gt;
&lt;P&gt;This architecture demonstrates how GitHub Enterprise Local serves as the secure, customer-managed foundation for source control, collaboration, and workflow orchestration, enabling developers to layer AI-assisted capabilities through GitHub Copilot, GitHub CLI, and Foundry Local—while ensuring that code, data, and AI execution remain fully within organizational boundaries.&lt;/P&gt;
&lt;img&gt;Reference Architecture&lt;/img&gt;
&lt;DIV class="lia-align-justify"&gt;
&lt;H4 aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 1"&gt;Architecture Overview&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:360,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H4&gt;
&lt;/DIV&gt;
&lt;P class="lia-align-justify"&gt;&lt;SPAN data-contrast="auto"&gt;GitHub Enterprise Local follows a layered architecture model.&lt;/SPAN&gt;&lt;/P&gt;
&lt;DIV class="lia-align-justify"&gt;
&lt;H5&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;Infrastructure Layer&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H5&gt;
&lt;/DIV&gt;
&lt;P&gt;Azure Local forms the foundation, deployed on Azure Local–certified hardware. It provides:&lt;/P&gt;
&lt;UL class="lia-align-justify"&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="4" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="1" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;The virtualization platform for running GitHub Enterprise Local&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="4" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="2" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;Infrastructure availability and update management&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="4" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="2" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;Customer&lt;/SPAN&gt;‑&lt;SPAN data-contrast="auto"&gt;controlled networking, identity, and security policies&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="4" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="2" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;Azure Arc&lt;/SPAN&gt;‑&lt;SPAN data-contrast="auto"&gt;enabled management for infrastructure lifecycle operations&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;DIV class="lia-align-justify"&gt;
&lt;H5 aria-level="3"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;GitHub Enterprise Local &lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;Appliance Layer&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:160,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H5&gt;
&lt;/DIV&gt;
&lt;P class="lia-align-justify"&gt;&lt;SPAN data-contrast="auto"&gt;GitHub Enterprise Server (GHES) is deployed as a prebuilt virtual machine image on Azure Local. This VM includes:&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL class="lia-align-justify"&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="5" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="1" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;The GHES application stack&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="5" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="2" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;Persistent data disks for repositories and metadata&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="5" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="2" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;Support for replica&lt;/SPAN&gt;‑&lt;SPAN data-contrast="auto"&gt;based failover configurations, depending on customer requirements&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P class="lia-align-justify"&gt;&lt;SPAN data-contrast="auto"&gt;All application data remains within customer infrastructure boundaries.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;DIV class="lia-align-justify"&gt;
&lt;H5 aria-level="3"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 3"&gt;Operations Layer&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H5&gt;
&lt;/DIV&gt;
&lt;P class="lia-align-justify"&gt;&lt;SPAN data-contrast="auto"&gt;Operational responsibilities are clearly separated:&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL class="lia-align-justify"&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="6" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="1" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;Azure Local administrators manage the Azure Local infrastructure through Azure&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="6" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="1" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;GitHub administrators manage GHES configuration, upgrades, user access, and ongoing maintenance through the GitHub Management control and site admin dashboard&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P class="lia-align-justify"&gt;&lt;SPAN data-contrast="auto"&gt;This separation aligns with common enterprise operational models.&lt;/SPAN&gt;&lt;/P&gt;
&lt;DIV class="lia-align-justify"&gt;
&lt;H4 aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 1"&gt;Connectivity Modes and Deployment Scenarios&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H4&gt;
&lt;/DIV&gt;
&lt;P class="lia-align-justify"&gt;&lt;SPAN data-contrast="auto"&gt;GHES is designed to operate fully offline, making it suitable for air&lt;/SPAN&gt;‑&lt;SPAN data-contrast="auto"&gt;gapped and restricted environments. Azure Local complements this capability by supporting both connected and fully disconnected operational modes. In connected environments, customers can take advantage of centralized management and monitoring of GHES appliance. In disconnected environments, the entire solution can operate in complete isolation, ensuring compliance with strict sovereignty or security mandates.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="lia-align-justify"&gt;&lt;SPAN data-contrast="auto"&gt;This flexibility allows organizations to adopt a deployment model that aligns with their regulatory, operational, and security requirements.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="lia-align-justify"&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;DIV class="lia-align-justify"&gt;
&lt;H4&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 1"&gt;Hardware and Capacity Planning&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:360,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H4&gt;
&lt;/DIV&gt;
&lt;P class="lia-align-justify"&gt;&lt;SPAN data-contrast="auto"&gt;GitHub Enterprise Local virtual machine sizing depends on customer use cases, including:&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL class="lia-align-justify"&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="7" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="1" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;Number of developers&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="7" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="2" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;Repository size and growth&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="7" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="3" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;CI/CD pipeline frequency&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="7" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;multilevel&amp;quot;}" data-aria-posinset="3" data-aria-level="1"&gt;&lt;SPAN data-contrast="auto"&gt;Artifact storage requirements&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P class="lia-align-justify"&gt;&lt;SPAN data-contrast="auto"&gt;Azure Local supports running GitHub Enterprise Local on both Integrated and Premier hardware solutions, provided sufficient capacity is available. Customers should plan compute, memory, storage, and network resources accordingly.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="lia-align-justify"&gt;&lt;A href="https://docs.github.com/en/enterprise-server@3.15/admin/monitoring-and-managing-your-instance/updating-the-virtual-machine-and-physical-resources/increasing-storage-capacity#minimum-recommended-requirements" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Minimum recommended requirements&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;DIV class="lia-align-justify"&gt;
&lt;H4 aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 1"&gt;Billing Overview&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:360,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H4&gt;
&lt;/DIV&gt;
&lt;P&gt;GitHub Enterprise Local combines user-based application licensing, Azure Local infrastructure-based billing, and separate pricing for AI services such as Copilot and Foundry.&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;GitHub Enterprise Local is billed per user seat. (&lt;SPAN data-contrast="auto"&gt;&lt;A href="https://docs.github.com/en/billing/concepts/enterprise-billing/billing-for-enterprises" target="_blank" rel="noopener"&gt;GitHub Enterprise license&lt;/A&gt;)&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;Azure Local is billed per physical CPU core. (&lt;A href="https://learn.microsoft.com/en-us/azure/azure-local/concepts/billing?view=azloc-2602" target="_blank" rel="noopener"&gt;Azure Local Billing&lt;/A&gt;)&lt;/LI&gt;
&lt;LI&gt;Copilot and Foundry have separate service-based pricing. (&lt;A href="https://github.com/features/copilot/plans" target="_blank" rel="noopener"&gt;GitHub Copilot Plans &amp;amp; pricing&lt;/A&gt;)&lt;/LI&gt;
&lt;/UL&gt;
&lt;DIV class="lia-align-justify"&gt;
&lt;H4 aria-level="1"&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 1"&gt;Public Preview Access&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:360,&amp;quot;335559739&amp;quot;:80}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H4&gt;
&lt;/DIV&gt;
&lt;P class="lia-align-justify"&gt;&lt;SPAN data-contrast="auto"&gt;GitHub Enterprise Local on Azure Local is available today in &lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;public preview&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;. Customers can request access by completing the public preview registration form. Submissions are reviewed as part of the preview onboarding process.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="lia-align-justify"&gt;&lt;SPAN data-contrast="auto"&gt;Participate in public preview:&lt;/SPAN&gt;&amp;nbsp;&lt;BR /&gt;&lt;A class="lia-external-url" href="https://aka.ms/GitHubEnterpriseLocalSurvey" target="_blank" rel="noopener"&gt;GitHub Enterprise Local Preview Sign-Up&lt;/A&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;DIV class="lia-align-justify"&gt;
&lt;H4&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;Learn More&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H4&gt;
&lt;/DIV&gt;
&lt;P&gt;&lt;A href="https://aka.ms/GHEL" target="_blank" rel="noopener"&gt;GitHub Enterprise Local &lt;SPAN data-teams="true"&gt;documentation&lt;/SPAN&gt;&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 02 Jun 2026 19:15:00 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/introducing-github-enterprise-local-preview-devops-for-sovereign/ba-p/4523046</guid>
      <dc:creator>sushil_jarwal</dc:creator>
      <dc:date>2026-06-02T19:15:00Z</dc:date>
    </item>
    <item>
      <title>Embed intelligence into physical systems with smaller form factor infrastructure (preview)</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/embed-intelligence-into-physical-systems-with-smaller-form/ba-p/4524876</link>
      <description>&lt;P&gt;&lt;EM&gt;Written by Cosmos Darwin, Azure Edge PM, and Michael MacKenzie, VP of Digital Operations&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;AI is transforming how we work, but so far it's mostly lived on your screen: agents and models assisting with &lt;EM&gt;information &lt;/EM&gt;work. How can that intelligence take on &lt;EM&gt;physical &lt;/EM&gt;work, too? Jobs that happen out in the world, like transporting goods, inspecting equipment, manufacturing products, and serving retail customers. This is already possible today, but developing autonomous robots remains highly complex and specialized. The real breakthrough will come when using AI in physical work is as simple and ubiquitous as it is on a screen. To get there, we need to go beyond software agents and embed intelligence directly into physical systems.&lt;/P&gt;
&lt;P&gt;Today at Microsoft Build 2026, we're announcing several new capabilities to help organizations everywhere get started. We're extending AI-ready Azure-managed infrastructure to smaller form factor hardware, bringing Foundry Local to it for running local AI agents and models, and adding support for Azure Kubernetes Service and Azure IoT Operations.&lt;/P&gt;
&lt;H3&gt;&lt;STRONG&gt;Demo: a simple robot that thinks for itself&lt;/STRONG&gt;&lt;/H3&gt;
&lt;P&gt;Applied in combination, these capabilities can be surprisingly powerful. For Microsoft Build this week, we wanted to show you just how easy this can be. We put together a basic agentic robot using nothing but open-source AI models, commercial off-the-shelf sensors and robot hardware, and the new Azure previews we're announcing today. It's a playful example, but it illustrates what’s possible – check it out:&lt;/P&gt;
&lt;DIV class="lia-embeded-content" contenteditable="false"&gt;&lt;IFRAME src="https://www.youtube.com/embed/CUsJrodGnxw?si=Kc720HAjlTNXT9JO" width="960" height="540" title="YouTube video player" allowfullscreen="allowfullscreen" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" frameborder="0" sandbox="allow-scripts allow-same-origin allow-forms"&gt;&lt;/IFRAME&gt;&lt;/DIV&gt;
&lt;H3&gt;&lt;STRONG&gt;Lightweight deployments on smaller form factor hardware (preview)&lt;/STRONG&gt;&lt;/H3&gt;
&lt;P&gt;First, we're extending Azure-based provisioning and management to smaller hardware form factors, using a lightweight, performance-oriented architecture built for AI workloads. Unlike hyperconverged and disaggregated deployments, this doesn’t rely on virtualization, and instead runs Linux (initially Azure Linux) directly on bare metal to host containers. You can choose whichever runtime tools you prefer, like Docker, open source k3s, or fully managed Azure Kubernetes Service.&lt;/P&gt;
&lt;P&gt;Each deployment is provisioned and managed from the cloud using a new type of resource called Provisioned Machine that looks and behaves a lot like an Azure VM – for example, you can see it in the Azure portal and govern access with Microsoft Entra ID. Over the coming months, we’ll be rolling out more features like update management, metrics, security configuration, and natively configurable child resources for network interfaces and disks.&lt;/P&gt;
&lt;img&gt;
&lt;P&gt;Screenshot of the new Provisioned Machine resource type in Azure portal.&lt;/P&gt;
&lt;/img&gt;
&lt;P&gt;Provisioned Machines support lifecycle operations centrally from the Azure portal and APIs. Effectively, you can treat physical machines like cloud resources, removing the need for separate on-site IT tools. This makes it much more practical to scale across many distributed locations. For an organization like Chevron, whose operations span field sites around the world, that’s significant:&lt;/P&gt;
&lt;P class="lia-indent-padding-left-30px"&gt;&lt;EM&gt;"Chevron has a growing fleet of industrial edge devices that collect data in the field and increasingly perform local AI processing. Technologies like Azure Local on smaller form factors can help us manage these systems centrally and in a more automated way – reducing complexity compared to the customized OS environments and tools we use today."&lt;/EM&gt;&lt;/P&gt;
&lt;P class="lia-indent-padding-left-30px"&gt;— Ed Moore, OT Strategist and Distinguished Engineer, Chevron&lt;/P&gt;
&lt;H3&gt;&lt;STRONG&gt;Run agents and models locally with Foundry Local (preview)&lt;/STRONG&gt;&lt;/H3&gt;
&lt;P&gt;To embed intelligence into physical systems, Foundry Local is now available as a lightweight container image for Linux infrastructure.&lt;/P&gt;
&lt;P&gt;Foundry Local provides a consistent way to deploy and run agents and models, including an inference server that runs alongside your app container and exposes an OpenAI-compatible REST endpoint. It also offers a trusted source for the latest open-source models with an extensive online catalog. Although it integrates closely with Microsoft Foundry, at run time everything stays local: there's no round-trip to the cloud. Data stays on the machine, responses start instantly with zero network latency, and inferences continue even without connectivity. There are no per-token costs, either.&lt;/P&gt;
&lt;P&gt;Optimized for edge and industrial form factors, the new Foundry Local preview automatically detects and uses available accelerators like GPUs (and soon NPUs), lining up the full stack for you, from kernel drivers to user-mode libraries. For example, in our demo above, Foundry Local taps an Nvidia RTX 2000E GPU to deliver snappy inferences in real time.&lt;/P&gt;
&lt;img&gt;
&lt;P&gt;Diagram of the lightweight Linux architecture with container-based Azure services.&lt;/P&gt;
&lt;/img&gt;
&lt;H3&gt;&lt;STRONG&gt;More popular Azure services&lt;/STRONG&gt;&lt;/H3&gt;
&lt;P&gt;In addition to Foundry Local, these popular Azure services are validated too:&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Azure Kubernetes Service (AKS)&lt;/STRONG&gt;, the fully-managed enterprise-grade Kubernetes service, now runs directly on bare metal with small form factor deployments – no virtualization layer required. It's the same AKS already available in the cloud and on servers. Once deployed, the cluster looks and works exactly like AKS anywhere else – with Azure-based RBAC, networking, upgrades, monitoring, and even integrations like AKS Fleet Manager – so the controls and tooling you rely on in the cloud extend all the way to the industrial edge. &lt;A class="lia-external-url" href="https://aka.ms/aks-edge-baremetal" target="_blank" rel="noopener"&gt;Learn more and join the AKS preview&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Azure IoT Operations&lt;/STRONG&gt; provides a unified data and control plane for physical assets at the edge. It includes a variety of connectors and an industrial-grade MQTT broker where local agents and logic can run – even with intermittent connectivity – to shape operational data into AI-ready forms, act on it autonomously, and connect into broader cloud analytics and AI systems. It provides a no-code graphical interface to configure data flows and contextualize data before sending it to destinations like Microsoft Fabric for Real-Time Intelligence, and allows you to send messages back to the physical machines it’s connected to. It's already generally available, and as seen in our demo above, it now works on small form factor deployments too. &lt;A class="lia-external-url" href="https://learn.microsoft.com/azure/iot-operations/overview-iot-operations" target="_blank" rel="noopener"&gt;Learn more about Azure IoT Operations&lt;/A&gt;&lt;/P&gt;
&lt;H3&gt;&lt;STRONG&gt;Choose the hardware that fits your requirements&lt;/STRONG&gt;&lt;/H3&gt;
&lt;P&gt;We're delighted to partner with leading makers of edge and industrial computers so you can deploy Azure-managed infrastructure on smaller form factor hardware that’s available to buy today – straight from your preferred vendor or distributor, with no special customization required.&lt;/P&gt;
&lt;img&gt;
&lt;P&gt;We’re partnering with leading makers of AI-ready edge and industrial computers.&lt;/P&gt;
&lt;/img&gt;
&lt;P&gt;The most compact and affordable options are the ASUS NUC 14 Pro and 15 Pro. At barely 4 inches square and under 2 pounds, they pack the latest Intel® Core™ Ultra processors into a remarkably trim package, well suited to space-constrained scenarios like retail. &lt;A class="lia-external-url" href="https://www.asus.com/us/displays-desktops/nucs/nuc-mini-pcs/asus-nuc-15-pro/" target="_blank" rel="noopener"&gt;Learn more about NUC 15 Pro&lt;/A&gt;&lt;/P&gt;
&lt;P class="lia-indent-padding-left-30px"&gt;&lt;EM&gt;“With ASUS NUC 14 Pro and 15 Pro, organizations have a powerful yet compact platform for innovation at the edge. When paired with Azure Local, these devices make it easy to deploy, manage, and scale AI workloads at the edge – unlocking real-time intelligence for retail stores and manufacturing environments while maintaining seamless integration with the cloud.”&lt;/EM&gt;&lt;/P&gt;
&lt;P class="lia-indent-padding-left-30px"&gt;– (ASUS) KuoWei Chao, General Manager of ASUS NUC Business Unit&lt;/P&gt;
&lt;P&gt;For more flexibility, the industrial-grade Lenovo ThinkEdge SE100 offers expandable storage and networking, plus an optional Nvidia RTX A1000 (8GB) or 2000E (16GB) GPU to accelerate demanding edge AI inferencing. &lt;A class="lia-external-url" href="https://www.lenovo.com/us/en/p/servers-storage/servers/edge/thinkedge-se100/len21te0020" target="_blank" rel="noopener"&gt;Learn more about ThinkEdge SE100&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;For the toughest operational and regulatory constraints, the OnLogic Helix 521 offers a fan-less design with no moving parts. Designed, assembled, and supported entirely in the USA, it takes the uncertainty out of meeting stringent supply-chain requirements. &lt;A class="lia-external-url" href="https://www.onlogic.com/store/k521-azurelocal/" target="_blank" rel="noopener"&gt;Learn more about the Hx521&lt;/A&gt;&lt;/P&gt;
&lt;H3&gt;&lt;STRONG&gt;Get started today&lt;/STRONG&gt;&lt;/H3&gt;
&lt;P&gt;We're excited to bring AI-ready infrastructure to where physical work happens, and we genuinely had a lot of fun making the agentic robot demo above. Now it's your turn. Small form factor deployments are available in public preview today, starting in the East US region. There is no charge during the preview. Once your hardware is ready, the Azure-based provisioning experience gets most previewers up and running in about an hour. &lt;A class="lia-external-url" href="https://learn.microsoft.com/azure/azure-local/small-form-factor/small-form-factor-overview" target="_blank" rel="noopener"&gt;Instructions to get started are on Microsoft Learn&lt;/A&gt;, and if you’d like to engage directly with our team, &lt;A class="lia-external-url" href="https://aka.ms/AzureSFFGetInTouch" target="_blank"&gt;get in touch&lt;/A&gt; here. (If you need to evaluate before committing to hardware, you can spin it up on a virtual machine, though it’s not quite the same as real hardware.)&lt;/P&gt;
&lt;P&gt;Whether you're bringing intelligence to a fleet of machines, standing up inference next to your data, or building something we haven't even imagined yet, we can't wait to see what you create!&lt;/P&gt;
&lt;P&gt;- Cosmos &amp;amp; Mike&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;on behalf of our global team in Redmond, Mountain View, Pittsburgh, and Bengaluru&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 03 Jun 2026 15:57:44 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/embed-intelligence-into-physical-systems-with-smaller-form/ba-p/4524876</guid>
      <dc:creator>Cosmos_Darwin</dc:creator>
      <dc:date>2026-06-03T15:57:44Z</dc:date>
    </item>
    <item>
      <title>Ansible + Azure Arc: Manage Azure Arc Extensions with New Ansible Modules</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/ansible-azure-arc-manage-azure-arc-extensions-with-new-ansible/ba-p/4521689</link>
      <description>&lt;P&gt;We’re excited to announce new modules in Ansible Galaxy that make it easier to manage Azure Arc machine extensions at scale. With the latest updates to the &lt;A href="https://galaxy.ansible.com/ui/repo/published/azure/azcollection/" target="_blank" rel="noopener"&gt;azure.azcollection&lt;/A&gt; on Ansible Galaxy, you can now deploy and manage Azure Arc extensions using familiar, declarative Ansible workflows.&lt;/P&gt;
&lt;P&gt;These new modules include:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;A class="lia-external-url" href="https://galaxy.ansible.com/ui/repo/published/azure/azcollection/content/module/azure_rm_arcmachineextensions/?keywords=arc" target="_blank" rel="noopener"&gt;Azure Arc machine extensions module&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;A class="lia-external-url" href="https://galaxy.ansible.com/ui/repo/published/azure/azcollection/content/module/azure_rm_arcmachineextensions_info/?keywords=arc" target="_blank" rel="noopener"&gt;Azure Arc extensions info module&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Together, they enable infrastructure and platform teams to automate extension lifecycle management across their hybrid estate—bringing consistency, security, and efficiency to Azure Arc-enabled servers.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Why this matters&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Azure Arc machine extensions power critical scenarios such as security, monitoring, update management, configuration and compliance. Until now, managing these Azure Arc extensions across hybrid estates often required Azure CLI scripts, ARM templates, or manual operations. With these new Ansible modules, you can:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Integrate Azure Arc extension management into existing Ansible playbooks&lt;/LI&gt;
&lt;LI&gt;Enforce consistent configuration across hybrid servers&lt;/LI&gt;
&lt;LI&gt;Reduce operational overhead through declarative automation&lt;/LI&gt;
&lt;LI&gt;Align extension deployment with broader configuration management workflows&lt;BR /&gt;&lt;BR /&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;STRONG&gt;What’s included&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;azure_rm_arcmachineextensions&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;This module allows you to manage the full lifecycle of Azure Arc machine extensions, including:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Creating and deploying extensions&lt;/LI&gt;
&lt;LI&gt;Updating extension settings&lt;/LI&gt;
&lt;LI&gt;Removing extensions when no longer needed&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;You can define extension state declaratively, ensuring consistent enforcement across your Azure Arc-enabled servers.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;azure_rm_arcmachineextensions_info&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;This module provides visibility into extension state by retrieving:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Installed extensions on Azure Arc-enabled machines&lt;/LI&gt;
&lt;LI&gt;Provisioning status and configuration details&lt;/LI&gt;
&lt;LI&gt;Extension metadata for reporting and validation&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;This is useful for compliance validation, auditing, and conditional automation in playbooks.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Scenario: Enforcing identity-based SSH access across a hybrid fleet&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Consider a regulated enterprise that must ensure all Linux servers—whether on-premises or in a multicloud environment—use Microsoft Entra ID for SSH access. The organization wants to:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Eliminate local SSH credentials&lt;/LI&gt;
&lt;LI&gt;Enforce centralized identity and access controls&lt;/LI&gt;
&lt;LI&gt;Audit access consistently across all environments&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;By combining Azure Arc with Ansible, the organization can deploy the&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/azure-arc/servers/ssh-arc-overview?tabs=azure-cli" target="_blank" rel="noopener"&gt;&lt;STRONG&gt;Microsoft Entra SSH for Linux extension&lt;/STRONG&gt;&lt;/A&gt; across all Azure Arc-enabled servers as part of a standardized playbook, ensuring compliance and reducing operational overhead.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Example: Deploy Microsoft Entra SSH for Linux extension&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Below is an example of using Ansible to deploy the Microsoft Entra SSH extension to an Azure Arc-enabled server:&lt;/P&gt;
&lt;LI-CODE lang="yaml"&gt;- name: Deploy Entra SSH extension to Arc server
  hosts: localhost
  connection: local
  tasks:
    - name: Install Entra SSH extension for Linux
      azure_rm_arcmachineextensions:
        resource_group: myResourceGroup
        machine_name: myArcServer
        name: AADSSHLoginForLinux
        publisher: Microsoft.Azure.ActiveDirectory
        type: AADSSHLoginForLinux
        type_handler_version: "1.0"
        settings: {}
        state: present
&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Example: Retrieve extension information&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Below is an example of using Ansible to retrieve details about your Azure Arc extensions:&lt;/P&gt;
&lt;LI-CODE lang="yaml"&gt;- name: Get Arc machine extension details
  hosts: localhost
  connection: local
  tasks:
    - name: Fetch extensions
      azure_rm_arcmachineextensions_info:
        resource_group: myResourceGroup
        machine_name: myArcServer&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Integrating with existing Ansible workflows&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;If you’re already using Ansible for:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;OS configuration&lt;/LI&gt;
&lt;LI&gt;Patch and update management&lt;/LI&gt;
&lt;LI&gt;Application deployment&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;You can now extend those workflows to include Azure Arc extension management—without introducing new tools or processes. This allows you to manage on-premises servers, Edge infrastructure and multicloud environments through a unified automation approach powered by Azure Arc and Ansible.&lt;/P&gt;
&lt;P&gt;Read more at &lt;A href="https://learn.microsoft.com/en-us/azure/azure-arc/servers/manage-vm-extensions-ansible" target="_blank" rel="noopener"&gt;Enable VM Extensions Using Red Hat Ansible - Azure Arc | Microsoft Learn&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;What’s next&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;These modules are part of our continued investment in making Azure Arc a first-class platform for managing Windows and Linux machines in hybrid and multicloud infrastructure.&lt;/P&gt;
&lt;P&gt;By bringing extension lifecycle management into Ansible, we’re enabling teams to enforce security, compliance, and operational consistency at scale—using the tools they already trust.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Stay connected&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Join the Azure Arc Monthly Forum here: &lt;A href="https://aka.ms/ArcServerForumSignup" target="_blank"&gt;aka.ms/&lt;/A&gt;&lt;A href="https://aka.ms/ArcServerForumSignup" target="_blank"&gt;ArcServerForumSignup&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;Let us know what you’d like to see next in the comments!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 28 May 2026 17:24:53 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/ansible-azure-arc-manage-azure-arc-extensions-with-new-ansible/ba-p/4521689</guid>
      <dc:creator>alinetran</dc:creator>
      <dc:date>2026-05-28T17:24:53Z</dc:date>
    </item>
    <item>
      <title>Azure Arc Server April 2026 Forum</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/azure-arc-server-april-2026-forum/ba-p/4521318</link>
      <description>&lt;P&gt;Please find the&amp;nbsp;&lt;A class="lia-external-url" href="https://www.youtube.com/watch?v=QjAH4c67dR8" target="_blank"&gt;recording for the monthly Azure Arc Server Forum on YouTube&lt;/A&gt;!&lt;/P&gt;
&lt;P&gt;During the April 2026 Azure Arc Server Forum, we discussed:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Public Preview of Essential Machine Management, learn more at &lt;A class="lia-external-url" href="https://aka.ms/EMM-blog" target="_blank"&gt;aka.ms/EMM-blog&lt;/A&gt; and sign up at &lt;A class="lia-external-url" href="https://aka.ms/EMM-feedback" target="_blank"&gt;aka.ms/EMM-feedback&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Engage with product group on exploration of AI on bring your own Kubernetes by signing up at &lt;A class="lia-external-url" href="https://aka.ms/arc-ai-survey" target="_blank"&gt;aka.ms/arc-ai-survey&lt;/A&gt;&amp;nbsp;&amp;nbsp;&lt;/LI&gt;
&lt;LI&gt;Product group is investing in extending the Multi-cloud Connector provide customers the ability to connect their MECM environments to Azure for inventory, monitoring, and management&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;To sign up for the Azure Arc Server Forum and newsletter, please register with contact details at&amp;nbsp;&lt;A href="https://aka.ms/arcserverforumsignup/" target="_blank"&gt;https://aka.ms/arcserverforumsignup/&lt;/A&gt;.&lt;/P&gt;
&lt;P&gt;For the latest agent release notes, check out&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/azure-arc/servers/agent-release-notes" target="_blank"&gt;What's new with Azure Connected Machine agent - Azure Arc | Microsoft Learn&lt;/A&gt;.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Our May 2026 forum will be held on Thursday, May 21 at 9:30 AM PST / 12:30 PM EST. We look forward to you joining us, thank you!&lt;/P&gt;</description>
      <pubDate>Tue, 19 May 2026 18:58:36 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/azure-arc-server-april-2026-forum/ba-p/4521318</guid>
      <dc:creator>Aurnov_Chattopadhyay</dc:creator>
      <dc:date>2026-05-19T18:58:36Z</dc:date>
    </item>
    <item>
      <title>Simplified access to Hotpatching enabled by Azure Arc for Windows Server 2025</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/simplified-access-to-hotpatching-enabled-by-azure-arc-for/ba-p/4521251</link>
      <description>&lt;P&gt;With Windows Server 2025, we introduced &lt;A href="https://www.microsoft.com/en-us/windows-server/blog/2025/04/24/tired-of-all-the-restarts-get-hotpatching-for-windows-server/" target="_blank" rel="noopener"&gt;hotpatch enabled by Azure Arc&lt;/A&gt;, delivering security updates to Windows Server across hybrid and multicloud environments – minimizing downtime (no reboot), accelerating protection, and unifying patch management.&lt;/P&gt;
&lt;P&gt;We know that keeping your servers updated with the latest patches is one of the critical tasks that IT teams perform day-to-day. We want to make it simpler to install the latest operating system (OS) updates without rebooting machines after every installation.&lt;/P&gt;
&lt;P&gt;The&amp;nbsp;resounding&amp;nbsp;feedback we have received from you underscored the criticality of this feature in the lifecycle management and security of your infrastructure. We are now taking it one step further to reduce the friction to deploying these critical updates:&amp;nbsp; &lt;EM&gt;hotpatch&amp;nbsp;enabled by Azure Arc is now available at&amp;nbsp;&lt;STRONG&gt;no&amp;nbsp;additional&amp;nbsp;cost&lt;/STRONG&gt; for Windows Server 2025.&lt;/EM&gt;&lt;/P&gt;
&lt;DIV class="lia-embeded-content" contenteditable="false"&gt;&lt;IFRAME src="https://www.linkedin.com/embed/feed/update/urn:li:ugcPost:7462589752584622080?compact=1" width="504" height="399" title="Embedded post" allowfullscreen="allowfullscreen" frameborder="0" sandbox="allow-scripts allow-same-origin allow-forms"&gt;&lt;/IFRAME&gt;&lt;/DIV&gt;
&lt;H3&gt;Which machines are eligible for this offer?&lt;/H3&gt;
&lt;P&gt;To use hotpatch for Windows Servers running on-premises or in multicloud environments, you must be using Windows Server 2025 Standard or Datacenter, and your server must be connected to Azure Arc. With this announcement, enabling and usage of the hotpatching service is available at no additional charge.&lt;/P&gt;
&lt;P&gt;Please take note that there are &lt;STRONG&gt;no charges&lt;/STRONG&gt; for customers running on Azure IaaS, or Azure Local, wherein hotpatching is available as part of the functionality of Windows Server Datacenter: Azure Edition. This feature is already included both with Windows Server 2022 Datacenter: Azure Edition and Windows Server 2025 Datacenter: Azure Edition.&lt;/P&gt;
&lt;H3&gt;How do I manage hotpatches enabled by Azure Arc for Windows Server 2025?&lt;/H3&gt;
&lt;P&gt;If your Windows Server 2025 machines aren't already connected to Azure Arc, &lt;A href="https://learn.microsoft.com/en-us/azure/azure-arc/servers/onboard-portal" target="_blank" rel="noopener"&gt;install the Azure Connected Machine agent&lt;/A&gt; — it takes just a few minutes per server and supports at-scale rollout via &lt;A href="https://learn.microsoft.com/en-us/azure/azure-arc/servers/onboard-group-policy-powershell" target="_blank" rel="noopener"&gt;Group Policy&lt;/A&gt;, &lt;A href="https://learn.microsoft.com/en-us/azure/azure-arc/servers/onboard-service-principal" target="_blank" rel="noopener"&gt;service principal&lt;/A&gt;, or Terraform. Once connected, &lt;A href="https://learn.microsoft.com/en-us/windows-server/get-started/enable-hotpatch-azure-arc-enabled-servers" target="_blank" rel="noopener"&gt;enable Hotpatch&lt;/A&gt; from the Azure portal, Azure PowerShell, Azure CLI, or the &lt;A href="https://techcommunity.microsoft.com/blog/coreinfrastructureandsecurityblog/managing-hotpatching-for-azure-arc-connected-machines-using-apis/4458192" target="_blank" rel="noopener"&gt;REST API&lt;/A&gt; — just confirm &lt;A href="https://learn.microsoft.com/en-us/windows-hardware/design/device-experiences/oem-vbs" target="_blank" rel="noopener"&gt;Virtualization-based security&lt;/A&gt;&amp;nbsp;(&lt;A href="https://learn.microsoft.com/en-us/windows-server/get-started/enable-hotpatch-azure-arc-enabled-servers#check-and-enable-virtual-secure-mode-if-necessary" target="_blank" rel="noopener"&gt;VBS is enabled&lt;/A&gt;) first. From there, use &lt;A href="https://learn.microsoft.com/en-us/azure/update-manager/manage-arc-enabled-servers" target="_blank" rel="noopener"&gt;Azure Update Manager&lt;/A&gt; to schedule and monitor rollouts at scale.&lt;/P&gt;
&lt;P&gt;For instructions on how to enable hotpatch for Azure Arc-enabled machines using group policy or scripts, learn more here: https://aka.ms/ws-hotpatch&lt;/P&gt;
&lt;P&gt;For patch orchestration at scale, you can use Azure Update Manager to deliver hotpatches enabled by Azure Arc for&amp;nbsp;Windows server 2025 machines. This enables&amp;nbsp;greater uptime with fewer reboots and faster deployment of updates with easy patch orchestration. Alternatively, you can use &lt;A href="https://techcommunity.microsoft.com/blog/coreinfrastructureandsecurityblog/managing-hotpatching-for-azure-arc-connected-machines-using-apis/4458192" target="_blank" rel="noopener"&gt;APIs &lt;/A&gt;or other management tools to manage hotpatches.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&lt;STRONG&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Centralized management of hotpatch updates across hybrid and multicloud environments enabled by Azure Arc&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Once your machines are connected to Azure Arc, you can also use the cloud-native services from Azure to manage your windows machines running on-prem. Azure Arc enables you to standardize security and governance across a wide range of resources so you can easily organize, govern and secure Windows, Linux, SQL servers, and Kubernetes clusters running across data centers, edge, and multi-cloud environments – using Azure services such as Azure Policy, Azure Monitor, Microsoft Defender and more.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;DIV class="styles_lia-table-wrapper__h6Xo9 styles_table-responsive__MW0lN"&gt;&lt;table border="1" style="width: 1050px; border-width: 1px;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td colspan="2"&gt;
&lt;P class="lia-align-center"&gt;&lt;STRONG&gt;At no additional cost for machines attached to Azure Arc&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Basic inventory across on-prem and multi-cloud&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Tag your resources, organize them into resource groups, subscriptions, and management groups, and query at scale with Azure Resource Graph to unify your environments.&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Infra as Code (Bicep, Terraform)&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Infra as code for provisioning and management of resources.&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;VM Self Service&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Perform lifecycle management such as (create, resize, update and delete) and power cycle operations such as (start, stop, and restart on VMware vCenter and System Center Virtual Machine Manager Virtual Machines.&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Hotpatch for Windows Server 2025 &lt;/STRONG&gt;&lt;STRONG&gt;&lt;EM&gt;&lt;SUP&gt;NEW&lt;/SUP&gt;&lt;/EM&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Windows Server hot patching enables you to apply security updates&amp;nbsp;without rebooting, keeping systems secure while maintaining continuous uptime.&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;VM Management&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Administrate your servers anywhere using SSH for Azure Arc, Run Command, and Custom Script Extension.&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td colspan="2"&gt;
&lt;P class="lia-align-center"&gt;&lt;STRONG&gt;Mgmt. Services &lt;U&gt;included&lt;/U&gt; for no additional costs with Windows Server Software Assurance or Extended Security Updates&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Azure Update Manager&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Provides a unified, centralized service to monitor, orchestrate, and automate patching across Azure, on‑prem, and multi‑cloud environments ensuring security, compliance, and minimal downtime at scale.&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Azure Machine Configuration (Policy)&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Policy‑driven auditing and enforcement of OS and application settings as code across Azure and hybrid machines—ensuring consistent, compliant state at scale. Including compliance policies like &lt;STRONG&gt;CIS Benchmark&lt;/STRONG&gt; and &lt;STRONG&gt;WinRE&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Change Tracking &amp;amp; Inventory&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Real‑time visibility into configuration changes and system state across your fleet enabling faster troubleshooting, improved security, and continuous compliance at scale.&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;VM insights from Azure Monitor&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Delivers a unified, pre‑built observability experience that provides real‑time performance, health, and dependency visibility across VMs—enabling faster troubleshooting, optimization, and capacity planning at scale.&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Windows Admin Center&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Unified, browser‑based management plane to securely manage Windows servers, VMs, and hybrid infrastructure from anywhere—simplifying operations and improving efficiency at scale.&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Best Practices Assessment&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Continuously evaluation your server configurations against Microsoft-recommended standards to proactively identify risks and provide actionable remediation guidance—improving security, performance, and operational health at scale.&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;colgroup&gt;&lt;col style="width: 456px" /&gt;&lt;/colgroup&gt;&lt;/table&gt;&lt;/DIV&gt;
&lt;H3&gt;Frequently Asked Questions&lt;/H3&gt;
&lt;P&gt;&lt;STRONG&gt;What are hotpatch updates?&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Hotpatch updates are&amp;nbsp;&lt;A href="https://learn.microsoft.com/windows/deployment/update/release-cycle#monthly-security-update-release" target="_blank" rel="noopener"&gt;monthly security updates&lt;/A&gt;&amp;nbsp;that take effect without requiring you to restart the device. They contain a full set of security updates equivalent to the standard updates released the same day.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;What is the hotpatch update cycle?&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;All eligible Windows Server 2025 machines enrolled in hotpatch are offered up to 8 monthly hotpatch updates in a calendar year in a quarterly cycle:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Baseline month: In January, April, July, and October, devices install the monthly cumulative security update and must restart for the update to take effect. This update includes the latest security fixes, cumulative new features, and enhancements since the last baseline.&lt;/LI&gt;
&lt;LI&gt;Subsequent two months: Devices receive hotpatch updates, which only include security updates and don't require a restart for the update to take effect. These devices will catch up on features and enhancements with the next cumulative baseline month (quarterly).&lt;/LI&gt;
&lt;/UL&gt;
&lt;img /&gt;
&lt;P&gt;&lt;STRONG&gt;Will billing be stopped for existing enrolled machines?&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Yes, as of 15&lt;SUP&gt;th&lt;/SUP&gt; May 2026 all billing for hotpatch has been stopped for all existing machines enrolled in hotpatch.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;What action do we need to take if we have machines enrolled in hotpatch already?&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;There is no additional action needed for machines that are currently enrolled in hotpatch. These machines will remain enrolled in hotpatch and receive hotpatch updates when available.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;I want all my Windows Server 2025 machines to get hotpatches. How do I do it?&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;If you have Windows Server 2025 machines on-premises or on cloud (other than Azure) then you can enable hotpatch on them. To do so, ensure these machines have Virtualization Based Security enabled and are connected to Azure Arc and then you can use Azure Arc portal, Azure Update manager or &lt;A href="https://techcommunity.microsoft.com/blog/coreinfrastructureandsecurityblog/managing-hotpatching-for-azure-arc-connected-machines-using-apis/4458192" target="_blank" rel="noopener"&gt;APIs&lt;/A&gt; to enable hotpatch. Learn more: &lt;A href="https://aka.ms/ws-hotpatch" target="_blank" rel="noopener"&gt;https://aka.ms/ws-hotpatch&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Is anything changing for Hotpatching on Azure?&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Hotpatch continues to be available on Azure for your Windows Server 2022 and Windows Server 2025 VMs when using Azure Edition. There is no fee associated with Hotpatching on Azure. Learn more &lt;A href="https://learn.microsoft.com/en-us/windows-server/get-started/hotpatch" target="_blank" rel="noopener"&gt;here&lt;/A&gt;.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Is there a community forum for Arc?&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Yes, you can join the Azure Arc Monthly Forum here: &lt;A href="https://aka.ms/ArcServerForumSignup" target="_blank" rel="noopener"&gt;aka.ms/&lt;/A&gt;&lt;A href="https://aka.ms/ArcServerForumSignup" target="_blank" rel="noopener"&gt;ArcServerForumSignup&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 19 May 2026 21:18:34 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/simplified-access-to-hotpatching-enabled-by-azure-arc-for/ba-p/4521251</guid>
      <dc:creator>sharmajyoti</dc:creator>
      <dc:date>2026-05-19T21:18:34Z</dc:date>
    </item>
    <item>
      <title>Introducing cert-manager for Azure Arc-enabled Kubernetes: now in Public Preview</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/introducing-cert-manager-for-azure-arc-enabled-kubernetes-now-in/ba-p/4514549</link>
      <description>&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;Today we’re releasing a public preview of cert-manager for Azure Arc-enabled Kubernetes. It’s an Arc extension that automates TLS certificate and trust bundle management for edge Kubernetes clusters.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:200}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;If you’re running Kubernetes at the edge: in factories, retail stores, remote sites, you’ve probably hit the certificate problem already. Certificates expire. Each cluster has its own tooling. Nobody owns the renewal process until something breaks. We routinely hear from customers that certificate issues are a common source of unplanned outages and last-minute firefighting, especially as workload counts grow.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134233117&amp;quot;:false,&amp;quot;134233118&amp;quot;:false,&amp;quot;201341983&amp;quot;:0,&amp;quot;335551550&amp;quot;:1,&amp;quot;335551620&amp;quot;:1,&amp;quot;335559685&amp;quot;:0,&amp;quot;335559737&amp;quot;:0,&amp;quot;335559738&amp;quot;:0,&amp;quot;335559739&amp;quot;:160,&amp;quot;335559740&amp;quot;:279}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;This extension packages the open-source cert-manager and trust-manager into a managed Arc extension with Microsoft support. You get automated lifecycle management and trust distribution without having to run and maintain these tools yourself.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:200}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;H3 aria-level="2"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;What it does&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:280,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;The extension bundles two CNCF-graduated projects:&amp;nbsp;&lt;/SPAN&gt;&lt;A href="https://cert-manager.io/" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;cert-manager&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;and&amp;nbsp;&lt;/SPAN&gt;&lt;A href="https://cert-manager.io/docs/trust/trust-manager/" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;trust-manager&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN data-contrast="auto"&gt;, into a single Arc-K8s extension that you install once per cluster. From there:&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:200}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="lia-indent-padding-left-30px"&gt;&lt;SPAN data-contrast="auto"&gt;1. You can issue, renew, and rotate certificates automatically. You do not need to manage them manually.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="lia-indent-padding-left-30px"&gt;&lt;SPAN data-contrast="auto"&gt;2. You can distribute trusted CA certificates consistently across namespaces. No more per-workload trust configuration.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="lia-indent-padding-left-30px"&gt;&lt;SPAN data-contrast="auto"&gt;3. You choose the CA issuer: built-in self-signed for dev/test, or your enterprise PKI for production.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="lia-indent-padding-left-30px"&gt;&lt;SPAN data-contrast="auto"&gt;4. The extension ships with enterprise support, regular security patches, and proactive maintenance from Microsoft team.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;H3 aria-level="2"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;Why we built it&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:280,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;We built Microsoft cert-manager for Azure Arc-enabled Kubernetes to address three recurring problems we saw in real hybrid and edge environments.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;H4&gt;&lt;SPAN data-contrast="auto"&gt;Problem 1: Manual certificate&amp;nbsp;issuance.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H4&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;Many organisations still issue, install, and renew certificates through manual steps across clusters and namespaces. That creates operational overhead, slows teams&amp;nbsp;down, and increases the risk of outages when certificates expire or are configured incorrectly.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;The answer is automation. With cert-manager running as an Arc-enabled extension, teams can automate certificate issuance, renewal, and rotation through Kubernetes-native workflows instead of relying on tickets, scripts, and manual intervention.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;H4&gt;&lt;SPAN data-contrast="auto"&gt;Problem 2: Fragmented&amp;nbsp;approaches to automation.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H4&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;Even when teams try to automate, they often end up with a mix of scripts, custom controllers, product-specific setups, and one-off operational patterns. That fragmentation makes certificate management harder to scale, harder to standardise, and harder to&amp;nbsp;operate&amp;nbsp;consistently across environments.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;The answer is to standardise on cert-manager. It provides a common, Kubernetes-native approach to certificate lifecycle management, helping teams reduce tool sprawl, align on a consistent operating model, and simplify how certificates are managed across clusters.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;H4&gt;&lt;SPAN data-contrast="auto"&gt;Problem 3:&amp;nbsp;Maintenance and upgrade burden&amp;nbsp;for open-source&amp;nbsp;cert-manager.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H4&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;cert-manager is a powerful open-source project, but many organisations do not want the ongoing burden of packaging,&amp;nbsp;validating, patching, upgrading, and supporting it themselves as a production dependency. That can create operational risk, delay updates, and make long-term ownership unclear.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;The answer is a Microsoft-supported Arc-enabled extension. Microsoft cert-manager for Azure Arc-enabled Kubernetes gives customers a supported way to use cert-manager, with Microsoft handling packaging, delivery, and ongoing maintenance so teams can adopt the capability without taking on the full operational burden of managing the OSS&amp;nbsp;component&amp;nbsp;themselves.&lt;/SPAN&gt;&lt;/P&gt;
&lt;H3 aria-level="2"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;What’s&lt;/SPAN&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;&amp;nbsp;in the public preview&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:280,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;Here’s&amp;nbsp;what you get:&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:200}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Certificate lifecycle automation&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;with cert-manager: issuance, renewal, rotation, all handled for you.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Trust bundle distribution&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;with trust-manager: push trusted CA certs to every namespace that needs them.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Self-signed or external CA.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;Start with the built-in CA, swap in your enterprise PKI when&amp;nbsp;you’re&amp;nbsp;ready.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Secure by default.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;We turned on the security settings&amp;nbsp;you’d&amp;nbsp;want enabled anyway: TLS enforcement, least-privilege RBAC, restricted pod security.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Tested at the edge.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;Validated on AKS Edge Essentials, AKS on Azure Local, and&amp;nbsp;several&amp;nbsp;&lt;/SPAN&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/azure-arc/kubernetes/cert-manager-overview#validated-arc-enabled-kubernetes-distributions" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;third-party Kubernetes distros&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN data-contrast="auto"&gt;. Works offline.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/OL&gt;
&lt;H3 aria-level="2"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;Fits into your Arc stack&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:280,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;If&amp;nbsp;you’re&amp;nbsp;already running Azure IoT Operations or Azure Monitor on Arc-enabled clusters, the extension handles TLS between those services with minimal setup. No custom certificate plumbing&amp;nbsp;required: install the extension and the other Arc components pick it up.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:200}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;H3 aria-level="2"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-parastyle="heading 2"&gt;Get started&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134245418&amp;quot;:true,&amp;quot;134245529&amp;quot;:true,&amp;quot;335559738&amp;quot;:280,&amp;quot;335559739&amp;quot;:180}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;The extension is available now in public preview.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:200}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;👉&amp;nbsp;&lt;/SPAN&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/azure-arc/kubernetes/cert-manager-overview" target="_blank" rel="noopener"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Documentation and quickstart&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN data-ccp-props="{&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 13 May 2026 03:37:38 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/introducing-cert-manager-for-azure-arc-enabled-kubernetes-now-in/ba-p/4514549</guid>
      <dc:creator>NicolasMondada</dc:creator>
      <dc:date>2026-05-13T03:37:38Z</dc:date>
    </item>
    <item>
      <title>Azure Local expands to sovereign-scale infrastructure with disaggregated deployments</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/azure-local-expands-to-sovereign-scale-infrastructure-with/ba-p/4514338</link>
      <description>&lt;P&gt;As organizations accelerate digital transformation across datacenters, sovereign environments, and edge locations, infrastructure architectures must evolve to meet new operational and regulatory demands.&lt;/P&gt;
&lt;P&gt;The &lt;STRONG&gt;first feature update of Azure Local in CY 2026 (version 2604)&lt;/STRONG&gt; marks a significant step forward—expanding Azure Local as a platform for sovereign private cloud infrastructure, introducing larger scale, disaggregated deployment architectures, expanded storage ecosystem partnerships, and simplified identity capabilities that unlock entirely new infrastructure scenarios from edge locations to enterprise-scale environments.&lt;/P&gt;
&lt;P&gt;This release is focused on enabling:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Sovereign private cloud deployments at scale from single node up to multi-rack infrastructure&lt;/LI&gt;
&lt;LI&gt;Infrastructure modernization through SAN reuse and disaggregated architectures&lt;/LI&gt;
&lt;LI&gt;Simplified edge deployment without Microsoft Active Directory dependencies&lt;/LI&gt;
&lt;LI&gt;Faster lifecycle operations across deployment and update workflows&lt;/LI&gt;
&lt;/UL&gt;
&lt;H2&gt;Introducing disaggregated larger scale deployments using SAN storage&lt;/H2&gt;
&lt;P&gt;Azure Local now supports a disaggregated infrastructure architecture, allowing customers to deploy compute and storage resources independently—while continuing to benefit from an Azure-consistent management and operational experience. This enables organizations to scale infrastructure more flexibly separating compute and storage to align with workload demands and long-term growth.&lt;/P&gt;
&lt;P&gt;This architecture enables:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Independent scaling of compute nodes and storage infrastructure&lt;/LI&gt;
&lt;LI&gt;SAN‑only and hybrid storage architectures for Azure Local infrastructure and workloads&lt;/LI&gt;
&lt;LI&gt;Fibre Channel (FC) connectivity support beginning with 2604 (iSCSI coming soon)&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;With disaggregated deployments and SAN storage, Azure Local clusters can now scale from a single node at the edge to multi-rack environments spanning beyond 16 nodes and up to thousands of nodes, addressing growing demand for large-scale deployments across sovereign, government, defense, and regulated environments. This unlocks new class of Azure -consistent infrastructure deployments at sovereign scale.&lt;/P&gt;
&lt;P&gt;This unlocks a new class of Azure-consistent infrastructure deployments at sovereign scale. This new capability is &lt;STRONG&gt;generally available&lt;/STRONG&gt; with the release of Azure Local 2604.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;H2&gt;General Availability of SAN Support for Azure Local&amp;nbsp;&lt;/H2&gt;
&lt;P&gt;Support for attaching SAN storage to Azure Local was &lt;A class="lia-internal-link lia-internal-url lia-internal-url-content-type-blog" href="https://techcommunity.microsoft.com/blog/azurearcblog/what%E2%80%99s-new-in-azure-local-cloud-infrastructure-for-distributed-locations-enabled/4469773" target="_blank" rel="noopener" data-lia-auto-title="introduced" data-lia-auto-title-active="0"&gt;introduced&lt;/A&gt; as public preview back in November 2025. Today this brownfield expansion capability is &lt;STRONG&gt;generally available&lt;/STRONG&gt; and allows external SAN devices to be introduced into already deployed Azure Local instances via Fibre Channel (FC)—supporting virtual machines, Kubernetes environments, and Azure Virtual Desktop workloads without requiring disruptive infrastructure changes or full system refresh.&lt;/P&gt;
&lt;P&gt;Azure Local instances now support the coexistence of Storage Spaces Direct volumes and external SAN volumes.&lt;/P&gt;
&lt;P&gt;Support for SAN-attached deployments allows organizations to:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Reuse existing enterprise SAN investments&lt;/LI&gt;
&lt;LI&gt;Modernize infrastructure without replacing existing storage estates&lt;/LI&gt;
&lt;LI&gt;Manage rising disk costs associated with hyperconverged architectures&lt;/LI&gt;
&lt;LI&gt;Enable workload scenarios that depend on massive storage requirements&lt;/LI&gt;
&lt;/UL&gt;
&lt;img /&gt;
&lt;P&gt;These innovative capabilities supporting disaggregated deployments and SAN storage are supported by a strong ecosystem of hardware partners.&lt;/P&gt;
&lt;P&gt;DataON, Dell Technologies, Everpure, HPE, Hitachi Vantara, Lenovo and NetApp are working with Microsoft to deliver configurations, giving customers more flexibility in how they design and scale their infrastructure.&lt;/P&gt;
&lt;H2&gt;General Availability of Local Identity with Azure Key Vault&lt;/H2&gt;
&lt;P&gt;While disaggregated architectures primarily target sovereign and centralized datacenter deployments, Azure Local 2604 also introduces a major advancement for distributed and edge scenarios.&lt;/P&gt;
&lt;P&gt;With the &lt;STRONG&gt;General Availability of Local Identity with Key Vault&lt;/STRONG&gt;, Azure Local can now be provisioned without infrastructure dependencies on Microsoft Active Directory, enabling &lt;STRONG&gt;simplified deployment in disconnected, air-gapped, and regulated environments.&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;This simplifies deployment and adoption, by removing the need for extra hardware running domain controllers and removing the complexity of firewall configurations when installing in isolated network environments.&lt;/P&gt;
&lt;P&gt;Azure Local 2604 adds support for deploying rack-aware clusters using Local Identity with Azure Key Vault. This combines reduced requirements with the high availability that customers demand across manufacturing, energy, and other industries. This capability removes one of the key barriers to deploying Azure-consistent infrastructure in sovereign and edge environments.&lt;/P&gt;
&lt;H2&gt;&lt;SPAN data-teams="true"&gt;Pricing Changes&lt;/SPAN&gt;&lt;/H2&gt;
&lt;P&gt;Pricing for multi-rack and sovereign-scale deployments is being introduced as part of this release. Customers should connect with their Microsoft account team to learn more about pricing, configuration options, and early access programs as these offerings continue to actively evolve.&lt;/P&gt;
&lt;H2&gt;Getting started&lt;/H2&gt;
&lt;P&gt;Release 2604 is available for both existing and new Azure Local instances.&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Review the release note for Azure Local 2604 release &lt;A class="lia-external-url" href="https://learn.microsoft.com/en-us/azure/azure-local/whats-new?view=azloc-2604" target="_blank" rel="noopener"&gt;here&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Learn more about disaggregated deployments &lt;A class="lia-external-url" href="https://learn.microsoft.com/en-us/azure/azure-local/overview/disaggregated-overview?" target="_blank" rel="noopener"&gt;here&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Learn more about SAN attach &lt;A class="lia-external-url" href="https://learn.microsoft.com/en-us/azure/azure-local/deploy/enable-external-storage?" target="_blank" rel="noopener"&gt;here&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Learn more about Local Identity with Azure Key Vault &lt;A class="lia-external-url" href="https://learn.microsoft.com/en-us/azure/azure-local/deploy/deployment-local-identity-with-key-vault?view=azloc-2603" target="_blank" rel="noopener"&gt;here&lt;/A&gt;.&lt;/LI&gt;
&lt;LI&gt;Learn more about hardware configurations that support disaggregated deployments using the &lt;A class="lia-external-url" href="https://aka.ms/azurelocalcatalog" target="_blank" rel="noopener"&gt;solutions catalog&lt;/A&gt; or learn directly from our partners:&lt;/LI&gt;
&lt;/UL&gt;
&lt;P class="lia-indent-padding-left-60px"&gt;o&amp;nbsp;&amp;nbsp; DataON: &lt;EM&gt;“&lt;A class="lia-external-url" href="https://aka.ms/alsan-dataon" target="_blank" rel="noopener"&gt;DataON Premier Solutions for Azure Local&lt;/A&gt; provide a premium Azure Local experience that includes deployment, integration, training, and white glove service &amp;amp; support. Our goal is to not only get you up and running quickly but also to help your team to be confident in managing Azure Local.”&lt;/EM&gt;&lt;/P&gt;
&lt;P class="lia-indent-padding-left-60px"&gt;o&amp;nbsp;&amp;nbsp; Dell Technologies: &lt;EM&gt;“&lt;A class="lia-external-url" href="https://aka.ms/alcompute-dell" target="_blank" rel="noopener"&gt;Coming Soon, Dell Private Cloud&lt;/A&gt;–Microsoft enables a modern disaggregated architecture, simplifying operations across Dell PowerEdge compute, Dell PowerStore storage, and Azure Local.”&lt;/EM&gt;&lt;/P&gt;
&lt;P class="lia-indent-padding-left-60px"&gt;&lt;EM&gt;“&lt;A class="lia-external-url" href="https://aka.ms/alsan-dell" target="_blank" rel="noopener"&gt;Available now, Dell PowerStore&lt;/A&gt; delivers high-performance, scalable, and resilient storage for Azure Local, with support for Dell Private Cloud coming soon to make it easier to streamline operations for storage, compute, and your Azure Local license.”&lt;/EM&gt;&lt;/P&gt;
&lt;P class="lia-indent-padding-left-60px"&gt;o&amp;nbsp;&amp;nbsp; Everpure: &lt;EM&gt;“Azure Local now supports external storage with &lt;A class="lia-external-url" href="https://aka.ms/alsan-everpure" target="_blank" rel="noopener"&gt;Everpure FlashArray&lt;/A&gt;, offering Azure Local customers unprecedented levels of scale, performance and efficiency with the added benefit of seamless hybrid cloud integration with Everpure Cloud in Azure.”&lt;/EM&gt;&lt;/P&gt;
&lt;P class="lia-indent-padding-left-60px"&gt;o&amp;nbsp;&amp;nbsp; Hitachi Vantara: &lt;EM&gt;“&lt;A class="lia-external-url" href="https://aka.ms/alsan-hitachi" target="_blank" rel="noopener"&gt;Hitachi Vantara VSP and VSP One Block&lt;/A&gt;, fully validated to meet Microsoft's Azure Local storage requirements, deliver enterprise SAN reliability for Azure Local.”&lt;/EM&gt;&lt;/P&gt;
&lt;P class="lia-indent-padding-left-60px"&gt;o&amp;nbsp;&amp;nbsp; HPE: &lt;EM&gt;“&lt;A class="lia-external-url" href="https://aka.ms/alcompute-hpe" target="_blank" rel="noopener"&gt;HPE ProLiant Compute Premier Solutions for Azure Local&lt;/A&gt; enable customers to gain full control over data residency, and accelerate innovation with industry-leading performance, security, and management automation.”&lt;/EM&gt;&lt;/P&gt;
&lt;P class="lia-indent-padding-left-60px"&gt;&lt;EM&gt;“&lt;A class="lia-external-url" href="https://aka.ms/alsan-hpe" target="_blank" rel="noopener"&gt;HPE Alletra Storage MP B10000 integrated with Azure Local&lt;/A&gt; delivers a unified, Azure managed experience with the simplicity of Azure Local plus the advanced data services of a modern enterprise storage platform.”&lt;/EM&gt;&lt;/P&gt;
&lt;P class="lia-indent-padding-left-60px"&gt;o&amp;nbsp;&amp;nbsp; Lenovo: &lt;EM&gt;“Lenovo is expanding its Azure Local portfolio to support disaggregated infrastructure designs that deliver greater choice across compute and storage. The &lt;A class="lia-external-url" href="https://aka.ms/alsan-lenovo" target="_blank" rel="noopener"&gt;ThinkAgile Disaggregated Solution for Microsoft Azure Local&lt;/A&gt; with new compute-only configurations on ThinkAgile MX Series enables customers to integrate ThinkSystem DM, DS, and DG Series storage arrays or bring their own Azure Local validated third party SAN arrays into new or existing Azure Local environments, allowing fully disaggregated, independent scaling using enterprise class Lenovo solutions for sovereign private cloud deployments and emerging AI workloads.”&lt;/EM&gt;&lt;/P&gt;
&lt;P class="lia-indent-padding-left-60px"&gt;o&amp;nbsp;&amp;nbsp; NetApp: &lt;EM&gt;“With Azure Local, NetApp delivers support across &lt;A class="lia-external-url" href="https://aka.ms/alsan-netapp" target="_blank" rel="noopener"&gt;NetApp® AFF, ASA, and FAS systems.&lt;/A&gt;”&lt;/EM&gt;&lt;/P&gt;
&lt;H2&gt;Thank you!&lt;/H2&gt;
&lt;P&gt;This first feature release of 2026 is packed with innovation for Azure Local, and we can’t wait for you to try it and share feedback. We are committed to listening to your feedback and delivering the next wave of capabilities in a continuously evolving world. Thank you to all our customers who trust Azure Local to run their business—and to our engineering partners for the incredible collaboration in building solutions together.&lt;/P&gt;</description>
      <pubDate>Mon, 27 Apr 2026 15:45:00 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/azure-local-expands-to-sovereign-scale-infrastructure-with/ba-p/4514338</guid>
      <dc:creator>troettinger</dc:creator>
      <dc:date>2026-04-27T15:45:00Z</dc:date>
    </item>
    <item>
      <title>SQL Server enabled by Azure Arc Overview</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/sql-server-enabled-by-azure-arc-overview/ba-p/4496399</link>
      <description>&lt;P&gt;&lt;STRONG&gt;Table of Contents&lt;/STRONG&gt;&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;What is Azure Arc-enabled SQL Server?&lt;/LI&gt;
&lt;LI&gt;Connecting SQL Server to Azure Arc (4-step onboarding)&lt;/LI&gt;
&lt;LI&gt;Your SQL Server is Now in Azure (unified management)&lt;/LI&gt;
&lt;LI&gt;SQL Best Practices Assessment&lt;/LI&gt;
&lt;LI&gt;Monitoring and Governance&lt;/LI&gt;
&lt;LI&gt;Troubleshooting Guide&lt;/LI&gt;
&lt;LI&gt;Azure Arc Demo&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;What You Can Learn from This Article&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;This article walks you through the end-to-end journey of bringing external SQL Servers (on-prem, AWS, GCP, edge) under Azure management using Azure Arc. Specifically, you'll learn how to onboard SQL Server instances via the Arc agent and PowerShell script, navigate the unified Azure Portal experience for hybrid SQL estates, enable and interpret SQL Best Practices Assessments with Log Analytics, apply Azure Policy and performance monitoring across all environments, leverage Azure Hybrid Benefit for cost savings, and troubleshoot common issues like assessment upload failures, Wire Server 403 errors, and IMDS connectivity problem, with a real case study distinguishing Azure VM vs. Arc-enabled server scenarios.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H1&gt;&lt;A class="lia-anchor" target="_blank" name="_Toc222897156"&gt;&lt;/A&gt;1. What is Azure Arc-enabled SQL Server?&lt;/H1&gt;
&lt;P&gt;&lt;STRONG&gt;Azure Arc helps you connect your SQL Server to Azure wherever it runs.&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Whether your SQL Server is running on-premises in your datacenter, on AWS EC2, Google Cloud, or at an edge location Azure Arc brings it under Azure management. This means you get the same governance, security, and monitoring capabilities as native Azure resources and streamline migration journey to Azure, effectively manage SQL estate at scale and strengthen security and governance posture&lt;/P&gt;
&lt;P&gt;Cloud innovation. Anywhere.&lt;/P&gt;
&lt;P&gt;SQL Server migration in Azure Arc includes an end-to-end migration journey with the following capabilities:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Continuous database migration assessments with Azure SQL target recommendations and cost estimates.&lt;/LI&gt;
&lt;LI&gt;Seamless provisioning of Azure SQL Managed Instance as destination target, also with an option of free instance evaluation.&lt;/LI&gt;
&lt;LI&gt;Option to choose between two built-in migration methods: real-time database replication using Distributed Availability Groups (powered by the Managed Instance link feature), or log shipping via backup and restore (powered by Log Replay Service feature).&lt;/LI&gt;
&lt;LI&gt;Unified interface that eliminates the need to use multiple tools or to jump between various places in Azure portal.&lt;/LI&gt;
&lt;LI&gt;Microsoft Copilot is integrated to assist you at select points during the migration journey.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;learn more in&amp;nbsp;&lt;A href="https://techcommunity.microsoft.com/blog/MicrosoftDataMigration/sql-server-migration-in-azure-arc-%E2%80%93-generally-available/4471020" target="_blank" rel="noopener"&gt;SQL Server migration in Azure Arc – Generally Available | Microsoft Community Hub&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;A class="lia-anchor" target="_blank" name="_Toc222897157"&gt;&lt;/A&gt;1.1 The Problem Azure Arc Solves&lt;/P&gt;
&lt;P&gt;Organizations typically have SQL Servers scattered across multiple environments:&lt;/P&gt;
&lt;DIV class="styles_lia-table-wrapper__h6Xo9 styles_table-responsive__MW0lN"&gt;&lt;table border="1" style="border-width: 1px;"&gt;&lt;thead&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Location&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Challenge Without Azure Arc&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/thead&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;On-premises datacenter&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Separate management tools, no unified view&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;AWS EC2 instances&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Multi-cloud complexity, different monitoring&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;Google Cloud VMs&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Inconsistent governance and policies&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;Edge / Branch offices&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Limited visibility, manual compliance&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;VMware / Hyper-V&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;No cloud-native management features&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;colgroup&gt;&lt;col style="width: 50.00%" /&gt;&lt;col style="width: 50.00%" /&gt;&lt;/colgroup&gt;&lt;/table&gt;&lt;/DIV&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Azure Arc solves this by extending a single Azure control plane to ALL your SQL Servers&amp;nbsp;regardless of where they physically run&lt;/STRONG&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Azure Arc Overview&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/azure-arc/overview" target="_blank" rel="noopener"&gt;Microsoft Learn&lt;/A&gt;:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/azure-arc/overview" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/azure/azure-arc/overview&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Architecture Reference — Administer SQL Server with Azure Arc&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/architecture/hybrid/azure-arc-sql-server" target="_blank" rel="noopener"&gt;Microsoft Learn&lt;/A&gt;:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/architecture/hybrid/azure-arc-sql-server" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/azure/architecture/hybrid/azure-arc-sql-server&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Documentation Index — SQL Server enabled by Azure Arc&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/?view=sql-server-ver17" target="_blank" rel="noopener"&gt;Microsoft Learn&lt;/A&gt;:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/?view=sql-server-ver17" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/?view=sql-server-ver17&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;SQL Server migration in Azure Arc (Community Hub):&amp;nbsp;&lt;A href="https://techcommunity.microsoft.com/blog/azuresqlblog/sql-server-migration-in-azure-arc-generally-available/4339783" target="_blank" rel="noopener"&gt;https://techcommunity.microsoft.com/blog/azuresqlblog/sql-server-migration-in-azure-arc-generally-av...&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;H1&gt;2. Connecting SQL Server to Azure Arc&lt;/H1&gt;
&lt;H1&gt;&lt;STRONG&gt;&lt;SPAN style="color: rgb(30, 30, 30); font-size: 16px;"&gt;Connecting SQL Server to Azure Arc&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/H1&gt;
&lt;P&gt;This section shows how to onboard your SQL Server to Azure Arc. Once connected, your SQL Server appears in Azure Portal alongside your other Azure resources.&lt;/P&gt;
&lt;P&gt;2.1 Step 1: Access Azure Arc Portal&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Navigation:&amp;nbsp;&lt;/STRONG&gt;Azure Portal → Azure Arc → Machines&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Figure 1:&amp;nbsp;&lt;/STRONG&gt;&lt;EM&gt;Azure Arc | Machines, Starting Point for Onboarding&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Description:&amp;nbsp;&lt;/STRONG&gt;The Azure Arc Machines blade is your entry point for connecting servers outside Azure. Click 'Onboard/Create' dropdown and select 'Onboard existing machines' to begin. The left menu shows Azure Arc capabilities: Machines, Kubernetes clusters, Data services, Licenses, etc. This is where ALL your Azure Arc-enabled servers will appear after onboarding.&lt;/P&gt;
&lt;P&gt;2.2 Step 2: Configure Onboarding Options&lt;/P&gt;
&lt;P&gt;Select your operating system, enable SQL Server auto-discovery, and choose connectivity method:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Figure 2:&amp;nbsp;&lt;/STRONG&gt;&lt;EM&gt;Onboarding Configuration, Enable SQL Server Auto-Discovery&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Description:&amp;nbsp;&lt;/STRONG&gt;Key settings: (1) Operating System select Windows or Linux, (2) SQL Server checkbox, 'Automatically connect any SQL Server instances to Azure Arc' enables auto-discovery of SQL instances on the server, (3) Connectivity method, 'Public endpoint' for direct internet access or 'Private endpoint' for VPN/ExpressRoute. The SQL Server checkbox is crucial, it installs the SQL Server extension automatically.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;💡 Important:&amp;nbsp;&lt;/STRONG&gt;Check the 'Connect SQL Server' option! This ensures SQL Server instances are automatically discovered and connected to Azure Arc.&lt;/P&gt;
&lt;P&gt;2.3 Step 3: Download the Onboarding Script&lt;/P&gt;
&lt;P&gt;Azure generates a customized PowerShell script containing your subscription details and configuration:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Figure 3:&amp;nbsp;&lt;/STRONG&gt;&lt;EM&gt;Generated Onboarding Script, Ready to Download&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Description:&amp;nbsp;&lt;/STRONG&gt;The portal generates a PowerShell script customized for your environment. Key components: (1) Agent download from Azure CDN, (2) Installation commands, (3) Pre-configured connection parameters (subscription, resource group, location). Click 'Download' to save the script. Requirements note: Server needs HTTPS (port 443) access to Azure endpoints.&lt;/P&gt;
&lt;P&gt;&lt;A class="lia-anchor" target="_blank" name="_Toc222897163"&gt;&lt;/A&gt;2.4 Step 4: Run the Script on Your Server&lt;/P&gt;
&lt;P&gt;Copy the script to your SQL Server and execute it in PowerShell as Administrator:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Figure 4:&amp;nbsp;&lt;/STRONG&gt;&lt;EM&gt;Executing OnboardingScript.ps1 on the SQL Server&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Description:&amp;nbsp;&lt;/STRONG&gt;PowerShell console showing script execution from D:\Azure Arch directory. The script (OnboardingScript.ps1, 3214 bytes) installs the Azure Connected Machine Agent and registers the server with Azure Arc. During execution, a browser window opens for Azure authentication. After completion, the server appears in Azure Arc within minutes.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;What happens during onboarding:&lt;/STRONG&gt;&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;Azure Connected Machine Agent is downloaded and installed&lt;/LI&gt;
&lt;LI&gt;Agent establishes secure connection to Azure&lt;/LI&gt;
&lt;LI&gt;Server is registered as an Azure Arc resource&lt;/LI&gt;
&lt;LI&gt;SQL Server extension is installed (if checkbox was enabled)&lt;/LI&gt;
&lt;LI&gt;SQL Server instance appears in Azure Arc → SQL Server&lt;/LI&gt;
&lt;/OL&gt;
&lt;UL&gt;
&lt;LI&gt;Connect Your SQL Server to Azure Arc&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/connect?view=sql-server-ver17" target="_blank" rel="noopener"&gt;Microsoft Learn&lt;/A&gt;:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/connect?view=sql-server-ver17" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/connect?view=sql-server-ver17&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Prerequisites — SQL Server enabled by Azure Arc&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/prerequisites?view=sql-server-ver17" target="_blank" rel="noopener"&gt;Microsoft Learn&lt;/A&gt;:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/prerequisites?view=sql-server-ver17" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/prerequisites?view=sql-server-ver17&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Manage Automatic Connection — SQL Server enabled by Azure Arc&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/manage-autodeploy?view=sql-server-ver17" target="_blank" rel="noopener"&gt;Microsoft Learn&lt;/A&gt;:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/manage-autodeploy?view=sql-server-ver17" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/manage-autodeploy?view=sql-server-ver17&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;H1&gt;3. Your SQL Server is Now Visible in the Azure Control Plane&lt;/H1&gt;
&lt;P&gt;Once connected via Azure Arc, your SQL Server is projected as a resource in the Azure Portal,right alongside your native Azure SQL resources. This is the power of Azure Arc: your SQL Server remains where it runs (on-premises, in AWS, or anywhere else), but Azure's management plane now extends to it. You can govern, monitor, and secure it with the same tools you use for Azure-native resources, without migrating the workload.&lt;/P&gt;
&lt;P&gt;&lt;A class="lia-anchor" target="_blank" name="_Toc222897165"&gt;&lt;/A&gt;3.1 Unified View in Azure Portal&lt;/P&gt;
&lt;P&gt;After onboarding, you can see your Azure Arc-enabled SQL Server through two paths:&lt;/P&gt;
&lt;DIV class="styles_lia-table-wrapper__h6Xo9 styles_table-responsive__MW0lN"&gt;&lt;table border="1" style="border-width: 1px;"&gt;&lt;thead&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Navigation Path&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;What You See&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/thead&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;Azure Arc → SQL Server&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;All Azure Arc-enabled SQL instances&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;Azure Arc → Machines&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;The host server with extensions&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;colgroup&gt;&lt;col style="width: 50.00%" /&gt;&lt;col style="width: 50.00%" /&gt;&lt;/colgroup&gt;&lt;/table&gt;&lt;/DIV&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A class="lia-anchor" target="_blank" name="_Toc222897166"&gt;&lt;/A&gt;3.2 Management Experience Similar to SQL Server on Azure VM&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;The management capabilities for Azure Arc-enabled SQL Server are very similar to SQL Server on Azure VM. The screenshots below show the SQL Server on Azure VM experience Azure Arc-enabled SQL Server provides nearly identical functionality.&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;Whether your SQL Server runs natively on an Azure VM or is connected from outside Azure via Azure Arc, you get access to a consistent management experience including:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;H1&gt;&amp;nbsp;&lt;/H1&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Figure 5:&amp;nbsp;&lt;/STRONG&gt;&lt;EM&gt;SQL Server Management Overview — Consistent Experience&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Description:&amp;nbsp;&lt;/STRONG&gt;This shows the management experience for SQL Server in Azure. Whether connected via Azure Arc or running on Azure VM, you see: SQL Server version and edition, VM details, License type configuration, Storage configuration, and feature status. Azure Arc-enabled SQL Server provides a nearly identical dashboard experience, extending this unified view to your on-premises and multi-cloud servers.&lt;/P&gt;
&lt;P&gt;&lt;A class="lia-anchor" target="_blank" name="_Toc222897167"&gt;&lt;/A&gt;3.3 Azure Hybrid Benefit&amp;nbsp;- Use Your Existing Licenses&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;One of the key cost-saving advantages&amp;nbsp;which is&amp;nbsp;you can apply&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;Azure Hybrid Benefit (AHB)&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;to&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;Azure SQL Database&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;and&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;Azure SQL Managed Instance&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;, saving&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;up to 30% or more&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;on licensing costs by&amp;nbsp;leveraging&amp;nbsp;your existing Software Assurance-enabled SQL Server licenses.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;&lt;STRONG&gt;Note&lt;/STRONG&gt;:&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;Azure Hybrid Benefit applies to Azure SQL Database and SQL Managed Instance. For SQL Server running on-premises or in other clouds managed via Azure Arc, AHB does not apply directly.&amp;nbsp;&amp;nbsp;However, Arc-enabled SQL Server provides other benefits such as centralized management, Azure-integrated security, and access to Extended Security Updates (ESUs).&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Figure 6:&amp;nbsp;&lt;/STRONG&gt;&lt;EM&gt;Azure Hybrid Benefit Configuration&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Description:&amp;nbsp;&lt;/STRONG&gt; License configuration for SQL Server on Azure VM, showing three options: Pay As You Go, Azure Hybrid Benefit (selected), and HA/DR. With Azure Hybrid Benefit, organizations with existing SQL Server licenses and active Software Assurance can save up to 30% or more on SQL Server licensing costs running on Azure VMs (as reflected in the Azure portal configuration blade). Free SQL Server licenses for High Availability and Disaster Recovery are also available for Standard and Enterprise editions.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Configure SQL Server enabled by Azure Arc&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/manage-configuration?view=sql-server-ver17" target="_blank" rel="noopener"&gt;Microsoft Learn&lt;/A&gt;:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/manage-configuration?view=sql-server-ver17" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/manage-configuration?view=sql-server-ver1...&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Manage Licensing and Billing — SQL Server enabled by Azure Arc&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/manage-license-billing?view=sql-server-ver17" target="_blank" rel="noopener"&gt;Microsoft Learn&lt;/A&gt;:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/manage-license-billing?view=sql-server-ver17" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/manage-license-billing?view=sql-server-ve...&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H1&gt;4. SQL Best Practices Assessment&lt;/H1&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;One of the most valuable features available to Azure Arc-enabled SQL Server is the Best Practices Assessment — automatically evaluating your SQL Server configuration against Microsoft's recommendations.&lt;/P&gt;
&lt;P&gt;&lt;A class="lia-anchor" target="_blank" name="_Toc222897169"&gt;&lt;/A&gt;4.1 Prerequisites: Log Analytics Workspace&lt;/P&gt;
&lt;P&gt;Before enabling assessment, you need a Log Analytics Workspace to store the results:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Figure 7:&amp;nbsp;&lt;/STRONG&gt;&lt;EM&gt;Create Log Analytics Workspace&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Description:&amp;nbsp;&lt;/STRONG&gt;Log Analytics workspace creation form. Fill in: Subscription, Resource Group, Name (green checkmark indicates valid name), and Region (choose same region as your resources). This workspace stores assessment results, performance metrics, and logs from ALL your SQL Servers both Azure Arc-enabled and Azure VMs.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Figure 8:&amp;nbsp;&lt;/STRONG&gt;&lt;EM&gt;Log Analytics Workspace Ready for Use&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Description:&amp;nbsp;&lt;/STRONG&gt;Workspace overview showing: Status (Active), Pricing tier (Pay-as-you-go), and Operational issues (OK). The 'Get Started' section guides you through: (1) Connect a data source, (2) Configure monitoring solutions, (3) Monitor workspace health. This workspace becomes the central repository for all your SQL Server insights.&lt;/P&gt;
&lt;P&gt;&lt;A class="lia-anchor" target="_blank" name="_Toc222897170"&gt;&lt;/A&gt;4.2 Enable SQL Best Practices Assessment&lt;/P&gt;
&lt;P&gt;Navigate to your SQL Server (Azure Arc-enabled or Azure VM) and enable the assessment:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Figure 9:&amp;nbsp;&lt;/STRONG&gt;&lt;EM&gt;SQL Best Practices Assessment Enable Feature&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Description:&amp;nbsp;&lt;/STRONG&gt;Assessment landing page explaining the feature: evaluates indexes, deprecated features, trace flags, statistics, etc. Results are uploaded via Azure Monitor Agent (AMA). Click 'Enable SQL best practices assessments' to begin configuration. This feature is available for BOTH Azure Arc-enabled SQL Server and Azure SQL VMs.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&lt;STRONG&gt;Figure 10:&amp;nbsp;&lt;/STRONG&gt;&lt;EM&gt;Assessment Configuration Select Log Analytics Workspace&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Description:&amp;nbsp;&lt;/STRONG&gt;Configuration panel requiring: (1) Enable checkbox, (2) Log Analytics workspace selection, (3) Resource group for AMA. The warning 'No Log Analytics workspace is found' appears if you haven't created one yet,&amp;nbsp; see Section 4.1. Once configured, assessments run on schedule and upload results to your workspace.&lt;/P&gt;
&lt;P&gt;&lt;A class="lia-anchor" target="_blank" name="_Toc222897171"&gt;&lt;/A&gt;4.3 Run and Review Assessment&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Figure 11:&amp;nbsp;&lt;/STRONG&gt;&lt;EM&gt;Run Assessment Button&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Description:&amp;nbsp;&lt;/STRONG&gt;After configuration, click 'Run assessment' to start evaluation. Assessment duration varies: 5-10 minutes for small environments, 30-60 minutes for large ones. The 'View latest successful assessment' button (disabled until first run completes) opens the results workbook.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Figure 12:&amp;nbsp;&lt;/STRONG&gt;&lt;EM&gt;Assessment Results History&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Description:&amp;nbsp;&lt;/STRONG&gt;Assessment history showing multiple runs with different statuses: 'Scheduled' (pending), 'Completed' (results available), 'Failed - result expired' (data retention exceeded). Regular assessments help catch configuration drift over time. If you see 'Failed - upload failed', see the Troubleshooting section.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Figure 13:&amp;nbsp;&lt;/STRONG&gt;&lt;EM&gt;Assessment Recommendations&amp;nbsp; Actionable Insights&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Description:&amp;nbsp;&lt;/STRONG&gt;Best practices workbook showing three panels: (1) Recommendation Summary with severity (High, Medium) and categories (DBConfiguration, Performance, Index, Backup), (2) Recommendation Details with target and name, (3) Details panel showing selected item — example: 'Enable instant file initialization' for performance improvement. High severity items should be addressed immediately.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Severity Levels:&lt;/STRONG&gt;&lt;/P&gt;
&lt;DIV class="styles_lia-table-wrapper__h6Xo9 styles_table-responsive__MW0lN"&gt;&lt;table border="1" style="border-width: 1px;"&gt;&lt;thead&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Severity&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Description&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Action Timeline&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/thead&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;🔴 High&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Critical issues affecting performance or security&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Address immediately&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;🟡 Medium&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Important optimizations recommended&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Within 30 days&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;🟢 Low&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Nice-to-have improvements&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;As time permits&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;ℹ️ Info&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Informational findings&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Review and acknowledge&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;colgroup&gt;&lt;col style="width: 33.33%" /&gt;&lt;col style="width: 33.33%" /&gt;&lt;col style="width: 33.33%" /&gt;&lt;/colgroup&gt;&lt;/table&gt;&lt;/DIV&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Configure Best Practices Assessment — SQL Server enabled by Azure Arc&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/assess?view=sql-server-ver17" target="_blank" rel="noopener"&gt;Microsoft Learn&lt;/A&gt;:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/assess?view=sql-server-ver17" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/assess?view=sql-server-ver17&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Troubleshoot Best Practices Assessment — SQL Server enabled by Azure Arc&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/troubleshoot-assessment?view=sql-server-ver16" target="_blank" rel="noopener"&gt;Microsoft Learn&lt;/A&gt;:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/troubleshoot-assessment?view=sql-server-ver16" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/troubleshoot-assessment?view=sql-server-v...&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Assess Migration Readiness — SQL Server enabled by Azure Arc&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/migration-assessment?view=sql-server-ver17" target="_blank" rel="noopener"&gt;Microsoft Learn&lt;/A&gt;:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/migration-assessment?view=sql-server-ver17" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/migration-assessment?view=sql-server-ver1...&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Log Analytics Workspace creation:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/azure-monitor/logs/quick-create-workspace" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/azure/azure-monitor/logs/quick-create-workspace&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;H1&gt;5. Monitoring and Governance&lt;/H1&gt;
&lt;H1&gt;&lt;SPAN style="color: rgb(30, 30, 30); font-size: 16px;"&gt;With your SQL Servers connected to Azure (via Azure Arc or native), you gain access to Azure's full monitoring and governance capabilities.&lt;/SPAN&gt;&lt;/H1&gt;
&lt;P&gt;&lt;A class="lia-anchor" target="_blank" name="_Toc222897173"&gt;&lt;/A&gt;5.1 Azure Policy Compliance&lt;/P&gt;
&lt;P&gt;Apply consistent governance policies across ALL your SQL Servers — regardless of where they run:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Figure 14:&amp;nbsp;&lt;/STRONG&gt;&lt;EM&gt;Azure Policy Compliance Dashboard&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Description:&amp;nbsp;&lt;/STRONG&gt;Compliance dashboard showing: 28% overall compliance (5 of 18 resources), pie chart with Compliant (green), Exempt, and Non-compliant (red). The table lists non-compliant resources (microsoft.hybridcompute type = Azure Arc-enabled servers). Use this to ensure ALL SQL Servers, on-premises, cloud, edge meet your organization's standards.&lt;/P&gt;
&lt;P&gt;&lt;A class="lia-anchor" target="_blank" name="_Toc222897174"&gt;&lt;/A&gt;5.2 Performance Monitoring&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Figure 15:&amp;nbsp;&lt;/STRONG&gt;&lt;EM&gt;Performance Monitoring&amp;nbsp; Unified Dashboard&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Description:&amp;nbsp;&lt;/STRONG&gt;Performance dashboard showing: Logical Disk Performance (C: drive 30% used), CPU Utilization (1.75% average, 5.73% 95th percentile), Available Memory (3.1GB average). This same dashboard works for Azure Arc-enabled servers, giving you consistent visibility across your entire SQL Server estate.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A class="lia-anchor" target="_blank" name="_Toc222897175"&gt;&lt;/A&gt;5.3 Service Dependency Mapping&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Figure 16:&amp;nbsp;&lt;/STRONG&gt;&lt;EM&gt;Service Map Visualize Dependencies&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Description:&amp;nbsp;&lt;/STRONG&gt;Map view showing server FNPSVR01 with 17 processes connecting to Port 443 (7 servers) and Port 53 (1 server). Machine Summary shows FQDN, OS (Windows Server 2016), IP address. Use this to understand application dependencies before maintenance or migration&amp;nbsp; available for both Azure Arc-enabled and Azure-native servers.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H1&gt;6. Troubleshooting Guide&lt;/H1&gt;
&lt;P&gt;This section covers common issues encountered when working with Azure Arc-enabled SQL Server and Azure SQL VMs.&lt;/P&gt;
&lt;P&gt;&lt;A class="lia-anchor" target="_blank" name="_Toc222897177"&gt;&lt;/A&gt;6.1 Common Issues Overview&lt;/P&gt;
&lt;DIV class="styles_lia-table-wrapper__h6Xo9 styles_table-responsive__MW0lN"&gt;&lt;table border="1" style="border-width: 1px;"&gt;&lt;thead&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Issue&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Symptoms&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Azure Arc-enabled&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Azure VM&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/thead&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;Assessment Upload Failed&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Status: 'Failed - upload failed'&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;✅ Applies&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;✅ Applies&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;Wire Server 403&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Agent cannot connect&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;❌ N/A&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;✅ Applies&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;IMDS Disabled&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Cannot obtain token&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;❌ N/A&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;✅ Applies&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;Azure Arc Agent Connectivity&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Server not appearing&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;✅ Applies&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;❌ N/A&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;SQL Login Failed&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Machine account denied&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;✅ Applies&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;✅ Applies&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;colgroup&gt;&lt;col style="width: 25.00%" /&gt;&lt;col style="width: 25.00%" /&gt;&lt;col style="width: 25.00%" /&gt;&lt;col style="width: 25.00%" /&gt;&lt;/colgroup&gt;&lt;/table&gt;&lt;/DIV&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A class="lia-anchor" target="_blank" name="_Toc222897178"&gt;&lt;/A&gt;6.2 Real Case Study: Assessment Upload Failed on Azure VM&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;Note: This case study is from an Azure VM (not Azure Arc-enabled). The Wire Server and IMDS issues are specific to Azure VMs. Azure Arc-enabled servers use different connectivity mechanisms.&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Symptoms observed:&lt;/STRONG&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Assessment status: 'Failed - upload failed'&lt;/LI&gt;
&lt;LI&gt;Local data collected successfully (415 issues)&lt;/LI&gt;
&lt;LI&gt;Data not appearing in Log Analytics workspace&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Root causes identified from logs:&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Error 1 (ExtensionLog ):&lt;/STRONG&gt;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;[ERROR] Customer disable the IMDS service, cannot obtain IMDS token.&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&lt;STRONG&gt;Error 2 (WaAppAgent.log):&lt;/STRONG&gt;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;[WARN] GetMachineGoalState() failed: 403 (Forbidden) to 168.63.129.16&amp;nbsp;Resolution for Azure VMs&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&lt;STRONG&gt;Fix Wire Server (168.63.129.16) connectivity:&lt;/STRONG&gt;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;# Test connectivity Test-NetConnection -ComputerName 168.63.129.16 -Port 80&amp;nbsp; # Add route if missing route add 168.63.129.16 mask 255.255.255.255 &amp;lt;gateway&amp;gt; -p&amp;nbsp; # Add firewall rule if needed New-NetFirewallRule -DisplayName "Allow Azure Wire Server" -Direction Outbound -RemoteAddress 168.63.129.16 -Action Allow&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&lt;STRONG&gt;Fix IMDS (169.254.169.254) connectivity:&lt;/STRONG&gt;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;# Test IMDS Invoke-RestMethod -Uri "http://169.254.169.254/metadata/instance?api-version=2021-02-01" -Headers @{Metadata="true"}&amp;nbsp; # Add firewall rule if blocked New-NetFirewallRule -DisplayName "Allow Azure IMDS" -Direction Outbound -RemoteAddress 169.254.169.254 -Action Allow&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&lt;STRONG&gt;Test Azure Arc agent connectivity:&lt;/STRONG&gt;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;# Check Arc agent status &amp;amp; "$env:ProgramW6432\AzureConnectedMachineAgent\azcmagent.exe" show&amp;nbsp; # Test connectivity to Azure endpoints&amp;nbsp;&amp;nbsp; &amp;amp; "$env:ProgramW6432\AzureConnectedMachineAgent\azcmagent.exe" check&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A class="lia-anchor" target="_blank" name="_Toc222897180"&gt;&lt;/A&gt;6.3 Azure Arc-enabled SQL Server Connectivity Issues&lt;/P&gt;
&lt;P&gt;For Azure Arc-enabled servers (not Azure VMs), connectivity issues are different:&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Required Azure endpoints for Azure Arc agent:&lt;/STRONG&gt;&lt;/P&gt;
&lt;DIV class="styles_lia-table-wrapper__h6Xo9 styles_table-responsive__MW0lN"&gt;&lt;table border="1" style="border-width: 1px;"&gt;&lt;thead&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Endpoint&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Port&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Purpose&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/thead&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;management.azure.com&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;443&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Azure Resource Manager&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;login.microsoftonline.com&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;443&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Azure AD authentication&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;*.his.arc.azure.com&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;443&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Azure Arc Hybrid Identity&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;*.guestconfiguration.azure.com&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;443&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Guest configuration&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;colgroup&gt;&lt;col style="width: 33.33%" /&gt;&lt;col style="width: 33.33%" /&gt;&lt;col style="width: 33.33%" /&gt;&lt;/colgroup&gt;&lt;/table&gt;&lt;/DIV&gt;
&lt;UL&gt;
&lt;LI&gt;Troubleshoot Best Practices Assessment&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/troubleshoot-assessment?view=sql-server-ver16" target="_blank" rel="noopener"&gt;Microsoft Learn&lt;/A&gt;:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/troubleshoot-assessment?view=sql-server-ver16" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/troubleshoot-assessment?view=sql-server-v...&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;What is IP Address 168.63.129.16 (Wire Server)&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/virtual-network/what-is-ip-address-168-63-129-16" target="_blank" rel="noopener"&gt;Microsoft Learn&lt;/A&gt;:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/virtual-network/what-is-ip-address-168-63-129-16" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/azure/virtual-network/what-is-ip-address-168-63-129-16&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Azure Instance Metadata Service (IMDS)&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/virtual-machines/instance-metadata-service" target="_blank" rel="noopener"&gt;Microsoft Learn&lt;/A&gt;:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/virtual-machines/instance-metadata-service" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/azure/virtual-machines/instance-metadata-service&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Troubleshoot IMDS Connection Issues on Windows VMs&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/troubleshoot/azure/virtual-machines/windows/windows-vm-imds-connection" target="_blank" rel="noopener"&gt;Microsoft Learn&lt;/A&gt;:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/troubleshoot/azure/virtual-machines/windows/windows-vm-imds-connection" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/troubleshoot/azure/virtual-machines/windows/windows-vm-imds-connec...&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Troubleshoot Azure Windows VM Agent Issues&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/troubleshoot/azure/virtual-machines/windows/windows-azure-guest-agent" target="_blank" rel="noopener"&gt;Microsoft Learn&lt;/A&gt;:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/troubleshoot/azure/virtual-machines/windows/windows-azure-guest-agent" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/troubleshoot/azure/virtual-machines/windows/windows-azure-guest-ag...&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;H1&gt;7. Troubleshooting Guide&lt;/H1&gt;
&lt;P&gt;&lt;A href="https://microsoft.seismic.com/apps/doccenter/a5266a70-9230-4c1e-a553-c5bddcb7a896/doc/%252Fdde0caec0e-9236-f21b-2991-5868e63d3984%252FdfYTZjNDRiZDMtMzEwZS1kNWZkLTNjOGEtNjliYWJjMjhmMmUw%252CPT0%253D%252CUHJvZHVjdCBEZW1v%252Flf062d7968-3275-47e7-8ed3-1c84ead67f6a/grid/" target="_blank" rel="noopener"&gt;Demo Deck: Azure Arc for Windows Server and SQL Server&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;More Additional Resources :&lt;/STRONG&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Learn more about the&amp;nbsp;&lt;A href="https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Flearn.microsoft.com%2Fsql%2Fsql-server%2Fazure-arc%2Fmigrate-to-azure-sql-managed-instance&amp;amp;data=05%7C02%7Cmprawironego%40microsoft.com%7C8a82e95e6f6c411ebe8008de82b0f42e%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C639091893784970183%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&amp;amp;sdata=PR635HkMsSjMvie%2FJd1zKZzrseV975DPTAwPRDTyWCI%3D&amp;amp;reserved=0" target="_blank" rel="noopener"&gt;new migration capability in Azure Arc&lt;/A&gt;&amp;nbsp;on Microsoft Learn.&lt;/LI&gt;
&lt;LI&gt;Onboard your&amp;nbsp;&lt;A href="https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Flearn.microsoft.com%2Fen-us%2Fsql%2Fsql-server%2Fazure-arc%2Fconnect%3Fview%3Dsql-server-ver17%26tabs%3Dwindows&amp;amp;data=05%7C02%7Cmprawironego%40microsoft.com%7C8a82e95e6f6c411ebe8008de82b0f42e%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C639091893784980761%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&amp;amp;sdata=rALs4XDQY55NSiOj5Jwpi7ShsQU%2BkzgFpwawCMKGTGg%3D&amp;amp;reserved=0" target="_blank" rel="noopener"&gt;SQL Server to Azure Arc&lt;/A&gt;&amp;nbsp;today.&amp;nbsp;&lt;/LI&gt;
&lt;LI&gt;Learn more about&amp;nbsp;&lt;A href="https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Ftechcommunity.microsoft.com%2Fblog%2Fmicrosoftdatamigration%2Fgeneral-availability-continuous-migration-assessment-for-sql-server-enabled-by-a%2F4430603&amp;amp;data=05%7C02%7Cmprawironego%40microsoft.com%7C8a82e95e6f6c411ebe8008de82b0f42e%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C639091893784991277%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&amp;amp;sdata=m7MiOXkc2wa75WQ9%2BiBUuR%2BOK3aPHTCD1gs8CnZgbqg%3D&amp;amp;reserved=0" target="_blank" rel="noopener"&gt;continuous migration assessment&lt;/A&gt;&amp;nbsp;from SQL Server enabled by Azure Arc.&lt;/LI&gt;
&lt;LI&gt;Download resources on&amp;nbsp;&lt;A href="http://github.com/microsoft/sql-server-samples" target="_blank" rel="noopener"&gt;github.com/&lt;/A&gt;&lt;A href="http://github.com/microsoft/sql-server-samples" target="_blank" rel="noopener"&gt;microsoft&lt;/A&gt;&lt;A href="http://github.com/microsoft/sql-server-samples" target="_blank" rel="noopener"&gt;/&lt;/A&gt;&lt;A href="http://github.com/microsoft/sql-server-samples" target="_blank" rel="noopener"&gt;sql&lt;/A&gt;&lt;A href="http://github.com/microsoft/sql-server-samples" target="_blank" rel="noopener"&gt;-server-samples&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H1&gt;&lt;A class="lia-anchor" target="_blank" name="_Toc222737912"&gt;&lt;/A&gt;&lt;/H1&gt;</description>
      <pubDate>Thu, 16 Apr 2026 15:00:00 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/sql-server-enabled-by-azure-arc-overview/ba-p/4496399</guid>
      <dc:creator>NaufalPrawironegoro</dc:creator>
      <dc:date>2026-04-16T15:00:00Z</dc:date>
    </item>
    <item>
      <title>From fragmented sites to consistent governance: Azure Arc patterns for adaptive cloud strategy.</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/from-fragmented-sites-to-consistent-governance-azure-arc/ba-p/4510110</link>
      <description>&lt;P&gt;&lt;BR /&gt;In Manufacturing companies, hybrid architectures aren’t transitional—they’re persistent. Most large manufacturers operate across remote plants, branch sites, private datacenters, and Azure. The main challenge manufacturers face isn’t adopting cloud services, it is preventing long‑term operational fragmentation: multiple teams, multiple tools, inconsistent security controls, and uneven governance as the estate grows.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;img&gt;Figure 1: &lt;EM&gt;When manufacturing IT grows organically, systems end up scattered across factories, edge, and cloud—creating fragmentation instead of flow.&lt;/EM&gt;&lt;/img&gt;
&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;Azure Arc addresses this as an architectural control‑plane pattern: it extends Azure management to infrastructure and Kubernetes outside Azure by projecting them into Azure Resource Manager (ARM) so they can be governed using Azure-native primitives such as policy, RBAC, and monitoring.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:150}"&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="none"&gt;This article describes three architecture patterns that consistently emerge in manufacturing and edge scenarios. Each pattern addresses a distinct set of constraints—ranging from centralized governance across hybrid estates, to plant‑adjacent platforms, to fully disconnected environments—and illustrates how Azure services can be composed to support these realities in a scalable, well‑governed way.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:150}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;Typical&amp;nbsp;manufacturing environments&amp;nbsp;must&amp;nbsp;contend with some&amp;nbsp;or&amp;nbsp;many of&amp;nbsp;the following&amp;nbsp;components:&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:150}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="8" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="1" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;&lt;STRONG&gt;Latency &amp;amp; determinism&lt;/STRONG&gt;:&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;&amp;nbsp;plant-floor systems often require local execution&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="8" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="2" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;&lt;STRONG&gt;Distributed footprint&lt;/STRONG&gt;:&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;&amp;nbsp;dozens/hundreds of sites with varying maturity&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="8" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="3" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;&lt;STRONG&gt;Connectivity variability&lt;/STRONG&gt;:&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;&amp;nbsp;some sites are intermittently connected&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="8" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="4" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;&lt;STRONG&gt;Regulatory &amp;amp; data constraints&lt;/STRONG&gt;:&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;&amp;nbsp;some workloads must remain on premises&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="8" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="5" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;&amp;nbsp;&lt;STRONG&gt;Cloud&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;:&amp;nbsp;Native cloud applications including&amp;nbsp;the AI&amp;nbsp;based research applications, SAP&amp;nbsp;systems,&amp;nbsp;etc.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;As a result, the estate becomes a mix of Azure + non‑Azure infrastructure. The failure mode&amp;nbsp;isn’t&amp;nbsp;performance—it’s&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;inconsistent operations&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;: different patching methods, different monitoring stacks, and uneven security baselines. Azure Arc is positioned specifically to&amp;nbsp;create unity across that&amp;nbsp;operational model by bringing hybrid resources into the Azure control plane&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:150}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;A helpful way to&amp;nbsp;think&amp;nbsp;about&amp;nbsp;Arc in manufacturing&amp;nbsp;scenario&amp;nbsp;is to separate&amp;nbsp;the&amp;nbsp;control plane and the&amp;nbsp;data plane:&lt;/SPAN&gt;&amp;nbsp;&lt;BR /&gt;&lt;SPAN data-contrast="none"&gt;&lt;BR /&gt;Arc&amp;nbsp;enables&amp;nbsp;a centralized&amp;nbsp;control&amp;nbsp;plane by&amp;nbsp;projecting resources, like the ones&amp;nbsp;below,&amp;nbsp;into ARM:&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:150}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="3" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="1" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;Azure Resource Manager (resource inventory, tags, RBAC, Policy)&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="3" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="2" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;Security posture &amp;amp; compliance (Defender for Cloud, policy initiatives)&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="3" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="3" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;Observability and operations workflows (Azure Monitor, Update Manager, etc.)&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&amp;nbsp;&lt;BR /&gt;&lt;SPAN data-contrast="auto"&gt;Whereas&amp;nbsp;the&amp;nbsp;data plane&amp;nbsp;remains&amp;nbsp;at&amp;nbsp;distributed&amp;nbsp;locations&amp;nbsp;meaning:&lt;/SPAN&gt;&amp;nbsp;&lt;BR /&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:150}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="4" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="1" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;Workload execution&amp;nbsp;remains&amp;nbsp;at plants, private DCs, or edge sites&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="4" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="2" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;Kubernetes API endpoints, runtime traffic, OT systems&amp;nbsp;remain&amp;nbsp;local&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:150}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;This separation is&amp;nbsp;an architectural&amp;nbsp;lever&amp;nbsp;allowing&amp;nbsp;organizations&amp;nbsp;to&amp;nbsp;standardize governance without forcing workload relocation.&lt;/SPAN&gt;&amp;nbsp;&lt;BR /&gt;&amp;nbsp;&lt;BR /&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="none"&gt;A high-level design decision matrix&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:150}"&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;DIV class="styles_lia-table-wrapper__h6Xo9 styles_table-responsive__MW0lN"&gt;&lt;table&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="none"&gt;Constraint&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335551550&amp;quot;:2,&amp;quot;335551620&amp;quot;:2,&amp;quot;335557856&amp;quot;:16777215}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="none"&gt;Recommended starting pattern&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335551550&amp;quot;:2,&amp;quot;335551620&amp;quot;:2,&amp;quot;335557856&amp;quot;:16777215}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="none"&gt;Why&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335551550&amp;quot;:2,&amp;quot;335551620&amp;quot;:2,&amp;quot;335557856&amp;quot;:16777215}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;Many sites + inconsistent tooling&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;Arc as distributed control plane&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;Standardizes governance and inventory via ARM projection&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;Plant workloads require local platform&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;Azure Local + Arc&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;Uses Azure Local baseline + Arc integration for operations&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;Connectivity cannot be assumed&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;Disconnected/intermittent design&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;Forces control-plane boundary design + local autonomy&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;/DIV&gt;
&lt;H2&gt;Pattern 1 — Azure Arc as the distributed control plane (for VM, SQL severs+ Kubernetes)&lt;/H2&gt;
&lt;H3&gt;When this pattern fits&lt;/H3&gt;
&lt;P&gt;Use this pattern when:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;You need consistent governance across plants, datacenters, and multicloud&lt;/LI&gt;
&lt;LI&gt;You can maintain &lt;STRONG&gt;at least periodic connectivity&lt;/STRONG&gt; for control-plane sync&lt;/LI&gt;
&lt;LI&gt;You want Azure policy/security/monitoring to apply uniformly&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3&gt;Architecture intent&lt;/H3&gt;
&lt;P&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/azure-arc/overview" target="_blank"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Azure Arc&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN data-contrast="none"&gt;&amp;nbsp;projects existing bare metal, VM, and Kubernetes infrastructure resources into Azure to handle operations with Azure management and security tools. Azure Arc simplifies governance and management by delivering a consistent&amp;nbsp;multicloud&amp;nbsp;and on-premises management platform&amp;nbsp;experience&amp;nbsp;for Azure services. Once projected, you can&amp;nbsp;operate&amp;nbsp;hybrid resources using Azure-native constructs (inventory, compliance reporting, policy scope) and apply standardized guardrails.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:150}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img&gt;Figure 2 - AzureArc integrates external resources into Azure landing zones via ARM, enabling a unified control plane and consistent governance across cloud, on‑premises, and edge environments.&lt;/img&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;From an architectural standpoint, Azure Arc establishes a centralized control plane in Azure (ARM, RBAC, Policy, Resource Graph) and decentralized data plane remaining at plants, datacenters, or edge sites. This separation enables organizations to apply management‑group–scoped policies, standardized tagging, and Defender for Cloud controls consistently across environments, while preserving local execution and latency characteristics required by manufacturing workloads.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:150}"&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;&lt;STRONG&gt;Why this pattern matters:&amp;nbsp;&lt;/STRONG&gt;&lt;BR /&gt;It &lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;moves&amp;nbsp;organizations from managing individual sites to governing the entire estate as one. It minimizes operational drift as environments expand across plants and edge locations. Centralized control simplifies enforcement of standards without slowing local operations. The pattern creates predictability at scale in highly distributed environments. It&amp;nbsp;establishes&amp;nbsp;a stable foundation for future modernization initiatives.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134233117&amp;quot;:false,&amp;quot;134233118&amp;quot;:false,&amp;quot;201341983&amp;quot;:0,&amp;quot;335551550&amp;quot;:0,&amp;quot;335551620&amp;quot;:0,&amp;quot;335559738&amp;quot;:210,&amp;quot;335559739&amp;quot;:210,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;H2&gt;Pattern 2 — Azure Local + Azure Arc (plant-adjacent platform pattern)&lt;/H2&gt;
&lt;H3&gt;When this pattern fits&lt;/H3&gt;
&lt;P&gt;Use this pattern when:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Workloads must run on premises for &lt;STRONG&gt;latency, sovereignty, or operational control&lt;/STRONG&gt;&lt;/LI&gt;
&lt;LI&gt;You want cloud-consistent operations without creating a separate tooling island&lt;/LI&gt;
&lt;LI&gt;You need a standardized platform for virtualized + containerized workloads at sites&lt;/LI&gt;
&lt;LI&gt;You need the local AI inferencing where data needs to be processed at the source/plant site&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3&gt;Architecture intent&lt;/H3&gt;
&lt;P&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/azure-local/overview?view=azloc-2602" target="_blank"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Azure Local&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN data-contrast="none"&gt;&amp;nbsp;Microsoft’s&amp;nbsp;distributed infrastructure solution that extends Azure capabilities to customer-owned environments. It&amp;nbsp;facilitates&amp;nbsp;the local deployment of both modern and legacy applications across distributed or sovereign locations. Azure Local accelerates cloud and AI innovation by seamlessly delivering new applications, workloads, and services from cloud to edge, using Azure Arc as the unifying control plane.&lt;/SPAN&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;img&gt;Figure 3: Azure local integration with select azure services&lt;/img&gt;
&lt;P&gt;&lt;BR /&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:150}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;From an architectural perspective, Azure Local serves as the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;local data plane&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;&amp;nbsp;for applications—supporting&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;general‑purpose virtual machines, managed Kubernetes (AKS), and selected Azure services&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;—while Azure Arc extends the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;Azure control plane&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;&amp;nbsp;to that environment for inventory, policy, monitoring, and security integration. This separation allows workloads to run close to manufacturing systems without creating a parallel or disconnected operational model.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:150}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;Azure Local supports a broad spectrum of workload types on the same platform foundation, including:&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:150}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="1" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="1" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;Traditional line‑of‑business applications on virtual machines&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="1" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="2" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;Modern containerized workloads using AKS on Azure Local&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="1" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="3" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;Azure‑consistent platform services that can be deployed locally, such as Azure Virtual Desktop and SQL Managed Instance&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="1" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="4" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;GPU‑accelerated workloads for AI inferencing and computer vision scenarios&lt;/SPAN&gt;&amp;nbsp;&lt;BR /&gt;&amp;nbsp;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;&lt;STRONG&gt;Why this pattern matters&lt;/STRONG&gt;:&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;&amp;nbsp;Without a platform like Azure Local integrated through Azure Arc, on‑premises manufacturing workloads tend to evolve into bespoke environments with inconsistent security, monitoring, and lifecycle management—making long‑term scale and governance increasingly difficult.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:150}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;H2&gt;Pattern 3 — Disconnected edge workloads (connectivity-constrained design)&lt;/H2&gt;
&lt;H3&gt;When this pattern fits&lt;/H3&gt;
&lt;P&gt;Use this pattern when:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Sites cannot assume continuous connectivity&lt;/LI&gt;
&lt;LI&gt;Local autonomy is required for safety or production continuity&lt;/LI&gt;
&lt;LI&gt;You still want centralized governance &lt;EM&gt;when connected&lt;/EM&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3&gt;Architecture intent&lt;/H3&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;In manufacturing and edge scenarios, some environments must&amp;nbsp;operate&amp;nbsp;without continuous internet connectivity due to regulatory constraints, physical isolation, or operational risk tolerance. In these cases, architectures must assume that cloud control‑plane access is intermittent or unavailable, while local execution must continue without disruption. Disconnected architectures shift the primary design concern from&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;availability of services&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;&amp;nbsp;to&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;autonomy of execution&lt;/SPAN&gt;&lt;SPAN data-contrast="none"&gt;. This pattern applies to environments that are fully offline, intermittently connected, or explicitly restricted from sending data to public cloud endpoints.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:150}"&gt;&amp;nbsp;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;img&gt;Figure 4: Azure Local disconnected architecture&lt;/img&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;Azure supports this model through&amp;nbsp;&lt;/SPAN&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/ai-services/containers/disconnected-containers" target="_blank"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Disconnected-containers&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN data-contrast="none"&gt;, where containerized services are deployed and&amp;nbsp;operated&amp;nbsp;fully offline. Once provisioned, these containers run entirely on local infrastructure with no runtime dependency on Azure endpoints, enabling uninterrupted execution even during extended disconnection periods.&amp;nbsp;&amp;nbsp;Disconnected containers are offered through commitment&amp;nbsp;tier&amp;nbsp;pricing, each offering a discounted rate compared to the Standard pricing model.&amp;nbsp;Learn more about&amp;nbsp;pricing here:&amp;nbsp;&lt;/SPAN&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/foundry/concepts/manage-costs" target="_blank"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Plan and Manage Costs - Microsoft Foundry | Microsoft Learn&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;Before&amp;nbsp;attempting&amp;nbsp;to run a&amp;nbsp;&lt;/SPAN&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/ai-services/containers/disconnected-containers#container-image-and-license-updates" target="_blank"&gt;&lt;SPAN data-contrast="none"&gt;&lt;SPAN data-ccp-charstyle="Hyperlink"&gt;Docker container in an offline environment&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN data-contrast="none"&gt;, make sure you know the steps to successfully download and use the container. For example:&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:150}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="2" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="1" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;Host computer requirements and recommendations.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="2" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="2" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;The Docker pull command you use to download the container.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="2" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="3" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;How to&amp;nbsp;validate&amp;nbsp;that a container is&amp;nbsp;running.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI aria-setsize="-1" data-leveltext="" data-font="Symbol" data-listid="2" data-list-defn-props="{&amp;quot;335552541&amp;quot;:1,&amp;quot;335559685&amp;quot;:720,&amp;quot;335559991&amp;quot;:360,&amp;quot;469769226&amp;quot;:&amp;quot;Symbol&amp;quot;,&amp;quot;469769242&amp;quot;:[8226],&amp;quot;469777803&amp;quot;:&amp;quot;left&amp;quot;,&amp;quot;469777804&amp;quot;:&amp;quot;&amp;quot;,&amp;quot;469777815&amp;quot;:&amp;quot;hybridMultilevel&amp;quot;}" data-aria-posinset="4" data-aria-level="1"&gt;&lt;SPAN data-contrast="none"&gt;How to send queries to the container's&amp;nbsp;endpoint&amp;nbsp;once&amp;nbsp;it's&amp;nbsp;running.&lt;/SPAN&gt;&amp;nbsp;&lt;BR /&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN data-contrast="auto"&gt;Why this pattern matters:&lt;/SPAN&gt;&amp;nbsp;&lt;/STRONG&gt;&lt;BR /&gt;&lt;SPAN data-contrast="auto"&gt;This pattern matters because not all environments can rely on continuous connectivity. It enables critical workloads to&amp;nbsp;operate&amp;nbsp;independently at the edge while&amp;nbsp;remaining&amp;nbsp;aligned to central governance when connectivity is available. The pattern prioritizes local autonomy without sacrificing architectural discipline. It reduces operational risk in constrained or disconnected sites. This approach ensures resilience and continuity in environments where connectivity cannot be assumed.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;134233117&amp;quot;:false,&amp;quot;134233118&amp;quot;:false,&amp;quot;201341983&amp;quot;:0,&amp;quot;335551550&amp;quot;:0,&amp;quot;335551620&amp;quot;:0,&amp;quot;335559738&amp;quot;:0,&amp;quot;335559739&amp;quot;:0,&amp;quot;335559740&amp;quot;:300}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="none"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:150}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;EM&gt;&lt;SPAN data-contrast="none"&gt;Manufacturing IT will remain distributed by design. The risk is not hybrid complexity, but fragmented operations. By centralizing the control plane while keeping execution local, Arc enables consistent security, compliance, and operations across cloud, datacenter, and edge.&lt;/SPAN&gt;&lt;SPAN data-ccp-props="{&amp;quot;335557856&amp;quot;:16777215,&amp;quot;335559739&amp;quot;:0}"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/EM&gt;&lt;/STRONG&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 15 Apr 2026 17:11:28 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/from-fragmented-sites-to-consistent-governance-azure-arc/ba-p/4510110</guid>
      <dc:creator>sakshimalhotra</dc:creator>
      <dc:date>2026-04-15T17:11:28Z</dc:date>
    </item>
    <item>
      <title>Azure Arc Server Mar 2026 Forum Recap</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/azure-arc-server-mar-2026-forum-recap/ba-p/4511299</link>
      <description>&lt;P&gt;Please find the &lt;A class="lia-external-url" href="https://youtu.be/90SNRrI64sg?si=zHrxUx2FQCCaE8sn" target="_blank"&gt;recording for the monthly Azure Arc Server Forum on YouTube&lt;/A&gt;!&lt;/P&gt;
&lt;P&gt;During the March 2026 Azure Arc Server Forum, we discussed:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Deploying Ansible Playbooks through Machine Configuration as Azure Policy (Learn more: &lt;A href="https://techcommunity.microsoft.com/blog/azurearcblog/announcing-private-preview-deploy-ansible-playbooks-using-azure-policy-via-machi/4507848" target="_blank"&gt;Announcing Private Preview: Deploy Ansible Playbooks using Azure Policy via Machine Configuration)&lt;/A&gt; and sign up at &lt;A href="https://aka.ms/ansible-arc-signup" target="_blank"&gt;https://aka.ms/ansible-arc-signup&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;New MECM (SCCM) connector supporting Cloud Native Server Management, sign up for Private Preview at &lt;A href="https://aka.ms/arc-mecm/preview" target="_blank"&gt;aka.ms/arc-&lt;/A&gt;&lt;A href="https://aka.ms/arc-mecm/preview" target="_blank"&gt;mecm&lt;/A&gt;&lt;A href="https://aka.ms/arc-mecm/preview" target="_blank"&gt;/preview&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Automatic Agent Upgrade at Scale Enablement (Learn more: &lt;A href="https://techcommunity.microsoft.com/blog/azurearcblog/run-the-latest-azure-arc-agent-with-automatic-agent-upgrade-public-preview/4508535" target="_blank"&gt;Run the latest Azure Arc agent with Automatic Agent Upgrade (Public Preview)&lt;/A&gt;)&lt;/LI&gt;
&lt;LI&gt;TPM-backed Identity for Secure Onboarding, sign up for Private Preview at&amp;nbsp;
&lt;P&gt;&lt;A class="lia-external-url" href="https://aka.ms/arc-tpm-backed-identity/preview/" target="_blank"&gt;https://aka.ms/arc-tpm-backed-identity/preview/&lt;/A&gt;&amp;nbsp;&lt;/P&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;To sign up for the Azure Arc Server Forum and newsletter, please register with contact details at&amp;nbsp;&lt;A href="https://aka.ms/arcserverforumsignup/" target="_blank"&gt;https://aka.ms/arcserverforumsignup/&lt;/A&gt;.&lt;/P&gt;
&lt;P&gt;For the latest agent release notes, check out&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/azure-arc/servers/agent-release-notes" target="_blank"&gt;What's new with Azure Connected Machine agent - Azure Arc | Microsoft Learn&lt;/A&gt;.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Our April 2026 forum will be held on Thursday, April 16 at 9:30 AM PST / 12:30 PM EST. We look forward to you joining us, thank you!&lt;/P&gt;</description>
      <pubDate>Tue, 14 Apr 2026 17:54:18 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/azure-arc-server-mar-2026-forum-recap/ba-p/4511299</guid>
      <dc:creator>Aurnov_Chattopadhyay</dc:creator>
      <dc:date>2026-04-14T17:54:18Z</dc:date>
    </item>
    <item>
      <title>Automating Arc-enabled SQL Server license type configuration with Azure Policy</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/automating-arc-enabled-sql-server-license-type-configuration/ba-p/4500326</link>
      <description>&lt;P&gt;Azure Arc enables customers to onboard &lt;SPAN data-contrast="auto"&gt;SQL Server instances&lt;/SPAN&gt; - hosted on Linux or Windows - into Azure, regardless of where they are hosted: on‑premises, in multicloud environments, or at the edge. Once onboarded, these resources can be managed through the Azure Portal using services like Azure Monitor, Azure Policy, and Microsoft Defender for Cloud.&lt;/P&gt;
&lt;P&gt;An important part of this onboarding is configuring the license type on each Arc-enabled resource &lt;SPAN data-contrast="auto"&gt;to match&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;your&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;licensing agreement&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;with Microsoft&lt;/SPAN&gt;. For SQL Server, the LicenseType property on the Arc extension determines how the instance is licensed: &lt;STRONG&gt;Paid&lt;/STRONG&gt; (&lt;SPAN data-contrast="auto"&gt;you have a SQL Server license with&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;Software Assurance&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;or a SQL Server subscription&lt;/SPAN&gt;),&amp;nbsp;&lt;STRONG&gt;PAYG&lt;/STRONG&gt; (y&lt;SPAN data-contrast="auto"&gt;ou are paying for SQL Server software&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;on a&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;pay-as-you-go&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;&amp;nbsp;basis&lt;/SPAN&gt;), or&amp;nbsp;&lt;STRONG&gt;LicenseOnly &lt;/STRONG&gt;(&lt;SPAN data-contrast="auto"&gt;you have a perpetual SQL Server license)&lt;/SPAN&gt;. Setting this correctly matters for two reasons:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;Unlocking additional benefits:&lt;/STRONG&gt; customers with Paid or PAYG license type gain access to some Azure services at no extra cost - such as Azure Update Manager and Machine Configuration - as well as exclusive capabilities like Best Practices Assessment and Remote Support&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Enabling pay-as-you-go billing:&lt;/STRONG&gt; customers who do not have Software Assurance can &lt;SPAN data-contrast="auto"&gt;pay for SQL Server software&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;only&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;w&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;hen&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;they use it&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;via their Azure subscription&lt;/SPAN&gt; by setting the license type to PAYG&lt;/LI&gt;
&lt;/UL&gt;
&lt;H2&gt;Configure the license types at scale using Azure Policy&lt;/H2&gt;
&lt;P&gt;Configuring the license type on each Arc-enabled SQL Server instance can be done manually in the Azure Portal, but for large scale operations, automation is essential.&lt;/P&gt;
&lt;P&gt;&lt;SPAN data-contrast="auto"&gt;One&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;way to implement automation is &lt;/SPAN&gt;via PowerShell, as explained here:&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/manage-configuration?view=sql-server-ver17&amp;amp;tabs=azure%2Cazure-portal#modify-sql-server-configuration" target="_blank" rel="noopener"&gt;Configure SQL Server - SQL Server enabled by Azure Arc | Microsoft Learn&lt;/A&gt;. &lt;SPAN data-contrast="auto"&gt;But here&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt; we will focus on how this can be automated using Azure Policy. &lt;/SPAN&gt;An existing &lt;A class="lia-internal-link lia-internal-url lia-internal-url-content-type-blog" href="https://techcommunity.microsoft.com/blog/coreinfrastructureandsecurityblog/automating-windows-server-licensing-benefits-with-azure-arc-policy/4469345" target="_blank" rel="noopener" data-lia-auto-title="article" data-lia-auto-title-active="0"&gt;article&lt;/A&gt;, written by Jeff Pigott, &lt;SPAN style="color: rgb(30, 30, 30);"&gt;explains this process for Windows Server, which inspired extending the same approach to SQL Server.&lt;/SPAN&gt;&lt;/P&gt;
&lt;H3&gt;How to deploy the policy?&lt;/H3&gt;
&lt;P&gt;Deployment has two steps:&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;Create/update the Azure Policy definition and assignment&lt;/LI&gt;
&lt;LI&gt;Start a remediation task so existing Arc-enabled SQL Server extensions are brought into compliance&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;You can deploy Azure Policy in multiple ways. In this article, we use PowerShell. See also: &lt;A style="font-style: normal; font-weight: 400; background-color: rgb(255, 255, 255);" href="https://learn.microsoft.com/en-us/azure/governance/policy/tutorials/create-and-manage#implement-a-new-custom-policy" target="_blank" rel="noopener"&gt;Tutorial: Build policies to enforce compliance - Azure Policy | Microsoft Learn&lt;/A&gt;&lt;SPAN style="color: rgb(30, 30, 30);"&gt;.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Source code&lt;/STRONG&gt;: &lt;A href="https://github.com/microsoft/sql-server-samples/tree/master/samples/manage/azure-arc-enabled-sql-server/compliance/arc-sql-license-type-compliance" target="_blank" rel="noopener"&gt;microsoft/sql-server-samples/.../arc-sql-license-type-compliance&lt;/A&gt;.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Personal repository&lt;/STRONG&gt;: &lt;A href="https://github.com/claestom/sql-arc-policy-license-config" target="_blank" rel="noopener"&gt;claestom/sql-arc-policy-license-config&lt;/A&gt;.&lt;/P&gt;
&lt;H4 data-line="21"&gt;&lt;STRONG&gt;Definition and assignment creation&lt;/STRONG&gt;&lt;/H4&gt;
&lt;P data-line="21"&gt;Download the required files:&lt;/P&gt;
&lt;LI-CODE lang="powershell"&gt;# Optional: create and enter a local working directory
mkdir sql-arc-lt-compliance
cd sql-arc-lt-compliance

$baseUrl = "https://raw.githubusercontent.com/microsoft/sql-server-samples/master/samples/manage/azure-arc-enabled-sql-server/compliance/arc-sql-license-type-compliance"

New-Item -ItemType Directory -Path policy, scripts -Force | Out-Null

curl -sLo policy/azurepolicy.json "$baseUrl/policy/azurepolicy.json"
curl -sLo scripts/deployment.ps1 "$baseUrl/scripts/deployment.ps1"
curl -sLo scripts/start-remediation.ps1 "$baseUrl/scripts/start-remediation.ps1"&lt;/LI-CODE&gt;
&lt;P&gt;&lt;STRONG&gt;Note:&lt;/STRONG&gt; On Windows PowerShell 5.1, &lt;CODE&gt;curl&lt;/CODE&gt; is an alias for &lt;CODE&gt;Invoke-WebRequest&lt;/CODE&gt;. Use &lt;CODE&gt;curl.exe&lt;/CODE&gt; instead, or run the commands in PowerShell 7+.&lt;/P&gt;
&lt;P&gt;Authenticate to Azure:&lt;/P&gt;
&lt;LI-CODE lang="powershell"&gt;Connect-AzAccount&lt;/LI-CODE&gt;
&lt;P&gt;Set your variables. Only &lt;CODE&gt;TargetLicenseType&lt;/CODE&gt; is required - all others are optional:&lt;/P&gt;
&lt;LI-CODE lang="powershell"&gt;# Required 
$TargetLicenseType    = "PAYG"                                      # "Paid" or "PAYG"

# Optional (uncomment to override defaults) 
# $ManagementGroupId      = "&amp;lt;management-group-id&amp;gt;"                 # Default: tenant root management group
# $SubscriptionId         = "&amp;lt;subscription-id&amp;gt;"                     # Default: policy assigned at management group scope
# $ExtensionType          = "Both"                                  # "Windows", "Linux", or "Both" (default)
# $LicenseTypesToOverwrite = @("Unspecified","Paid","PAYG","LicenseOnly")  # Default: all&lt;/LI-CODE&gt;
&lt;P&gt;Run the deployment script:&lt;/P&gt;
&lt;LI-CODE lang="powershell"&gt;# Minimal: uses defaults for management group, platform, and overwrite targets
.\scripts\deployment.ps1 -TargetLicenseType $TargetLicenseType

# With subscription scope
.\scripts\deployment.ps1 -TargetLicenseType $TargetLicenseType -SubscriptionId $SubscriptionId

# With all options
.\scripts\deployment.ps1 `
  -ManagementGroupId $ManagementGroupId `
  -SubscriptionId $SubscriptionId `
  -ExtensionType $ExtensionType `
  -TargetLicenseType $TargetLicenseType `
  -LicenseTypesToOverwrite $LicenseTypesToOverwrite&lt;/LI-CODE&gt;
&lt;P&gt;&lt;STRONG&gt;Parameter notes:&lt;/STRONG&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;ManagementGroupId&lt;/STRONG&gt; (optional): management group where the policy definition is created. Defaults to the tenant root management group when not specified&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;ExtensionType&lt;/STRONG&gt; (optional, default &lt;CODE&gt;Both&lt;/CODE&gt;): &lt;CODE&gt;Windows&lt;/CODE&gt;, &lt;CODE&gt;Linux&lt;/CODE&gt;, or &lt;CODE&gt;Both&lt;/CODE&gt;. When &lt;CODE&gt;Both&lt;/CODE&gt;, a single policy definition and assignment covers both platforms&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;SubscriptionId&lt;/STRONG&gt; (optional): if provided, assignment scope is subscription (otherwise management group scope)&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;TargetLicenseType&lt;/STRONG&gt; (required): &lt;CODE&gt;Paid&lt;/CODE&gt; or &lt;CODE&gt;PAYG&lt;/CODE&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;LicenseTypesToOverwrite&lt;/STRONG&gt; (optional, default all): controls which current states are eligible for update
&lt;UL&gt;
&lt;LI&gt;&lt;CODE&gt;Unspecified&lt;/CODE&gt; = no current LicenseType&lt;/LI&gt;
&lt;LI&gt;&lt;CODE&gt;Paid&lt;/CODE&gt;, &lt;CODE&gt;PAYG&lt;/CODE&gt;, &lt;CODE&gt;LicenseOnly&lt;/CODE&gt; = explicit current values&lt;/LI&gt;
&lt;/UL&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;The script also creates a system-assigned managed identity on the policy assignment and assigns required roles automatically. Role assignments include retry logic (5 attempts, 10-second delay) to handle managed identity replication delays, which helps prevent common &lt;CODE&gt;PolicyAuthorizationFailed&lt;/CODE&gt; errors.&lt;/P&gt;
&lt;H4&gt;&lt;STRONG&gt;Remediation task creation&lt;/STRONG&gt;&lt;/H4&gt;
&lt;P&gt;After deployment, allow a few minutes for Azure Policy to run a compliance scan for the selected scope.&lt;BR /&gt;You can monitor this in &lt;STRONG&gt;Azure Policy → Compliance.&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;More info: &lt;A href="https://learn.microsoft.com/en-us/azure/governance/policy/how-to/get-compliance-data#portal" target="_blank" rel="noopener"&gt;Get policy compliance data - Azure Policy | Microsoft Learn&lt;/A&gt;.&lt;/P&gt;
&lt;P&gt;Set your variables. &lt;CODE&gt;TargetLicenseType&lt;/CODE&gt; is required and must match the value used during deployment:&lt;/P&gt;
&lt;LI-CODE lang="powershell"&gt;# Required 
$TargetLicenseType    = "PAYG"                                      # Must match the deployment target

# Optional (uncomment to override defaults)
# $ManagementGroupId      = "&amp;lt;management-group-id&amp;gt;"                 # Default: tenant root management group
# $SubscriptionId         = "&amp;lt;subscription-id&amp;gt;"                     # Default: remediation runs at management group scope
# $ExtensionType          = "Both"                                  # Must match the platform used for deployment&lt;/LI-CODE&gt;
&lt;P&gt;Then start remediation:&lt;/P&gt;
&lt;LI-CODE lang="powershell"&gt;# Minimal: uses defaults for management group and platform
.\scripts\start-remediation.ps1 -TargetLicenseType $TargetLicenseType -GrantMissingPermissions

# With subscription scope
.\scripts\start-remediation.ps1 -TargetLicenseType $TargetLicenseType -SubscriptionId $SubscriptionId -GrantMissingPermissions

# With all options
.\scripts\start-remediation.ps1 `
  -ManagementGroupId $ManagementGroupId `
  -ExtensionType $ExtensionType `
  -SubscriptionId $SubscriptionId `
  -TargetLicenseType $TargetLicenseType `
  -GrantMissingPermissions&lt;/LI-CODE&gt;
&lt;P&gt;&lt;STRONG&gt;Parameter notes:&lt;/STRONG&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;ManagementGroupId&lt;/STRONG&gt; (optional): defaults to tenant root management group&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;ExtensionType&lt;/STRONG&gt; (optional, default &lt;CODE&gt;Both&lt;/CODE&gt;): must match the platform used for the assignment&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;SubscriptionId&lt;/STRONG&gt; (optional): run remediation at subscription scope&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;TargetLicenseType&lt;/STRONG&gt; (required): must match the assignment target&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;GrantMissingPermissions&lt;/STRONG&gt; (optional switch): checks and assigns missing required roles before remediation starts&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;You can track remediation progress in Azure Policy → Remediation → Remediation tasks. It can take a few minutes to complete, depending on scope and resource count.&lt;/P&gt;
&lt;H4&gt;&lt;STRONG&gt;Recurring Billing Consent (PAYG)&lt;/STRONG&gt;&lt;/H4&gt;
&lt;P&gt;When &lt;CODE&gt;TargetLicenseType&lt;/CODE&gt; is set to &lt;CODE&gt;PAYG&lt;/CODE&gt;, the policy automatically includes &lt;CODE&gt;ConsentToRecurringPAYG&lt;/CODE&gt; in the extension settings with &lt;CODE&gt;Consented: true&lt;/CODE&gt; and a UTC timestamp. &lt;SPAN data-contrast="auto"&gt;For details of this requirement see: &lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/manage-pay-as-you-go-transition?view=sql-server-ver17" target="_blank" rel="noopener"&gt;Move SQL Server license agreement to pay-as-you-go subscription - SQL Server enabled by Azure Arc | Microsoft Learn&lt;/A&gt;.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;The policy also checks for &lt;CODE&gt;ConsentToRecurringPAYG&lt;/CODE&gt; in its compliance evaluation - resources with &lt;CODE&gt;LicenseType: PAYG&lt;/CODE&gt; but missing the consent property are flagged as non-compliant and remediated. This applies both when transitioning to PAYG and for existing PAYG extensions that predate the consent requirement (backward compatibility).&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Note:&lt;/STRONG&gt; Once &lt;CODE&gt;ConsentToRecurringPAYG&lt;/CODE&gt; is set on an extension, it cannot be removed - this is enforced by the Azure resource provider. When transitioning away from PAYG, the policy changes &lt;CODE&gt;LicenseType&lt;/CODE&gt; but leaves the consent property in place.&lt;/P&gt;
&lt;H4&gt;&lt;STRONG&gt;RBAC&lt;/STRONG&gt;&lt;/H4&gt;
&lt;P&gt;When &lt;CODE&gt;.\scripts\deployment.ps1&lt;/CODE&gt; creates the policy assignment, it uses &lt;CODE&gt;-IdentityType SystemAssigned&lt;/CODE&gt;. Azure then creates a managed identity for that assignment.&lt;/P&gt;
&lt;P&gt;The assignment identity needs these roles at assignment scope (or inherited scope):&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;Azure Extension for SQL Server Deployment&lt;/STRONG&gt;: allows updating Arc SQL extension settings, including LicenseType&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Reader&lt;/STRONG&gt;: allows reading resource and extension state for policy evaluation&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Resource Policy Contributor&lt;/STRONG&gt;: allows policy-driven template deployments required by DeployIfNotExists&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;This identity is used whenever DeployIfNotExists applies changes, both during regular compliance evaluation and during remediation runs.&lt;/P&gt;
&lt;P&gt;By default, the deployment script assigns these roles automatically with built-in retry logic to handle managed identity replication delays, which helps prevent common &lt;CODE&gt;PolicyAuthorizationFailed&lt;/CODE&gt; errors.&lt;/P&gt;
&lt;H4&gt;&lt;STRONG&gt;Brownfield and Greenfield Scenarios&lt;/STRONG&gt;&lt;/H4&gt;
&lt;P&gt;This policy is useful in both brownfield and greenfield Azure Arc environments.&lt;/P&gt;
&lt;H5&gt;&lt;STRONG&gt;Brownfield: existing Arc SQL inventory&lt;/STRONG&gt;&lt;/H5&gt;
&lt;P&gt;In a brownfield environment, you already have Arc-enabled SQL Server resources in inventory and the current LicenseType values might be mixed, incorrect, or missing. This is where Azure Policy is especially useful, because it gives you a controlled way to remediate the current estate at scale.&lt;/P&gt;
&lt;P&gt;Depending on how you configure targetLicenseType and licenseTypesToOverwrite, you can use the policy to:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;standardize all in-scope resources on a single value&lt;/LI&gt;
&lt;LI&gt;set LicenseType only when it is missing&lt;/LI&gt;
&lt;LI&gt;migrate a specific subset, such as Paid to PAYG&lt;/LI&gt;
&lt;LI&gt;preserve selected states while correcting only the resources that need attention&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Examples:&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Standardize everything to Paid&lt;/STRONG&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;targetLicenseType: Paid&lt;/LI&gt;
&lt;LI&gt;licenseTypesToOverwrite: ['Unspecified','Paid','PAYG','LicenseOnly']&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Result: every in-scope Arc SQL extension is converged to LicenseType == Paid.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Backfill only missing values&lt;/STRONG&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;targetLicenseType: Paid&lt;/LI&gt;
&lt;LI&gt;licenseTypesToOverwrite: ['Unspecified']&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Result: only resources without a configured LicenseType are updated; existing Paid, PAYG, and LicenseOnly values remain unchanged.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Migrate only Paid to PAYG&lt;/STRONG&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;targetLicenseType: PAYG&lt;/LI&gt;
&lt;LI&gt;licenseTypesToOverwrite: ['Paid']&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Result: only resources currently set to Paid are updated to PAYG; missing, PAYG, and LicenseOnly remain unchanged. When transitioning to PAYG, the policy also automatically sets ConsentToRecurringPAYG with Consented: true and a UTC timestamp, as required for recurring pay-as-you-go billing.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Protect existing PAYG, fix only missing or LicenseOnly&lt;/STRONG&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;targetLicenseType: Paid&lt;/LI&gt;
&lt;LI&gt;licenseTypesToOverwrite: ['Unspecified','LicenseOnly']&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Result: resources with no LicenseType or with LicenseOnly are updated to Paid, while existing PAYG stays untouched.&lt;/P&gt;
&lt;H5&gt;&lt;STRONG&gt;Greenfield: newly onboarded SQL Servers&lt;/STRONG&gt;&lt;/H5&gt;
&lt;P&gt;In a greenfield scenario, the main value of Azure Policy is ongoing enforcement. Once new SQL Servers are onboarded to Azure Arc and fall within the assignment scope, the policy can act as a governance control to keep LicenseType aligned with your business model.&lt;/P&gt;
&lt;P&gt;This means Azure Policy is not only a remediation mechanism for existing inventory, but also a way to continuously enforce the intended license configuration for future Arc-enabled SQL Server resources.&lt;/P&gt;
&lt;H6&gt;&lt;EM&gt;Azure Policy vs tagging&lt;/EM&gt;&lt;/H6&gt;
&lt;P&gt;By default, Microsoft manages automatic deployment of SQL Server extension for Azure. It include an option to enforce the LicenseType setting via tags. See &lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/manage-autodeploy?view=sql-server-ver17&amp;amp;tabs=azure-portal" target="_blank" rel="noopener"&gt;Manage Automatic Connection - SQL Server enabled by Azure Arc | Microsoft Learn&lt;/A&gt; for details. This way all newly onboarded SQL Server instance are set to the desired LicenceType from day one. The deployment of the Azure Policy is still important to ensure that the changes of the extension properties or ad-hoc additions of the SQL Server instances stay compliant to our business model.&lt;/P&gt;
&lt;P&gt;A practical way to think about it:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Tagging &lt;SPAN data-contrast="auto"&gt;ensures the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;initial compliance&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;of&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;newly&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;connected&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;Arc-ena&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;b&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;led&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;SQL&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;servers&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-contrast="auto"&gt;Azure Policy&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN data-contrast="auto"&gt;enforces ongoing compliance&amp;nbsp;of&amp;nbsp;the&amp;nbsp;existing Arc-enabled SQL&amp;nbsp;servers&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;H2&gt;Tools&lt;/H2&gt;
&lt;P&gt;Interested in gaining better visibility into &lt;STRONG&gt;LicenseType configurations&lt;/STRONG&gt; across your estate? Below you'll find an insightful KQL query and an accompanying workbook to help track compliance.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;KQL Query&lt;/STRONG&gt;&lt;/P&gt;
&lt;LI-CODE lang="sql"&gt;resources
| where type == "microsoft.hybridcompute/machines"
| where properties.detectedProperties.mssqldiscovered == "true"
| extend machineIdHasSQLServerDiscovered = id
| project name, machineIdHasSQLServerDiscovered, resourceGroup, subscriptionId
| join kind= leftouter (
    resources
    | where type == "microsoft.hybridcompute/machines/extensions"    | where properties.type in ("WindowsAgent.SqlServer","LinuxAgent.SqlServer")
    | extend machineIdHasSQLServerExtensionInstalled = iff(id contains "/extensions/WindowsAgent.SqlServer" or id contains "/extensions/LinuxAgent.SqlServer", substring(id, 0, indexof(id, "/extensions/")), "")
    | project License_Type = properties.settings.LicenseType,
    machineIdHasSQLServerExtensionInstalled)on $left.machineIdHasSQLServerDiscovered == $right.machineIdHasSQLServerExtensionInstalled
| where isnotempty(machineIdHasSQLServerExtensionInstalled)
| project-away machineIdHasSQLServerDiscovered, machineIdHasSQLServerExtensionInstalled&lt;/LI-CODE&gt;
&lt;P&gt;Source: &lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/manage-configuration?view=sql-server-ver17&amp;amp;tabs=azure%2Cazure-portal#list-configuration-details-for-each-sql-server-instance" target="_blank" rel="noopener"&gt;Configure SQL Server - SQL Server enabled by Azure Arc | Microsoft Learn&lt;/A&gt;.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Azure Workbook&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://github.com/claestom/azure-arc-sa-workbook" target="_blank" rel="noopener"&gt;claestom/azure-arc-sa-workbook: Azure Workbook for monitoring Software Assurance compliance across Arc-enabled servers and SQL Server instances&lt;/A&gt;.&lt;/P&gt;
&lt;H2&gt;Resources&lt;/H2&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/manage-configuration?view=sql-server-ver17&amp;amp;tabs=azure%2Cazure-portal#modify-sql-server-configuration" target="_blank" rel="noopener"&gt;Configure SQL Server - SQL Server enabled by Azure Arc | Microsoft Learn&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/governance/policy/" target="_blank" rel="noopener"&gt;Azure Policy documentation | Microsoft Learn&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;A href="https://techcommunity.microsoft.com/blog/coreinfrastructureandsecurityblog/automating-windows-server-licensing-benefits-with-azure-arc-policy/4469345" target="_blank" rel="noopener"&gt;Automating Windows Server Licensing Benefits with Azure Arc Policy | Microsoft Community Hub&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;A href="https://learn.microsoft.com/en-us/sql/sql-server/azure-arc/manage-pay-as-you-go-transition?view=sql-server-ver17#recurring-billing-consent" target="_blank" rel="noopener"&gt;Recurring billing consent - SQL Server enabled by Azure Arc | Microsoft Learn&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;A href="https://github.com/claestom/azure-arc-sa-workbook" target="_blank" rel="noopener"&gt;claestom/azure-arc-sa-workbook: Azure Workbook for monitoring Software Assurance compliance across Arc-enabled servers and SQL Server instances&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;A href="https://github.com/microsoft/sql-server-samples/tree/master/samples/manage/azure-arc-enabled-sql-server/compliance/arc-sql-license-type-compliance" target="_blank" rel="noopener"&gt;microsoft/sql-server-samples/.../arc-sql-license-type-compliance&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;A href="https://github.com/claestom/sql-arc-policy-license-config" target="_blank" rel="noopener"&gt;claestom/sql-arc-policy-license-config&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Thank you!&lt;/P&gt;</description>
      <pubDate>Mon, 13 Apr 2026 07:06:25 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/automating-arc-enabled-sql-server-license-type-configuration/ba-p/4500326</guid>
      <dc:creator>TomClaes</dc:creator>
      <dc:date>2026-04-13T07:06:25Z</dc:date>
    </item>
    <item>
      <title>Simplify Azure Arc Server Onboarding with Ansible and the New Onboarding Role</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/simplify-azure-arc-server-onboarding-with-ansible-and-the-new/ba-p/4509481</link>
      <description>&lt;P&gt;If you’re already using Ansible to manage your infrastructure, there’s now a simpler—and more secure—way to bring machines under Azure Arc management. We’ve introduced a &lt;STRONG&gt;new Azure Arc onboarding role&lt;/STRONG&gt; designed specifically for automated scenarios like Ansible playbooks. This role follows the principle of least privilege, giving your automation exactly what it needs to onboard servers—nothing more.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;A better way to onboard at scale&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Many customers want to standardize Azure Arc onboarding across hybrid and multicloud environments, but run into common challenges:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Over‑privileged service principals&lt;/LI&gt;
&lt;LI&gt;Manual steps that don’t scale&lt;/LI&gt;
&lt;LI&gt;Inconsistent onboarding across environments&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;By combining &lt;STRONG&gt;Ansible&lt;/STRONG&gt; with the &lt;STRONG&gt;Azure Arc onboarding role&lt;/STRONG&gt;, you can:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;Automate server onboarding end‑to‑end&lt;/STRONG&gt;&lt;/LI&gt;
&lt;LI&gt;&amp;nbsp;&lt;STRONG&gt;Reduce permissions risk&lt;/STRONG&gt; with a purpose‑built role&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Scale confidently&lt;/STRONG&gt; across thousands of machines&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Integrate Arc onboarding into existing Ansible workflows&lt;/STRONG&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;STRONG&gt;Built for automation, designed for security&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;The new onboarding role removes the need to assign broader Azure roles just to connect servers to Azure Arc. Instead, your Ansible automation can authenticate using a tightly scoped identity that’s purpose‑built for Arc onboarding—making security teams happier without slowing down operations.&lt;/P&gt;
&lt;P&gt;Whether you’re modernizing existing datacenters or managing servers across multiple clouds, this new approach makes Azure Arc onboarding simpler, safer, and &amp;nbsp;repeatable.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Get started in minutes&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Our Microsoft Learn documentation provides guidance to help you get started quickly:&lt;BR /&gt;&lt;A href="https://docs.azure.cn/en-us/azure-arc/servers/onboard-ansible-playbooks" target="_blank"&gt;Connect machines to Azure Arc at scale with Ansible&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;Check out the Arc onboarding role, part of the Azure collection in Ansible Galaxy:&lt;/P&gt;
&lt;P&gt;&lt;A class="lia-external-url" href="https://galaxy.ansible.com/ui/repo/published/azure/azcollection/content/role/azure_arc/" target="_blank"&gt;Ansible Galaxy - azure.azcollection - Arc onboarding role&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Anything else you’d like to see with Azure Arc + Linux? Drop us a comment!&lt;/STRONG&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 08 Apr 2026 06:40:42 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/simplify-azure-arc-server-onboarding-with-ansible-and-the-new/ba-p/4509481</guid>
      <dc:creator>alinetran</dc:creator>
      <dc:date>2026-04-08T06:40:42Z</dc:date>
    </item>
    <item>
      <title>Run the latest Azure Arc agent with Automatic Agent Upgrade (Public Preview)</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/run-the-latest-azure-arc-agent-with-automatic-agent-upgrade/ba-p/4508535</link>
      <description>&lt;P&gt;Customers managing large fleets of Azure Arc servers need a scalable way to ensure the Azure Arc agent stays up to date without manual intervention. Per server configuration does not scale, and gaps in upgrade coverage can lead to operational drift, missed features, and delayed security updates.&lt;/P&gt;
&lt;P&gt;To address this, we’re introducing two new options to help customers enable Automatic Agent Upgrade at scale: applied as a built-in Azure Policy and a new onboarding CLI flag.&lt;/P&gt;
&lt;P&gt;The built-in policy makes it easy to check whether Automatic Agent Upgrade is enabled across a given scope and automatically remediates servers that are not compliant.&lt;/P&gt;
&lt;P&gt;For servers being newly onboarded, customers can enable the feature at onboarding by adding the &lt;STRONG&gt;--enable-automatic-upgrade&lt;/STRONG&gt; flag to the &lt;STRONG&gt;azcmagent connect&lt;/STRONG&gt; command, ensuring the agent is configured correctly from the start.&lt;/P&gt;
&lt;H1&gt;What is Automatic Agent Upgrade?&lt;/H1&gt;
&lt;P&gt;Automatic Agent Upgrade is a feature, in public preview, that automatically keeps the Azure Connected Machine agent (Arc agent) up to date. Updates are managed by Microsoft, so once enabled, customers no longer need to manually manage agent upgrades.&lt;/P&gt;
&lt;P&gt;By always running the latest agent version, customers receive all the newest capabilities, security updates, and bug fixes as soon as they’re released. Learn more: &lt;A href="https://learn.microsoft.com/en-us/azure/azure-arc/servers/agent-release-notes" target="_blank"&gt;What's new with Azure Connected Machine agent - Azure Arc | Microsoft Learn&lt;/A&gt;.&lt;/P&gt;
&lt;H1&gt;Getting Started&lt;/H1&gt;
&lt;H2&gt;Apply automatic agent upgrade policy&lt;/H2&gt;
&lt;OL&gt;
&lt;LI&gt;Navigate to the ‘Policy’ blade in the &lt;A href="https://portal.azure.com/" target="_blank"&gt;Azure Portal&lt;/A&gt;&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;OL start="2"&gt;
&lt;LI&gt;Navigate to the ‘Compliance’ section and click ‘Assign Policy’&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;OL start="3"&gt;
&lt;LI&gt;Fill out the required sections
&lt;UL&gt;
&lt;LI&gt;Scope: Subscription and resource group (optional) that policy will apply to&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt; &lt;/STRONG&gt;Policy definition: &lt;STRONG&gt;Configure Azure Arc-enabled Servers to enable automatic upgrades&lt;/STRONG&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;/LI&gt;
&lt;/OL&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;OL start="4"&gt;
&lt;LI&gt;Navigate to the ‘Remediation’ tab and check the box next to ‘Create a remediation task’&lt;img /&gt;&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;OL start="5"&gt;
&lt;LI&gt;Navigate to the ‘Review + create’ tab and press ‘Create’. The Policy has been successfully applied to the scope.&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;For more information on this process, please visit this article &lt;A href="https://learn.microsoft.com/en-us/azure/governance/policy/assign-policy-portal#create-a-policy-assignment" target="_blank"&gt;Quickstart: Create policy assignment using Azure portal - Azure Policy | Microsoft Learn&lt;/A&gt;.&lt;/P&gt;
&lt;H2&gt;Apply automatic agent upgrade CLI Flag&lt;/H2&gt;
&lt;P&gt;Adding the following flag enables automatic agent upgrade during onboarding&lt;/P&gt;
&lt;LI-CODE lang="bash"&gt;--enable-automatic-upgrade&lt;/LI-CODE&gt;
&lt;P&gt;While this flag can be used on a single server, it can also be applied at scale using one of the existing Azure Arc at scale onboarding methods and adding the flag &lt;A href="https://learn.microsoft.com/en-us/azure/azure-arc/servers/onboard-service-principal?tabs=portal" target="_blank"&gt;Connect hybrid machines to Azure at scale - Azure Arc | Microsoft Learn&lt;/A&gt;.&lt;/P&gt;
&lt;P&gt;Here is an at scale onboarding sample using a basic script.&lt;/P&gt;
&lt;LI-CODE lang="bash"&gt;azcmagent connect --resource-group {rg} --location {location} --subscription-id {subid} --service-principal-id {service principal id} --service-principal-secret {service principal secret} --tenant-id {tenant id} --enable-automatic-upgrade&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;To get started with this feature or learn more, please refer to this article &lt;A href="https://learn.microsoft.com/en-us/azure/azure-arc/servers/manage-agent?tabs=windows#automatic-agent-upgrade-preview" target="_blank"&gt;Manage and maintain the Azure Connected Machine agent - Azure Arc | Microsoft Learn&lt;/A&gt;.&lt;/P&gt;</description>
      <pubDate>Fri, 03 Apr 2026 19:30:06 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/run-the-latest-azure-arc-agent-with-automatic-agent-upgrade/ba-p/4508535</guid>
      <dc:creator>yunishussein</dc:creator>
      <dc:date>2026-04-03T19:30:06Z</dc:date>
    </item>
    <item>
      <title>Announcing Private Preview: Deploy Ansible Playbooks using Azure Policy via Machine Configuration</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/announcing-private-preview-deploy-ansible-playbooks-using-azure/ba-p/4507848</link>
      <description>&lt;P&gt;&lt;STRONG&gt;Azure Arc i&lt;/STRONG&gt;s on a mission to&amp;nbsp;&lt;STRONG&gt;unify security, compliance, and management for Windows and Linux machines—anywhere&lt;/STRONG&gt;. By extending Azure’s control plane beyond the cloud, Azure Arc enables organizations to unify governance, compliance, security and management of servers across on‑premises, edge, and multicloud environments using a consistent set of Azure tools and policies.&lt;/P&gt;
&lt;P&gt;Building on this mission, we’re excited to announce the &lt;STRONG&gt;private preview of deploying Ansible playbooks through Azure Policy using Machine Configuration&lt;/STRONG&gt;, bringing Ansible‑driven automation into Azure Arc’s policy‑based governance model for Azure and Arc‑enabled Linux machines.&lt;/P&gt;
&lt;P&gt;This new capability enables you to &lt;STRONG&gt;orchestrate Ansible playbook execution directly from Azure Policy &lt;/STRONG&gt;(via Machine Configuration) without requiring an Ansible control node, while benefiting from built‑in compliance reporting and remediation.&lt;/P&gt;
&lt;H2&gt;Why this matters&lt;/H2&gt;
&lt;P&gt;As organizations manage increasingly diverse server estates, they often rely on different tools for Windows and Linux, cloud, on-premises, or at the edge—creating fragmented security, compliance, and operational workflows. Many organizations rely on Ansible for OS configuration and application setup, but struggle with:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Enforcing consistent configuration across distributed environments&lt;/LI&gt;
&lt;LI&gt;Detecting and correcting drift over time&lt;/LI&gt;
&lt;LI&gt;Integrating Ansible automation with centralized governance and compliance workflows&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;With this private preview, Azure Policy becomes the &lt;STRONG&gt;single control plane&lt;/STRONG&gt; for applying and monitoring &lt;STRONG&gt;Ansible‑based configuration&lt;/STRONG&gt;, bringing Linux automation into the same governance model already used for Windows. Configuration is treated as policy—&lt;STRONG&gt;declarative, auditable, and continuously enforced&lt;/STRONG&gt;—with compliance results surfaced in familiar Azure dashboards.&lt;/P&gt;
&lt;H2&gt;What’s included in the private preview&lt;/H2&gt;
&lt;P&gt;In this preview, you can:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Use &lt;STRONG&gt;Azure Policy&lt;/STRONG&gt; to trigger &lt;STRONG&gt;Ansible playbook execution&lt;/STRONG&gt; on &lt;STRONG&gt;Azure and Azure Arc–enabled Linux machines&lt;/STRONG&gt;&lt;/LI&gt;
&lt;LI&gt;
&lt;P&gt;Execute &lt;STRONG&gt;playbooks locally&lt;/STRONG&gt; on each target machine, &lt;STRONG&gt;triggered by policy&lt;/STRONG&gt;.&lt;/P&gt;
&lt;/LI&gt;
&lt;LI&gt;Enable &lt;STRONG&gt;drift detection and automatic remediation&lt;/STRONG&gt; by default&lt;/LI&gt;
&lt;LI&gt;View &lt;STRONG&gt;playbook execution status and compliance results&lt;/STRONG&gt; directly in the Azure Policy compliance dashboard, alongside your other policies&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;This provides a &lt;STRONG&gt;unified security, compliance and management experience&lt;/STRONG&gt; across Windows and Linux machines—whether they’re running in Azure or connected through Azure Arc—while using your existing Ansible investments.&lt;/P&gt;
&lt;H2&gt;Join the private preview&lt;/H2&gt;
&lt;P&gt;If you’re interested in helping shape the future of Ansible‑based configuration management in Azure Arc, we’d love to partner with you. We’re especially interested in hearing your stories around usability, compliance reporting, and real‑world operational workflows.&lt;/P&gt;
&lt;P&gt;👉 &lt;A class="lia-external-url" href="https://forms.office.com/r/QpDW96P5wC" target="_blank" rel="noopener"&gt;&lt;STRONG&gt;Sign up for the private preview&lt;/STRONG&gt;&lt;/A&gt; and we'll reach out to you.&lt;/P&gt;
&lt;P&gt;We’ll continue investing in deeper Linux parity, broader scenarios, and tighter integration across Azure Arc’s security, governance and compliance experiences.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We look forward to enhancing your unified Azure Arc experience for deploying, governing, and remediating configuration with Ansible—bringing consistent security, compliance, and management to Windows and Linux machines not only in Azure, but also across on‑premises and other public clouds.&lt;/P&gt;</description>
      <pubDate>Thu, 09 Apr 2026 16:25:52 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/announcing-private-preview-deploy-ansible-playbooks-using-azure/ba-p/4507848</guid>
      <dc:creator>alinetran</dc:creator>
      <dc:date>2026-04-09T16:25:52Z</dc:date>
    </item>
    <item>
      <title>Announcing Public Preview of Argo CD extension on AKS and Azure Arc enabled Kubernetes clusters</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/announcing-public-preview-of-argo-cd-extension-on-aks-and-azure/ba-p/4504497</link>
      <description>&lt;P data-sfc-cp="" data-sfc-root="c" data-sfc-cb="" data-hveid="CAEIAhAA" data-complete="true" data-processed="true"&gt;We are excited to announce public preview of the Argo CD extension for Azure Kubernetes Service (AKS) and Azure Arc-enabled Kubernetes clusters.&lt;/P&gt;
&lt;P data-sfc-cp="" data-sfc-root="c" data-sfc-cb="" data-hveid="CAEIAxAA" data-complete="true" data-processed="true"&gt;As GitOps becomes the standard for deploying and operating applications at scale, enterprises need a way to implement GitOps while staying compliant with best practices for security and identity management. Argo CD extension delivers on this need across 3 pillars -&amp;nbsp;&lt;/P&gt;
&lt;P aria-level="3" data-animation-nesting="" data-sfc-cp="" data-sfc-root="c" data-sfc-cb="" data-complete="true" data-processed="true" data-sae=""&gt;&lt;STRONG data-sfc-root="c" data-sfc-cb="" data-complete="true"&gt;Trusted Identity and Secure Access&lt;/STRONG&gt;&lt;/P&gt;
&lt;P data-sfc-cp="" data-sfc-root="c" data-sfc-cb="" data-hveid="CAEIBRAA" data-complete="true" data-processed="true"&gt;The Argo CD extension integrates with Microsoft Entra ID to provide a secure, enterprise-ready experience for:&lt;/P&gt;
&lt;UL data-sfc-root="c" data-sfc-cb="" data-complete="true" data-processed="true"&gt;
&lt;LI data-sfc-root="c" data-sfc-cb="" data-hveid="CAEIBhAB" data-complete="true" data-sae=""&gt;&lt;SPAN data-sfc-cp="" data-sfc-root="c" data-sfc-cb="" data-complete="true"&gt;Secure authentication using &lt;STRONG&gt;Workload Identity federation &lt;/STRONG&gt;to Azure Container Registry (ACR) and Azure DevOps. This removes the need for long-lived credentials or hard-coded secrets in Git Repos, moving your CD pipelines closer to a true zero-trust architecture.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;SPAN data-sfc-cp="" data-sfc-root="c" data-sfc-cb="" data-complete="true"&gt;Single Sign-On (SSO) using &lt;STRONG&gt;existing Azure identities.&lt;/STRONG&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P aria-level="3" data-animation-nesting="" data-sfc-cp="" data-sfc-root="c" data-sfc-cb="" data-complete="true" data-processed="true" data-sae=""&gt;&lt;STRONG data-sfc-root="c" data-sfc-cb="" data-complete="true"&gt;Enterprise-Grade Hardening and Security&lt;/STRONG&gt;&lt;/P&gt;
&lt;P data-sfc-cp="" data-sfc-root="c" data-sfc-cb="" data-hveid="CAEIDBAA" data-complete="true" data-processed="true"&gt;This preview introduces several enhancements to improve your security posture:&lt;/P&gt;
&lt;UL data-sfc-root="c" data-sfc-cb="" data-complete="true" data-processed="true"&gt;
&lt;LI data-sfc-root="c" data-sfc-cb="" data-hveid="CAEIDRAA" data-complete="true" data-sae=""&gt;&lt;SPAN data-sfc-cp="" data-sfc-root="c" data-sfc-cb="" data-complete="true"&gt;To minimize the attack surface, the extension’s images are built on&amp;nbsp;Azure Linux, specifically engineered for&lt;STRONG&gt; reduced CVEs &lt;/STRONG&gt;and &lt;STRONG&gt;improved baseline security.&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI data-sfc-root="c" data-sfc-cb="" data-hveid="CAEIDRAB" data-complete="true" data-sae=""&gt;&lt;SPAN data-sfc-cp="" data-sfc-root="c" data-sfc-cb="" data-complete="true"&gt;Opt-in to &lt;STRONG&gt;automatic patch releases&lt;/STRONG&gt; to stay current on security fixes while maintaining full control over your change management processes.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P aria-level="3" data-animation-nesting="" data-sfc-cp="" data-sfc-root="c" data-sfc-cb="" data-complete="true" data-processed="true" data-sae=""&gt;&lt;STRONG data-sfc-root="c" data-sfc-cb="" data-complete="true"&gt;Parity with upstream Argo CD&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Argo CD extension is designed to remain fully aligned with the upstream Argo CD open‑source project, so teams can use Argo CD as they do today with support for&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Configuring Argo CD extension with &lt;STRONG&gt;High availability (HA)&lt;/STRONG&gt; for production‑grade deployments of critical workloads.&lt;/LI&gt;
&lt;LI&gt;Using &lt;STRONG&gt;hub‑and‑spoke&lt;/STRONG&gt; architecture for multi‑cluster GitOps scenarios.&lt;/LI&gt;
&lt;LI&gt;Application and ApplicationSet, enabling automated and scalable application delivery across large fleets of clusters.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P aria-level="3" data-animation-nesting="" data-sfc-cp="" data-sfc-root="c" data-sfc-cb="" data-complete="true" data-processed="true" data-sae=""&gt;&lt;STRONG data-sfc-root="c" data-sfc-cb="" data-complete="true"&gt;Getting Started&lt;/STRONG&gt;&lt;/P&gt;
&lt;P data-sfc-cp="" data-sfc-root="c" data-sfc-cb="" data-hveid="CAEIDxAA" data-complete="true" data-processed="true"&gt;We invite you to explore the Argo CD extension and provide feedback as we continue to evolve GitOps capabilities for Kubernetes.&lt;/P&gt;
&lt;P data-sfc-cp="" data-sfc-root="c" data-sfc-cb="" data-hveid="CAEIEBAA" data-complete="true" data-processed="true" aria-owns="action-menu-parent-container"&gt;To get started today, you can &lt;A class="lia-external-url" href="https://learn.microsoft.com/en-us/azure/azure-arc/kubernetes/tutorial-use-gitops-argocd" target="_blank" rel="noopener"&gt;enable the extension &lt;/A&gt;on your clusters using the Azure CLI. Argo CD extension management via the Azure Portal will be available in a few weeks.&lt;/P&gt;</description>
      <pubDate>Tue, 24 Mar 2026 08:00:00 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/announcing-public-preview-of-argo-cd-extension-on-aks-and-azure/ba-p/4504497</guid>
      <dc:creator>Poornima99</dc:creator>
      <dc:date>2026-03-24T08:00:00Z</dc:date>
    </item>
    <item>
      <title>Building Microsoft’s Sovereign AI on Azure Local with NVIDIA RTX PRO and Next Gen NVIDIA Rubin</title>
      <link>https://techcommunity.microsoft.com/t5/azure-arc-blog/building-microsoft-s-sovereign-ai-on-azure-local-with-nvidia-rtx/ba-p/4502383</link>
      <description>&lt;P&gt;Across governments, regulated industries, and critical infrastructure, organizations are moving quickly to adopt advanced AI but with a clear requirement: AI systems should be built and operated within Sovereign boundaries. The question is no longer whether to use advanced models, but where those models should run, who controls them, and how they can evolve over time without compromising sovereignty, resilience, or trust.&lt;/P&gt;
&lt;P&gt;Many of these customers are building &lt;STRONG&gt;Sovereign Private Clouds&lt;/STRONG&gt;: environments designed to run their most sensitive and mission-critical workloads entirely within their own operational boundaries. These environments often span on-premises datacenters, edge locations, and, in some cases, fully disconnected deployments. Customers want the power of modern AI, but available on infrastructure they control.&lt;/P&gt;
&lt;P&gt;This is where Azure Local comes in. &lt;STRONG&gt;Azure Local&lt;/STRONG&gt; helps bring Azure-consistent infrastructure, lifecycle management, and governance into customer-controlled environments. As AI workloads evolve rapidly, customers are planning for the next generation of AI acceleration and need confidence that their local infrastructure can be ready – without re-architecting their environment.&lt;/P&gt;
&lt;H3&gt;&lt;STRONG&gt;NVIDIA Rubin: Powering the Next Generation of Sovereign AI on Azure Local&lt;/STRONG&gt;&lt;/H3&gt;
&lt;P&gt;Sovereign AI requirements span a wide range of deployments models – from enterprise inference and development today to large-scale, sustained AI systems in the near future. Azure Local is designed to help support this evolution by providing a consistent deployment, management, and governance model across current NVIDIA Blackwell-based systems and future NVIDIA Rubin platforms.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;The NVIDIA Rubin platform &lt;/STRONG&gt;represents the next wave of accelerated computing, targeting large-scale workloads that go beyond basic inference. These include frontier model inference, agentic and reasoning‑based systems, and high‑throughput AI pipelines where performance, efficiency, and interconnect matter at scale.&lt;/P&gt;
&lt;P&gt;These are the workloads Sovereign Private Cloud customers are planning for:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;National and sector specific AI models trained and operated‑ within sovereign boundaries&lt;/LI&gt;
&lt;LI&gt;Advanced reasoning and agentic systems used to help customers with defense, intelligence, and critical decision-making‑&lt;/LI&gt;
&lt;LI&gt;High-performance inference pipelines that operate‑ close to sensitive data, with low latency and strict compliance requirements&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;STRONG&gt;Foundry Local &lt;/STRONG&gt;is built to help host these workloads directly inside customer environments, providing a modern AI platform for deploying, operating, and scaling advanced models under full customer control while remaining aligned with Azure API, tooling, and governance on Azure Local.&lt;/P&gt;
&lt;P&gt;That’s why &lt;STRONG&gt;Azure Local will be adding support for NVIDIA Rubin platform&lt;/STRONG&gt;, working closely with NVIDIA to enable customers to adopt this next generation of accelerated computing as part of their Sovereign Private Cloud strategy.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Enterprise AI Acceleration Available Today on Azure Local&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;While Rubin represents the next wave of large-scale AI acceleration, Azure Local also supports enterprise-grade GPUs available today to help customers begin building sovereign AI systems now.&lt;/P&gt;
&lt;P&gt;Built on the NVIDIA Blackwell architecture, &lt;STRONG&gt;NVIDIA RTX PRO™ 6000 Blackwell Server Edition&lt;/STRONG&gt; delivers powerful AI and visual computing performance for on-premises and sovereign environments. Azure Local harnesses the power of the RTX PRO 6000 for scenarios such as AI inferencing, code development, and data adjacent workloads, whether environments are connected to Azure or fully disconnected.&lt;/P&gt;
&lt;P&gt;Support for the NVIDIA RTX PRO 6000 Blackwell Server Edition is generally available starting with Azure Local 2603 release. And coming soon, Azure Local will be adding support for the upcoming models of the Blackwell series, including the NVIDIA RTX PRO™ 4500 Blackwell Server Edition. This allows customers to start building and operating sovereign AI workloads today, while preparing for next generation platforms such as NVIDIA Rubin as they become available.&lt;/P&gt;
&lt;H3&gt;&lt;STRONG&gt;From Infrastructure to Models: Bringing AI Closer to the Data&lt;/STRONG&gt;&lt;/H3&gt;
&lt;P&gt;With &lt;STRONG&gt;Foundry Local services on Azure Local&lt;/STRONG&gt;, customers can deploy and operate advanced AI models directly inside their sovereign environments. This includes access to modern models and APIs from the Foundry catalog, alongside NVIDIA-optimized models and frameworks, enabling high-performance AI execution close to sensitive data.&lt;/P&gt;
&lt;P&gt;Using &lt;STRONG&gt;Azure Kubernetes Services (&lt;/STRONG&gt;&lt;STRONG&gt;AKS) on Azure Local&lt;/STRONG&gt;, customers can containerize and deploy AI workloads consistently across public and sovereign environments. This supports scenarios where models are built in the cloud and operated locally, or built and run entirely within customer-controlled boundaries, while maintaining a consistent application and operations model.&lt;/P&gt;
&lt;P&gt;Together, this enables customers to continuously evolve AI solutions over time without sacrificing control, compliance, or operational consistency.&lt;/P&gt;
&lt;H3&gt;&lt;STRONG&gt;A Platform That Evolves With Customers&lt;/STRONG&gt;&lt;/H3&gt;
&lt;P&gt;Azure Local is more than infrastructure. Through Azure Arc, it helps deliver consistent management, policy enforcement, and lifecycle operations across sovereign environments, with integrated security and monitoring capabilities.&lt;/P&gt;
&lt;P&gt;This platform approach allows Sovereign Private Cloud customers to adopt new GPU architectures, AI frameworks, and models over time, without re-architecting their environments, while maintaining a strong security posture and compliance with regulatory requirements.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Azure Local and NVIDIA will work closely with OEMs including Dell, HPE, and Lenovo to enable validated hardware platforms and GPU support over time&lt;/STRONG&gt;. This helps ensure customers can adopt new AI infrastructure with confidence as part of their Sovereign Private Cloud strategy.&lt;/P&gt;
&lt;P&gt;As AI workloads continue to advance, Azure Local remains focused on one simple goal: enabling customers to run demanding AI workloads securely, compliantly, and under their control, today and into the future.&lt;/P&gt;
&lt;P&gt;To learn more: &lt;A href="https://aka.ms/azurelocal" target="_blank" rel="noopener"&gt;https://aka.ms/azurelocal&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 16 Mar 2026 20:08:55 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/azure-arc-blog/building-microsoft-s-sovereign-ai-on-azure-local-with-nvidia-rtx/ba-p/4502383</guid>
      <dc:creator>meenagowdar</dc:creator>
      <dc:date>2026-03-16T20:08:55Z</dc:date>
    </item>
  </channel>
</rss>

