AI Infrastructure
56 TopicsOptimizing Language Model Inference on Azure
Inefficient inference optimization can lead to skyrocketing costs for customers, making it crucial to establish clear performance benchmarking numbers. This blog sets the standard for expected performance, helping customers make informed decisions that maximize efficiency and minimize expenses with the new Azure ND H200 v5-series.Azure NV V710 v5: Empowering Real-Time AI/ML Inferencing and Advanced Visualization in the Cloud
Today, we’re excited to introduce the Azure NV V710 v5, the latest VM tailored for small-to-medium AI/ML inferencing workloads, Virtual Desktop Infrastructure (VDI), visualization, and cloud gaming workloads.Comprehensive Nvidia GPU Monitoring for Azure N-Series VMs Using Telegraf with Azure Monitor
Unlocking Nvidia GPU Monitoring for Azure N-Series VMs with Telegraf and Azure Monitor. In the world of AI and HPC, optimizing GPU performance is critical for avoiding inefficiencies that can bottleneck workflows and drive up costs. While Azure Monitor tracks key resources like CPU and memory, it falls short in native GPU monitoring for Azure N-series VMs. Enter Telegraf—a powerful tool that integrates seamlessly with Azure Monitor to bridge this gap. In this blog, discover how to harness Telegraf for comprehensive GPU monitoring and ensure your GPUs perform at peak efficiency in the cloud.Running tightly coupled HPC/AI workloads with InfiniBand using NVIDIA Network Operator on AKS
Dr. Kai Neuffer - Principal Program Manager, Industry and Partner Sales - Energy Industry Dr. Wolfgang De Salvador - Senior Product Manager - Azure Storage Paul Edwards - Principal Technical Program Manager - Azure Core HPC & AIAnnouncing the General Availability of Azure CycleCloud Workspace for Slurm
We are excited to announce the general availability of Azure CycleCloud Workspace for Slurm https://aka.ms/ccw4slurm , a groundbreaking solution template available on the Azure Marketplace. This innovative template allows users to effortlessly create, configure, and deploy pre-defined Slurm clusters with CycleCloud on Azure, all without requiring any prior knowledge of Azure or Slurm.