ai infrastructure

88 Topics

Azure announces new AI optimized VM series featuring AMD’s flagship MI300X GPU
In our relentless pursuit of pushing the boundaries of artificial intelligence, we understand that cutting-edge infrastructure and expertise is needed to harness the full potential of advanced AI. At Microsoft, we've amassed a decade of experience in supercomputing and have consistently supported the most demanding AI training and generative inferencing workloads. Today, we're excited to announce the latest milestone in our journey. We’ve created a virtual machine (VM) with an unprecedented 1.5 TB of high bandwidth memory (HBM) that leverages the power of AMD’s flagship MI300X GPU. Our Azure VMs powered with the MI300X GPU give customers even more choices for AI optimized VMs.
MarcCharest
Nov 15, 2023 Place Azure High Performance Computing (HPC) Blog
57KViews
4likes
2Comments
Introducing the new Azure AI infrastructure VM series ND MI300X v5
ND MI300X v5 features industry-leading high-bandwidth memory (HBM) capacity and bandwidth targeting and AI training.
MarcCharest
May 21, 2024 Place Azure High Performance Computing (HPC) Blog
26KViews
6likes
8Comments
Join Microsoft Azure at SC24
Come visit Microsoft at Supercomputing 2024 Nov 17 – 22 where we’ll deep dive into HPC and AI solutions during a week of engaging sessions, hands on experiences and networking. We also have an exciting new Azure HPC announcement!
Michelle_R
Oct 18, 2024 Place Azure High Performance Computing (HPC) Blog
19KViews
1like
1Comment
Performance considerations for large scale deep learning training on Azure NDv4 (A100) series
Modern DL training jobs require large Clusters of multi-GPUs with high floating-point performance connected with high bandwidth, low latency networks. The Azure NDv4 VM series is designed specifically for these types of workloads. We will be focusing on HPC+AI Clusters built with the ND96asr_v4 virtual machine type and providing specific optimization recommendations to get the best performance.
CormacGarvey
Aug 28, 2021 Place Azure High Performance Computing (HPC) Blog
19KViews
4likes
0Comments
Introducing Azure NC H100 v5 VMs for mid-range AI and HPC workloads
Today at Ignite, Microsoft is announcing the public preview of the NC H100 v5 Virtual Machine Series, the latest addition to our portfolio of purpose-built infrastructure for High Performance Computing (HPC) and Artificial Intelligence (AI) workloads.
sherrywang
Nov 15, 2023 Place Azure High Performance Computing (HPC) Blog
17KViews
2likes
0Comments
Running GPU accelerated workloads with NVIDIA GPU Operator on AKS
The focus of this article will be on getting NVIDIA GPUs managed and configured in the best way on Azure Kuberentes Services using NVIDIA GPU Operator for HPC/AI workloads requiring a high degree of customization and granular control over the compute-resources configuration
wolfgangdesalvador
Feb 23, 2024 Place Azure High Performance Computing (HPC) Blog
15KViews
4likes
1Comment
Breaking the Million-Token Barrier: The Technical Achievement of Azure ND GB300 v6
Azure ND GB300 v6 Virtual Machines with NVIDIA GB300 NVL72 rack-scale systems achieve unprecedented performance of 1,100,000 tokens/s on Llama2 70B Inference, beating the previous Azure ND GB200 v6 record of 865,000 tokens/s by 27%.
HugoAffaticati
Nov 03, 2025 Place Azure High Performance Computing (HPC) Blog
15KViews
0likes
0Comments
Bringing Generative AI to Semiconductor and Electronics Design
Behind the scenes look at the partnership between Microsoft and Synopsys in developing the Synopsys.ai CoPilot announced at Ignite 2023
mujtabahamid
Nov 15, 2023 Place Azure High Performance Computing (HPC) Blog
14KViews
3likes
0Comments
Exploring CPU vs GPU Speed in AI Training: A Demonstration with TensorFlow
In the ever-evolving landscape of artificial intelligence, the speed of model training is a crucial factor that can significantly impact the development and deployment of AI applications. Central Processing Units (CPUs) and Graphics Processing Units (GPUs) are two types of processors commonly used for this purpose. In this blog post, we will delve into a practical demonstration using TensorFlow to showcase the speed differences between CPU and GPU when training a deep learning model.
vinilv
Dec 21, 2023 Place Azure High Performance Computing (HPC) Blog
13KViews
2likes
0Comments
Accessing Azure Managed Lustre from Windows through a SAMBA server
This article is focused on providing a recipe to expose Azure Managed Lustre File Systems to Windows clients through SMB/CIFS protocol.
wolfgangdesalvador
Jul 10, 2023 Place Azure High Performance Computing (HPC) Blog
13KViews
3likes
0Comments