Recent Blogs
3 MIN READ
by Mark Gitau (Software Engineer)
Introduction
For the MLPerf Inference v5.1 submission, Azure shared performance results on the new ND GB200 v6 virtual machines. A single ND GB200 v6 VM on Azur...
Sep 09, 2025105Views
0likes
0Comments
Introduction
The DeepSeek R1 model represents a new frontier in large-scale reasoning for AI applications. Designed to tackle complex inference tasks, R1 pushes the boundaries of what’s possible—bu...
Aug 28, 2025249Views
0likes
0Comments
5 MIN READ
Introduction:
Many customers run multiple Teamcenter-SPDM solutions across the enterprise, mixing multiple instances, multiple ISV vendors, and hybrid cloud/on-prem implementations. This fragmentat...
Aug 28, 2025106Views
0likes
0Comments
Introduction
Following our previous evaluation of Llama 3.1 8B inference performance on Azure’s ND-H100-v5 infrastructure using vLLM, this report broadens the scope to compare inference performance...
Aug 26, 2025244Views
0likes
0Comments
Introduction
The pace of development in large language models (LLMs) has continued to accelerate as the global AI community races toward the goal of artificial general intelligence (AGI). Today’s m...
Aug 26, 2025224Views
0likes
0Comments
6 MIN READ
Small performance gaps on a single virtual machine lead to large and costly performance losses at scale. Running small-scale pretraining jobs enables single-VM validation and allows for fine-grained ...
Aug 18, 2025434Views
0likes
0Comments
Architecture
Ansys Minerva baseline architecture has four distributed tiers (client, web, enterprise, and resource) in a single Azure availability zone. Each tier aligns to function and communicati...
Jul 30, 2025190Views
0likes
0Comments
High Performance Computing (HPC) environments are essential for research, engineering, and data-intensive workloads. To efficiently manage compute resources and job submissions, organizations rely on...
Jul 21, 2025353Views
0likes
0Comments
Microsoft Azure’s high-performance computing (HPC) & AI infrastructure is designed from the ground up to support the world’s most demanding workloads. High-performance AI workloads are bandwidth-hung...
Jun 25, 20251.6KViews
3likes
1Comment
Overview
Semiconductor (or Electronic Design Automation [EDA]) companies prioritize reducing time to market (TTM), which depends on how quickly tasks such as chip design validation and pre-foundry ...
Jun 24, 2025339Views
0likes
0Comments
Resources
Tags
- hpc235 Topics
- ai infrastructure90 Topics
- virtual machines65 Topics
- benchmarking51 Topics
- updates18 Topics
- storage17 Topics
- events15 Topics
- ramp up with me12 Topics
- Microsoft Ignite 20231 Topic
- Microsoft Build 20241 Topic