AKS and NVIDIA A100 GPU support with Azure NDasrv4 Series

Copper Contributor

Hi there,

 

We are using Azure's Standard_ND96asr_v4 instance types for our ML workloads and would love to use AKS images, instead of custom VM images to make it work.

 

We ran into issues migrating from V100 to A100 GPUs, which could be addressed by installing the drivers and fabric managers mentioned in this help page: https://docs.microsoft.com/en-us/azure/machine-learning/data-science-virtual-machine/reference-known...

 

Is there any plan to fix the AKS VM images with those packages so we don't have to maintain a separate image?

0 Replies