Forum Discussion

nqnielsen's avatar
nqnielsen
Copper Contributor
Sep 23, 2021

AKS and NVIDIA A100 GPU support with Azure NDasrv4 Series

Hi there,

 

We are using Azure's Standard_ND96asr_v4 instance types for our ML workloads and would love to use AKS images, instead of custom VM images to make it work.

 

We ran into issues migrating from V100 to A100 GPUs, which could be addressed by installing the drivers and fabric managers mentioned in this help page: https://docs.microsoft.com/en-us/azure/machine-learning/data-science-virtual-machine/reference-known-issues#fix-gpu-on-nvidia-a100-gpu-chip---azure-ndasrv4-series

 

Is there any plan to fix the AKS VM images with those packages so we don't have to maintain a separate image?

No RepliesBe the first to reply

Resources