Getting started with the NCsv3 series and NCas_T4_v3 series
Published Jul 08 2022 08:18 PM 5,972 Views
Microsoft

By Hugo Affaticati, Program Manager

 

Useful resources

Information on the Microsoft NCsv3-series  

Information on the Microsoft NCas_T4_v3-series

 

Pre-requisites

Deploy a virtual machine on Microsoft Azure Portal.

 

Key values:

  • Size: NC6s v3 (also available NC12s v3 and NC24s v3) or NC16as_T4_v3 (also available NC4as_T4_v3, NC8as_T4_v3, and NC64as_T4_v3)
  • Image: Ubuntu HPC 18.04 (recommended, also available Ubuntu HPC 20.04)
  • Availability: no redundancy required for benchmarking
  • Disk: 2048 GB data disk

 

Step 1: NVIDIA driver and CUDA

Verify the NVIDIA driver version

cd /mnt
nvidia-smi

 

If the driver version is less than 510, update both the driver and the CUDA versions

sudo wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64/cuda-ubuntu1804.pin 

sudo mv cuda-ubuntu1804.pin /etc/apt/preferences.d/cuda-repository-pin-600

sudo wget https://developer.download.nvidia.com/compute/cuda/11.6.1/local_installers/cuda-repo-ubuntu1804-11-6...

sudo dpkg -i cuda-repo-ubuntu1804-11-6-local_11.6.1-510.47.03-1_amd64.deb

sudo apt-key add /var/cuda-repo-ubuntu1804-11-6-local/7fa2af80.pub

sudo apt-get update

sudo apt-get -y install cuda

 

Restart the machine

sudo reboot

 

Step 2: Docker

The next step is to update Docker to the latest version.

cd /mnt

sudo apt update

sudo apt-get install apt-transport-https ca-certificates curl software-properties-common

curl -fsSL https://download.docker.com/linux/ubuntu/gpg | sudo apt-key add -

sudo add-apt-repository "deb [arch=amd64] https://download.docker.com/linux/ubuntu  $(lsb_release -cs)  stable"

sudo apt update

sudo apt-get install docker-ce

 

Update the Docker root directory in the docker daemon configuration file

sudo vi /etc/docker/daemon.json

 

Add the line after the first curly bracket

"data-root": "/mnt/data",

 

Verify the previous steps and enable docker

docker --version

sudo systemctl restart docker

sudo systemctl enable docker

 

Register your user for Docker

sudo groupadd docker

sudo usermod -aG docker $USER

newgrp docker

 

You should not have any permission issues when running

docker info

 

Step 3: Mount the data disk

Find the managed disk you deployed with the deployment of the virtual machine

lsblk -o NAME,HCTL,SIZE,MOUNTPOINT | grep -i "sd"

 

Get the name (sda, sdb, or sdc…) corresponding to the 2TB disk from the previous line.

Replace “sdc” in the following lines 2 and 3 by the accurate name.

sudo mkdir /mnt/resource_mdisk

sudo mkfs.ext4 /dev/sdc && sudo fsck /dev/sdc

sudo mount /dev/sdc /mnt/resource_mdisk

sudo chown -R azureuser:users resource_mdisk/

 

 

Co-Authors
Version history
Last update:
‎Oct 25 2022 12:54 PM
Updated by: