Installing NVIDIA Docker component and Python packages

After docker has been installed, install the NVIDIA docker component and the Python packages required to run the check service script.

Procedure

Install nvidia-docker

Follow the instructions below for your computer architecture.

For x86_64:

Remove nvidia-docker 1.0 and all existing GPU containers, if it's already installed:

# docker volume ls -q -f driver=nvidia-docker | xargs -r -I{} -n1 docker ps -q -a -f volume={} | xargs -r docker rm -f
# sudo yum remove nvidia-docker

Add the package repositories

# distribution=$(. /etc/os-release;echo $ID$VERSION_ID)
# curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.repo | \
tee /etc/yum.repos.d/nvidia-docker.repo

Install nvidia-docker2 and reload the Docker daemon configuration:

# sudo yum install -y nvidia-docker2
# sudo pkill -SIGHUP dockerd

Set up the container runtime and tell Docker to use it as the default. Type the following, hitting enter after each line or simply copy the contents starting with { into /etc/docker/daemon.json:
```
# cat /etc/docker/daemon.json << EOF
{
"default-runtime": "nvidia",
"runtimes": {
"nvidia": {
    "path": "/usr/bin/nvidia-container-runtime",
    "runtimeArgs": []
}
}
}
```
Restart Docker:
```
# systemctl restart docker
```

Test that it worked:

# docker run --rm nvidia/cuda:10.0-runtime-ubuntu18.04 nvidia-smi

For POWER9:

Install the nvidia-docker and nvidia-container-runtime repositories. Type the following, hitting enter after each line or simply copy the contents starting with [nvidia-docker] into /etc/yum.repos.d/nvidia-docker.repo:

# cat > /etc/yum.repos.d/nvidia-docker.repo << EOF
[nvidia-docker]
name=nvidia-docker
baseurl=https://nvidia.github.io/nvidia-docker/centos7/ppc64le
repo_gpgcheck=1
gpgcheck=0
enabled=1
gpgkey=https://nvidia.github.io/nvidia-docker/gpgkey
sslverify=1
sslcacert=/etc/pki/tls/certs/ca-bundle.crt

[nvidia-container-runtime]
name=nvidia-container-runtime
baseurl=https://nvidia.github.io/nvidia-container-runtime/centos7/$basearch
repo_gpgcheck=1
gpgcheck=0
enabled=1
gpgkey=https://nvidia.github.io/nvidia-container-runtime/gpgkey
sslverify=1
sslcacert=/etc/pki/tls/certs/ca-bundle.crt

Install the container runtime and runtime hook:

# yum install -y nvidia-container-runtime-hook
# yum install -y nvidia-container-runtime
# mkdir -p /usr/libexec/oci/hooks.d
# echo -e '#!/bin/sh\nPATH="/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin" exec nvidiacontainer-runtime-hook "$@"' | sudo tee /usr/libexec/oci/hooks.d/nvidia
# chmod +x /usr/libexec/oci/hooks.d/nvidia

Set up the container runtime and tell Docker to use it as the default:

# cat /etc/docker/daemon.json << EOF
{
"default-runtime": "nvidia",
"runtimes": {
"nvidia": {
    "path": "/usr/bin/nvidia-container-runtime",
    "runtimeArgs": []
}
}
}

Restart Docker:
```
# systemctl restart docker
```

Test that it worked:

# docker run --rm nvidia/cuda-ppc64le:10.0-runtime-ubuntu18.04 nvidia-smi

You should see output like you would see when you run nvidia-smi on the host.

Install Python packages for the check service script
```
# pip install PyYAML requests colorama
```

Parent topic: Installing the Deep Learning Engine (DLE) component
Next topic:
Running the DLE installer