在 K8S 上簡單實現 Nvidia GPU Time-Slicing

Posted on 2025-03-092025-12-18 by 檸檬爸

Post Views: 2,284

Nvidia 的 GPU 目前是市場上使用的主流，在雲的世界裡面，由於大部分的使用場景是按需 (On Demand)，因此 K8S 慢慢地也是雲端管理資源的一個利器，如何在 Kubernetes 上調用 GPU 的資源相對地也越來越普遍，本篇整理了目前網路上可以看到 Nvidia GPU 於操作方法，並且介紹一種簡單實現 GPU Time-Slicing 的設定。

在 Kubernetes 上使用 GPU 的方法

由於檸檬爸使用的雲端環境主要是 Azure，所以從 Use GPUs for compute-intensive workloads on Azure Kubernetes Service (AKS) 的文件出發，可以總結在 K8S 上面使用 GPU 有以下兩種做法：

複雜且消耗資源的做法：Nvidia GPU Operator
簡單但比較受限的做法：直接部署 Nvidia Device Plugin

Nvidia GPU Operator

Nvidia GPU Operator 是一個比較全面部署 GPU 相關軟件在 K8S 上面的管理套件，除了 Nvidia Device Plugin 之外，GPU Operator 還可以依照需求按照以下順序幫忙叢集安裝以下程式：

Nvidia Driver Installer
Nvidia Container Toolkit Installer
Nvidia Device Plugin
DCGM Exporter

根據網站的介紹，安裝完成以後，應該要可以看到以下的 Pods 列表。

root@test:~# kubectl -n gpu-operator get pods
NAME                                                           READY   STATUS      RESTARTS      AGE
gpu-feature-discovery-jdqpb                                    1/1     Running     0             35d
gpu-operator-67f8b59c9b-k989m                                  1/1     Running     6 (35d ago)   35d
nfd-node-feature-discovery-gc-5644575d55-957rp                 1/1     Running     6 (35d ago)   35d
nfd-node-feature-discovery-master-5bd568cf5c-c6t9s             1/1     Running     6 (35d ago)   35d
nfd-node-feature-discovery-worker-sqb7x                        1/1     Running     6 (35d ago)   35d
nvidia-container-toolkit-daemonset-rqgtv                       1/1     Running     0             35d
nvidia-cuda-validator-9kqnf                                    0/1     Completed   0             35d
nvidia-dcgm-exporter-8mb6v                                     1/1     Running     0             35d
nvidia-device-plugin-daemonset-7nkjw                           1/1     Running     0             35d
nvidia-driver-daemonset-5.15.0-105-generic-ubuntu22.04-g5dgx   1/1     Running     5 (35d ago)   35d
nvidia-operator-validator-6mqlm                                1/1     Running     0             35d

只安裝 Nvidia Device Plugin

在 AKS 上，可以選擇只安裝 Nvidia Device Plugin，以下官方提供的 nvidia-device-plugin.yaml，透過 kubectl apply -f nvidia-device-plugin.yaml 指令安裝就可以部署 Device Plugin 的 DaemonSet，在

apiVersion: apps/v1
kind: DaemonSet
metadata:
  name: nvidia-device-plugin-daemonset
  namespace: kube-system
spec:
  selector:
    matchLabels:
      name: nvidia-device-plugin-ds
  updateStrategy:
    type: RollingUpdate
  template:
    metadata:
      labels:
        name: nvidia-device-plugin-ds
    spec:
      tolerations:
      - key: "sku"
        operator: "Equal"
        value: "gpu"
        effect: "NoSchedule"
      # Mark this pod as a critical add-on; when enabled, the critical add-on
      # scheduler reserves resources for critical add-on pods so that they can
      # be rescheduled after a failure.
      # See https://kubernetes.io/docs/tasks/administer-cluster/guaranteed-scheduling-critical-addon-pods/
      priorityClassName: "system-node-critical"
      containers:
      - image: nvcr.io/nvidia/k8s-device-plugin:v0.15.0
        name: nvidia-device-plugin-ctr
        env:
          - name: FAIL_ON_INIT_ERROR
            value: "false"
        securityContext:
          allowPrivilegeEscalation: false
          capabilities:
            drop: ["ALL"]
        volumeMounts:
        - name: device-plugin
          mountPath: /var/lib/kubelet/device-plugins
      volumes:
      - name: device-plugin
        hostPath:
          path: /var/lib/kubelet/device-plugins

另外如果想要設定 Time-Slicing 則可以利用以下的 time-slicing.yaml 檔先創建一個 ConfigMap，在參考連結之後，稍微修改以上的 nvidia-device-plugin.yaml 得到新的 Daemonsets 部署設定。

apiVersion: apps/v1
kind: DaemonSet
metadata:
  name: nvidia-device-plugin-daemonset
  namespace: kube-system
spec:
  selector:
    matchLabels:
      name: nvidia-device-plugin-ds
  updateStrategy:
    type: RollingUpdate
  template:
    metadata:
      labels:
        name: nvidia-device-plugin-ds
    spec:
      tolerations:
      - key: nvidia.com/gpu
        operator: Exists
        effect: NoSchedule
      # Mark this pod as a critical add-on; when enabled, the critical add-on
      # scheduler reserves resources for critical add-on pods so that they can
      # be rescheduled after a failure.
      # See https://kubernetes.io/docs/tasks/administer-cluster/guaranteed-scheduling-critical-addon-pods/
      priorityClassName: "system-node-critical"
      containers:
      - image: nvcr.io/nvidia/k8s-device-plugin:v0.14.0
        name: nvidia-device-plugin-ctr
        env:
          - name: CONFIG_FILE
            value: "/opt/config/config.yaml"
        securityContext:
          privileged: true
        volumeMounts:
        - name: device-plugin
          mountPath: /var/lib/kubelet/device-plugins
        - name: config
          mountPath: "/opt/config"
      volumes:
      - name: device-plugin
        hostPath:
          path: /var/lib/kubelet/device-plugins
      - name: config
        configMap:
          name: nvidia-config
---
apiVersion: v1
kind: ConfigMap
metadata:
  name: nvidia-config
  namespace: kube-system
  labels:
    app: nvidia
data:
  config.yaml: |-
    version: v1
    flags:
      migStrategy: "none"
      failOnInitError: false
      nvidiaDriverRoot: "/"
      plugin:
        passDeviceSpecs: true
    sharing:
      timeSlicing:
        resources:
        - name: nvidia.com/gpu
          replicas: 10

備註：可以只部署一個 Nvidia Device Plugin 的原因主要是因為 GPU Driver 已經被預先安裝好了，在 AKS 的 agent pool 裡面有一個參數 –gpu-driver 預設是 Install，如果想要自己透過 GPU Operator 來部署 GPU 相關的環境的話，這一個參數在部署 Agent pool 的時候需要預先關閉！

CUDA driver version is insufficient for CUDA runtime version

文件中有另外說明一件事情，如果遇到 CUDA runtime 不支援的問題可以參考。

Error: failed to create containerd task: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running prestart hook #0: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy'
nvidia-container-cli: requirement error: unsatisfied condition: cuda>=129.0, please update your driver to a newer version, or use an earlier cuda container: unknown
  Warning  BackOff  10s (x2 over 11s)  kubelet  Back-off restarting failed container interactive-server in pod run-bo79klsfugnrjny-0(67da1e70-e8f7-4c7e-b2ae-300955d8782a)

在 K8S 上簡單實現 Nvidia GPU Time-Slicing

在 Kubernetes 上使用 GPU 的方法

Nvidia GPU Operator

只安裝 Nvidia Device Plugin

CUDA driver version is insufficient for CUDA runtime version

One thought on “在 K8S 上簡單實現 Nvidia GPU Time-Slicing”

Leave a Reply Cancel reply

Most Viewed Posts

Categories

Recent Posts

Archives

Facebook Page Widget

Contact Us

檸檬媽

檸檬爸