Gol*_*ian 7 nvidia docker nvidia-docker
我按照此处的说明安装了 nvidia-docker2 。运行以下命令时,我将得到如图所示的预期输出。
sudo docker run --rm --gpus all nvidia/cuda:11.0.3-base-ubuntu20.04 nvidia-smi
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 495.29.05 Driver Version: 495.29.05 CUDA Version: 11.5 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA GeForce ... On | 00000000:0B:00.0 On | N/A |
| 24% 31C P8 13W / 250W | 222MiB / 11011MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
+-----------------------------------------------------------------------------+
Run Code Online (Sandbox Code Playgroud)
但是,在没有“sudo”的情况下运行上述命令会导致以下错误:
$ docker run --rm --gpus all nvidia/cuda:11.0.3-base-ubuntu20.04 nvidia-smi
docker: Error response from daemon: failed to create shim task: OCI runtime create
failed: runc create failed: unable to start container process: error during container
init: error running hook #0: error running hook: exit status 1, stdout: , stderr:
nvidia-container-cli: initialization error: load library failed: libnvidia-ml.so.1:
cannot open shared object file: no such file or directory: unknown.
Run Code Online (Sandbox Code Playgroud)
谁能帮我解决这个问题吗?
将 docker 组添加到您的用户:
sudo usermod -aG docker your_user
Run Code Online (Sandbox Code Playgroud)
更新:
检查这里https://github.com/NVIDIA/nvidia-docker/issues/539
也许评论中的一些内容会对您有所帮助。
| 归档时间: |
|
| 查看次数: |
2965 次 |
| 最近记录: |