nvidia-smi执行失败的解决方案
1.nvidia-smi执行失败信息
nvidia-smi
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
2.解决方案
(1) 删除nvidia驱动
su - root
apt update && sudo apt upgrade
apt install build-essential dkms
apt-get remove --purge '^nvidia-.*'
apt-get remove --purge '^libnvidia-.*'
(2) 查询显卡和nvidia文件
lshw -c display
dpkg --list | grep nvidia-*
(3) 查找可用的nVidia显卡驱动
apt install ubuntu-drivers-common
ubuntu-drivers devices
== /sys/devices/pci0000:00/0000:00:02.0/0000:02:00.0 ==
modalias : pci:v000010DEd00001DB5sv000010DEsd00001249bc03sc02i00
vendor : NVIDIA Corporation
model : GV100GL [Tesla V100 SXM2 32GB]
driver : nvidia-driver-535 - distro non-free
driver : nvidia-driver-470 - distro non-free
driver : nvidia-driver-575 - distro non-free recommended
driver : nvidia-driver-418-server - distro non-free
driver : nvidia-driver-450-server - distro non-free
driver : nvidia-driver-545 - distro non-free
driver : nvidia-driver-570 - distro non-free
driver : nvidia-driver-535-server - distro non-free
driver : nvidia-driver-550 - distro non-free
driver : nvidia-driver-390 - distro non-free
driver : nvidia-driver-470-server - distro non-free
driver : nvidia-driver-570-server - distro non-free
driver : nvidia-driver-575-server - distro non-free
driver : xserver-xorg-video-nouveau - distro free builtin
(4) 安装nvidia显卡驱动
#ubuntu-drivers autoinstall #会自动安装最高版本(575)
apt install nvidia-driver-550
(5) 重启动
reboot (一定要重启)
(6) nvidia-smi
nvidia-smi
Fri Jul 25 14:28:17 2025
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 550.163.01 Driver Version: 550.163.01 CUDA Version: 12.4 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 Tesla V100-SXM2-32GB Off | 00000000:02:00.0 Off | 0 |
| N/A 38C P0 25W / 300W | 1MiB / 32768MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
+-----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| No running processes found |
+-----------------------------------------------------------------------------------------+
ai@ai-X99:~$
(7)查询nvidia文件
dpkg --list | grep nvidia-*
ai@ai-X99:~$ dpkg --list | grep nvidia-*
ii libnvidia-cfg1-550:amd64 550.163.01-0ubuntu0.22.04.1 amd64 NVIDIA binary OpenGL/GLX configuration library
ii libnvidia-common-550 550.163.01-0ubuntu0.22.04.1 all Shared files used by the NVIDIA libraries
ii libnvidia-compute-550:amd64 550.163.01-0ubuntu0.22.04.1 amd64 NVIDIA libcompute package
ii libnvidia-compute-550:i386 550.163.01-0ubuntu0.22.04.1 i386 NVIDIA libcompute package
ii libnvidia-decode-550:amd64 550.163.01-0ubuntu0.22.04.1 amd64 NVIDIA Video Decoding runtime libraries
ii libnvidia-decode-550:i386 550.163.01-0ubuntu0.22.04.1 i386 NVIDIA Video Decoding runtime libraries
ii libnvidia-egl-wayland1:amd64 1:1.1.9-1.1ubuntu0.1 amd64 Wayland EGL External Platform library -- shared library
ii libnvidia-egl-wayland1:i386 1:1.1.9-1.1ubuntu0.1 i386 Wayland EGL External Platform library -- shared library
ii libnvidia-encode-550:amd64 550.163.01-0ubuntu0.22.04.1 amd64 NVENC Video Encoding runtime library
ii libnvidia-encode-550:i386 550.163.01-0ubuntu0.22.04.1 i386 NVENC Video Encoding runtime library
ii libnvidia-extra-550:amd64 550.163.01-0ubuntu0.22.04.1 amd64 Extra libraries for the NVIDIA driver
ii libnvidia-fbc1-550:amd64 550.163.01-0ubuntu0.22.04.1 amd64 NVIDIA OpenGL-based Framebuffer Capture runtime library
ii libnvidia-fbc1-550:i386 550.163.01-0ubuntu0.22.04.1 i386 NVIDIA OpenGL-based Framebuffer Capture runtime library
ii libnvidia-gl-550:amd64 550.163.01-0ubuntu0.22.04.1 amd64 NVIDIA OpenGL/GLX/EGL/GLES GLVND libraries and Vulkan ICD
ii libnvidia-gl-550:i386 550.163.01-0ubuntu0.22.04.1 i386 NVIDIA OpenGL/GLX/EGL/GLES GLVND libraries and Vulkan ICD
ii nvidia-compute-utils-550 550.163.01-0ubuntu0.22.04.1 amd64 NVIDIA compute utilities
ii nvidia-dkms-550 550.163.01-0ubuntu0.22.04.1 amd64 NVIDIA DKMS package
ii nvidia-driver-550 550.163.01-0ubuntu0.22.04.1 amd64 NVIDIA driver metapackage
ii nvidia-firmware-550-550.163.01 550.163.01-0ubuntu0.22.04.1 amd64 Firmware files used by the kernel module
ii nvidia-kernel-common-550 550.163.01-0ubuntu0.22.04.1 amd64 Shared files used with the kernel module
ii nvidia-kernel-source-550 550.163.01-0ubuntu0.22.04.1 amd64 NVIDIA kernel source package
ii nvidia-prime 0.8.17.1 all Tools to enable NVIDIA's Prime
ii nvidia-settings 510.47.03-0ubuntu1 amd64 Tool for configuring the NVIDIA graphics driver
ii nvidia-utils-550 550.163.01-0ubuntu0.22.04.1 amd64 NVIDIA driver support binaries
ii screen-resolution-extra 0.18.2 all Extension for the nvidia-settings control panel
ii xserver-xorg-video-nvidia-550 550.163.01-0ubuntu0.22.04.1 amd64 NVIDIA binary Xorg driver

浙公网安备 33010602011771号