1、安装gcc、make
2、安装pkg-config、libglvnd-dev
3、删除所有Ubuntu自带NVIDIA驱动和禁用nouveau
sudo apt purge '*nvidia*'
sudo reboot
sudo nano /etc/modprobe.d/blacklist-nouveau.conf
blacklist nouveau
options nouveau modeset=0
sudo update-initramfs -u
sudo reboot
lsmod | grep nouveau
4、使用inxi查看系统显卡的驱动程序:
sudo apt install inxi
inxi -G
Graphics:
Device-1: NVIDIA GM107 [GeForce GTX 750] driver: N/A
Device-2: NVIDIA GP100GL [Tesla P100 PCIe 16GB] driver: N/A
Display: server: X.org v: 1.21.1.13 with: Xwayland v: 24.1.2 driver: N/A tty: 166x28
API: EGL v: 1.5 drivers: swrast platforms: surfaceless,device
API: OpenGL v: 4.5 vendor: mesa v: 24.2.8-1ubuntu1~24.10.1 note: console (EGL sourced)
renderer: llvmpipe (LLVM 19.1.1 256 bits)
5、禁用gdm3或lightdm
sudo systemctl stop gdm3
6、安装单文件驱动(注意导出X-Server-Config的备份)
sudo ./NVIDIA-Linux-x86_64-570.86.15.run
7、安装单文件cuda安装包
sudo ./cuda_12.8.0_570.86.10_linux.run
注意取消勾选Driver
8、修改环境变量
nano ~/.bashrc
export PATH="$PATH:/usr/local/cuda-12.8/bin"
export LD_LIBRARY_PATH="$LD_LIBRARY_PATH:/usr/local/cuda-12.8/lib64"
source ~/.bashrc
9、检查cuda-toolkit是否正确安装:
nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2025 NVIDIA Corporation
Built on Wed_Jan_15_19:20:09_PST_2025
Cuda compilation tools, release 12.8, V12.8.61
Build cuda_12.8.r12.8/compiler.35404655_0
10、驱动是否正确安装:
nvidia-smi
Fri Feb 28 18:19:01 2025
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 570.86.15 Driver Version: 570.86.15 CUDA Version: 12.8 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA GeForce GTX 750 Off | 00000000:06:00.0 Off | N/A |
| 33% 33C P8 1W / 38W | 10MiB / 1024MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
| 1 Tesla P100-PCIE-16GB Off | 00000000:07:00.0 Off | 0 |
| N/A 32C P0 26W / 250W | 0MiB / 16384MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
+-----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| No running processes found |
+-----------------------------------------------------------------------------------------+
11、如何卸载驱动:
sudo ./NVIDIA-Linux-x86_64-535.113.01.run --uninstall