We have a Lambda Labs workstation running on 18.04.6 LTS (Bionic Beaver), the NVIDIA driver stopped loading after an user upgraded GCC few days ago. I have tried to uninstall and reinstalled the Lambda Stack for deep learning that is supposed to covered the NVIDIA drivers with all the deep learning modules like cuda, tensorflow and pytorch.
I have been researching on this issue for couple of days, and any helps would be greatly appreciated. I have checked that secure boot is disabled, and nvidia is not blacklisted in modprobe.d, and I have seen and tried other suggestions I found in other posts but nothing has worked so far. As far as I can tell, the compiled driver is 515.65.01 and GCC is 9.4.0
I found a similar case in NVIDIA Developer forum (https://forums.developer.nvidia.com/t/unable-to-load-nvidia-driver-for-ubuntu-20-04-lts/231138/3), and the solution was to recompile the driver with standard header with:
sudo apt install --install-recommends linux-generic-hwe-20.04
I imagine that I would need to change from 20.04 to 18.04 to match my Ubuntu version, does that look like a possible solution?
Thanks,