Hi, i’ve been sucessfully using 1xH100 instances in the last few days, and every day i would launch/terminate instance.
Today i have a problem, it seems that no gpu is detected on system:
nvidia-smi
NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
I tried:
apt update && apt install
separately:
sudo apt install nvidia-driver-545 (doesn’t work)
sudo apt install nvidia-driver-535 (doesn’t work)
rebooting in between steps as well
nothing helped