Deleted Lambda Stack as it wasn’t working with Pytorch or Tensorflow.
Reinstalled fresh Ubuntu 22.04, new nvidia drivers etc. Pytorch and Tensorflow worked for ~3 weeks.
Last Thursday training started suddenly going very slowly
NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
Reinstalled 515 drivers through
Software & Updates , restarted and got nothing but a flashing cursor on restart.
- Uninstalled / reinstalled Ubuntu 22.04, drivers, etc, same issue
- Tried Ubuntu 20.04, same issue
Interestingly the nvidia gpu output of
lspci | grep VGA is VGA compatible controller: NVIDIA Corporation Device 2460 (rev a1) when this should be something like
NVIDIA Geforce Ti 3080 or similar
If I change
GPU Mode to
Discrete GPU Only I can get to the login screen, but the trackpad and mouse don’t work.
I can’t get the recovery iso image off the website as the link doesn’t work
So what’s the next move here? Calling Lambda tomorrow but wondering if anyone else has experienced similar?