Fresh Install Cuda Issues Ubuntu 22.04

Aris-T · July 9, 2024, 5:27am

So this is on a lambda desktop hardware. I just reinstalled the os and put on lambda stack to be able to get torch running again. No joy. Before I go and start messing with drivers again I was hoping someone might be able to help tell me why a fresh install is having issues.

The CUDA version seems to be ahead of that used by the latest pytorch.
This is a fresh OS and lambda stack. I thought this should work out of the box.

cody_b · July 10, 2024, 12:41am

Did you reinstall using a recovery image?

Aris-T · July 10, 2024, 5:02pm

No I did not. I’ll give that a go. What is different about these recovery images from a standard Ubuntu image?

Aris-T · July 10, 2024, 8:36pm

Just finished rebuilding from a recovery image. Same issue.
Installed the recovery image, put on lambda stack. Cant get PyTorch to see the GPUs.

Aris-T · July 11, 2024, 1:24am

Solution:
Ok I figured it out. I was looking over the additional drivers section and I saw it had thought it installed the drivers for a P5000 instead of RTX6000. WTF! I also had been wondering why my standard setup worked on another system with near identical specs.

Turns out this box had been used by the deployment folks to test different graphics cards for deployment. So I took a look at the hardware and sure enough, tucked beneath the two RTX 6000s was a P5000 in slot 3.

I pulled that and it worked. I think it comes down to the RTX and Pxx series having different internal architectures and so can’t share a driver.

Final verdict. Dont mix GPUs. Thanks for the help.

Topic		Replies	Views
CUDA pytorch not working on a fresh install RTX 3070 Technical Help	1	1719	March 10, 2021
Pytorch sometimes fails to recognize GPU Technical Help	8	2579	September 28, 2020
CUDA error after sudo apt-get update. UserWarning: CUDA initialization: CUDA unknown error....return torch._C._cuda_getDeviceCount() > 0 Technical Help	1	2659	May 6, 2021
Razer x Tensorbook GPU issues Technical Help	2	1062	November 10, 2022
Lambda workstation gpu not recognized Technical Help	1	1544	March 4, 2022

Fresh Install Cuda Issues Ubuntu 22.04

Related topics