Lambda Stack - CUDNN 8 Upgrade Query

subhrajeitbhowmick · November 6, 2020, 7:34am

Hi,

I use a Tensorbook and need to leverage on Tensorflow GPU support for CUDA 11. Though the latest Lambda Stack upgrade switched my previous CUDA 10.2 to 11.1, the CUDNN version still remains 7.6.

Do we know of a timeline by when we can expect Lambda Stack to upgrade its CUDNN 7.6 to CUDNN 8.x?

Alternatively, is there a suggestion how to upgrade it manually without breaking the Lambda Stack - ensuring no compatibility issues in future upgrades of the Lambda Stack?

Many thanks in advance!

Regards.

sabalaba · November 13, 2020, 5:45pm

Lambda Stack with CuDNN 8 is coming shortly.

In the mean time, you should be able to link to CUDNN with something like modifying your LD_LIBRARY_PATH to include a path to the libcudnn.so.8 files.

subhrajeitbhowmick · November 14, 2020, 9:54pm

Many thanks @sabalaba for your kind response!

I downloaded cuDNN 8 for Ubuntu 20.04 and added the LD_LIBRARY_PATH as per your suggestion.

However, while testing for Tensorflow’s GPU association, I received an error stating, “Could not load dynamic library ‘libcusolver.so.10’; dlerror: libcusolver.so.10: cannot open shared object file: No such file or directory”.

This might be resolved by completely removing all installed CUDA files and a fresh install of CUDA, but I fear, this would lead to an incongruity in the existing Lambda Stack.

Note: I even tried with Tensorflow 2.4.0-rc1 whose pip packages are now built with CUDA11 and cuDNN 8.0.2.

Would you kindly be able to suggest a solution here?

perreiradasilva-m · November 18, 2020, 11:53am

Hello to all,

Just to add more details @sabalaba (because I faced the same problem), it seems there is however a problem with the current lambdastack version. On a fresh install, any pytorch call to code that uses cudnn throws an error :
“Could not load library libcudnn_cnn_train.so.8. Error: libcudnn_ops_train.so.8: cannot open shared object file: No such file or directory
Please make sure libcudnn_cnn_train.so.8 is in your library path!”

The only way to get things working is to set LD_LIBRARY_PATH accordingly:
export LD_LIBRARY_PATH=/usr/lib/python3/dist-packages/torch/lib/

But is seem not to be that lambda stack way.

I understand that cudnn 8 is not yet supported but in that case why does a fresh install try to use cudnn 8 by default ?

sabalaba · December 1, 2020, 6:12pm

Is there a reason that you’re using a pip installed version of pytorch instead of the default Lambda Stack pytorch?

The default Lambda stack pytorch has cudnn built in and doesn’t throw that error.

>>> import torch
>>> torch.__path__
['/usr/lib/python3/dist-packages/torch']
>>> torch.__version__
'1.6.0'

perreiradasilva-m · December 2, 2020, 7:23am

`Hello,
No, not using any pip installed version of pytorch. Just Ubuntu 20.04 and a fresh install of lambda stack (nothing else added appart from jupyterhub). In that configuration I must export torch location in LD_LIBRARY_PATH. Otherwise I’ve got the reported error about libcudnn_cnn_train.so.8 not found :-(.
Pytorch it self works fine, just the cudnn related part that fails. Reported torch version is however 1.7.0.
>>> import torch
>>> torch.path
[‘/usr/lib/python3/dist-packages/torch’]
>>> torch.version
‘1.7.0’

willkaes · December 6, 2020, 7:47am

Good information thanks for sharing
vmware

mikey · December 30, 2020, 8:26pm

I solved this problem by running this in python:

import torch
torch.__path__
>>> ['/some/path/to/torch']

Then in terminal:

export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/some/path/to/torch

suvrat · April 23, 2021, 6:45am

Any updates on when it will be supported by default?

Topic		Replies	Views
Updated Lambda Stack and now have a PyTorch CuDNN version mismatch Error [resolved] Technical Help	5	3769	December 26, 2018
Fresh 20.04.1 LTS + Lambda Stack - libcudnn_adv_train.so.8 Technical Help	3	2782	April 23, 2021
Could not load library libcudnn_adv_train.so.8 error on lambda workstation Technical Help	5	7018	September 15, 2022
CUDA libraries path with Lambda Stack on Tensorbook	2	2361	December 5, 2020
Updating Stack on GPU Cloud to latest versions Technical Help	1	1102	April 30, 2023

Lambda Stack - CUDNN 8 Upgrade Query

Related topics