I’m attempting to run the imitation learning library Atari-Reset from openai on Github.
This repo requires installation of Horovod on GPU and Cuda-aware Open MPI. Would it be possible for you to write a installation guide for this stack? It gets confusing when Open MPI asks for the locations of the ucx-cuda and cuda locations (See ‘Building CUDA-aware Open MPI’).
I have been able to install the libraries with CPU support, but I want to get the GPU compute power of my Lambda Quad.
That would be an even better outcome! These tools are particularly valuable to your customers who are looking at clusters of Lambda machines for reinforcement / imitation learning tasks. I look forward to future updates to the Lambda Stack.