How to run Llama-2-13B on a single GPU?

seb_wilkes · August 26, 2023, 1:18pm

Hi everyone, I’m a real novice to using LLMs.

I know that Lambda Labs has provided a script to run Llama with multiple GPUs. For context this is because for the models >7B, they specify a MP>1. However, reading around “on the internet” it seems to me that there is enough memory to make it happen on a A6000.

Is there any advice on getting a 13B model work on a single GPU, rather than relying on spreading it between GPUs?

Topic		Replies	Views
Is there anyway to add more ram to the existing gpu_1x_a100_sxm4?	1	30	March 6, 2025
How to Optimize Deep Learning Models with Lambda Labs Hardware? Deep Learning: General Discussion	0	56	January 17, 2025
NVIDIA GPU for ML only?	2	1494	October 21, 2021
1080Ti on 18.04 LTS - should I upgrade GPU? Deep Learning: General Discussion	0	1178	May 12, 2021
How to use GPU's when training a model? Technical Help	3	1089	May 11, 2023

How to run Llama-2-13B on a single GPU?

Related topics