I'm getting an HTTP code 524 response from the the inference API's

bnbnbn · March 2, 2025, 5:20pm

Have been using the inference API’s a fair bit and more and more seem to be getting 524 status code responses. A quick google suggests this might be cloudflare response.

Is this just me and is this more common?

Edit: Should have added i’m mostly using the llama3.3-70b-instruct-fp8 endpoint, if i try and run multiple (approx 5) concurrent queries it seems to trigger it, or at least increase the chances of it occuring. Haven’t had a chance to see if different models have the same issue.

Topic		Replies	Views
Did inference api change recently? Technical Help	0	77	March 20, 2025
Inference API Timeout Technical Help	2	51	June 6, 2025
Inference API call returning "HTTP/1.1 400 Bad Request"	1	84	February 6, 2025
Inferrence Streaming Changed Technical Help	0	38	May 13, 2025
Does Inference API support batch/asynchronous processing	1	72	March 13, 2025

I'm getting an HTTP code 524 response from the the inference API's

Related topics