← All fixes
429Any provider· Rate limits

429 Too Many Requests — what it means & how to fix

429 Too Many Requests
Quick answer

HTTP 429 means the API is rejecting requests because you’ve exceeded a limit — usually requests-per-minute or tokens-per-minute, sometimes quota. The fix is to back off, retry, and reduce burst load.

What causes it

How to fix it

  1. Implement exponential backoff with jitter; honor Retry-After.
  2. Cap concurrency and queue non-urgent calls.
  3. Use separate keys per app so one workload can’t starve another.
  4. Upgrade your tier for higher limits.
The permanent fix

Stop hitting this entirely

Standard Compute doesn’t throw 429 for load — it degrades gracefully (slows and batches) instead of failing the request, so a bursty client keeps working.

Get a free API key →How it connects →

FAQ

Should I just retry on 429?

Retry with exponential backoff, not immediately — hammering the endpoint makes it worse. Respect the Retry-After header when the provider sends one.

Related errors

OpenAI “Rate limit reached for requests” (429)OpenAI · 429Anthropic / Claude rate limit exceeded (429)Anthropic (Claude) · 429Why your AI agent keeps getting rate limited (and how to stop it)Any agent