429Any provider· Rate limits

429 Too Many Requests — what it means & how to fix

429 Too Many Requests

Quick answer

HTTP 429 means the API is rejecting requests because you’ve exceeded a limit — usually requests-per-minute or tokens-per-minute, sometimes quota. The fix is to back off, retry, and reduce burst load.

What causes it

Too many requests or tokens per minute for your tier.
Parallelism or retry storms multiplying your call rate.
Shared keys: multiple apps or teammates using the same key at once.

How to fix it

Implement exponential backoff with jitter; honor Retry-After.
Cap concurrency and queue non-urgent calls.
Use separate keys per app so one workload can’t starve another.
Upgrade your tier for higher limits.

The permanent fix

Stop hitting this entirely

Standard Compute doesn’t throw 429 for load — it degrades gracefully (slows and batches) instead of failing the request, so a bursty client keeps working.

Get a free API key →How it connects →

FAQ

Should I just retry on 429?

Retry with exponential backoff, not immediately — hammering the endpoint makes it worse. Respect the Retry-After header when the provider sends one.

429 Too Many Requests — what it means & how to fix

What causes it

How to fix it

Stop hitting this entirely

FAQ

Related errors