Rate limit reached for requests
You’re sending more requests per minute (RPM) than your OpenAI tier allows. OpenAI returns HTTP 429 and rejects the request until the per-minute window resets. It’s a throughput cap, not a billing problem.
Agents are the classic cause: heartbeats, retries, and parallel tool calls add up fast and burn the RPM budget before you’ve sent a single message.
Standard Compute has no per-minute request limit. Under sustained heavy load it gracefully slows and batches instead of returning 429s, so an agent that bursts requests keeps running rather than erroring out.
No. It’s a per-minute throughput cap (RPM), not a quota or billing problem. You can have plenty of credit and still hit it. “You exceeded your current quota” is the billing one.
The per-minute window resets continuously, so a 429 usually clears within seconds to a minute. Back off and retry rather than hammering the endpoint.