OpenAI API 429 Too Many Requests (rate limit / quota)
Is there an outage right now?
OpenAI API official status is operational. If this error persists, it is likely an implementation or environment issue on your side.
View OpenAI API status →What this error means
429 means too many requests: you exceeded RPM/TPM limits, or ran out of balance/usage quota.
Common causes
- •Too many requests in a short time (RPM/TPM exceeded)
- •Out of credit balance or hit usage limits
- •Sudden burst / too much concurrency
- •Low rate limits on your tier
- •Temporary demand spike on OpenAI side
How to fix
- 1.Retry with exponential backoff (honor the Retry-After header)
- 2.Lower request rate and concurrency
- 3.Use the Batch API / caching to cut call volume
- 4.Raise balance / usage limits in Billing
- 5.Check your tier and model rate limits
- 6.Check the OpenAI-side status at the top of this page
When waiting helps (and when it won't)
If it is OpenAI-side demand, it may clear in minutes. If it is your own quota, waiting will not help—reduce rate or raise limits.
Alternatives when OpenAI API is down
Related errors
FAQ
What is 429 Too Many Requests?
429 means too many requests: you exceeded RPM/TPM limits, or ran out of balance/usage quota.
How do I fix 429 Too Many Requests?
Retry with exponential backoff (honor the Retry-After header) / Lower request rate and concurrency / Use the Batch API / caching to cut call volume / Raise balance / usage limits in Billing / Check your tier and model rate limits / Check the OpenAI-side status at the top of this page