Rate Limit Overview
API requests are rate limited to ensure fair usage and system stability. Limits are enforced at two levels: per IP address and per user account.Current Limits
| Dimension | Limit |
|---|---|
| Per IP address | 60 requests / minute |
| Per user account | 600 requests / minute |
When either limit is exceeded, the API returns HTTP status
429 Too Many Requests. Limits may vary based on your plan — contact support for higher limits.Rate Limit Exceeded Response
When a rate limit is exceeded, the API responds with:Handling Rate Limits
Implement Exponential Backoff
Best Practices
- Implement exponential backoff with jitter for retries
- Cache responses when possible to reduce redundant requests
- Batch operations where supported (e.g. batch document removal)
- Implement client-side request queuing to smooth out traffic spikes
- Monitor request volume and log
429responses for observability
Storage Quotas
| Resource | Limit |
|---|---|
| Total storage | 100 GB (varies by plan) |
| Max file size | 500 MB |
| Projects | Unlimited |
| Memories per project | Unlimited |
| API keys per project | 10 |
Requesting Higher Limits
To request higher rate limits, contact support with:- Current usage patterns and peak request volume
- Expected future usage
- Use case description
Next Steps
API Overview
Return to API overview
Error Handling
Handle rate limit errors