Rate Limiting and Throtling
Understand and manage rate limiting controls to ensure optimal integration with the iQ Suite Platform.
Overview
To maintain optimal system performance and ensure fair usage, the iQ Suite Platform enforces rate limiting on various SDK function calls. Rate limiting controls the number of requests your application can make within a specific time frame, preventing server overload and ensuring consistent service quality for all users.
Small update from us!
We love that you're supporting us in our journey to become the developer-first platform for AI. We're constantly improving our platform and documentation to make your experience better. And thence with further growth, we'll be able to expand these limitations and make them more flexible.
If you have any feedback or suggestions, please feel free to reach out to us at iQ Support.
Rate Limiting Details
Enforced Rate Limits per Endpoint
Endpoint | Rate Limit |
---|---|
create_index_with_polling | 10 requests per minute |
add_document_with_polling | 10 requests per minute |
retrieve | 50 requests per minute |
search | 50 requests per minute |
webhook_create | 5 requests per minute |
webhook_list | 20 requests per minute |
webhook_update | 10 requests per minute |
webhook_delete | 10 requests per minute |
Handling Rate Limits
When your application exceeds the allowed rate limits, the API will respond with a 429 Too Many Requests
status code. To handle this gracefully and ensure uninterrupted service, implement retry logic with exponential backoff.