Rate limits help ensure fair usage and optimal performance for all users. Each subscription plan has specific limits for queued API requests.
Request Queue Limits
Our API uses request queue limits to manage server load and ensure optimal performance. The limits vary based on your subscription plan:Pay as you go plan
5 queued API requestsPerfect for individual developers and small projects getting started with our APIs.
Standard plan
10 queued API requestsIdeal for growing businesses and applications with moderate usage requirements.
Unlimited Premium Plan
15 queued API requestsDesigned for enterprise applications and high-volume usage scenarios.
How Request Queuing Works
Request queuing ensures that API calls are processed sequentially in a controlled manner. Here’s what you need to know:Understanding the Limits
- Sequential Processing: Requests are processed one after another in queue order
- Queue Management: New requests are added to the queue and processed when previous ones complete
- Per Account: Limits apply to your entire account, not per API endpoint
- Real-time: The limit is enforced in real-time as requests come in
Example Scenarios
Pay as you go (5 queued requests)
Pay as you go (5 queued requests)
If you have 5 requests in the queue and try to make a 6th request, it will be rejected until one of the queued requests completes and is processed.
Standard plan (10 queued requests)
Standard plan (10 queued requests)
You can have up to 10 requests in the queue. Any additional requests will be rejected until queue space becomes available.
Premium plan (15 queued requests)
Premium plan (15 queued requests)
Request Queuing
When you reach your queue limit:- Queue Full: Additional requests are rejected with a rate limit error
- Sequential Processing: Requests are processed one after another in queue order
- FIFO Order: Requests are processed in First-In-First-Out order
- Automatic Processing: Queued requests are automatically processed as previous ones complete
Best Practices
Optimizing for Rate Limits
1
Monitor Your Queue
Keep track of your queued requests to avoid hitting limits unexpectedly.
2
Implement Retry Logic
Add exponential backoff retry logic for better handling of rate limit scenarios.
3
Batch Operations
When possible, combine multiple operations into single requests to reduce queue usage.
4
Upgrade When Needed
Consider upgrading your plan if you consistently hit queue limits.
Error Handling
When you hit rate limits, you’ll receive an HTTP 429 status code with details about the limit:json
Upgrading Your Plan
If you need higher queue limits:- Log in to your ModelsLab account
- Navigate to the billing section
- Select a higher tier plan
- New limits take effect immediately
Need more than 15 queued requests? Contact our enterprise team for custom solutions tailored to your specific needs.
Frequently Asked Questions
What happens if I exceed my queue limit?
What happens if I exceed my queue limit?
Additional requests are rejected with a rate limit error. You’ll need to wait for queued requests to complete before making new ones.
Do rate limits reset daily?
Do rate limits reset daily?
No, queue limits are enforced in real-time, not on a daily basis. They’re based on current queue size, not total requests per day.
Are there different limits for different APIs?
Are there different limits for different APIs?
No, the queue limit applies to your entire account across all API endpoints.
Support
Need help with rate limits or want to discuss custom solutions?- Documentation: Check our API Reference for detailed endpoint information
- Support: Contact us at support@modelslab.com
- Discord: Join our Discord community for real-time help