Are there any limits to the Langfuse API?
While the Langfuse API is extremely open and flexible, there are some limits to ensure the stability and performance of the platform.
Langfuse Cloud
If you run into unexpected limits or need to increase your limits, please reach out to us via the chat widget.
Payloads
5MB per request and 5MB per response
Rate Limits
Resource | Hobby/Pro | Team |
---|---|---|
Tracing. Batched /ingestion endpoint used by SDKs to ingest traces. | 1000 batches/min | 5000 batches/min |
Tracing (legacy). Deprecated POST APIs used by V1 SDKs. Support for these APIs will be removed in Langfuse V3. | 400 req/min | 400 req/min |
Prompts. GET APIs used to fetch prompts for use in applications. | No limit | No limit |
Metrics. GET APIs used to fetch analytics data, e.g. daily metrics api. | 10 req/min | 10 req/min |
API. All other APIs. | 1000 req/min | 1000 req/min |
Rate limit response
- HTTP status code 429
Retry-After
response header with the number of seconds to wait before retrying
Self-hosted instances
No hard limits, depends on your infrastructure capacity and load-balancer configuration.