Skip to content

Error Codes

HTTP status codes

Code Meaning
200 Success
400 Bad request — validation error, missing version, malformed payload
401 Unauthorized — missing or invalid API key
403 Forbidden — API key lacks required scope
404 Not found — job ID or model not found
413 Payload too large — body exceeds 1 MB
429 Too many requests — rate limit exceeded
500 Internal server error — inference failure or timeout

Inference error types

These appear in inference_errors_total{error_type=...} and in job error_types fields.

Error type Cause
model_not_found No pipeline registered for (model, version)
timeout Executor timed out (see SLA config)
inference_error Any other exception during pipeline execution

Job statuses

Status Meaning
created Record created
pending Queued, not yet picked up
running Executing
succeeded Complete — result populated
failed Error — error_message populated
cancelled Cancelled before execution