Skip to content

CLI Reference

Commands

Command Description
inference-engine deploy <artifact> Deploy a trained artifact
inference-engine fix <model-dir> Fix a broken pipeline definition

deploy

inference-engine deploy <artifact> [options]
Flag Default Description
--name derived from filename Model name
--version auto-incremented Version string
--device cpu cpu or gpu
--routing static static, canary, or ab
--sample-input prompted Sample input for validation
--dry-run off Validate but write nothing

fix

inference-engine fix <model-dir>

Interactive only. Prompts for sample input and confirmation before writing.


Environment variables

Variable Required Description
GROQ_API_KEY Yes Groq API key for LLM generation
INFERENCE_ENGINE_LLM_MODEL No Override LLM model (default: llama-3.3-70b-versatile)