Skip to content

CLI Flags Reference

deploy flags

Flag Default Description
--name derived from filename Model name
--version auto-incremented Version string (e.g. v1, v2)
--device cpu cpu or gpu
--routing static static, canary, or ab
--sample-input prompted interactively Sample input for pipeline validation
--dry-run off Run full flow including validation but write nothing

When all flags are provided, all interactive prompts are skipped (CI-safe).


fix flags

inference-engine fix <model-dir> takes no flags. It is interactive-only and requires a TTY.


Environment variables

Variable Required Description
GROQ_API_KEY Yes API key for LLM code generation
INFERENCE_ENGINE_LLM_MODEL No Override default model (llama-3.3-70b-versatile)

Variables are loaded automatically from .env in the project root.