--base-url | API base URL (default http://localhost:8080) |
--query | Required. User question |
--namespace | Text namespace for RAG; default "" (direct LLM) |
--top-k | Number of chunks to retrieve |
--temperature | LLM temperature (0.0–2.0) |
--ai-model | Override configured LLM model |
--header-prompt | System instruction |
--footer-prompt | Trailing instruction |
--threshold | Relevance threshold (with --kiosk-mode) |
--kiosk-mode | Filter chunks below threshold |
--chat-history-json | JSON array of prior turns |
--structured | Enable default structured JSON schema |