Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.moorcheh.ai/llms.txt

Use this file to discover all available pages before exploring further.

Moorcheh On-Prem Changelog

Track releases of the self-hosted Moorcheh stack: moorcheh-client on PyPI, the moorcheh/server Docker image, REST API, moorcheh CLI, and MoorchehApiClient.
Version 0.1.0
Released May 22, 2026

Initial release

First public release of Moorcheh On-Prem - semantic search and vector storage that runs locally with Docker and Ollama. No cloud API keys required.

Runtime

  • moorcheh up / down / status - Start and stop the Moorcheh API server (moorcheh/server:latest) via Docker Compose
  • Ollama integration - Host Ollama or bundled moorcheh-ollama container for embeddings (default model: nomic-embed-text)
  • Local data - Persistent storage under ~/.moorcheh/data

REST API

  • Health - Server status, embedding model, and global item quota (GET /health)
  • Namespaces - Create, list, and async delete text or vector namespaces
  • Data - Async upload of documents (text) and precomputed vectors, with job polling
  • Items - Get and delete items by id (up to 100 ids per request)
  • Search - Semantic search with text or vector queries across one or more namespaces

CLI and Python client

  • moorcheh-client on PyPI - Installs the moorcheh CLI and MoorchehApiClient in one package
  • Full parity - CLI and Python SDK cover the same API surface (namespaces, uploads, items, search)

Limits

  • Unlimited namespaces - no cloud-tier namespace count limit; only the global item cap applies
  • 100,000 items global cap across all namespaces
  • Item ids unique per namespace
Getting started: See Introduction, Prerequisites, and Quickstart.