Skip to main content

Overview

Text namespaces need an embedding provider to turn documents and search queries into vectors. Moorcheh On-Prem supports three providers, configured once and stored in ~/.moorcheh/config.json.
ProviderAPI keyOllama / DockerBest for
OllamaNot requiredOptional host install or bundled moorcheh-ollamaFully local, no cloud
OpenAIRequiredServer container onlyHosted embeddings, familiar models
CohereRequiredServer container onlyMultilingual and v4 models
Vector namespaces (you upload precomputed vectors) do not use these providers for ingest or vector search. You only need a provider if you use text upload or text search.

Configure once

moorcheh configure
Or on first run:
moorcheh up
If no config exists, moorcheh up runs the same interactive wizard before starting Docker. Saved settings:
FileContents
~/.moorcheh/config.jsonprovider, model, api_key (cloud only), base_url
~/.moorcheh/data/Namespaces and items (unchanged)
Re-run setup:
moorcheh configure --force
See moorcheh configure and moorcheh up.

Supported models and dimensions

Use the same model for all text namespaces on one server. When creating a vector namespace, set vector_dimension to match your embedding model output (text namespaces infer dimension from the first embed).

Ollama (local)

Model IDDimensionsNotes
nomic-embed-text768Recommended default
mxbai-embed-large1024Higher quality, larger download
all-minilm384Smaller, faster
Moorcheh pulls the selected model automatically on moorcheh up when Ollama is reachable (host or bundled). You do not need a separate ollama pull unless you skipped pull with --skip-ollama-model-pull.

OpenAI

Model IDDimensionsNotes
text-embedding-3-small1536 (default)Recommended
text-embedding-3-large3072 (default)Higher quality
text-embedding-ada-0021536Legacy
API key is stored in config.json (file mode 600 on Unix). The server calls OpenAI from inside the container; outbound HTTPS is required.

Cohere

Model IDDimensionsNotes
embed-v4.01536 (fixed in Moorcheh)Recommended; multimodal-capable API
embed-english-v3.01024English-optimized
embed-multilingual-v3.01024100+ languages
API key required. Base URL default: https://api.cohere.com/v2.
Do not mix embedding models across text namespaces on the same instance. Existing namespaces were built with one vector size; changing the model without re-uploading will break search quality and dimension checks.

What moorcheh up does by provider

ProviderContainers startedEmbedding model pull
openai / coheremoorcheh-onprem-serverN/A (cloud)
ollama (host running)Server onlyAuto-pull if missing
ollama (host down)Server + moorcheh-ollamaAuto-pull after Ollama is ready
Override flags: --embedding-provider, --embedding-model, --embedding-api-key, --bundled-ollama, --use-host-ollama. See moorcheh up.

Health check

GET /health and moorcheh status return:
{
  "status": "ok",
  "embedding_provider": "ollama",
  "model": "nomic-embed-text",
  "items": 0,
  "max_items": 100000,
  "remaining": 100000
}