LlamaIndex Integration - Moorcheh Documentation

What is LlamaIndex Integration?

The Moorcheh Vector Store integration with LlamaIndex provides a powerful way to build advanced document processing and retrieval systems. This integration combines Moorcheh’s lightning-fast semantic search with LlamaIndex’s sophisticated document processing capabilities.

Key Features

Document Processing

Advanced document loading, chunking, and processing with LlamaIndex

Vector Storage

Efficient vector storage and retrieval using Moorcheh’s MIB technology

Query Engine

Powerful query engine with metadata filtering and advanced search

AI Integration

Seamless integration with various AI models for generative answers

Installation

Install the required packages:

pip install llama_index
pip install moorcheh_sdk

Quick Start

Import Required Libraries

import logging
import sys
import os
from moorcheh_sdk import MoorchehClient
from IPython.display import Markdown, display
from typing import Any, Callable, Dict, List, Optional, cast
from llama_index.core import (
    VectorStoreIndex,
    SimpleDirectoryReader,
    StorageContext,
    Settings,
)
from llama_index.core.base.embeddings.base_sparse import BaseSparseEmbedding
from llama_index.core.bridge.pydantic import PrivateAttr
from llama_index.core.schema import BaseNode, MetadataMode, TextNode
from llama_index.core.vector_stores.types import (
    BasePydanticVectorStore,
    MetadataFilters,
    VectorStoreQuery,
    VectorStoreQueryMode,
    VectorStoreQueryResult,
)
from llama_index.core.vector_stores.utils import (
    DEFAULT_TEXT_KEY,
    legacy_metadata_dict_to_node,
    metadata_dict_to_node,
    node_to_metadata_dict,
)
from llama_index.core.vector_stores.types import (
    MetadataFilter,
    MetadataFilters,
    FilterOperator,
    FilterCondition,
)

Configure Logging

# Logging Setup
logging.basicConfig(stream=sys.stdout, level=logging.INFO)
logging.getLogger().addHandler(logging.StreamHandler(stream=sys.stdout))

Load Moorcheh API Key

# Set the values of the API Keys in your Environment Variables
from google.colab import userdata

api_key = os.environ["MOORCHEH_API_KEY"] = userdata.get("MOORCHEH_API_KEY")

if "MOORCHEH_API_KEY" not in os.environ:
    raise EnvironmentError(f"Environment variable MOORCHEH_API_KEY is not set")

Document Processing

Load and Chunk Documents

# Load Documents
documents = SimpleDirectoryReader("./documents").load_data()

# Set chunk size and overlap
Settings.chunk_size = 1024
Settings.chunk_overlap = 20

Adjust chunk_size and chunk_overlap based on your document type and requirements. Larger chunks provide more context but may reduce precision.

Vector Store Setup

Initialize Vector Store

# Initialize the Moorcheh Vector Store
vector_store = MoorchehVectorStore(
    api_key=api_key,
    namespace="llamaindex_moorcheh",
    namespace_type="text",
    vector_dimension=None,
    add_sparse_vector=False,
    batch_size=100,
)

Configuration Parameters

Namespace Configuration

namespace: Unique identifier for your vector store
namespace_type: “text” for text-based or “vector” for vector-based namespaces
vector_dimension: Dimension of vectors (None for text-based namespaces)

Performance Settings

batch_size: Number of documents to process in each batch
add_sparse_vector: Whether to include sparse vectors for hybrid search

Create Vector Store Index

# Create a Vector Store Index using the Vector Store and given Documents
storage_context = StorageContext.from_defaults(vector_store=vector_store)
index = VectorStoreIndex.from_documents(
    documents, storage_context=storage_context
)

Querying the Vector Store

Basic Query

# Generate Response
query_engine = index.as_query_engine()
response = vector_store.generate_answer(
    query="Which company has had the highest revenue in 2025 and why?"
)

display(Markdown(f"<b>{response}</b>"))
print(f"\n\n================================\n\n{response}\n\n================================\n\n")

AI-Powered Query

# Get generative answer using AI model
moorcheh_response = vector_store.get_generative_answer(
    query="Which company has had the highest revenue in 2025 and why?",
    ai_model="anthropic.claude-sonnet-4-20250514-v1:0",
)

print(f"\n\n================================\n\n{moorcheh_response}\n\n================================\n\n")

Advanced Features

Metadata Filtering

Filter documents based on metadata:

# Filters for Metadata
filter = MetadataFilters(
    filters=[
        MetadataFilter(
            key="file_path",
            value="insert the file path to the document here",
            operator=FilterOperator.EQ,
        )
    ],
    condition=FilterCondition.AND,
)

Supported Filter Operators

EQ

Equal to

NEQ

Not equal to

GT

Greater than

GTE

Greater than or equal

LT

Less than

LTE

Less than or equal

IN

In list

NIN

Not in list

Multiple Filters

# Multiple metadata filters
filters = MetadataFilters(
    filters=[
        MetadataFilter(
            key="file_type",
            value="pdf",
            operator=FilterOperator.EQ,
        ),
        MetadataFilter(
            key="date",
            value="2024-01-01",
            operator=FilterOperator.GTE,
        )
    ],
    condition=FilterCondition.AND,
)

Configuration Options

Chunking Strategy

Customize how documents are chunked:

# Custom chunking settings
Settings.chunk_size = 512  # Smaller chunks for more precise retrieval
Settings.chunk_overlap = 50  # Higher overlap for better context preservation

Batch Processing

Optimize batch size for your use case:

vector_store = MoorchehVectorStore(
    api_key=api_key,
    namespace="llamaindex_moorcheh",
    namespace_type="text",
    batch_size=50,  # Smaller batches for memory-constrained environments
)

Sparse Vectors

Enable hybrid search with sparse vectors:

vector_store = MoorchehVectorStore(
    api_key=api_key,
    namespace="llamaindex_moorcheh",
    namespace_type="text",
    add_sparse_vector=True,  # Enable sparse vectors for hybrid search
)

AI Models

Supported Models

Model ID	Name	Provider	Description
anthropic.claude-sonnet-4-20250514-v1:0	Claude Sonnet 4	Anthropic	Hybrid reasoning, extended thinking, efficient code generation
anthropic.claude-sonnet-4-5-20250929-v1:0	Claude Sonnet 4.5	Anthropic	Latest Claude model with enhanced capabilities and agentic search
meta.llama4-maverick-17b-instruct-v1:0	Llama 4 Maverick 17B	Meta	1M token context, fine tuning, text summarization, function calling
meta.llama3-3-70b-instruct-v1:0	Llama 3.3 70B	Meta	Advanced reasoning and decision making capabilities
amazon.nova-pro-v1:0	Amazon Nova Pro	Amazon	300K context, chat optimized, complex reasoning, math
deepseek.r1-v1:0	DeepSeek R1	DeepSeek	Advanced reasoning and code generation
openai.gpt-oss-120b-1:0	OpenAI GPT OSS 120B	OpenAI	Hybrid reasoning, extended thinking, efficient research
qwen.qwen3-32b-v1:0	Qwen 3 32B	Qwen	Text generation and code generation

Model Selection

Choose the best model for your use case:

# For general purpose queries
response = vector_store.get_generative_answer(
    query="Your question here",
    ai_model="anthropic.claude-sonnet-4-20250514-v1:0",
)

# For advanced reasoning and agentic search
response = vector_store.get_generative_answer(
    query="Complex analytical question",
    ai_model="anthropic.claude-sonnet-4-5-20250929-v1:0",
)

Best Practices

Document Preparation

Organize Documents

Place documents in a structured directory for easy loading

Set Appropriate Chunk Size

Balance between context and precision (512-1024 characters)

Add Rich Metadata

Include file paths, dates, and other relevant metadata

Test Different Configurations

Experiment with different chunk sizes and overlap values

Performance Optimization

Use appropriate batch sizes based on your memory constraints
Enable sparse vectors for hybrid search when needed
Monitor API usage and rate limits
Cache frequently accessed results

Error Handling

try:
    response = vector_store.get_generative_answer(
        query="Your question",
        ai_model="anthropic.claude-sonnet-4-20250514-v1:0",
    )
    print(response)
except Exception as e:
    print(f"Error generating answer: {e}")

Troubleshooting

Common Issues

API Key Issues

Ensure MOORCHEH_API_KEY is set in environment variables
Check API key permissions and validity
Verify the key has access to the specified namespace

Document Loading Problems

Check document file paths and formats
Ensure documents are readable and not corrupted
Verify directory structure for SimpleDirectoryReader

Query Performance

Adjust chunk size and overlap settings
Use metadata filters to narrow down results
Consider enabling sparse vectors for better search

Memory Issues

Reduce batch size for large document collections
Process documents in smaller batches
Monitor memory usage during indexing

Debug Mode

Enable detailed logging for troubleshooting:

# Enable debug logging
logging.basicConfig(stream=sys.stdout, level=logging.DEBUG)
logging.getLogger().addHandler(logging.StreamHandler(stream=sys.stdout))

Advanced Usage

Custom Embeddings

from llama_index.embeddings import OpenAIEmbeddings

# Use custom embeddings
embeddings = OpenAIEmbeddings()
Settings.embed_model = embeddings

Custom Vector Store

class CustomMoorchehVectorStore(MoorchehVectorStore):
    def __init__(self, **kwargs):
        super().__init__(**kwargs)
        # Add custom initialization logic

Hybrid Search

# Enable hybrid search with sparse vectors
vector_store = MoorchehVectorStore(
    api_key=api_key,
    namespace="hybrid_search",
    namespace_type="text",
    add_sparse_vector=True,
)

Examples

Financial Document Analysis

# Load financial documents
documents = SimpleDirectoryReader("./financial_reports").load_data()

# Configure for financial analysis
Settings.chunk_size = 1024
Settings.chunk_overlap = 100

# Create vector store
vector_store = MoorchehVectorStore(
    api_key=api_key,
    namespace="financial_analysis",
    namespace_type="text",
)

# Query financial data
response = vector_store.get_generative_answer(
    query="What were the key financial metrics for Q4 2024?",
    ai_model="anthropic.claude-sonnet-4-20250514-v1:0",
)

Legal Document Search

# Load legal documents
documents = SimpleDirectoryReader("./legal_docs").load_data()

# Configure for legal documents
Settings.chunk_size = 2048  # Larger chunks for legal context
Settings.chunk_overlap = 200

# Create vector store with metadata filtering
vector_store = MoorchehVectorStore(
    api_key=api_key,
    namespace="legal_documents",
    namespace_type="text",
)

# Query with metadata filters
filters = MetadataFilters(
    filters=[
        MetadataFilter(
            key="document_type",
            value="contract",
            operator=FilterOperator.EQ,
        )
    ]
)

Further Resources

LlamaIndex Documentation

Complete LlamaIndex documentation and guides

Moorcheh SDK

Moorcheh Python SDK repository

Examples Repository

Practical examples and tutorials

Community Support

Join the LlamaIndex community

Support

Need help with the LlamaIndex integration?

Get Support

Contact our support team for assistance

MCP

Chat Boilerplate

LangChain

LlamaIndex

n8n

​What is LlamaIndex Integration?

​Key Features

Document Processing

Vector Storage

Query Engine

AI Integration

​Installation

​Quick Start

​Import Required Libraries

​Configure Logging

​Load Moorcheh API Key

​Document Processing

​Load and Chunk Documents

​Vector Store Setup

​Initialize Vector Store

​Configuration Parameters

​Create Vector Store Index

​Querying the Vector Store

​Basic Query

​AI-Powered Query

​Advanced Features

​Metadata Filtering

​Supported Filter Operators

EQ

NEQ

GT

GTE

LT

LTE

IN

NIN

​Multiple Filters

​Configuration Options

​Chunking Strategy

​Batch Processing

​Sparse Vectors

​AI Models

​Supported Models

​Model Selection

​Best Practices

​Document Preparation

​Performance Optimization

​Error Handling

​Troubleshooting

​Common Issues

​Debug Mode

​Advanced Usage

​Custom Embeddings

​Custom Vector Store

​Hybrid Search

​Examples

​Financial Document Analysis

​Legal Document Search

​Further Resources

LlamaIndex Documentation

Moorcheh SDK

Examples Repository

Community Support

​Support

Get Support

What is LlamaIndex Integration?

Key Features

Installation

Quick Start

Import Required Libraries

Configure Logging

Load Moorcheh API Key

Document Processing

Load and Chunk Documents

Vector Store Setup

Initialize Vector Store

Configuration Parameters

Create Vector Store Index

Querying the Vector Store

Basic Query

AI-Powered Query

Advanced Features

Metadata Filtering

Supported Filter Operators

Multiple Filters

Configuration Options

Chunking Strategy

Batch Processing

Sparse Vectors

AI Models

Supported Models

Model Selection

Best Practices

Document Preparation

Performance Optimization

Error Handling

Troubleshooting

Common Issues

Debug Mode

Advanced Usage

Custom Embeddings

Custom Vector Store

Hybrid Search

Examples

Financial Document Analysis

Legal Document Search

Further Resources

Support