Chroma

Chroma

Embeddings, vector search, document storage, and full-text search with the open-source AI application database

425
Stars
82
Forks
7
Releases

Overview

Chroma MCP Server is a self-hosted Model Context Protocol (MCP) server built on top of Chroma that provides data retrieval capabilities for LLM-powered applications. It enables AI models to create collections over generated data and user inputs, and to retrieve that data using vector search, full-text search, and metadata filtering. The server supports multiple client types: Ephemeral (in-memory) for testing, Persistent (file-based) storage, HTTP for self-hosted Chroma instances, and Cloud for Chroma Cloud integration. Collection management covers create, modify, delete, and listing with pagination; you can view collection information and statistics, configure HNSW parameters for vector search, and select an embedding function that persists with the collection. Document operations include adding documents with optional metadata and IDs, semantic search, advanced filtering by metadata or content, and retrieval by IDs or filters. Embedding functions available include default, cohere, openai, jina, voyageai, and roboflow, with per-collection persistence (not available for older versions). The server also supports environment-variable configuration, dotenv files, and CLI tooling, plus Claude Desktop integration.

Details

Owner
chroma-core
Language
Python
License
Apache License 2.0
Updated
2025-12-07

Features

Flexible Client Types

Ephemeral (in-memory), Persistent (file-based), HTTP for self-hosted Chroma, and Cloud client for Chroma Cloud connectivity.

Collection Management

Create/modify/delete collections; list with pagination; view collection info/statistics; configure HNSW parameters; select per-collection embedding function.

Document Operations

Add documents with optional metadata and IDs; semantic search; metadata/content filtering; retrieve by IDs or filters; full-text search.

Embedding Function Support and Persistence

Supports default, cohere, openai, jina, voyageai, and roboflow embedding functions; the chosen function persists with the collection. Embedding function persistence was added in v1.0.0; not supported for collections created with versions <= 0.6.3.

Environment Variables and Configuration

Configure clients and embedding API keys via environment variables; dotenv-path support; CLI args take precedence over environment variables.

Cloud and Self-Hosted Connectivity

Supports Chroma Cloud and self-hosted HTTP deployments with secure connections and easy setup.

Claude Desktop Integration

Usage examples for connecting via Claude Desktop, with configurations for ephemeral, persistent, cloud, and HTTP clients.

Supported Tools and Endpoints

A suite of MCP tools for managing operations (e.g., chroma_list_collections, chroma_create_collection, chroma_query_documents, etc.).

Audience

LLM DevelopersBuild memory-enabled AI apps with a self-hosted MCP server using vector and full-text search.
Data EngineersManage collections of generated data and metadata via a Chroma-backed MCP server.
AI Product TeamsIntegrate with Claude Desktop or Chroma Cloud for embedding storage and retrieval.

Tags

MCPChromaembeddingvector searchfull-text searchcollectionsHNSWdocumentsenvironment variablesClaude DesktopChroma Cloudself-hostedpersistentephemeralHTTP clientembedding functions