py-mcp-qdrant-rag

py-mcp-qdrant-rag

MCP server for Retrieval-Augmented Generation using Qdrant with Ollama/OpenAI embeddings.

2
Stars
2
Forks
0
Releases

Overview

This MCP server implements Retrieval-Augmented Generation (RAG) by storing embeddings in a Qdrant vector database and enabling semantic search over indexed documents. It supports ingestion from diverse sources and formats (PDF, DOCX, TXT, MD, HTML content from URLs), along with web scraping and bulk directory imports. Embeddings can be generated locally through Ollama or remotely via OpenAI, giving flexible deployment options for offline or cloud-based workflows. The server exposes a Claude Desktop integration, allowing users to add documentation, index sources, and perform semantic queries directly from Claude. The architecture includes a run.py entry point, mcp_server, rag_engine, and modular embedding providers, all designed to run on macOS, Linux, and Windows with a Conda-based environment. Prerequisites include Python 3.11+, Conda, Qdrant, Ollama (or OpenAI), and Claude Desktop. Configuration is done via environment variables (QDRANT_URL, EMBEDDING_PROVIDER, OLLAMA_URL, OPENAI_API_KEY) and optional Qdrant Cloud settings. The project exposes API functions such as add_documentation, add_directory, search_documentation, and list_sources. The focus is on fast retrieval, flexible chunking, and secure handling of API keys.

Details

Owner
amornpan
Language
Python
License
Updated
2025-12-07

Features

Semantic Search

Search through stored documents using advanced semantic similarity.

Multi-Format Support

Process various document formats including PDF, TXT, MD, DOCX, and more.

Web Scraping

Add documentation directly from URLs.

Bulk Import

Import entire directories of documents at once.

Flexible Embeddings

Choose between Ollama (local) or OpenAI embeddings.

Vector Storage

Efficient storage and retrieval using Qdrant vector database.

MCP Integration

Seamless integration with Claude Desktop application.

Fast Retrieval

Optimized vector search for quick information retrieval.

Audience

AI developersBuild and test MCP-based RAG workflows with Qdrant, embedding providers, and Claude Desktop integration.
Knowledge managersIndex and semantically search internal documents across formats via Claude Desktop.

Tags

MCPRAGQdrantvector-dbembeddingsOllamaOpenAIsemantic-searchdocument-retrievalClaude Desktopweb-scrapingmulti-format