Overview
Features
Endpoint-centric OpenAPI processing
Indexes individual endpoints (path + method) as unique units with parameter-aware embeddings and response context, enabling precise retrieval of endpoint docs as chunks.
In-memory FAISS-based semantic search
Uses FAISS to perform fast nearest-neighbor search over in-memory embeddings for instant endpoint discovery from natural language queries.
Remote OpenAPI source and in-memory indexing
Consumes a remote OpenAPI JSON document via OPENAPI_JSON_DOCS_URL; no local file system access and no frequent API changes required.
Optimized spec handling and chunking
Handles large specs (100KB+; up to tens of MB) with endpoint-centric chunking, lazy loading, and parallel parsing to maintain performance and context.
FastAPI-based asynchronous server
Built on an async FastAPI server with MCP protocol support and tool registration/invocation handling.
Configurable tool namespace and prompt
Supports MCP_API_PREFIX to customize tool namespace and an optional GLOBAL_TOOL_PROMPT to steer tool descriptions for NLP agents.
Claude Desktop integration and multi-instance support
Designed for Claude Desktop workflows with multi-instance deployment capabilities to run multiple MCP servers with separate settings.
Who Is This For?
- Claude Desktop users:Discover and call endpoints from private OpenAPI specs via semantic search and tools in Claude Desktop.
- API developers:Integrate semantic endpoint discovery for large OpenAPI docs; expose endpoints as MCP tools for programmatic calls.




