OpenCV

OpenCV

Python MCP server exposing OpenCV image and video processing via MCP for AI assistants.

81
Stars
12
Forks
0
Releases

Overview

OpenCV MCP Server is a Python package that provides OpenCV's image and video processing capabilities through the Model Context Protocol (MCP). It enables AI assistants and language models to access powerful computer vision tools for tasks ranging from basic image manipulation to advanced object detection and tracking. The server exposes a broad set of CV functionalities, including image I/O, color space conversion, resizing and cropping, edge detection, filtering, face detection, and video frame analysis. It supports pre-trained models for face and object detection (e.g., YOLO and Haar-based cascades) and offers high-level tools for features, matching, and tracking in both images and video. Configuration is simple via environment variables, and tools are organized into image basics, image processing, computer vision, and video processing categories. The MCP integration enables chained tool usage, enabling workflows that pass outputs between tools. The project includes examples for Python usage and Claude Desktop integration, and provides guidance on model files and dependencies. This setup makes OpenCV available to AI systems as a scalable, model-agnostic CV backend.

Details

Owner
GongRzhe
Language
Python
License
MIT License
Updated
2025-12-07

Features

Basic image handling and manipulation

Read, save, and convert images using straightforward image I/O operations.

Image processing and enhancement

Resize, crop, and apply filters to enhance images.

Edge detection and contour analysis

Detect edges and analyze contours to extract shapes and boundaries.

Advanced computer vision capabilities

Includes feature detection and object detection for high-level CV tasks.

Face detection and recognition

Detect faces and optionally recognize them using configured models.

Video processing and analysis

Frame extraction and motion analysis within video streams.

Object tracking in videos

Track objects across frames for trajectories and activity analysis.

Camera integration for real-time object detection

Integrates camera feeds to enable real-time computer vision tasks.

Audience

AI assistantsAccess OpenCV CV tools via MCP to perform image and video processing tasks.
Language modelsLeverage OpenCV capabilities through MCP for visual understanding and analysis.
DevelopersIntegrate the MCP server into CV-enabled applications and workflows.

Tags

OpenCVMCPComputer VisionImage ProcessingVideo ProcessingObject DetectionFace DetectionCamera IntegrationPythonAI AssistantsLanguage Models