MCP-Airflow-API

MCP-Airflow-API

An MCP server that exposes Airflow REST APIs as natural language tools.

42
Stars
10
Forks
0
Releases

Overview

MCP-Airflow-API is an MCP server that exposes Apache Airflow REST API operations as natural-language tools. It supports two runtime modes and dynamic version selection via AIRFLOW_API_VERSION, allowing a single server to work with Airflow v1 (2.x) and v2 (3.0+) REST APIs. The server loads a core set of 43 tools for v1 compatibility and, when operating in v2, adds two asset-management tools for data-aware scheduling (total 45 tools). This enables natural-language control over DAG management, task monitoring, pools, variables, connections, configuration queries, and event logs, with full coverage across API versions. It emphasizes high-scale environments (1000+ DAGs) via smart pagination and advanced filtering, and supports multiple Airflow clusters via environment-configured endpoints. The architecture supports two transport modes: stdio for local MCP clients and streamable-http for Docker/distributed deployments, and can secure remote access with Bearer token authentication in streamable-http mode. Quickstart includes a Docker Compose-based demo with an OpenWebUI and API docs, and there are clear examples for multi-cluster setups across v1 and v2.

Details

Owner
call518
Language
Python
License
MIT License
Updated
2025-12-07

Features

Natural Language Queries

No need to learn complex API syntax; users interact with Airflow via natural-language commands such as asking for running DAGs or failed tasks.

Comprehensive Monitoring Capabilities

Real-time cluster status monitoring including health, DAG status, performance analysis, task logs, and XCom data management.

Dynamic API Version Support

A single MCP server adapts to Airflow API versions (v1 with 43 tools; v2 with 43 shared tools plus 2 asset-management tools), switchable via AIRFLOW_API_VERSION.

Comprehensive Tool Coverage

Extensive Airflow functionality coverage: DAG management, task instance monitoring, pools and variables, connections, configuration queries, and event logs.

Large Environment Optimization

Efficient handling of 1000+ DAGs with smart pagination, advanced filtering, and batch processing capabilities.

Audience

Data EngineersReduce debugging time, improve productivity, and simplify Airflow management via natural language.
DevOps EngineersAutomate monitoring and incident response for Airflow deployments.
System AdministratorsUser-friendly administration with real-time cluster status monitoring and configuration access.

Tags

Apache-AirflowMCPModelContextProtocolDataEngineeringDevOpsWorkflowAutomationNaturalLanguageOpenSourcePythonDockerAI-Integration