Firecrawl

Firecrawl

Extract web data with Firecrawl

4,978
Stars
544
Forks
16
Releases

Overview

A Model Context Protocol (MCP) server implementation that integrates with Firecrawl to provide comprehensive web scraping capabilities. This MCP server connects Firecrawl’s scraping, crawling, discovery, search, and data extraction features to MCP clients, enabling automated, scriptable web data workflows. Supported tools include scrape, batch_scrape, map, search, crawl, and extract, covering single-page extraction, multi-page batch processing, URL discovery, and structured data extraction. It offers automatic retries with exponential backoff and built-in rate limiting to handle transient errors and API constraints. It supports both cloud API usage (requiring a FIRECRAWL_API_KEY) and self-hosted deployments via FIRECRAWL_API_URL, and can run in various environments such as Cursor, Windsurf, or Streamable HTTP Local Mode. Configuration options cover retry policies and credit usage monitoring, along with rate limiting controls and SSE streaming support for real-time updates. The server is designed for cloud and self-hosted setups, enabling deep scraping workflows, batch research, and scalable processing. Example configurations and integration notes are provided to assist Composer Agents and MCP clients in building advanced web-scraping workflows.

Details

Owner
firecrawl
Language
JavaScript
License
MIT License
Updated
2025-12-07

Features

Web scraping, crawling, and discovery

Enables scraping, crawling, and site discovery to locate and index pages.

Search and content extraction

Supports web search and extraction of content from pages.

Deep research and batch scraping

Supports deep research workflows and batch scraping of multiple URLs.

Automatic retries and rate limiting

Handles transient errors with retries and implements exponential backoff.

Cloud and self-hosted support

Operates with both cloud API and self-hosted instances.

SSE support

Provides Server-Sent Events (SSE) for streaming updates.

Tags

web scrapingweb crawlingdata extractionMCPcloudself-hostedSSErate limitingretriesbatch processing