Overview
Features
Fast startup with lazy loading
Fast startup using the official MCP SDK with lazy loading for optimal performance.
Mozilla Readability extraction
Content extraction using Mozilla Readability (same as Firefox Reader View) to capture the main article content.
HTML to Markdown conversion
HTML to Markdown conversion with Turndown and GitHub Flavored Markdown support.
Smart caching
Caching with SHA-256 hashed URLs for efficient repeated fetches.
Polite crawling
Robots.txt support with rate limiting to crawl sites responsibly.
Concurrent fetching
Configurable depth and concurrency for parallel page retrieval.
Stream-first design
Low memory usage achieved through stream-first processing.
Link preservation
Preserves links to support knowledge graphs and downstream linking.
Who Is This For?
- Claude Code:Integrates with Claude Code to fetch pages and convert to clean Markdown for prompts.
- IDEs:Used in IDEs to generate token-efficient Markdown content for AI workflows.
- LLM pipelines:Supports LLM pipelines and agents with clean Markdown representations for training and reasoning.
- Knowledge graphs:Facilitates knowledge graph construction with preserved links and Markdown data.




