Oxylabs

Oxylabs

Scrape websites with Oxylabs Web API, supporting dynamic rendering and parsing for structured data extraction.

71
Stars
20
Forks
16
Releases

Overview

The Oxylabs MCP server provides a bridge between AI models and the web. It enables them to scrape any URL, render JavaScript-heavy pages, extract and format content for AI use, bypass anti-scraping measures, and access geo-restricted web data from 195+ countries. The server offers two tool families: Oxylabs Web Scraper API Tools (universal_scraper, google_search_scraper, amazon_search_scraper, amazon_product_scraper) and Oxylabs AI Studio Tools (ai_scraper, ai_crawler, ai_browser_agent, ai_search). Credentials determine which tools are exposed: with Web Scraper API credentials you get the Scraper tools; with AI Studio credentials you get the AI Studio tools; with both, you get all tools. It supports multiple deployment configurations (uvx, uv, Smithery OAuth2, or direct URLs) and environment variables for authentication. Logging provides detailed notifications of job status and errors. Features include JS rendering options, HTML-to-Markdown conversion, automated parsers for popular targets, anti-bot bypass, proxy IP rotation across 195+ countries, and cross-platform compatibility (macOS, Windows, Linux).

Details

Owner
oxylabs
Language
Python
License
MIT License
Updated
2025-12-07

Features

Scrape content from any site

Extract data from URLs, render dynamic sites, and choose rendering modes (full JS, HTML-only, or none) while emulating mobile or desktop viewports.

Automatically get AI-ready data

Clean HTML to Markdown and apply automated parsers for common targets like Google, Amazon, and more.

Bypass blocks & geo-restrictions

Overcome anti-bot protections and access geo-restricted data using rotating IPs from Oxylabs proxy pool across 195+ countries.

Flexible setup & cross-platform support

Configure rendering and parsing as needed and run on macOS, Windows, and Linux; integrate data into AI models or analytics tools.

Built-in error handling and request management

Provide robust error handling, reporting, Smart rate limiting, and reliable request management.

Audience

AI developersIntegrate MCP to fetch, render, and structure web data for prompts and AI training.
LLM integratorsEmbed web data access into LLM workflows with AI-ready, parsed outputs.
Data engineersAutomate data collection and formatting for analytics dashboards and data-powered insights.

Tags

web-scrapingJS-renderingheadless-browserHTML-to-Markdownstructured-dataAI-ready-dataparsersanti-bot-bypassgeo-restrictionsproxy-networkcross-platformmcpoxylabs