Operative WebEvalAgent

Operative WebEvalAgent

An MCP server to test, debug, and fix web applications autonomously.

1,214
Stars
103
Forks
0
Releases

Overview

The MCP server, built for operatives, provides an in-editor, browser-driven autonomous agent workflow to test, debug, and fix web applications. It exposes tools like web_eval_agent, an automated UX evaluator that drives a real browser to navigate, test flows, capture screenshots, and collect console and network logs, producing a comprehensive UX report. It also includes setup_browser_state, which opens a non-headless browser session for sign-in and authentication, reusing cookies and local storage across runs. Users interact with the MCP server through chat-style prompts in their IDE, for example evaluating an app at http://localhost:3000 with a task description such as run through the signup flow and note UX issues. The system integrates with Playwright, uv, and Cursor to enable end-to-end validation: the Cursor agent calls the web QA agent to verify that code changes work as expected. Outputs include a structured report, console logs, network activity, and a timeline of events, helping developers identify UX issues quickly and iteratively improve software.

Details

Owner
Operative-Sh
Language
Python
License
Apache License 2.0
Updated
2025-12-07

Features

web_eval_agent

🤖 Automated UX evaluator that drives the browser, captures screenshots, console & network logs, and returns a rich UX report.

setup_browser_state

🔒 Opens an interactive (non-headless) browser so you can sign in once; the saved cookies/local-storage are reused by subsequent web_eval_agent runs.

Navigate webapp with BrowserUse

Navigate your webapp using BrowserUse to drive test flows—2x faster with the operative backend.

Capture network traffic

Requests are intelligently filtered and returned into the context window.

Collect console errors

Captures logs & errors from the browser.

Autonomous debugging

Cursor agent coordinates with the web QA agent to test if the code works end-to-end.

IDE chat integration

Trigger these tools directly from your IDE chat, e.g., Evaluate my app … using web-eval-agent.

Easy setup with API key and Cursor integration

Get your API key, install Cursor via a deeplink, and prefill commands for instant integration.

Audience

DeveloperAutomate browser-based testing and debugging of web apps directly from your IDE.
QA EngineerEasily capture network and console data to validate UX flows.
Automation EngineerIntegrate into automation pipelines to run end-to-end UI tests seamlessly.
Frontend DeveloperDebug frontend workflows in isolation within your editor without breaking live apps.

Tags

web-eval-agentBrowserUseautonomous-debuggingUX-evaluationnetwork-traffic-captureconsole-logsPlaywrightIDE-integrationMCP-server