Browser Screenshot Diff

Skill Verified Active

Visual + DOM diff between two recorded sessions at matching trajectory step ids; used for visual regression and replay verification

Purpose

To provide precise visual and DOM difference analysis between recorded browser sessions, enabling accurate visual regression testing and validation of replay fidelity.

Features

Visual diffing of screenshots between session steps
DOM diffing of accessibility snapshots between session steps
Reporting of divergence scores and largest diff clusters
Handling of unmatched steps between sessions
Support for pixel and DOM diff modes

Use Cases

Performing visual regression testing after UI changes
Verifying that a `browser-replay` run matches a parent session
Comparing A/B variants of a form flow for visual consistency

Non-Goals

Handling of dynamic content like clocks or ads without configuration
Directly operating an MCP server for live browser evaluation

Installation

First, add the marketplace

/plugin marketplace add ruvnet/ruflo

/plugin install ruflo-browser@ruflo

Quality Score

Verified

98 /100

Analyzed about 17 hours ago

Trust Signals

Last commitabout 18 hours ago

GitHub owner ruvnet

Stars50.2k

Downloads 68.3k

LicenseMIT

Websitecognitum.one

Status

View Source

Similar Extensions

Agent Browser

100

Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.

Skill

shanraisshan

AutoBrowse

100

Self-improving browser automation via the auto-research loop. Iteratively runs a browsing task, reads the trace, and improves the navigation skill (strategy.md) until it reliably passes. Supports parallel runs across multiple tasks using sub-agents. Use when you want to build or improve browser automation skills for specific website tasks.

Skill

browserbase

Chatgpt Search

100

Search ChatGPT and extract the full response + hydration JSON that powers the UI. Attaches to a running Chrome instance (port 9222 by default), opens ChatGPT, submits a query, waits for the streamed response, and returns structured data: messages, product cards, hydration JSON, and API calls. Use when asked to "search chatgpt", "ask chatgpt", "chatgpt search", "get chatgpt response", or "scrape chatgpt".

Skill

SeifBenayed

Cli Anything Safari

100

Safari browser automation CLI on macOS via safari-mcp. Controls real Safari (native, keeps logins) by wrapping the safari-mcp MCP server. Every one of the 84 MCP tools is exposed 1:1 with schema-accurate arguments — guaranteed parity, no manual drift.

Skill

hkuds

Sentry React SDK

100

Full Sentry SDK setup for React. Use when asked to "add Sentry to React", "install @sentry/react", or configure error monitoring, tracing, session replay, profiling, or logging for React applications. Supports React 16+, React Router v5-v7 non-framework mode, TanStack Router, Redux, Vite, and webpack.

Skill

getsentry

Test A2a Interop

100

Test A2A interoperability between agents by validating Agent Card conformance, exercising all task lifecycle states, and verifying streaming and error handling. Use when verifying a new A2A server implementation before deployment, validating interoperability between two or more A2A agents, running conformance tests in CI/CD for A2A services, debugging failures in multi-agent A2A workflows, or certifying that an agent meets A2A protocol requirements for a registry.

Skill

pjt222