Skip to main content

Browser Screenshot Diff

Skill Verified Active

Visual + DOM diff between two recorded sessions at matching trajectory step ids; used for visual regression and replay verification

Purpose

To provide precise visual and DOM difference analysis between recorded browser sessions, enabling accurate visual regression testing and validation of replay fidelity.

Features

  • Visual diffing of screenshots between session steps
  • DOM diffing of accessibility snapshots between session steps
  • Reporting of divergence scores and largest diff clusters
  • Handling of unmatched steps between sessions
  • Support for pixel and DOM diff modes

Use Cases

  • Performing visual regression testing after UI changes
  • Verifying that a `browser-replay` run matches a parent session
  • Comparing A/B variants of a form flow for visual consistency

Non-Goals

  • Handling of dynamic content like clocks or ads without configuration
  • Directly operating an MCP server for live browser evaluation

Installation

First, add the marketplace

/plugin marketplace add ruvnet/ruflo
/plugin install ruflo-browser@ruflo

Quality Score

Verified
98 /100
Analyzed about 17 hours ago

Trust Signals

Last commitabout 18 hours ago
Stars50.2k
LicenseMIT
Status
View Source

Similar Extensions

Agent Browser

100

Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.

Skill
shanraisshan

AutoBrowse

100

Self-improving browser automation via the auto-research loop. Iteratively runs a browsing task, reads the trace, and improves the navigation skill (strategy.md) until it reliably passes. Supports parallel runs across multiple tasks using sub-agents. Use when you want to build or improve browser automation skills for specific website tasks.

Skill
browserbase

Chatgpt Search

100

Search ChatGPT and extract the full response + hydration JSON that powers the UI. Attaches to a running Chrome instance (port 9222 by default), opens ChatGPT, submits a query, waits for the streamed response, and returns structured data: messages, product cards, hydration JSON, and API calls. Use when asked to "search chatgpt", "ask chatgpt", "chatgpt search", "get chatgpt response", or "scrape chatgpt".

Skill
SeifBenayed

Cli Anything Safari

100

Safari browser automation CLI on macOS via safari-mcp. Controls real Safari (native, keeps logins) by wrapping the safari-mcp MCP server. Every one of the 84 MCP tools is exposed 1:1 with schema-accurate arguments — guaranteed parity, no manual drift.

Skill
hkuds

Sentry React SDK

100

Full Sentry SDK setup for React. Use when asked to "add Sentry to React", "install @sentry/react", or configure error monitoring, tracing, session replay, profiling, or logging for React applications. Supports React 16+, React Router v5-v7 non-framework mode, TanStack Router, Redux, Vite, and webpack.

Skill
getsentry

Test A2a Interop

100

Test A2A interoperability between agents by validating Agent Card conformance, exercising all task lifecycle states, and verifying streaming and error handling. Use when verifying a new A2A server implementation before deployment, validating interoperability between two or more A2A agents, running conformance tests in CI/CD for A2A services, debugging failures in multi-agent A2A workflows, or certifying that an agent meets A2A protocol requirements for a registry.

Skill
pjt222

© 2025 SkillRepo · Find the right skill, skip the noise.