Skip to main content

AutoBrowse

Skill Verified Active
Part of:Browse

Self-improving browser automation via the auto-research loop. Iteratively runs a browsing task, reads the trace, and improves the navigation skill (strategy.md) until it reliably passes. Supports parallel runs across multiple tasks using sub-agents. Use when you want to build or improve browser automation skills for specific website tasks.

Purpose

To build and refine reliable, production-ready browser automation skills for any website through an iterative, agent-driven process, significantly reducing costs and improving deterministic outcomes.

Features

  • Iterative skill improvement via auto-research loop
  • Parallel task execution using sub-agents
  • Support for local and remote browser environments
  • Generation of self-contained, graduated skills
  • Clear documentation and examples for usage and setup

Use Cases

  • Building automated navigation for specific website tasks
  • Improving existing brittle browser automation scripts
  • Creating site-specific playbooks for agents
  • Reducing LLM exploration costs for repetitive web tasks

Non-Goals

  • Executing arbitrary shell commands
  • Replacing a full testing framework
  • Managing the user's local browser installation
  • Performing destructive operations outside the browser context

Workflow

  1. Parse arguments and orient task mode (single/multi-task, iterations, env).
  2. Set up workspace (tasks/, traces/) and scaffold task.md/strategy.md if needed.
  3. Run the inner agent via `evaluate.mjs` for the specified task and environment.
  4. Read the trace (summary.md) to identify failures or areas for improvement.
  5. Formulate a single hypothesis and update strategy.md.
  6. Repeat the loop until the task passes consistently or max iterations are reached.
  7. If ready, graduate the task by writing a self-contained SKILL.md to ~/.claude/skills/.
  8. Generate a final report for multi-task sessions.

Practices

  • Iterative refinement
  • Browser automation
  • Agent development

Prerequisites

  • Node.js 18+
  • browse CLI
  • ANTHROPIC_API_KEY
  • BROWSERBASE_API_KEY (for remote env)

Trust

  • info:Issues AttentionWith 2 issues opened and 0 closed in the last 90 days, maintainer responsiveness on issues is currently not demonstrable.

Installation

First, add the marketplace

/plugin marketplace add browserbase/skills
/plugin install skills@browserbase

Quality Score

Verified
100 /100
Analyzed about 20 hours ago

Trust Signals

Last commit1 day ago
Stars3.2k
LicenseMIT
Status
View Source

Similar Extensions

Agent Browser

100

Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.

Skill
shanraisshan

Chatgpt Search

100

Search ChatGPT and extract the full response + hydration JSON that powers the UI. Attaches to a running Chrome instance (port 9222 by default), opens ChatGPT, submits a query, waits for the streamed response, and returns structured data: messages, product cards, hydration JSON, and API calls. Use when asked to "search chatgpt", "ask chatgpt", "chatgpt search", "get chatgpt response", or "scrape chatgpt".

Skill
SeifBenayed

Senior Backend Engineer

100

Designs and implements backend systems including REST APIs, microservices, database architectures, authentication flows, and security hardening. Use when the user asks to "design REST APIs", "optimize database queries", "implement authentication", "build microservices", "review backend code", "set up GraphQL", "handle database migrations", or "load test APIs". Covers Node.js/Express/Fastify development, PostgreSQL optimization, API security, and backend architecture patterns.

Skill
alirezarezvani

Browser Extract

99

Extract structured data via stored browser-templates or one-shot DOM queries, with mandatory AIDefence PII + prompt-injection gates before content reaches the model

Skill
ruvnet

Agent Browser

99

Interact with Slack workspaces using browser automation. Use when the user needs to check unread channels, navigate Slack, send messages, extract data, find information, search conversations, or automate any Slack task. Triggers include "check my Slack", "what channels have unreads", "send a message to", "search Slack for", "extract from Slack", "find who said", or any task requiring programmatic Slack interaction.

Skill
vercel-labs

Dev Browser

99

Browser automation with persistent page state. Use when users ask to navigate websites, fill forms, take screenshots, extract web data, test web apps, or automate browser workflows. Trigger phrases include "go to [url]", "click on", "fill out the form", "take a screenshot", "scrape", "automate", "test the website", "log into", or any browser interaction request.

Skill
SawyerHood

© 2025 SkillRepo · Find the right skill, skip the noise.