Computer Use Agents

Skill Active

Build AI agents that interact with computers like humans do - viewing screens, moving cursors, clicking buttons, and typing text. Covers Anthropic's Computer Use, OpenAI's Operator/CUA, and open-source alternatives. Critical focus on sandboxing, security, and handling the unique challenges of vision-based control. Use when: computer use, desktop automation agent, screen control AI, vision-based agent, GUI automation.

Purpose

To enable developers to build sophisticated and secure AI agents that can automate tasks and interact with computer interfaces by leveraging vision, reasoning, and action execution.

Features

Perception-Reasoning-Action loop architecture
Sandboxed environment patterns for secure execution
Official Anthropic Computer Use implementation details
Code examples for screen capture, mouse/keyboard control, and bash commands

Use Cases

Building AI agents for computer use from scratch
Integrating vision models with desktop control
Understanding and analyzing agent behavior patterns
Deploying and testing agent behavior safely in isolated environments

Non-Goals

Providing a fully managed AI agent runtime
Directly controlling user systems without sandboxing
Handling complex enterprise-specific automation workflows without customization

Documentation

info:Configuration & parameter referenceWhile the `SKILL.md` provides code examples and explains patterns, specific configuration parameters for the `ComputerUseAgent` class or Anthropic tools are not exhaustively documented with defaults.

Scope

info:Minimal I/O surfaceThe code examples show structured inputs for actions and tool calls, but the exact schema for all parameters and responses isn't fully detailed in the markdown for completeness.
info:Tool surface sizeThe `SKILL.md` describes specific patterns and provides code examples for a few core tools (`computer`, `bash`, `text_editor`), rather than exposing a large number of distinct tools.

Trust

warning:Issues AttentionIn the last 90 days, 17 issues were opened and 4 were closed, indicating a low closure rate and potentially slow maintainer response.

Execution

info:ValidationThe code examples show structured input for tool parameters but do not explicitly detail the use of schema validation libraries like Zod or Pydantic within the SKILL.md context.

Code Execution

info:Error HandlingThe provided code snippets show basic error handling, but the SKILL.md does not comprehensively detail error categorization, structured reporting, or specific recovery steps for all potential failure modes.

Errors

info:Actionable error messagesWhile the documentation emphasizes robustness, the SKILL.md does not explicitly detail the format or content of all potential user-facing error messages or provide specific remediation links.

Protocol

info:Idempotent retry & timeoutsThe provided code examples and patterns do not explicitly detail idempotency for mutations or hard per-call timeouts on agent actions, though the Anthropic implementation might handle some of this.

Practical Utility

info:Edge casesThe documentation highlights critical aspects like sandboxing and vision agent limitations but doesn't exhaustively list failure modes and recovery steps for every scenario.

Safety

info:Halt on unexpected stateWhile the emphasis on sandboxing implies controlled execution, the `SKILL.md` does not explicitly detail machine-readable pre-conditions or instructions to halt on unexpected user-side states.

Installation

npx skills add davila7/claude-code-templates

Runs the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.

Quality Score

88 /100

Analyzed 6 days ago

Trust Signals

Last commit6 days ago

GitHub owner davila7

Stars27.2k

Downloads 23k

LicenseMIT

Websiteaitmpl.com

Status

View Source

Similar Extensions

Azure Container Registry SDK for Python

100

Azure Container Registry SDK for Python. Use for managing container images, artifacts, and repositories. Triggers: "azure-containerregistry", "ContainerRegistryClient", "container images", "docker registry", "ACR".

Skill

microsoft

Lean Ctx

100

Context Runtime for AI Agents — 59 MCP tools, 10 read modes, 95+ shell patterns, tree-sitter AST for 18 languages. Compresses LLM context by up to 99%. Use when reading files, running shell commands, searching code, or exploring directories. Auto-installs if not present.

Skill

yvgude

Manus

100

Delegate complex, long-running tasks to Manus AI agent for autonomous execution. Use when user says 'use manus', 'delegate to manus', 'send to manus', 'have manus do', 'ask manus', 'check manus sessions', or when tasks require deep web research, market analysis, product comparisons, stock analysis, competitive research, document generation, data analysis, or multi-step workflows that benefit from autonomous agent execution with parallel processing.

Skill

sanjay3290

Risk Management

100

Risk management domain knowledge for trading agents — affective state monitoring, position sizing, drawdown management, tilt detection, and behavioral guardrails. Use when checking risk before trades, managing drawdowns, detecting behavioral drift, or enforcing discipline. Triggers on "risk", "drawdown", "tilt", "position size", "lot size", "confidence", "revenge trading", "overtrading", "discipline".

Skill

mnemox-ai

Setup Gbrain

100

Set up gbrain for this coding agent: install the CLI, initialize a local PGLite or Supabase brain, register MCP, capture per-remote trust policy. One command from zero to "gbrain is running, and this agent can call it." Use when: "setup gbrain", "connect gbrain", "start gbrain", "install gbrain", "configure gbrain for this machine". (gstack)

Skill

garrytan

Skill Creator

100

Guide for creating effective skills

Skill

bobmatnyc