Computer Use Agents
Skill ActiveBuild AI agents that interact with computers like humans do - viewing screens, moving cursors, clicking buttons, and typing text. Covers Anthropic's Computer Use, OpenAI's Operator/CUA, and open-source alternatives. Critical focus on sandboxing, security, and handling the unique challenges of vision-based control. Use when: computer use, desktop automation agent, screen control AI, vision-based agent, GUI automation.
To enable developers to build sophisticated and secure AI agents that can automate tasks and interact with computer interfaces by leveraging vision, reasoning, and action execution.
Features
- Perception-Reasoning-Action loop architecture
- Sandboxed environment patterns for secure execution
- Official Anthropic Computer Use implementation details
- Code examples for screen capture, mouse/keyboard control, and bash commands
Use Cases
- Building AI agents for computer use from scratch
- Integrating vision models with desktop control
- Understanding and analyzing agent behavior patterns
- Deploying and testing agent behavior safely in isolated environments
Non-Goals
- Providing a fully managed AI agent runtime
- Directly controlling user systems without sandboxing
- Handling complex enterprise-specific automation workflows without customization
Documentation
- info:Configuration & parameter referenceWhile the `SKILL.md` provides code examples and explains patterns, specific configuration parameters for the `ComputerUseAgent` class or Anthropic tools are not exhaustively documented with defaults.
Scope
- info:Minimal I/O surfaceThe code examples show structured inputs for actions and tool calls, but the exact schema for all parameters and responses isn't fully detailed in the markdown for completeness.
- info:Tool surface sizeThe `SKILL.md` describes specific patterns and provides code examples for a few core tools (`computer`, `bash`, `text_editor`), rather than exposing a large number of distinct tools.
Trust
- warning:Issues AttentionIn the last 90 days, 17 issues were opened and 4 were closed, indicating a low closure rate and potentially slow maintainer response.
Execution
- info:ValidationThe code examples show structured input for tool parameters but do not explicitly detail the use of schema validation libraries like Zod or Pydantic within the SKILL.md context.
Code Execution
- info:Error HandlingThe provided code snippets show basic error handling, but the SKILL.md does not comprehensively detail error categorization, structured reporting, or specific recovery steps for all potential failure modes.
Errors
- info:Actionable error messagesWhile the documentation emphasizes robustness, the SKILL.md does not explicitly detail the format or content of all potential user-facing error messages or provide specific remediation links.
Protocol
- info:Idempotent retry & timeoutsThe provided code examples and patterns do not explicitly detail idempotency for mutations or hard per-call timeouts on agent actions, though the Anthropic implementation might handle some of this.
Practical Utility
- info:Edge casesThe documentation highlights critical aspects like sandboxing and vision agent limitations but doesn't exhaustively list failure modes and recovery steps for every scenario.
Safety
- info:Halt on unexpected stateWhile the emphasis on sandboxing implies controlled execution, the `SKILL.md` does not explicitly detail machine-readable pre-conditions or instructions to halt on unexpected user-side states.
Installation
npx skills add davila7/claude-code-templatesRuns the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.
Quality Score
Trust Signals
Similar Extensions
Azure Container Registry SDK for Python
100Azure Container Registry SDK for Python. Use for managing container images, artifacts, and repositories. Triggers: "azure-containerregistry", "ContainerRegistryClient", "container images", "docker registry", "ACR".
Lean Ctx
100Context Runtime for AI Agents — 59 MCP tools, 10 read modes, 95+ shell patterns, tree-sitter AST for 18 languages. Compresses LLM context by up to 99%. Use when reading files, running shell commands, searching code, or exploring directories. Auto-installs if not present.
Manus
100Delegate complex, long-running tasks to Manus AI agent for autonomous execution. Use when user says 'use manus', 'delegate to manus', 'send to manus', 'have manus do', 'ask manus', 'check manus sessions', or when tasks require deep web research, market analysis, product comparisons, stock analysis, competitive research, document generation, data analysis, or multi-step workflows that benefit from autonomous agent execution with parallel processing.
Risk Management
100Risk management domain knowledge for trading agents — affective state monitoring, position sizing, drawdown management, tilt detection, and behavioral guardrails. Use when checking risk before trades, managing drawdowns, detecting behavioral drift, or enforcing discipline. Triggers on "risk", "drawdown", "tilt", "position size", "lot size", "confidence", "revenge trading", "overtrading", "discipline".
Setup Gbrain
100Set up gbrain for this coding agent: install the CLI, initialize a local PGLite or Supabase brain, register MCP, capture per-remote trust policy. One command from zero to "gbrain is running, and this agent can call it." Use when: "setup gbrain", "connect gbrain", "start gbrain", "install gbrain", "configure gbrain for this machine". (gstack)
Skill Creator
100Guide for creating effective skills