跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Agent Benchmark Suite

技能 已验证 活跃

Agent skill for benchmark-suite - invoke with $agent-benchmark-suite

目的

To automate and enhance the performance optimization lifecycle for software systems by providing comprehensive benchmarking, regression detection, and validation capabilities.

功能

  • Comprehensive benchmarking framework
  • Automated performance regression detection
  • Automated performance testing and validation
  • Integration with MCP for advanced analysis
  • CLI commands for operational control

使用场景

  • Running performance benchmarks for new features or infrastructure changes
  • Detecting performance regressions before they impact users
  • Validating performance against Service Level Agreements (SLAs)
  • Automating performance testing as part of CI/CD pipelines

非目标

  • Functional testing of application logic
  • Security vulnerability scanning beyond performance-related aspects
  • End-user application support or bug fixing

工作流

  1. Configure benchmark parameters (duration, iterations, baseline)
  2. Execute comprehensive benchmark suite or specific benchmarks
  3. Analyze benchmark results for performance metrics and trends
  4. Detect performance regressions by comparing current results with historical data
  5. Validate performance against predefined criteria (SLAs, scalability)
  6. Generate summary reports and recommendations

实践

  • Performance Optimization
  • Automated Testing
  • Regression Prevention
  • Continuous Integration

先决条件

  • Claude Code environment
  • Access to MCP server (for full functionality)

安装

npx skills add ruvnet/ruflo

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

已验证
99 /100
1 day ago 分析

信任信号

最近提交1 day ago
星标50.2k
许可证MIT
状态
查看源代码

类似扩展

Telegram Crabbox E2e Proof

100

Use when reviewing, reproducing, or proving OpenClaw Telegram behavior with a real Telegram user on Crabbox, including PR review workflows that need an agent-controlled Telegram Desktop recording, TDLib user-driver commands, Convex-leased credentials, WebVNC observation, and motion-trimmed artifacts.

技能
steipete

Openclaw Testing

100

Choose, run, rerun, or debug OpenClaw tests, CI checks, Docker E2E lanes, release validation, and the cheapest safe verification path.

技能
steipete

OpenClaw Release Maintainer

100

Prepare or verify OpenClaw stable/beta releases, changelogs, release notes, publish commands, and artifacts.

技能
steipete

ClawSweeper Skill

100

Use for all ClawSweeper work: OpenClaw issue/PR sweep reports, commit-review reports, repair jobs, cloud fix PRs, @clawsweeper maintainer mention commands, trusted ClawSweeper-reviewed autofix/automerge, GitHub Actions monitoring, permissions, gates, and manual backfills.

技能
steipete

Agent Browser

100

AI 代理的浏览器自动化 CLI。当用户需要与网站交互时使用,包括浏览页面、填写表单、点击按钮、截屏、提取数据、测试 Web 应用或自动化任何浏览器任务。触发条件包括请求“打开网站”、“填表”、“点击按钮”、“截屏”、“抓取页面数据”、“测试此 Web 应用”、“登录网站”、“自动化浏览器操作”或任何需要以编程方式进行 Web 交互的任务。

技能
shanraisshan

Benchmark

100

Performance regression detection using the browse daemon. Establishes baselines for page load times, Core Web Vitals, and resource sizes. Compares before/after on every PR. Tracks performance trends over time. Use when: "performance", "benchmark", "page speed", "lighthouse", "web vitals", "bundle size", "load time". (gstack) Voice triggers (speech-to-text aliases): "speed test", "check performance".

技能
garrytan