Minimal Run And Audit

技能已验证活跃

用于 README 优先的 AI 代码库进行可信执行和报告的技能。当任务是专门从选定的 smoke test 或已文档化的推理或评估命令捕获或标准化证据，并写入标准化的 `repro_outputs/` 文件（包括在存储库文件更改时生成补丁说明）时使用。请勿用于训练执行、初始代码库引入、通用环境设置、论文查找、目标选择或单独的端到端编排。

目的

提供一种可信且可审计的方式来执行和报告 AI 研究代码库中的已文档化命令，确保一致地捕获证据以供复现。

功能

AI 代码库复现的可信执行通道
捕获命令输出、错误和文件更改
生成标准化的 `repro_outputs/` 文件
处理执行超时和非零退出码
在存储库文件更改时支持补丁说明

使用场景

验证已文档化的推理或评估命令
捕获 smoke test 的证据
标准化执行结果以供审计
报告命令执行后存储库文件的更改

非目标

初始代码库扫描或引入
通用环境设置
论文查找或目标选择
单独的端到端编排
训练执行或状态管理

安装

npx skills add lllllllama/ai-paper-reproduction-skill

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

已验证

100 /100

1 day ago 分析

信任信号

最近提交5 days ago

GitHub 所有者 lllllllama

星标75

许可证MIT

状态

查看源代码

类似扩展

Context Mode

100

从 GitHub 更新 context-mode 并修复 hooks/settings。拉取最新代码，构建，安装，更新 npm 全局包，配置 hooks。触发器：/context-mode:ctx-upgrade

技能

mksglu

Performance Analysis

100

Comprehensive performance analysis, bottleneck detection, and optimization recommendations for Claude Flow swarms

技能

ruvnet

Cleanup Dashboards

100

Audit and consolidate HubSpot reporting dashboards. Identifies unused, duplicate, or outdated dashboards. Must be performed manually — no dashboard API is available.

技能

TomGranot

Status

100

Display the current state of the FPF knowledge base

技能

NeoLabHQ

Pm Strategic Review

100

End-of-quarter strategic review in narrative style with a bets scorecard. Use when someone says "quarter review", "strategic review", "what happened last quarter", "quarterly retro", "bets scorecard", "review our bets", "end of quarter report".

技能

marfoerst

Ops Revenue

100

Revenue and costs tracker. AWS spend via aws ce, credits tracker, project revenue stages. Shows burn rate, runway estimate, credits expiring.

技能

Lifecycle-Innovations-Limited