跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Document Hunter

技能 活跃

Searches and retrieves documents from free public sources using automated browser navigation. Use when research needs primary source documents like court filings, government reports, or public records.

目的

To automate the time-consuming task of finding and downloading primary source documents from free public archives for research purposes.

功能

  • Searches public document archives systematically
  • Automates browser navigation and PDF downloads
  • Organizes downloaded documents with metadata
  • Reports on found documents, sources searched, and remaining gaps

使用场景

  • Retrieving court filings for legal research
  • Finding government reports for policy analysis
  • Gathering public records for investigative journalism
  • Automating the collection of primary source material for academic research

非目标

  • Accessing documents behind paywalls (e.g., full PACER access without RECAP)
  • Downloading copyrighted material
  • Replacing manual verification of document content
  • Providing legal advice based on retrieved documents

Documentation

  • info:Configuration & parameter referenceThe SKILL.md mentions an argument hint but does not fully document expected arguments or their defaults. The Python script template also implies parameters but they are not explicitly documented in the skill description.

Maintenance

  • warning:Dependency ManagementThe SKILL.md lists Python dependencies ('playwright', 'beautifulsoup4', 'requests') but does not include a lockfile or explicit version pinning, and lacks automated vulnerability scanning.

Security

  • warning:InjectionWhile the skill targets specific websites, it relies on general web scraping techniques and DOM manipulation, which could be vulnerable to injection if not carefully implemented, especially given the `query_selector_all` and `fill` methods used.
  • warning:Transitive Supply-Chain GrenadesThe skill uses Playwright which fetches browser binaries and scripts; while generally safe, it involves external dependencies. The reliance on external websites for content also introduces a risk if those sites are compromised or change their structure.

Portability

  • warning:Structural AssumptionThe `site-patterns.md` describes specific site structures and selectors, which could break if websites change their HTML or JavaScript rendering. The Python code template also assumes specific directory structures for output.
  • warning:Runtime stabilityThe skill requires Playwright and Chromium, and assumes a POSIX environment (Linux or macOS) for installation and execution, as stated in the README. Windows users must use WSL. This limits its cross-platform stability without explicit handling.
  • warning:Stack assumptionsThe SKILL.md mentions requirements like Playwright and Chromium, but does not explicitly declare the runtime surface (e.g., Python version) or minimum version for the bundled scripts. It assumes a POSIX environment.

Versioning

  • warning:Release ManagementThe SKILL.md frontmatter declares a model but not a package version. While the README indicates releases, there is no explicit semver versioning wired into the skill's metadata or installation instructions, and installation defaults to `main`.

Code Execution

  • warning:ValidationThe Python script template includes placeholders for input validation and sanitization, but the provided code does not show explicit schema validation or sanitization for inputs like case names or URLs.
  • warning:Error HandlingThe Python script template includes a basic try-except block for source functions but lacks detailed error categorization, structured error reporting, or specific recovery steps for common issues like site blocking or download failures.
  • warning:LoggingThere is no explicit mention or implementation of local audit logging for destructive actions or outbound calls within the provided SKILL.md or supporting files. The console output during execution is the primary feedback.

Install

  • warning:Installation instructionThe installation instructions are primarily in the parent README and assume familiarity with the larger project. While it details Playwright setup, it lacks specific copy-paste invocations for this skill and does not clearly document authentication requirements for sources like Scribd.

Errors

  • warning:Actionable error messagesThe provided script template shows basic error reporting but lacks detailed, actionable messages for common failure modes like site blocking, download failures, or missing documents. Remediation steps are not clearly defined per error.

Execution

  • warning:Pinned dependenciesThe skill lists Python dependencies but does not provide a lockfile or specify exact versions, which can lead to runtime issues due to dependency drift. The shebang/header for scripts is also not fully detailed.

Protocol

  • warning:Idempotent retry & timeoutsThe Python script template shows basic timeouts for page navigation, but lacks explicit implementation of per-call timeouts for download operations and does not detail idempotency considerations for file operations.

Practical Utility

  • warning:Usage examplesThe SKILL.md provides a Python code template but lacks concrete end-to-end usage examples with specific inputs and expected outputs for the 'document-hunter' skill itself. The README has workflow examples for a larger music project.
  • warning:Edge casesWhile the troubleshooting section in `site-patterns.md` and the main SKILL.md touch on some failure modes (site blocked, no results, download fails), they lack structured documentation of specific symptoms and recovery steps for each edge case as expected for a complete skill.

Safety

  • warning:Halt on unexpected stateThe skill does not explicitly list machine-readable preconditions or instruct to abort on unexpected pre-states like a dirty working tree or missing dependencies beyond the mentioned Python requirements. The workflow does not appear to have explicit rollback procedures.

安装

请先添加 Marketplace

/plugin marketplace add bitwize-music-studio/claude-ai-music-skills
/plugin install claude-ai-music-skills@bitwize-music

质量评分

72 /100
1 day ago 分析

信任信号

最近提交2 days ago
星标174
许可证CC0-1.0
状态
查看源代码

类似扩展

Agent Browser

100

AI 代理的浏览器自动化 CLI。当用户需要与网站交互时使用,包括浏览页面、填写表单、点击按钮、截屏、提取数据、测试 Web 应用或自动化任何浏览器任务。触发条件包括请求“打开网站”、“填表”、“点击按钮”、“截屏”、“抓取页面数据”、“测试此 Web 应用”、“登录网站”、“自动化浏览器操作”或任何需要以编程方式进行 Web 交互的任务。

技能
shanraisshan

Manus

100

Delegate complex, long-running tasks to Manus AI agent for autonomous execution. Use when user says 'use manus', 'delegate to manus', 'send to manus', 'have manus do', 'ask manus', 'check manus sessions', or when tasks require deep web research, market analysis, product comparisons, stock analysis, competitive research, document generation, data analysis, or multi-step workflows that benefit from autonomous agent execution with parallel processing.

技能
sanjay3290

Dev Browser

99

使用持久化页面状态进行浏览器自动化。当用户要求导航网站、填写表单、截屏、提取网页数据、测试 Web 应用或自动化浏览器工作流时使用。触发短语包括“转到 [url]”、“点击”、“填写表单”、“截屏”、“抓取”、“自动化”、“测试网站”、“登录”或任何浏览器交互请求。

技能
SawyerHood

Project Session Manager

100

Worktree-first dev environment manager for issues, PRs, and features with optional tmux sessions

技能
Yeachan-Heo

Public Google Drive

100

无需 OAuth 即可创建公开的 Google Docs 或 Google Sheet 文件。使用此技能创建和编辑 Google Docs 和 Sheets,无需登录 Google。文档可通过共享链接查看。首次使用时自动注册。

技能
zagmoai

Oh My Claudecode

100

Process-first advisor routing for Claude, Codex, or Gemini via `omc ask`, with artifact capture and no raw CLI assembly

技能
Yeachan-Heo