此内容尚未提供您的语言版本,正在以英文显示。

PDF Extraction

Skill 已验证

Extract text, tables, and metadata from PDFs using pdfplumber

AI 摘要

This skill leverages the `pdfplumber` library to precisely extract textual content, tabular data, and document metadata from PDF files. It offers detailed control over extraction parameters and includes examples for common use cases like converting tables to DataFrames and processing invoice data.

Documentation

info:Configuration & parameter referenceWhile the code snippets show usage of pdfplumber with parameters like tolerances, these specific parameters and their default values are not explicitly documented in the SKILL.md or accompanying files.

Code Execution

info:ValidationThe provided code snippets demonstrate basic usage of `pdfplumber` but do not explicitly show the use of a schema validation library for input parameters like file paths or extraction options.

安装

npx skills add claude-office-skills/skills

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

3 months ago

claude-office-skills

98 stars

MIT

更新于 6 days ago

查看源代码

类似扩展

Document Parser Skill

Skill

claude-office-skills

PDF to DOCX Converter

Convert PDF files to editable Word documents using pdf2docx

Skill

claude-office-skills

Chat with PDF

Answer questions about PDF content, summarize, and extract information

Skill

claude-office-skills

Table Extractor

Skill

claude-office-skills

Smart OCR Skill

Skill

claude-office-skills

GPU Document Processing

Use when processing large PDFs, document collections, or bulk text extraction tasks that benefit from GPU-accelerated processing. Triggers when the user provides large documents or needs bulk document analysis.

Skill

langchain-ai