PDF Extraction
Skill 已验证Extract text, tables, and metadata from PDFs using pdfplumber
This skill leverages the `pdfplumber` library to precisely extract textual content, tabular data, and document metadata from PDF files. It offers detailed control over extraction parameters and includes examples for common use cases like converting tables to DataFrames and processing invoice data.
Documentation
- info:Configuration & parameter referenceWhile the code snippets show usage of pdfplumber with parameters like tolerances, these specific parameters and their default values are not explicitly documented in the SKILL.md or accompanying files.
Code Execution
- info:ValidationThe provided code snippets demonstrate basic usage of `pdfplumber` but do not explicitly show the use of a schema validation library for input parameters like file paths or extraction options.
安装
npx skills add claude-office-skills/skills通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。
类似扩展
Document Parser Skill
92>
PDF to DOCX Converter
98Convert PDF files to editable Word documents using pdf2docx
Chat with PDF
95Answer questions about PDF content, summarize, and extract information
Table Extractor
92>
Smart OCR Skill
92>
GPU Document Processing
85Use when processing large PDFs, document collections, or bulk text extraction tasks that benefit from GPU-accelerated processing. Triggers when the user provides large documents or needs bulk document analysis.