PDF Extraction
Skill تم التحققExtract text, tables, and metadata from PDFs using pdfplumber
This skill leverages the `pdfplumber` library to precisely extract textual content, tabular data, and document metadata from PDF files. It offers detailed control over extraction parameters and includes examples for common use cases like converting tables to DataFrames and processing invoice data.
Documentation
- info:Configuration & parameter referenceWhile the code snippets show usage of pdfplumber with parameters like tolerances, these specific parameters and their default values are not explicitly documented in the SKILL.md or accompanying files.
Code Execution
- info:ValidationThe provided code snippets demonstrate basic usage of `pdfplumber` but do not explicitly show the use of a schema validation library for input parameters like file paths or extraction options.
التثبيت
npx skills add claude-office-skills/skillsيشغّل Vercel skills CLI (skills.sh) عبر npx — يتطلب وجود Node.js محليًا ووكيلًا واحدًا على الأقل متوافقًا مع skills (مثل Claude Code أو Cursor أو Codex). يفترض أن المستودع يتبع تنسيق agentskills.io.
امتدادات مماثلة
Document Parser Skill
92>
PDF to DOCX Converter
98Convert PDF files to editable Word documents using pdf2docx
Chat with PDF
95Answer questions about PDF content, summarize, and extract information
Table Extractor
92>
Smart OCR Skill
92>
GPU Document Processing
85Use when processing large PDFs, document collections, or bulk text extraction tasks that benefit from GPU-accelerated processing. Triggers when the user provides large documents or needs bulk document analysis.