PDF Processing Skill
Skill Verified ActiveExtract text from PDFs, fill forms, and merge documents
To provide an agent with the capability to programmatically interact with PDF documents for tasks such as data extraction, form completion, and document assembly.
Features
- Extract text content from PDFs
- Fill form fields in PDF documents
- Merge multiple PDFs into a single document
- Split PDFs into individual pages
Use Cases
- Automating data extraction from scanned or form-based PDFs.
- Generating filled PDF forms based on provided data.
- Consolidating multiple PDF reports or chapters into a single file.
- Separating a multi-page PDF into individual documents for easier distribution.
Non-Goals
- Performing optical character recognition (OCR) on image-based PDFs (beyond what pypdf provides).
- Editing or creating complex PDF layouts from scratch.
- Handling encrypted or password-protected PDFs without the password.
Workflow
- Identify the specific PDF processing task required (extraction, form filling, merging, splitting).
- Provide the relevant PDF file(s) and any necessary data (e.g., form field values).
- Execute the skill's functionality, which uses the pypdf library to perform the requested operation.
- Receive the processed PDF file or extracted text as output.
Installation
npx skills add jlowin/fastmcpRuns the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.
Quality Score
VerifiedTrust Signals
Similar Extensions
Nutrient Document Processing
98Process documents with Nutrient DWS. Use when the user wants to generate PDFs from HTML or URLs, convert Office/images/PDFs, assemble or split packets, OCR scans, extract text/tables/key-value pairs, redact PII, watermark, sign, fill forms, optimize PDFs, or produce compliance outputs like PDF/A or PDF/UA. Triggers include convert to PDF, merge these PDFs, OCR this scan, extract tables, redact PII, sign this PDF, make this PDF/A, or linearize for web delivery.
Azure AI Document Intelligence SDK for Java
98Azure AI Document Intelligence SDK for Java (com.azure:azure-ai-documentintelligence). Use for extracting text, tables, key-value pairs from documents, receipts, invoices, IDs, or building custom document models. Triggers: "document intelligence java", "form recognizer java", "extract text from PDF java", "OCR document java", "analyze invoice receipt java", "custom document model java", "document classification java".
Generate Restaurant Menu
100Generate a branded restaurant menu PDF with sections, items, prices, and descriptions.
Extract Fleet Vehicle Registration
100Extract vehicle identification, owner details, registration dates, and technical specifications from vehicle registration documents.
Convert Resume to Markdown
100Convert a resume PDF to clean markdown for LLM parsing or candidate pipelines.
Eyeball
100Document analysis with inline source screenshots. When you ask Copilot to analyze a document, Eyeball generates a Word doc where every factual claim includes a highlighted screenshot from the source material so you can verify it with your own eyes.