PDF Processing Skill

Skill Verified Active

Extract text from PDFs, fill forms, and merge documents

Purpose

To provide an agent with the capability to programmatically interact with PDF documents for tasks such as data extraction, form completion, and document assembly.

Features

Extract text content from PDFs
Fill form fields in PDF documents
Merge multiple PDFs into a single document
Split PDFs into individual pages

Use Cases

Automating data extraction from scanned or form-based PDFs.
Generating filled PDF forms based on provided data.
Consolidating multiple PDF reports or chapters into a single file.
Separating a multi-page PDF into individual documents for easier distribution.

Non-Goals

Performing optical character recognition (OCR) on image-based PDFs (beyond what pypdf provides).
Editing or creating complex PDF layouts from scratch.
Handling encrypted or password-protected PDFs without the password.

Workflow

Identify the specific PDF processing task required (extraction, form filling, merging, splitting).
Provide the relevant PDF file(s) and any necessary data (e.g., form field values).
Execute the skill's functionality, which uses the pypdf library to perform the requested operation.
Receive the processed PDF file or extracted text as output.

Installation

npx skills add jlowin/fastmcp

Runs the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.

Quality Score

Verified

99 /100

Analyzed about 19 hours ago

Trust Signals

Last commit1 day ago

GitHub owner jlowin

Stars25.1k

LicenseApache-2.0

Websitegofastmcp.com

Status

View Source

Similar Extensions

Nutrient Document Processing

Process documents with Nutrient DWS. Use when the user wants to generate PDFs from HTML or URLs, convert Office/images/PDFs, assemble or split packets, OCR scans, extract text/tables/key-value pairs, redact PII, watermark, sign, fill forms, optimize PDFs, or produce compliance outputs like PDF/A or PDF/UA. Triggers include convert to PDF, merge these PDFs, OCR this scan, extract tables, redact PII, sign this PDF, make this PDF/A, or linearize for web delivery.

Skill

PSPDFKit-labs

Azure AI Document Intelligence SDK for Java

Azure AI Document Intelligence SDK for Java (com.azure:azure-ai-documentintelligence). Use for extracting text, tables, key-value pairs from documents, receipts, invoices, IDs, or building custom document models. Triggers: "document intelligence java", "form recognizer java", "extract text from PDF java", "OCR document java", "analyze invoice receipt java", "custom document model java", "document classification java".

Skill

microsoft

Generate Restaurant Menu

100

Generate a branded restaurant menu PDF with sections, items, prices, and descriptions.

Skill

iterationlayer

Extract Fleet Vehicle Registration

100

Extract vehicle identification, owner details, registration dates, and technical specifications from vehicle registration documents.

Skill

iterationlayer

Convert Resume to Markdown

100

Convert a resume PDF to clean markdown for LLM parsing or candidate pipelines.

Skill

iterationlayer

Eyeball

100

Document analysis with inline source screenshots. When you ask Copilot to analyze a document, Eyeball generates a Word doc where every factual claim includes a highlighted screenshot from the source material so you can verify it with your own eyes.

Skill

github