Skip to main content

PDF Processing Skill

Skill Verified Active

Extract text from PDFs, fill forms, and merge documents

Purpose

To provide an agent with the capability to programmatically interact with PDF documents for tasks such as data extraction, form completion, and document assembly.

Features

  • Extract text content from PDFs
  • Fill form fields in PDF documents
  • Merge multiple PDFs into a single document
  • Split PDFs into individual pages

Use Cases

  • Automating data extraction from scanned or form-based PDFs.
  • Generating filled PDF forms based on provided data.
  • Consolidating multiple PDF reports or chapters into a single file.
  • Separating a multi-page PDF into individual documents for easier distribution.

Non-Goals

  • Performing optical character recognition (OCR) on image-based PDFs (beyond what pypdf provides).
  • Editing or creating complex PDF layouts from scratch.
  • Handling encrypted or password-protected PDFs without the password.

Workflow

  1. Identify the specific PDF processing task required (extraction, form filling, merging, splitting).
  2. Provide the relevant PDF file(s) and any necessary data (e.g., form field values).
  3. Execute the skill's functionality, which uses the pypdf library to perform the requested operation.
  4. Receive the processed PDF file or extracted text as output.

Installation

npx skills add jlowin/fastmcp

Runs the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.

Quality Score

Verified
99 /100
Analyzed about 19 hours ago

Trust Signals

Last commit1 day ago
Stars25.1k
LicenseApache-2.0
Status
View Source

Similar Extensions

Nutrient Document Processing

98

Process documents with Nutrient DWS. Use when the user wants to generate PDFs from HTML or URLs, convert Office/images/PDFs, assemble or split packets, OCR scans, extract text/tables/key-value pairs, redact PII, watermark, sign, fill forms, optimize PDFs, or produce compliance outputs like PDF/A or PDF/UA. Triggers include convert to PDF, merge these PDFs, OCR this scan, extract tables, redact PII, sign this PDF, make this PDF/A, or linearize for web delivery.

Skill
PSPDFKit-labs

Azure AI Document Intelligence SDK for Java

98

Azure AI Document Intelligence SDK for Java (com.azure:azure-ai-documentintelligence). Use for extracting text, tables, key-value pairs from documents, receipts, invoices, IDs, or building custom document models. Triggers: "document intelligence java", "form recognizer java", "extract text from PDF java", "OCR document java", "analyze invoice receipt java", "custom document model java", "document classification java".

Skill
microsoft

Generate Restaurant Menu

100

Generate a branded restaurant menu PDF with sections, items, prices, and descriptions.

Skill
iterationlayer

Extract Fleet Vehicle Registration

100

Extract vehicle identification, owner details, registration dates, and technical specifications from vehicle registration documents.

Skill
iterationlayer

Convert Resume to Markdown

100

Convert a resume PDF to clean markdown for LLM parsing or candidate pipelines.

Skill
iterationlayer

Eyeball

100

Document analysis with inline source screenshots. When you ask Copilot to analyze a document, Eyeball generates a Word doc where every factual claim includes a highlighted screenshot from the source material so you can verify it with your own eyes.

Skill
github

© 2025 SkillRepo · Find the right skill, skip the noise.