Pdf Processing Anthropic
Skill Warning ActiveUse this skill whenever the user wants to do anything with PDF files. This includes reading or extracting text/tables from PDFs, combining or merging multiple PDFs into one, splitting PDFs apart, rotating pages, adding watermarks, creating new PDFs, filling PDF forms, encrypting/decrypting PDFs, extracting images, and OCR on scanned PDFs to make them searchable. If the user mentions a .pdf file or asks to produce one, use this skill.
To provide a unified interface and extensive examples for performing virtually any operation on PDF files, from simple text extraction to complex form filling and OCR.
Features
- Extract text and tables from PDFs
- Merge, split, rotate, and watermark PDFs
- Create new PDFs programmatically
- Fillable and non-fillable PDF form processing
- OCR for scanned PDFs
- Extract images from PDFs
- Password protection and decryption
Use Cases
- Automating the processing of bulk PDF documents
- Extracting structured data from PDF reports
- Generating standardized PDF forms
- Archiving and organizing PDF content
Non-Goals
- Operating on file types other than PDF
- Performing complex graphic design within PDFs
- Managing cloud storage for PDFs
License
- critical:License usabilityThe LICENSE.txt file states 'Proprietary. See LICENSE.txt' and includes significant restrictions on copying, distribution, and creation of derivative works, which are incompatible with standard OSS licenses.
Maintenance
- warning:Dependency ManagementThe skill relies on several Python libraries (pypdf, pdfplumber, reportlab, pytesseract, pdf2image) but there's no clear indication of dependency pinning or vulnerability checking mechanisms.
Code Execution
- warning:ValidationWhile the provided scripts include some validation (e.g., `check_bounding_boxes.py`), the core skill logic doesn't explicitly demonstrate schema validation libraries for all inputs and outputs.
Install
- info:Installation instructionThe SKILL.md provides Python code examples for usage, but explicit installation instructions for the skill itself or its dependencies are not clearly laid out in a copy-paste format.
Execution
- warning:Pinned dependenciesWhile the SKILL.md lists Python libraries used, there is no explicit mention or demonstration of pinned dependencies or lockfiles to ensure consistent execution.
Installation
First, add the marketplace
/plugin marketplace add lawvable/awesome-legal-skills/plugin install pdf-processing-anthropic@lawvableQuality Score
WarningTrust Signals
Similar Extensions
Document Extraction API
99Extract structured data from documents using AI-powered field extraction.
Nutrient Document Processing
98Process documents with Nutrient DWS. Use when the user wants to generate PDFs from HTML or URLs, convert Office/images/PDFs, assemble or split packets, OCR scans, extract text/tables/key-value pairs, redact PII, watermark, sign, fill forms, optimize PDFs, or produce compliance outputs like PDF/A or PDF/UA. Triggers include convert to PDF, merge these PDFs, OCR this scan, extract tables, redact PII, sign this PDF, make this PDF/A, or linearize for web delivery.
Extract Fleet Vehicle Registration
100Extract vehicle identification, owner details, registration dates, and technical specifications from vehicle registration documents.
Convert Resume to Markdown
100Convert a resume PDF to clean markdown for LLM parsing or candidate pipelines.
Extract Receipt Data
99Extract merchant, date, line items, tax, and total from receipts.
Convert Contract To Markdown
99Convert a contract PDF to clean markdown for clause extraction or LLM analysis.