Pdf Processing Anthropic
技能 警告 活跃Use this skill whenever the user wants to do anything with PDF files. This includes reading or extracting text/tables from PDFs, combining or merging multiple PDFs into one, splitting PDFs apart, rotating pages, adding watermarks, creating new PDFs, filling PDF forms, encrypting/decrypting PDFs, extracting images, and OCR on scanned PDFs to make them searchable. If the user mentions a .pdf file or asks to produce one, use this skill.
To provide a unified interface and extensive examples for performing virtually any operation on PDF files, from simple text extraction to complex form filling and OCR.
功能
- Extract text and tables from PDFs
- Merge, split, rotate, and watermark PDFs
- Create new PDFs programmatically
- Fillable and non-fillable PDF form processing
- OCR for scanned PDFs
- Extract images from PDFs
- Password protection and decryption
使用场景
- Automating the processing of bulk PDF documents
- Extracting structured data from PDF reports
- Generating standardized PDF forms
- Archiving and organizing PDF content
非目标
- Operating on file types other than PDF
- Performing complex graphic design within PDFs
- Managing cloud storage for PDFs
License
- critical:License usabilityThe LICENSE.txt file states 'Proprietary. See LICENSE.txt' and includes significant restrictions on copying, distribution, and creation of derivative works, which are incompatible with standard OSS licenses.
Maintenance
- warning:Dependency ManagementThe skill relies on several Python libraries (pypdf, pdfplumber, reportlab, pytesseract, pdf2image) but there's no clear indication of dependency pinning or vulnerability checking mechanisms.
Code Execution
- warning:ValidationWhile the provided scripts include some validation (e.g., `check_bounding_boxes.py`), the core skill logic doesn't explicitly demonstrate schema validation libraries for all inputs and outputs.
Install
- info:Installation instructionThe SKILL.md provides Python code examples for usage, but explicit installation instructions for the skill itself or its dependencies are not clearly laid out in a copy-paste format.
Execution
- warning:Pinned dependenciesWhile the SKILL.md lists Python libraries used, there is no explicit mention or demonstration of pinned dependencies or lockfiles to ensure consistent execution.
安装
请先添加 Marketplace
/plugin marketplace add lawvable/awesome-legal-skills/plugin install pdf-processing-anthropic@lawvable质量评分
警告类似扩展
Document Extraction API
99Extract structured data from documents using AI-powered field extraction.
Nutrient Document Processing
98使用 Nutrient DWS 处理文档。当用户希望从 HTML 或 URL 生成 PDF、转换 Office/图像/PDF、组装或拆分文件包、OCR 扫描件、提取文本/表格/键值对、进行 PII 拟态、添加水印、签名、填充表单、优化 PDF 或生成 PDF/A 或 PDF/UA 等合规性输出时使用。触发器包括转换为 PDF、合并这些 PDF、OCR 此扫描件、提取表格、拟态 PII、签名此 PDF、制作此 PDF/A 或为 Web 交付进行线性化。
Extract Fleet Vehicle Registration
100Extract vehicle identification, owner details, registration dates, and technical specifications from vehicle registration documents.
Convert Resume to Markdown
100Convert a resume PDF to clean markdown for LLM parsing or candidate pipelines.
Extract Receipt Data
99Extract merchant, date, line items, tax, and total from receipts.
Convert Contract To Markdown
99Convert a contract PDF to clean markdown for clause extraction or LLM analysis.