PDF OCR Extraction
Skill 検証済みExtract text from scanned PDFs using optical character recognition
This skill leverages OCR technology to extract text from scanned PDF documents, supporting various document types, languages, and output formats including plain text, structured data, and searchable PDFs. It also provides guidance on image quality and pre-processing steps for optimal results.
Maintenance
- warning:Commit recencyNo commits have been made to the repository in the last 12 months (last commit date is not available, but the repository was last updated in 2026).
インストール
npx skills add claude-office-skills/skillsVercel skills CLI(skills.sh)を npx 経由で実行します。ローカルに Node.js と、skills 対応のエージェント(Claude Code、Cursor、Codex など)が少なくとも 1 つインストールされている必要があります。リポジトリが agentskills.io 形式に従っていることを前提としています。
類似の拡張機能
PaddleOCR Text Recognition
95Use this skill whenever the user wants text extracted from images, photos, scans, screenshots, or scanned PDFs. Returns exact machine-readable strings with line-level text and optional bbox coordinates. Strong accuracy for CJK, small print, and handwritten text. Trigger terms: OCR, 文字识别, 图片转文字, 截图识字, 提取图中文字, 扫描识字, 识字, 纯文字, plain text extraction, 坐标, 检测框, bbox, bounding box, image to text, screenshot, photo scan, recognize text.
PDF Compress
98Reduce PDF file size while maintaining acceptable quality
PaddleOCR Document Parsing
98Use this skill to extract structured Markdown/JSON from PDFs and document images—tables with cell-level precision, formulas as LaTeX, figures, seals, charts, headers/footers, multi-column layout and correct reading order. Trigger terms: 文档解析, 版面分析, 版面还原, 表格提取, 公式识别, 多栏排版, 扫描件结构化, 发票, 财报, 复杂 PDF, PDF转Markdown, 图表, 阅读顺序; reading order, formula, LaTeX, layout parsing, structure extraction, PP-StructureV3, PaddleOCR-VL.
Office MCP Server
94MCP server with 39 tools for Word, Excel, PowerPoint, PDF, OCR operations
Document Parser Skill
92>
PDF Processing Guide
Use this skill whenever the user wants to do anything with PDF files. This includes reading or extracting text/tables from PDFs, combining or merging multiple PDFs into one, splitting PDFs apart, rotating pages, adding watermarks, creating new PDFs, filling PDF forms, encrypting/decrypting PDFs, extracting images, and OCR on scanned PDFs to make them searchable. If the user mentions a .pdf file or asks to produce one, use this skill.