Đi tới nội dung chính
Nội dung này hiện chưa có sẵn bằng ngôn ngữ của bạn và đang được hiển thị bằng tiếng Anh.

PDF OCR Extraction

Skill Đã xác minh
95

Extract text from scanned PDFs using optical character recognition

Tóm tắt từ AI

This skill leverages OCR technology to extract text from scanned PDF documents, supporting various document types, languages, and output formats including plain text, structured data, and searchable PDFs. It also provides guidance on image quality and pre-processing steps for optimal results.

Maintenance

  • warning:Commit recencyNo commits have been made to the repository in the last 12 months (last commit date is not available, but the repository was last updated in 2026).

Cài đặt

npx skills add claude-office-skills/skills

Chạy Vercel skills CLI (skills.sh) qua npx — yêu cầu Node.js trên máy và ít nhất một agent tương thích skills đã được cài (Claude Code, Cursor, Codex, …). Giả định repo tuân theo định dạng agentskills.io.

3 months ago
98 stars
MIT
Cập nhật 2 days ago
Xem mã nguồn

Tiện ích tương tự

PaddleOCR Text Recognition

95

Use this skill whenever the user wants text extracted from images, photos, scans, screenshots, or scanned PDFs. Returns exact machine-readable strings with line-level text and optional bbox coordinates. Strong accuracy for CJK, small print, and handwritten text. Trigger terms: OCR, 文字识别, 图片转文字, 截图识字, 提取图中文字, 扫描识字, 识字, 纯文字, plain text extraction, 坐标, 检测框, bbox, bounding box, image to text, screenshot, photo scan, recognize text.

Skill
aidenwu0209

PDF Compress

98

Reduce PDF file size while maintaining acceptable quality

Skill
claude-office-skills

PaddleOCR Document Parsing

98

Use this skill to extract structured Markdown/JSON from PDFs and document images—tables with cell-level precision, formulas as LaTeX, figures, seals, charts, headers/footers, multi-column layout and correct reading order. Trigger terms: 文档解析, 版面分析, 版面还原, 表格提取, 公式识别, 多栏排版, 扫描件结构化, 发票, 财报, 复杂 PDF, PDF转Markdown, 图表, 阅读顺序; reading order, formula, LaTeX, layout parsing, structure extraction, PP-StructureV3, PaddleOCR-VL.

Skill
aidenwu0209

Office MCP Server

94

MCP server with 39 tools for Word, Excel, PowerPoint, PDF, OCR operations

Skill
claude-office-skills

Document Parser Skill

92

>

Skill
claude-office-skills

PDF Processing Guide

Use this skill whenever the user wants to do anything with PDF files. This includes reading or extracting text/tables from PDFs, combining or merging multiple PDFs into one, splitting PDFs apart, rotating pages, adding watermarks, creating new PDFs, filling PDF forms, encrypting/decrypting PDFs, extracting images, and OCR on scanned PDFs to make them searchable. If the user mentions a .pdf file or asks to produce one, use this skill.

Skill
anthropics