PaddleOCR Document Parsing

Skill Verified

Use this skill to extract structured Markdown/JSON from PDFs and document images—tables with cell-level precision, formulas as LaTeX, figures, seals, charts, headers/footers, multi-column layout and correct reading order. Trigger terms: 文档解析, 版面分析, 版面还原, 表格提取, 公式识别, 多栏排版, 扫描件结构化, 发票, 财报, 复杂 PDF, PDF转Markdown, 图表, 阅读顺序; reading order, formula, LaTeX, layout parsing, structure extraction, PP-StructureV3, PaddleOCR-VL.

AI Summary

This skill leverages the PaddleOCR API to parse complex documents, extracting text, tables, formulas, and layout information into structured Markdown or JSON. It supports both local files and URLs, with options for output customization and error handling.

Versioning

warning:Release ManagementNo manifest version (SKILL.md, package.json, etc.) or GitHub release tags are present, and installation instructions reference HEAD.

Installation

npx skills add aidenwu0209/paddleocr-skills

Runs the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.

6 days ago

aidenwu0209

20 stars

Apache-2.0

Updated 6 days ago

View Source

Similar Extensions

Document Parser Skill

Skill

claude-office-skills

PDF to DOCX Converter

Convert PDF files to editable Word documents using pdf2docx

Skill

claude-office-skills

PDF OCR Extraction

Extract text from scanned PDFs using optical character recognition

Skill

claude-office-skills

PaddleOCR Text Recognition

Use this skill whenever the user wants text extracted from images, photos, scans, screenshots, or scanned PDFs. Returns exact machine-readable strings with line-level text and optional bbox coordinates. Strong accuracy for CJK, small print, and handwritten text. Trigger terms: OCR, 文字识别, 图片转文字, 截图识字, 提取图中文字, 扫描识字, 识字, 纯文字, plain text extraction, 坐标, 检测框, bbox, bounding box, image to text, screenshot, photo scan, recognize text.

Skill

aidenwu0209

Office MCP Server

MCP server with 39 tools for Word, Excel, PowerPoint, PDF, OCR operations

Skill

claude-office-skills

Smart OCR Skill

Skill

claude-office-skills