Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

PaddleOCR Document Parsing

Skill Verifiziert

Use this skill to extract structured Markdown/JSON from PDFs and document images—tables with cell-level precision, formulas as LaTeX, figures, seals, charts, headers/footers, multi-column layout and correct reading order. Trigger terms: 文档解析, 版面分析, 版面还原, 表格提取, 公式识别, 多栏排版, 扫描件结构化, 发票, 财报, 复杂 PDF, PDF转Markdown, 图表, 阅读顺序; reading order, formula, LaTeX, layout parsing, structure extraction, PP-StructureV3, PaddleOCR-VL.

KI-Zusammenfassung

This skill leverages the PaddleOCR API to parse complex documents, extracting text, tables, formulas, and layout information into structured Markdown or JSON. It supports both local files and URLs, with options for output customization and error handling.

Versioning

warning:Release ManagementNo manifest version (SKILL.md, package.json, etc.) or GitHub release tags are present, and installation instructions reference HEAD.

Installation

npx skills add aidenwu0209/paddleocr-skills

Führt das Vercel skills CLI (skills.sh) via npx aus — benötigt Node.js lokal und mindestens einen installierten skills-kompatiblen Agent (Claude Code, Cursor, Codex, …). Setzt voraus, dass das Repo dem agentskills.io-Format folgt.

5 days ago

aidenwu0209

20 stars

Apache-2.0

Aktualisiert 5 days ago

Quellcode ansehen

PaddleOCR Document Parsing

Versioning

Ähnliche Erweiterungen

Document Parser Skill

PDF to DOCX Converter

PDF OCR Extraction

PaddleOCR Text Recognition

Office MCP Server

Smart OCR Skill