Zum Hauptinhalt springen
Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

Document To Markdown Pipeline

Skill Verifiziert Aktiv

Convert PDF, DOCX, HTML, or image documents to clean, structured Markdown.

Zweck

To enable AI agents and development teams to easily convert documents into structured Markdown for ingestion into RAG systems, knowledge bases, or content migration workflows.

Funktionen

  • Convert PDF to Markdown
  • Convert DOCX to Markdown
  • Convert HTML to Markdown
  • Convert Images to Markdown
  • Preserve document structure in Markdown
  • API key authentication

Anwendungsfälle

  • Preparing documents for retrieval-augmented generation (RAG)
  • Importing content into a knowledge base or CMS
  • Migrating documents between platforms
  • Extracting structured text from various file types

Nicht-Ziele

  • Performing complex document editing or manipulation beyond conversion
  • Analyzing or interpreting the content of the documents
  • Directly interacting with local files on the user's machine without upload/URL

Installation

Zuerst Marketplace hinzufügen

/plugin marketplace add iterationlayer/skills
/plugin install skills@iterationlayer-skills

Qualitätspunktzahl

Verifiziert
98 /100
Analysiert about 22 hours ago

Vertrauenssignale

Letzter Commit16 days ago
Sterne0
LizenzMIT
Status
Quellcode ansehen

Ähnliche Erweiterungen

Baoyu Post To Wechat

100

Posts content to WeChat Official Account (微信公众号) via API or Chrome CDP. Supports article posting (文章) with HTML, markdown, or plain text input, and image-text posting (贴图, formerly 图文) with multiple images. Markdown article workflows default to converting ordinary external links into bottom citations for WeChat-friendly output. Use when user mentions "发布公众号", "post to wechat", "微信公众号", or "贴图/图文/文章".

Skill
jimliu

Convert Resume to Markdown

100

Convert a resume PDF to clean markdown for LLM parsing or candidate pipelines.

Skill
iterationlayer

Firecrawl Parse

99

Efficiently extract and convert the contents of any local file—such as PDF, DOCX, DOC, ODT, RTF, XLSX, XLS, or HTML—into clean, well-formatted markdown saved to disk. Use this skill whenever the user requests to parse, read, or extract information from a file on their computer, including phrases like “parse this PDF”, “convert this document”, “read this file”, “extract text from”, or when a local file path (not a URL) is provided. This skill offers advanced options like generating AI-powered summaries and answering questions based on the file's content. Prefer this tool over `scrape` when handling local files to deliver precise, structured outputs for downstream tasks.

Skill
firecrawl

Report Generator

99

Generate PDF/HTML reports from templates and data. Use when: creating client reports; generating weekly summaries; producing marketing performance reports; automating recurring reports

Skill
guia-matthieu

Wiki Builder

99

Start, structure, and grow a persistent research wiki indexed in pro-workflow's SQLite knowledge base. Each wiki is a folder of markdown pages with provenance, plus a shadow FTS5 index so any session can recall it. Use when the user says "start a wiki", "add to wiki", "compile a page", "wiki on X", or wants a long-lived knowledge base on a topic, paper, product, person, project, or codebase.

Skill
rohitg00

PaddleOCR Document Parsing

99

Verwenden Sie diese Fähigkeit, um strukturierte Markdown/JSON aus PDFs und Dokumentbildern zu extrahieren – Tabellen mit präziser Zellendefinition, Formeln als LaTeX, Abbildungen, Siegel, Diagramme, Kopf-/Fußzeilen, mehrspaltiges Layout und korrekte Lesereihenfolge. Trigger-Begriffe: 文档解析, 版面分析, 版面还原, 表格提取, 公式识别, 多栏排版, 扫描件结构化, 发票, 财报, 复杂 PDF, PDF转Markdown, 图表, 阅读顺序; reading order, formula, LaTeX, layout parsing, structure extraction, PP-StructureV3, PaddleOCR-VL.

Skill
PaddlePaddle