跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Firecrawl Parse

技能 已验证 活跃
属于:Firecrawl

Efficiently extract and convert the contents of any local file—such as PDF, DOCX, DOC, ODT, RTF, XLSX, XLS, or HTML—into clean, well-formatted markdown saved to disk. Use this skill whenever the user requests to parse, read, or extract information from a file on their computer, including phrases like “parse this PDF”, “convert this document”, “read this file”, “extract text from”, or when a local file path (not a URL) is provided. This skill offers advanced options like generating AI-powered summaries and answering questions based on the file's content. Prefer this tool over `scrape` when handling local files to deliver precise, structured outputs for downstream tasks.

目的

To efficiently and cleanly convert local documents into well-formatted markdown, enabling easier access and processing of file content.

功能

  • Convert local files (PDF, DOCX, XLSX, HTML, etc.) to markdown
  • Generate AI-powered summaries of file content
  • Answer questions based on parsed file content
  • Save extracted content to disk
  • Differentiate from URL scraping tools

使用场景

  • When a user requests to parse, read, or extract information from a local file.
  • When a local file path (not a URL) is provided for processing.
  • To create clean markdown versions of documents for downstream tasks.
  • To quickly summarize or get answers from a document without reading it fully.

非目标

  • Processing files from URLs (use `firecrawl-scrape` instead)
  • Streaming large outputs to stdout (prefer saving to disk)
  • Handling files larger than 50MB
  • Replacing general-purpose file viewers

安装

请先添加 Marketplace

/plugin marketplace add firecrawl/cli
/plugin install cli@firecrawl

质量评分

已验证
99 /100
1 day ago 分析

信任信号

最近提交2 days ago
星标383
状态
查看源代码

类似扩展

PaddleOCR 文档解析

99

使用此技能可从 PDF 和文档图像中提取结构化 Markdown/JSON — 表格(精确到单元格)、公式(LaTeX 格式)、图形、印章、图表、页眉/页脚、多栏布局和正确的阅读顺序。触发词:文档解析, 版面分析, 版面还原, 表格提取, 公式识别, 多栏排版, 扫描件结构化, 发票, 财报, 复杂 PDF, PDF转Markdown, 图表, 阅读顺序; reading order, formula, LaTeX, layout parsing, structure extraction, PP-StructureV3, PaddleOCR-VL.

技能
PaddlePaddle

Convert Resume to Markdown

100

Convert a resume PDF to clean markdown for LLM parsing or candidate pipelines.

技能
iterationlayer

Paddleocr 文本识别

99

当用户希望从图像、照片、扫描件、截图或扫描的 PDF 中提取文本时,请使用此技能。返回机器可读的精确字符串,包含行级文本和可选的 bbox 坐标。对 CJK、小字和手写文本具有很高的准确性。触发词:OCR、文字识别、图片转文字、截图识字、提取图中文字、扫描识字、识字、纯文字、plain text extraction、坐标、检测框、bbox、bounding box、image to text、screenshot、photo scan、recognize text。

技能
PaddlePaddle

Markdown to Styled PDF

99

Generate a professionally styled PDF document from Markdown content with custom fonts, headers, and page numbers.

技能
iterationlayer

Trader Regime

100

Detect current market regime using npx neural-trader — bull/bear/ranging/volatile classification with recommended strategy

技能
ruvnet

Setup

100

Use first for install/update routing — sends setup, doctor, or MCP requests to the correct OMC setup flow

技能
Yeachan-Heo