此内容尚未提供您的语言版本,正在以英文显示。

Firecrawl Parse

技能已验证活跃

Efficiently extract and convert the contents of any local file—such as PDF, DOCX, DOC, ODT, RTF, XLSX, XLS, or HTML—into clean, well-formatted markdown saved to disk. Use this skill whenever the user requests to parse, read, or extract information from a file on their computer, including phrases like “parse this PDF”, “convert this document”, “read this file”, “extract text from”, or when a local file path (not a URL) is provided. This skill offers advanced options like generating AI-powered summaries and answering questions based on the file's content. Prefer this tool over `scrape` when handling local files to deliver precise, structured outputs for downstream tasks.

目的

To efficiently and cleanly convert local documents into well-formatted markdown, enabling easier access and processing of file content.

功能

Convert local files (PDF, DOCX, XLSX, HTML, etc.) to markdown
Generate AI-powered summaries of file content
Answer questions based on parsed file content
Save extracted content to disk
Differentiate from URL scraping tools

使用场景

When a user requests to parse, read, or extract information from a local file.
When a local file path (not a URL) is provided for processing.
To create clean markdown versions of documents for downstream tasks.
To quickly summarize or get answers from a document without reading it fully.

非目标

Processing files from URLs (use `firecrawl-scrape` instead)
Streaming large outputs to stdout (prefer saving to disk)
Handling files larger than 50MB
Replacing general-purpose file viewers

安装

请先添加 Marketplace

/plugin marketplace add firecrawl/cli

/plugin install cli@firecrawl

质量评分

已验证

99 /100

1 day ago 分析

信任信号

最近提交2 days ago

GitHub 所有者 firecrawl

星标383

下载量 51.1k

网站docs.firecrawl.dev

状态

查看源代码

类似扩展

PaddleOCR 文档解析

使用此技能可从 PDF 和文档图像中提取结构化 Markdown/JSON — 表格（精确到单元格）、公式（LaTeX 格式）、图形、印章、图表、页眉/页脚、多栏布局和正确的阅读顺序。触发词：文档解析, 版面分析, 版面还原, 表格提取, 公式识别, 多栏排版, 扫描件结构化, 发票, 财报, 复杂 PDF, PDF转Markdown, 图表, 阅读顺序; reading order, formula, LaTeX, layout parsing, structure extraction, PP-StructureV3, PaddleOCR-VL.

技能

PaddlePaddle

Convert Resume to Markdown

100

Convert a resume PDF to clean markdown for LLM parsing or candidate pipelines.

技能

iterationlayer

Paddleocr 文本识别

当用户希望从图像、照片、扫描件、截图或扫描的 PDF 中提取文本时，请使用此技能。返回机器可读的精确字符串，包含行级文本和可选的 bbox 坐标。对 CJK、小字和手写文本具有很高的准确性。触发词：OCR、文字识别、图片转文字、截图识字、提取图中文字、扫描识字、识字、纯文字、plain text extraction、坐标、检测框、bbox、bounding box、image to text、screenshot、photo scan、recognize text。

技能

PaddlePaddle

Markdown to Styled PDF

Generate a professionally styled PDF document from Markdown content with custom fonts, headers, and page numbers.

技能

iterationlayer

Trader Regime

100

Detect current market regime using npx neural-trader — bull/bear/ranging/volatile classification with recommended strategy

技能

ruvnet

Setup

100

Use first for install/update routing — sends setup, doctor, or MCP requests to the correct OMC setup flow

技能

Yeachan-Heo