Website Extraction Api
技能 已验证 活跃Extract typed JSON from public website pages using a schema.
To enable users to reliably extract specific, typed data from public web pages into a structured JSON format, automating data collection for analysis or integration.
功能
- Extract typed JSON from public website pages
- Uses a user-defined schema for extraction
- Supports rich field types (IBAN, Address, Currency, etc.)
- Provides confidence scores and source citations
- Handles structured arrays and calculated fields
使用场景
- Extracting pricing tables from competitor websites
- Gathering product details from e-commerce pages
- Collecting job listing information from career pages
- Automating the extraction of specific data points from public reports
非目标
- Extracting data from behind login walls or private sites
- Processing uploaded files (use Document Extraction instead)
- Handling dynamic JavaScript rendering by default (requires specific option)
- Automating interactions beyond data fetching (e.g., form submission)
安装
请先添加 Marketplace
/plugin marketplace add iterationlayer/skills/plugin install skills@iterationlayer-skills质量评分
已验证类似扩展
Extract Supplier Catalog From Website
100Extract SKUs, product names, unit prices, availability, and minimum order quantities from a supplier catalog page.
X Twitter Scraper
100当用户需要通过 Xquik 获取 X (Twitter) 数据或执行需要确认的 X 操作时使用:推文搜索、用户查找、关注者提取、媒体下载、监控、Webhook、MCP、SDK、发布、点赞、私信和个人资料更新。需要 Xquik API 密钥。切勿索要 X 登录凭据。
Slack
100Use the Slack tool to react, pin/unpin, send, edit, delete messages, or fetch Slack member info.
Github
100Use gh for GitHub issues, PR status, CI/logs, comments, reviews, releases, and API queries.
Agent Browser
100AI 代理的浏览器自动化 CLI。当用户需要与网站交互时使用,包括浏览页面、填写表单、点击按钮、截屏、提取数据、测试 Web 应用或自动化任何浏览器任务。触发条件包括请求“打开网站”、“填表”、“点击按钮”、“截屏”、“抓取页面数据”、“测试此 Web 应用”、“登录网站”、“自动化浏览器操作”或任何需要以编程方式进行 Web 交互的任务。
Chatgpt Search
100Search ChatGPT and extract the full response + hydration JSON that powers the UI. Attaches to a running Chrome instance (port 9222 by default), opens ChatGPT, submits a query, waits for the streamed response, and returns structured data: messages, product cards, hydration JSON, and API calls. Use when asked to "search chatgpt", "ask chatgpt", "chatgpt search", "get chatgpt response", or "scrape chatgpt".