Website Extraction Api
Skill Verified ActiveExtract typed JSON from public website pages using a schema.
To enable users to reliably extract specific, typed data from public web pages into a structured JSON format, automating data collection for analysis or integration.
Features
- Extract typed JSON from public website pages
- Uses a user-defined schema for extraction
- Supports rich field types (IBAN, Address, Currency, etc.)
- Provides confidence scores and source citations
- Handles structured arrays and calculated fields
Use Cases
- Extracting pricing tables from competitor websites
- Gathering product details from e-commerce pages
- Collecting job listing information from career pages
- Automating the extraction of specific data points from public reports
Non-Goals
- Extracting data from behind login walls or private sites
- Processing uploaded files (use Document Extraction instead)
- Handling dynamic JavaScript rendering by default (requires specific option)
- Automating interactions beyond data fetching (e.g., form submission)
Installation
First, add the marketplace
/plugin marketplace add iterationlayer/skills/plugin install skills@iterationlayer-skillsQuality Score
VerifiedTrust Signals
Similar Extensions
Extract Supplier Catalog From Website
100Extract SKUs, product names, unit prices, availability, and minimum order quantities from a supplier catalog page.
X Twitter Scraper
100Use when the user needs X (Twitter) data or confirmation-gated X actions through Xquik: tweet search, user lookup, follower extraction, media download, monitoring, webhooks, MCP, SDKs, posting, likes, DMs, and profile updates. Requires a Xquik API key. Never ask for X login material.
Slack
100Use the Slack tool to react, pin/unpin, send, edit, delete messages, or fetch Slack member info.
Github
100Use gh for GitHub issues, PR status, CI/logs, comments, reviews, releases, and API queries.
Agent Browser
100Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.
Chatgpt Search
100Search ChatGPT and extract the full response + hydration JSON that powers the UI. Attaches to a running Chrome instance (port 9222 by default), opens ChatGPT, submits a query, waits for the streamed response, and returns structured data: messages, product cards, hydration JSON, and API calls. Use when asked to "search chatgpt", "ask chatgpt", "chatgpt search", "get chatgpt response", or "scrape chatgpt".