Document Extraction API
Skill Verified ActiveExtract structured data from documents using AI-powered field extraction.
Extract structured and validated data from diverse document types, automating data entry and analysis tasks.
Features
- Extracts data from 40+ file formats
- Supports rich validated field types (IBAN, ADDRESS, CURRENCY)
- Handles structured arrays and calculated fields
- Provides confidence scores and source citations
- Includes intelligent schema validation and website URL ingestion
Use Cases
- Automating invoice data entry
- Processing resumes for HR systems
- Extracting metadata from academic papers
- Populating databases from legal or financial documents
Non-Goals
- Performing OCR on scanned images without clear text
- Interpreting unstructured text for sentiment or intent analysis
- Replacing a full document management system
Workflow
- Define extraction schema with desired fields and types
- Submit files (base64 or URL) and schema to the API
- Receive structured data, confidence scores, and citations
Prerequisites
- Iteration Layer API key
Installation
First, add the marketplace
/plugin marketplace add iterationlayer/skills/plugin install skills@iterationlayer-skillsQuality Score
VerifiedTrust Signals
Similar Extensions
Extract Fleet Vehicle Registration
100Extract vehicle identification, owner details, registration dates, and technical specifications from vehicle registration documents.
Extract Receipt Data
99Extract merchant, date, line items, tax, and total from receipts.
Nutrient Document Processing
98Process documents with Nutrient DWS. Use when the user wants to generate PDFs from HTML or URLs, convert Office/images/PDFs, assemble or split packets, OCR scans, extract text/tables/key-value pairs, redact PII, watermark, sign, fill forms, optimize PDFs, or produce compliance outputs like PDF/A or PDF/UA. Triggers include convert to PDF, merge these PDFs, OCR this scan, extract tables, redact PII, sign this PDF, make this PDF/A, or linearize for web delivery.
Image Transformation API
100Transform images with resize, crop, smart crop, upscale, remove background, and 20+ operations.
Website Extraction Api
100Extract typed JSON from public website pages using a schema.
Extract Supplier Catalog From Website
100Extract SKUs, product names, unit prices, availability, and minimum order quantities from a supplier catalog page.