Naar hoofdinhoud springen
Deze inhoud is nog niet beschikbaar in jouw taal en wordt in het Engels weergegeven.

Data Extractor

Skill Waarschuwing
85

>

AI-samenvatting

This skill leverages the unstructured Python library to process a wide range of document types, including PDFs, Word docs, emails, and HTML. It automatically detects and partitions elements, extracts text and metadata, and supports advanced features like table structure inference, OCR, and semantic chunking for RAG applications.

Scope

  • critical:Description qualityThe description is materially misleading as it contains only a single character ('>') and provides no actual information about the extension's functionality, which is contrary to the provided content in SKILL.md.

Documentation

  • info:Configuration & parameter referenceWhile the SKILL.md provides extensive code examples, it does not explicitly document all configuration options or parameters for the `partition` function or its variations, nor does it detail precedence order for any potential configurations.

Code Execution

  • info:ValidationThe SKILL.md demonstrates the use of `unstructured` library functions, which likely perform internal validation on file paths and parameters, but explicit schema validation within the skill's logic is not showcased.

Compliance

  • info:GDPRThe skill extracts data from documents. While it doesn't explicitly handle personal data, the extracted content could potentially contain PII, which would be submitted to the LLM without additional sanitization by this skill itself.

Installatie

npx skills add claude-office-skills/skills

Voert de Vercel skills CLI (skills.sh) uit via npx — vereist Node.js lokaal en minstens één geïnstalleerde skills-compatibele agent (Claude Code, Cursor, Codex, …). Gaat ervan uit dat de repo het agentskills.io-formaat volgt.

3 months ago
98 stars
MIT
Bijgewerkt op 2 days ago
Broncode bekijken

© 2025 SkillRepo · Vind de juiste skill, sla de ruis over.