Skip to main content

PDF Extraction

Skill Verified
95

Extract text, tables, and metadata from PDFs using pdfplumber

AI Summary

This skill leverages the `pdfplumber` library to precisely extract textual content, tabular data, and document metadata from PDF files. It offers detailed control over extraction parameters and includes examples for common use cases like converting tables to DataFrames and processing invoice data.

Documentation

  • info:Configuration & parameter referenceWhile the code snippets show usage of pdfplumber with parameters like tolerances, these specific parameters and their default values are not explicitly documented in the SKILL.md or accompanying files.

Code Execution

  • info:ValidationThe provided code snippets demonstrate basic usage of `pdfplumber` but do not explicitly show the use of a schema validation library for input parameters like file paths or extraction options.

Installation

npx skills add claude-office-skills/skills

Runs the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.

3 months ago
98 stars
MIT
Updated 5 days ago
View Source

© 2025 SkillRepo · Find the right skill, skip the noise.