[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"extension-skill-claude-office-skills-pdf-extraction-uk":3,"guides-for-claude-office-skills-pdf-extraction":222,"similar-k171dtxahnz3h8q0jz3gk6akks867ym1":223},{"_creationTime":4,"_id":5,"children":6,"community":7,"display":9,"evaluation":22,"identity":188,"isFallback":193,"parentExtension":194,"providers":195,"relations":199,"repo":201,"workflow":219},1778053148350.4636,"k171dtxahnz3h8q0jz3gk6akks867ym1",[],{"reviewCount":8},0,{"description":10,"installMethods":11,"name":12,"sourceUrl":13,"tags":14},"Extract text, tables, and metadata from PDFs using pdfplumber",{},"PDF Extraction","https://github.com/claude-office-skills/skills/tree/HEAD/pdf-extraction",[15,16,17,18,19,20,21],"pdf","extraction","text","tables","metadata","pdfplumber","document-processing",{"_creationTime":23,"_id":24,"extensionId":5,"locale":25,"result":26,"trustSignals":176,"workflow":186},1778053561145.7153,"kn70fm1jnsgy2xewgtfacjq10d867r2q","en",{"checks":27,"evaluatedAt":166,"extensionSummary":167,"promptVersionExtension":168,"promptVersionScoring":169,"rationale":170,"score":171,"summary":172,"tags":173,"targetMarket":174,"tier":175},[28,33,36,40,44,48,52,55,59,63,67,70,73,76,80,83,86,90,93,96,100,104,107,111,114,117,120,123,126,129,133,136,140,144,147,150,153,156,160,163],{"category":29,"check":30,"severity":31,"summary":32},"Invocation","Precise Purpose","pass","The description accurately states the skill's purpose and provides clear example prompts covering various use cases like extracting text, tables, and specific data from PDFs.",{"category":29,"check":34,"severity":31,"summary":35},"Concise Frontmatter","The frontmatter is concise and effectively summarizes the core capability of extracting text, tables, and metadata from PDFs using pdfplumber.",{"category":37,"check":38,"severity":31,"summary":39},"Documentation","Concise Body","The skill body is well-structured with clear sections for overview, usage, domain knowledge, examples, and limitations, keeping the main SKILL.md file manageable.",{"category":41,"check":42,"severity":31,"summary":43},"Context","Progressive Disclosure","Detailed code examples and explanations are embedded within the SKILL.md, which is appropriate for this skill's scope and complexity. No excessively large blobs or external material are embedded.",{"category":41,"check":45,"severity":46,"summary":47},"Forked exploration","not_applicable","This skill performs direct data extraction and does not involve deep exploration or code review that would necessitate forked context.",{"category":49,"check":50,"severity":31,"summary":51},"Practical Utility","Usage examples","The skill provides sufficient and clear end-to-end examples for various PDF extraction scenarios, including table extraction to DataFrames and invoice data extraction.",{"category":49,"check":53,"severity":31,"summary":54},"Edge cases","The skill clearly documents limitations such as handling scanned PDFs, complex layouts, and font issues, providing context for potential failure modes.",{"category":56,"check":57,"severity":46,"summary":58},"Code Execution","Tool Fallback","The skill relies on the `pdfplumber` library and potentially an MCP server, but does not indicate a need for optional fallback paths or external tool versioning in its documentation.",{"category":60,"check":61,"severity":31,"summary":62},"Safety","Halt on unexpected state","The skill's instructions focus on data extraction and do not describe workflows that would require complex precondition checks or state management that needs explicit halting.",{"category":64,"check":65,"severity":31,"summary":66},"Portability","Cross-skill coupling","The skill is self-contained and focuses on PDF extraction using `pdfplumber`, without implicit reliance on other skills.",{"category":49,"check":68,"severity":31,"summary":69},"Problem relevance","The description clearly states the problem of extracting text, tables, and metadata from PDFs, a common need for document processing.",{"category":49,"check":71,"severity":31,"summary":72},"Unique selling proposition","The skill leverages `pdfplumber`, highlighting its advanced capabilities for precise text and table extraction, offering value beyond basic PDF text extraction.",{"category":49,"check":74,"severity":31,"summary":75},"Production readiness","The skill provides a complete solution for PDF text and table extraction, including installation instructions and detailed examples, making it ready for use.",{"category":77,"check":78,"severity":31,"summary":79},"Scope","Single responsibility principle","The skill focuses exclusively on PDF text and table extraction using the pdfplumber library, adhering to a single responsibility.",{"category":77,"check":81,"severity":31,"summary":82},"Description quality","The description is accurate, concise, and clearly communicates the skill's purpose: extracting text, tables, and metadata from PDFs using pdfplumber.",{"category":29,"check":84,"severity":31,"summary":85},"Scoped tools","The MCP tools listed (`extract_text_from_pdf`, `extract_tables_from_pdf`) are narrow verb-noun specialists, indicating a well-scoped interface.",{"category":37,"check":87,"severity":88,"summary":89},"Configuration & parameter reference","info","While the code snippets show usage of pdfplumber with parameters like tolerances, these specific parameters and their default values are not explicitly documented in the SKILL.md or accompanying files.",{"category":77,"check":91,"severity":31,"summary":92},"Tool naming","The MCP tool names ('extract_text_from_pdf', 'extract_tables_from_pdf') are descriptive verb-noun pairs.",{"category":77,"check":94,"severity":31,"summary":95},"Minimal I/O surface","The skill focuses on providing extracted text and tables, and the examples demonstrate focused inputs (PDF path) and outputs (data structures).",{"category":97,"check":98,"severity":31,"summary":99},"License","License usability","The LICENSE file is present and clearly states the MIT license, which is a permissive open-source license.",{"category":101,"check":102,"severity":46,"summary":103},"Maintenance","Commit recency","No commit date is available for the default branch; therefore, recency cannot be assessed.",{"category":101,"check":105,"severity":31,"summary":106},"Dependency Management","The skill explicitly lists `pdfplumber` and `Pillow` as dependencies and provides installation commands, facilitating dependency management.",{"category":108,"check":109,"severity":46,"summary":110},"Security","Secret Management","The skill does not handle or expose any secrets.",{"category":108,"check":112,"severity":31,"summary":113},"Injection","The skill extracts data from PDFs using a dedicated library; there are no indications of loading or executing untrusted third-party content as instructions.",{"category":108,"check":115,"severity":31,"summary":116},"Transitive Supply-Chain Grenades","The skill relies on locally installed libraries (`pdfplumber`) and does not perform runtime downloads or execute arbitrary remote code.",{"category":108,"check":118,"severity":31,"summary":119},"Sandbox Isolation","The skill performs file operations within its designated scope (reading PDFs) and does not attempt to modify files outside the project folder.",{"category":108,"check":121,"severity":31,"summary":122},"Sandbox escape primitives","The skill's code, focused on PDF parsing, does not contain any apparent sandbox-escape primitives.",{"category":108,"check":124,"severity":31,"summary":125},"Data Exfiltration","The skill's function is to extract data from PDFs and return it locally; there are no outbound calls or mechanisms for data exfiltration.",{"category":108,"check":127,"severity":31,"summary":128},"Hidden Text Tricks","The bundled files do not contain any hidden text tricks or obfuscation techniques designed to steer the model.",{"category":130,"check":131,"severity":31,"summary":132},"Hooks","Opaque code execution","The Python code for pdfplumber interaction is plain and readable, with no signs of obfuscation, base64 encoding, or runtime code fetching.",{"category":64,"check":134,"severity":31,"summary":135},"Structural Assumption","The skill assumes the presence of a PDF file but does not make assumptions about user-specific project organization or OS-specific paths.",{"category":137,"check":138,"severity":46,"summary":139},"Trust","Issues Attention","No issue data is available for this repository.",{"category":141,"check":142,"severity":31,"summary":143},"Versioning","Release Management","The SKILL.md frontmatter includes a `version: \"1.0\"` field, indicating a managed release version.",{"category":56,"check":145,"severity":88,"summary":146},"Validation","The provided code snippets demonstrate basic usage of `pdfplumber` but do not explicitly show the use of a schema validation library for input parameters like file paths or extraction options.",{"category":108,"check":148,"severity":46,"summary":149},"Unguarded Destructive Operations","This skill is read-only and performs no destructive operations.",{"category":56,"check":151,"severity":31,"summary":152},"Error Handling","The Python code snippets in the examples show basic error handling, such as using `or ''` to prevent errors on pages without text and `try-except` blocks for float conversion, which are standard practices.",{"category":56,"check":154,"severity":46,"summary":155},"Logging","The skill is focused on data extraction and does not perform actions that require a local audit log.",{"category":157,"check":158,"severity":46,"summary":159},"Compliance","GDPR","The skill only processes PDF content and does not interact with personal data in a way that triggers GDPR concerns.",{"category":157,"check":161,"severity":31,"summary":162},"Target market","The skill's functionality is global and not specific to any particular geographic or legal jurisdiction.",{"category":64,"check":164,"severity":31,"summary":165},"Runtime stability","The skill relies on standard Python libraries and does not make assumptions about specific shells, editors, or operating systems.",1778053414991,"This skill leverages the `pdfplumber` library to precisely extract textual content, tabular data, and document metadata from PDF files. It offers detailed control over extraction parameters and includes examples for common use cases like converting tables to DataFrames and processing invoice data.","2.0.0","3.4.0","The PDF Extraction skill is well-implemented, adhering to best practices for scope, documentation, and security. It provides clear examples and handles limitations appropriately. A minor improvement could be explicit parameter documentation for pdfplumber's advanced options.",95,"A high-quality skill for extracting text, tables, and metadata from PDF documents using the pdfplumber library.",[15,16,17,18,19,20,21],"global","verified",{"codeQuality":177,"collectedAt":178,"documentation":179,"maintenance":181,"security":182,"testCoverage":185},{},1778053404526,{"descriptionLength":180,"readmeSize":8},61,{},{"hasNpmPackage":183,"license":184,"smitheryVerified":183},false,"MIT",{"hasCi":183,"hasTests":183},{"updatedAt":187},1778053561145,{"githubOwner":189,"githubRepo":190,"locale":25,"slug":191,"type":192},"claude-office-skills","skills","pdf-extraction","skill",true,null,{"extract":196,"llm":198},{"commitSha":197,"license":184},"9c4c7d5cd2813a8936bf2c9fdb174ea883b85a11",{"promptVersionExtension":168,"promptVersionScoring":169,"score":171,"targetMarket":174,"tier":175},{"repoId":200},"kd7fw7xbj58qc2z8whrrjptbed8659db",{"_creationTime":202,"_id":200,"identity":203,"providers":205,"workflow":216},1777995558409.8474,{"githubOwner":189,"githubRepo":190,"sourceUrl":204},"https://github.com/claude-office-skills/skills",{"discover":206,"github":209},{"sources":207},[208],"skills-sh",{"closedIssues90d":8,"forks":210,"license":184,"openIssues90d":211,"pushedAt":212,"readmeSize":213,"stars":214,"topics":215},27,2,1769868236000,29630,98,[],{"discoverAt":217,"extractAt":218,"githubAt":218,"updatedAt":218},1777995558409,1778053155657,{"anyEnrichmentAt":220,"extractAt":221,"githubAt":220,"llmAt":187,"updatedAt":187},1778053151766,1778053148350,[],[224,247,266,287,306,325],{"_creationTime":225,"_id":226,"community":227,"display":228,"identity":239,"providers":241,"relations":245,"workflow":246},1778053148350.4373,"k1776t2fdx4h35mkwpc5h201dd866zms",{"reviewCount":8},{"description":229,"installMethods":230,"name":231,"sourceUrl":232,"tags":233},">",{},"Document Parser Skill","https://github.com/claude-office-skills/skills/tree/HEAD/doc-parser",[234,21,15,235,236,16,237,238],"parsing","python","ocr","layout-analysis","docling",{"githubOwner":189,"githubRepo":190,"locale":25,"slug":240,"type":192},"doc-parser",{"extract":242,"llm":243},{"commitSha":197,"license":184},{"promptVersionExtension":168,"promptVersionScoring":169,"score":244,"targetMarket":174,"tier":175},92,{"repoId":200},{"anyEnrichmentAt":220,"extractAt":221,"githubAt":220,"llmAt":187,"updatedAt":187},{"_creationTime":248,"_id":249,"community":250,"display":251,"identity":259,"providers":261,"relations":264,"workflow":265},1778053148350.4656,"k171nxqak0bb4qq89mkfwf02s5867cf6",{"reviewCount":8},{"description":252,"installMethods":253,"name":254,"sourceUrl":255,"tags":256},"Convert PDF files to editable Word documents using pdf2docx",{},"PDF to DOCX Converter","https://github.com/claude-office-skills/skills/tree/HEAD/pdf-to-docx",[15,257,258,21,235],"docx","conversion",{"githubOwner":189,"githubRepo":190,"locale":25,"slug":260,"type":192},"pdf-to-docx",{"extract":262,"llm":263},{"commitSha":197,"license":184},{"promptVersionExtension":168,"promptVersionScoring":169,"score":214,"targetMarket":174,"tier":175},{"repoId":200},{"anyEnrichmentAt":220,"extractAt":221,"githubAt":220,"llmAt":187,"updatedAt":187},{"_creationTime":267,"_id":268,"community":269,"display":270,"identity":280,"providers":282,"relations":285,"workflow":286},1778053148350.4265,"k171agyyd8nv26rt447dvhy0998669wm",{"reviewCount":8},{"description":271,"installMethods":272,"name":273,"sourceUrl":274,"tags":275},"Answer questions about PDF content, summarize, and extract information",{},"Chat with PDF","https://github.com/claude-office-skills/skills/tree/HEAD/chat-with-pdf",[15,276,16,277,278,279],"qa","summarization","mcp","documentation",{"githubOwner":189,"githubRepo":190,"locale":25,"slug":281,"type":192},"chat-with-pdf",{"extract":283,"llm":284},{"commitSha":197,"license":184},{"promptVersionExtension":168,"promptVersionScoring":169,"score":171,"targetMarket":174,"tier":175},{"repoId":200},{"anyEnrichmentAt":220,"extractAt":221,"githubAt":220,"llmAt":187,"updatedAt":187},{"_creationTime":288,"_id":289,"community":290,"display":291,"identity":298,"providers":300,"relations":304,"workflow":305},1778053148350.4768,"k17c4t5g480bzq5t7qrjgbjsys867fb5",{"reviewCount":8},{"description":229,"installMethods":292,"name":293,"sourceUrl":294,"tags":295},{},"Table Extractor","https://github.com/claude-office-skills/skills/tree/HEAD/table-extractor",[15,16,296,297,234],"table","camelot",{"githubOwner":189,"githubRepo":190,"locale":25,"slug":299,"type":192},"table-extractor",{"extract":301,"llm":302},{"commitSha":197,"license":184},{"promptVersionExtension":168,"promptVersionScoring":169,"score":244,"targetMarket":174,"tier":303},"flagged",{"repoId":200},{"anyEnrichmentAt":220,"extractAt":221,"githubAt":220,"llmAt":187,"updatedAt":187},{"_creationTime":307,"_id":308,"community":309,"display":310,"identity":318,"providers":320,"relations":323,"workflow":324},1778053148350.4734,"k1782aqmjfqy0qysysgq76w9z1867e3x",{"reviewCount":8},{"description":229,"installMethods":311,"name":312,"sourceUrl":313,"tags":314},{},"Smart OCR Skill","https://github.com/claude-office-skills/skills/tree/HEAD/smart-ocr",[236,234,315,316,16,317,21],"multilingual","paddleocr","image-processing",{"githubOwner":189,"githubRepo":190,"locale":25,"slug":319,"type":192},"smart-ocr",{"extract":321,"llm":322},{"commitSha":197,"license":184},{"promptVersionExtension":168,"promptVersionScoring":169,"score":244,"targetMarket":174,"tier":175},{"repoId":200},{"anyEnrichmentAt":220,"extractAt":221,"githubAt":220,"llmAt":187,"updatedAt":187},{"_creationTime":326,"_id":327,"community":328,"display":329,"identity":339,"providers":343,"relations":349,"workflow":351},1778053968286.4924,"k177bv4rmjf0e7ypzrd861xcvn867ysw",{"reviewCount":8},{"description":330,"name":331,"sourceUrl":332,"tags":333},"Use when processing large PDFs, document collections, or bulk text extraction tasks that benefit from GPU-accelerated processing. Triggers when the user provides large documents or needs bulk document analysis.","GPU Document Processing","https://github.com/langchain-ai/deepagents/tree/HEAD/examples/nvidia_deep_agent/skills/gpu-document-processing",[334,21,15,335,16,336,337,338],"gpu","embedding","nvidia-nim","llm","agent",{"githubOwner":340,"githubRepo":341,"locale":25,"slug":342,"type":192},"langchain-ai","deepagents","gpu-document-processing",{"extract":344,"llm":346,"smithery":348},{"commitSha":345,"license":184},"b108c71d0c570e16c7050c1eac482e15dc35a5ed",{"promptVersionExtension":168,"promptVersionScoring":169,"score":347,"targetMarket":174,"tier":175},85,{"qualityScore":8,"totalActivations":8,"uniqueUsers":8,"useCount":8,"verified":183},{"repoId":350},"kd76dna2fvfbnjvzcpd2cwqnyd865xz7",{"anyEnrichmentAt":352,"extractAt":353,"githubAt":354,"llmAt":355,"smitheryAt":352,"updatedAt":355},1778053994907,1778053968286,1778053969344,1778054053159]