Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

Model Pruning

Skill Verifiziert Aktiv

Reduce LLM size and accelerate inference using pruning techniques like Wanda and SparseGPT. Use when compressing models without retraining, achieving 50% sparsity with minimal accuracy loss, or enabling faster inference on hardware accelerators. Covers unstructured pruning, structured pruning, N:M sparsity, magnitude pruning, and one-shot methods.

Zweck

To reduce LLM size and accelerate inference using techniques like Wanda and SparseGPT, enabling deployment on constrained hardware and efficient serving.

Funktionen

Reduce model size by 40-60%
Accelerate inference with hardware-friendly sparsity
Deploy on constrained hardware
Compress models without retraining (one-shot)
Implement Wanda, SparseGPT, and N:M structured pruning

Anwendungsfälle

Compressing LLMs for deployment on edge devices
Achieving faster inference speeds on hardware accelerators
Reducing memory footprint for efficient LLM serving
Exploring state-of-the-art model pruning techniques

Nicht-Ziele

Retraining models after pruning
Providing a general-purpose model optimization suite
Handling unstructured sparsity without hardware support for speedup

Practical Utility

info:Edge casesThe SKILL.md names limitations like 'no retraining' and 'activation dependency' but does not detail specific failure modes with symptoms and recovery steps.

Execution

info:ValidationWhile the code uses standard Python libraries, explicit schema validation for all inputs and outputs is not detailed in the documentation.
info:Pinned dependenciesDependencies are listed, but specific version pinning or lockfiles are not explicitly shown in the documentation for the provided examples.

Code Execution

info:Error HandlingPython scripts generally handle errors, but specific details on structured error reporting or fail-closed behavior for the pruning functions are not explicitly documented.

Installation

npx skills add davila7/claude-code-templates

Führt das Vercel skills CLI (skills.sh) via npx aus — benötigt Node.js lokal und mindestens einen installierten skills-kompatiblen Agent (Claude Code, Cursor, Codex, …). Setzt voraus, dass das Repo dem agentskills.io-Format folgt.

Qualitätspunktzahl

Verifiziert

95 /100

Analysiert 1 day ago

Vertrauenssignale

Letzter Commit1 day ago

GitHub-Inhaber davila7

Sterne27.2k

Downloads 23k

LizenzMIT

Websiteaitmpl.com

Status

Quellcode ansehen

Model Pruning

Funktionen

Anwendungsfälle

Nicht-Ziele

Practical Utility

Execution

Code Execution

Qualitätspunktzahl

Vertrauenssignale

Ähnliche Erweiterungen

Model Pruning

PyTorch Lightning

Implementing Llms Litgpt

ML Training Recipes

Ray Train

Pytorch Lightning