Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

LlamaGuard

Skill Aktiv

Meta's 7-8B specialized moderation model for LLM input/output filtering. 6 safety categories - violence/hate, sexual content, weapons, substances, self-harm, criminal planning. 94-95% accuracy. Deploy with vLLM, HuggingFace, Sagemaker. Integrates with NeMo Guardrails.

Zweck

To provide a robust, pre-trained AI model for filtering harmful or inappropriate content in LLM inputs and outputs, ensuring safer AI interactions.

Funktionen

Specialized moderation model (Meta's LlamaGuard 7-8B)
6 detailed safety categories (violence, sexual, weapons, substances, self-harm, criminal planning)
High accuracy (94-95%)
Multiple deployment options (vLLM, HuggingFace, Sagemaker)
Integration with NeMo Guardrails

Anwendungsfälle

Moderating user prompts before sending to an LLM
Filtering LLM responses before displaying them to users
Implementing content safety guardrails in production AI applications
Detecting and classifying various types of harmful content

Nicht-Ziele

Performing general text generation or summarization
Acting as a general-purpose chatbot
Replacing the need for LLM alignment training itself

Workflow

Install necessary Python libraries (transformers, torch).
Log in to HuggingFace CLI.
Load the LlamaGuard model and tokenizer.
Prepare chat input using the tokenizer's template.
Generate moderation output from the model.
Parse the output to determine safety status and category.
Block or allow content based on the moderation result.

Voraussetzungen

Python 3.7+
transformers library
torch library
HuggingFace CLI login with token
GPU resources (recommended for performance)

Trust

warning:Issues Attention17 issues opened, 4 closed in the last 90 days, indicating a low closure rate and potentially slow maintainer response.

Compliance

info:GDPRThe skill moderates content but does not inherently process personal data. However, the LLM itself might process PII if present in the input, and this is not explicitly sanitized.

Execution

warning:Pinned dependenciesDependencies are listed but not explicitly pinned with versions, and there's no lockfile mentioned for the Python environment, posing a risk for reproducibility and stability.

Installation

npx skills add davila7/claude-code-templates

Führt das Vercel skills CLI (skills.sh) via npx aus — benötigt Node.js lokal und mindestens einen installierten skills-kompatiblen Agent (Claude Code, Cursor, Codex, …). Setzt voraus, dass das Repo dem agentskills.io-Format folgt.

Qualitätspunktzahl

75 /100

Analysiert about 19 hours ago

Vertrauenssignale

Letzter Commitabout 21 hours ago

GitHub-Inhaber davila7

Sterne27.2k

Downloads 23k

LizenzMIT

Websiteaitmpl.com

Status

Quellcode ansehen

LlamaGuard

Funktionen

Anwendungsfälle

Nicht-Ziele

Workflow

Voraussetzungen

Trust

Compliance

Execution

Qualitätspunktzahl

Vertrauenssignale

Ähnliche Erweiterungen

Llamaguard

Constitutional Ai

NeMo Guardrails

Constitutional Ai

Fixflow

Safe Mode