Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

CLIP

Skill Verifiziert Aktiv

OpenAI's model connecting vision and language. Enables zero-shot image classification, image-text matching, and cross-modal retrieval. Trained on 400M image-text pairs. Use for image search, content moderation, or vision-language tasks without fine-tuning. Best for general-purpose image understanding.

Zweck

To provide a powerful, zero-shot capability for understanding and relating images and text, useful for a wide range of AI-driven tasks without requiring custom model training.

Funktionen

Zero-shot image classification
Image-text matching and similarity
Semantic image search
Content moderation
Visual question answering
Cross-modal retrieval

Anwendungsfälle

Use for image search based on natural language queries.
Use for content moderation to detect inappropriate or sensitive content.
Use for classifying images into categories without prior training data.
Use for visual question answering tasks on images.

Nicht-Ziele

Use for image segmentation tasks.
Use for advanced image captioning (suggests BLIP-2).
Use for vision-language chat applications (suggests LLaVA).

Workflow

Load CLIP model and preprocessor.
Prepare image and text inputs.
Encode image and/or text features.
Compute similarity scores or probabilities.
Interpret results for classification, search, or moderation.

Voraussetzungen

Python 3.7+
PyTorch and torchvision
transformers library
Pillow library

Trust

info:Issues AttentionThere are 17 open issues and 4 closed issues in the last 90 days. The closure rate is low, suggesting maintainers may respond slowly.

Code Execution

info:ValidationThe Python code includes basic image and text processing, but parameter validation via a schema library is not explicitly demonstrated or used.
info:Error HandlingThe provided Python code includes basic error handling for file operations but does not implement structured error reporting with retryable flags or hints for the agent.

Errors

info:Actionable error messagesThe Python code includes basic error handling for file loading, but error messages are standard Python exceptions and do not provide specific remediation steps or doc links for the agent.

Practical Utility

info:Edge casesThe 'Limitations' section in SKILL.md names several edge cases such as dataset biases and limited spatial understanding, but does not provide specific recovery steps for each.

Installation

npx skills add davila7/claude-code-templates

Führt das Vercel skills CLI (skills.sh) via npx aus — benötigt Node.js lokal und mindestens einen installierten skills-kompatiblen Agent (Claude Code, Cursor, Codex, …). Setzt voraus, dass das Repo dem agentskills.io-Format folgt.

Qualitätspunktzahl

Verifiziert

95 /100

Analysiert about 23 hours ago

Vertrauenssignale

Letzter Commit1 day ago

GitHub-Inhaber davila7

Sterne27.2k

Downloads 23k

LizenzMIT

Websiteaitmpl.com

Status

Quellcode ansehen

CLIP

Funktionen

Anwendungsfälle

Nicht-Ziele

Workflow

Voraussetzungen

Trust

Code Execution

Errors

Practical Utility

Qualitätspunktzahl

Vertrauenssignale

Ähnliche Erweiterungen

Clip

Blip 2 Vision Language

Baoyu Imagine

Whisper

Llava

Segment Anything Model