P Video Avatar
Skill Verifiziert AktivGenerate talking head avatar videos with Pruna P-Video-Avatar via inference.sh CLI. Turn a portrait image into a realistic speaking video with built-in TTS. 18x faster and 6x cheaper than competitors. Models: P-Video-Avatar, P-Image (for portrait generation). Capabilities: text-to-avatar, audio-driven avatars, 30 voices, 10 languages, 720p/1080p, built-in TTS, dynamic backgrounds, full-body control. Use for: AI presenters, product demos, explainer videos, virtual influencers, marketing, education, multilingual content, UGC, gaming avatars. Triggers: avatar video, talking head, ai avatar, p-video-avatar, pruna avatar, video avatar, ai presenter, digital human, virtual presenter, lipsync, talking avatar, ai spokesperson, heygen alternative, synthesia alternative, veed alternative, fabric alternative, omnihuman alternative
To generate realistic and cost-effective talking head avatar videos for various applications like marketing, education, and social media content.
Funktionen
- Generate avatar videos from portrait images
- Text-to-speech synthesis with 30 voices
- Support for 10 languages
- 720p and 1080p resolution output
- Audio-driven avatar lip-sync
Anwendungsfälle
- Creating AI presenters for product demos
- Generating explainer videos for educational content
- Producing virtual influencers for social media marketing
- Localizing content across multiple languages with a single avatar
Nicht-Ziele
- Performing advanced video editing beyond avatar animation
- Providing a complete video production suite
- Replacing live actors for all scenarios
Compliance
- info:GDPRThe skill processes image and text data, which could potentially include personal data, but no specific sanitization is mentioned beyond what the LLM might provide.
Installation
npx skills add inferen-sh/skillsFührt das Vercel skills CLI (skills.sh) via npx aus — benötigt Node.js lokal und mindestens einen installierten skills-kompatiblen Agent (Claude Code, Cursor, Codex, …). Setzt voraus, dass das Repo dem agentskills.io-Format folgt.
Qualitätspunktzahl
VerifiziertÄhnliche Erweiterungen
Google Tts
100Convert documents and text to audio using Google Cloud Text-to-Speech. Use this skill when the user wants to: narrate a document, read aloud text, generate audio from a file, convert text to speech, create a recording of documentation or analysis, create a podcast from a document, or use Google TTS/text-to-speech. Trigger phrases: "read this aloud", "narrate this", "create a recording", "text to speech", "TTS", "convert to audio", "audio from document", "listen to this", "generate audio", "google tts", "create a podcast".
Speech Generation Skill
100Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; run the bundled CLI (`scripts/text_to_speech.py`) with built-in voices and require `OPENAI_API_KEY` for live calls. Custom voice creation is out of scope.
Openclaw
100Generieren Sie Bilder und Videos aus Text mit Multi-Provider-Routing – unterstützt GPT Image 2.0 (nahezu perfekte Textdarstellung), Nanobanana 2, Seedream 5.0, Midjourney V8.1 (vereinheitlicht fotorealistisch + Anime), Flux 2 Klein (günstige Entwürfe), Seedance 2.0 / Happyhorse 1.0 / Veo 3.1 Video und lokale ComfyUI-Workflows. Enthält 1.446 kuratierte Prompts und stilbewusste Prompt-Verbesserung. Verwenden Sie es, wenn Benutzer Bilder/Videos erstellen, Assets entwerfen, Fotos animieren, Prompts verbessern oder KI-Kunst-Workflows verwalten möchten. NICHT für: allgemeine Chats, Code-Generierung, Dokumentenerstellung, Videobearbeitung von vorhandenem Material, Audio/TTS oder Aufgaben, die nichts mit der Erstellung von KI-Bildern/Videos zu tun haben.
Smart Crop Avatar and Remove Background
99Smart crop to face, remove the background, and convert to WebP for a clean user avatar.
Sweep Flag Namespace
99Bulk-extract every candidate flag from a binary namespace, build an extraction inventory with occurrence counts and call-type tags, cross- reference against a documented set, and track completeness across probe campaigns until the undocumented remainder reaches zero. Covers namespace prefix harvesting, gate-vs-telemetry disambiguation at the call-site level, completeness metrics, DEFAULT-TRUE population reporting, and a final completion confirmation scan. Use upstream of probe-feature-flag- state when you need a complete catalog rather than a sample, or when a prior wave-based campaign needs a verifiable end condition.
Review Skill
99Überprüft einen vorgeschlagenen Agent Skill auf strukturelle Gültigkeit und Inhaltsqualität, bevor er veröffentlicht wird. Führt die `skill-validator` CLI aus, um nach strukturellen Problemen zu suchen, bewertet den Skill mit einem LLM-Richter und interpretiert die Ergebnisse, um Fachexperten Ratschläge zu geben, was zu beheben ist. Verwenden Sie dies, wenn ein Benutzer einen Agent Skill überprüfen, validieren oder auf Qualität prüfen möchte.