P Video Avatar
技能 已验证 活跃Generate talking head avatar videos with Pruna P-Video-Avatar via inference.sh CLI. Turn a portrait image into a realistic speaking video with built-in TTS. 18x faster and 6x cheaper than competitors. Models: P-Video-Avatar, P-Image (for portrait generation). Capabilities: text-to-avatar, audio-driven avatars, 30 voices, 10 languages, 720p/1080p, built-in TTS, dynamic backgrounds, full-body control. Use for: AI presenters, product demos, explainer videos, virtual influencers, marketing, education, multilingual content, UGC, gaming avatars. Triggers: avatar video, talking head, ai avatar, p-video-avatar, pruna avatar, video avatar, ai presenter, digital human, virtual presenter, lipsync, talking avatar, ai spokesperson, heygen alternative, synthesia alternative, veed alternative, fabric alternative, omnihuman alternative
To generate realistic and cost-effective talking head avatar videos for various applications like marketing, education, and social media content.
功能
- Generate avatar videos from portrait images
- Text-to-speech synthesis with 30 voices
- Support for 10 languages
- 720p and 1080p resolution output
- Audio-driven avatar lip-sync
使用场景
- Creating AI presenters for product demos
- Generating explainer videos for educational content
- Producing virtual influencers for social media marketing
- Localizing content across multiple languages with a single avatar
非目标
- Performing advanced video editing beyond avatar animation
- Providing a complete video production suite
- Replacing live actors for all scenarios
Compliance
- info:GDPRThe skill processes image and text data, which could potentially include personal data, but no specific sanitization is mentioned beyond what the LLM might provide.
安装
npx skills add inferen-sh/skills通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。
质量评分
已验证类似扩展
Google Tts
100Convert documents and text to audio using Google Cloud Text-to-Speech. Use this skill when the user wants to: narrate a document, read aloud text, generate audio from a file, convert text to speech, create a recording of documentation or analysis, create a podcast from a document, or use Google TTS/text-to-speech. Trigger phrases: "read this aloud", "narrate this", "create a recording", "text to speech", "TTS", "convert to audio", "audio from document", "listen to this", "generate audio", "google tts", "create a podcast".
Speech Generation Skill
100Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; run the bundled CLI (`scripts/text_to_speech.py`) with built-in voices and require `OPENAI_API_KEY` for live calls. Custom voice creation is out of scope.
Openclaw
100使用多提供商路由从文本生成图像和视频 — 支持 GPT Image 2.0(近乎完美的文本渲染)、Nanobanana 2、Seedream 5.0、Midjourney V8.1(统一的逼真+动漫风格)、Flux 2 Klein(经济高效的草稿)、Seedance 2.0 / Happyhorse 1.0 / Veo 3.1 视频,以及本地 ComfyUI 工作流。包含 1,446 个精选提示和风格感知提示增强。当用户想要创建图像/视频、设计素材、制作照片动画、增强提示或管理 AI 艺术工作流时使用。不适用于:通用聊天、代码生成、文档编写、现有素材的视频编辑、音频/TTS,或任何与 AI 图像/视频创建无关的任务。
Smart Crop Avatar and Remove Background
99Smart crop to face, remove the background, and convert to WebP for a clean user avatar.
Sweep Flag Namespace
99Bulk-extract every candidate flag from a binary namespace, build an extraction inventory with occurrence counts and call-type tags, cross- reference against a documented set, and track completeness across probe campaigns until the undocumented remainder reaches zero. Covers namespace prefix harvesting, gate-vs-telemetry disambiguation at the call-site level, completeness metrics, DEFAULT-TRUE population reporting, and a final completion confirmation scan. Use upstream of probe-feature-flag- state when you need a complete catalog rather than a sample, or when a prior wave-based campaign needs a verifiable end condition.
Review Skill
99Review a proposed Agent Skill for structural validity and content quality before publishing. Runs the skill-validator CLI to check for structural issues, scores the skill with an LLM judge, and interprets results to advise SMEs on what to address. Use when a user wants to review, validate, or quality-check an Agent Skill.