跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

P Video Avatar

技能 已验证 活跃

Generate talking head avatar videos with Pruna P-Video-Avatar via inference.sh CLI. Turn a portrait image into a realistic speaking video with built-in TTS. 18x faster and 6x cheaper than competitors. Models: P-Video-Avatar, P-Image (for portrait generation). Capabilities: text-to-avatar, audio-driven avatars, 30 voices, 10 languages, 720p/1080p, built-in TTS, dynamic backgrounds, full-body control. Use for: AI presenters, product demos, explainer videos, virtual influencers, marketing, education, multilingual content, UGC, gaming avatars. Triggers: avatar video, talking head, ai avatar, p-video-avatar, pruna avatar, video avatar, ai presenter, digital human, virtual presenter, lipsync, talking avatar, ai spokesperson, heygen alternative, synthesia alternative, veed alternative, fabric alternative, omnihuman alternative

目的

To generate realistic and cost-effective talking head avatar videos for various applications like marketing, education, and social media content.

功能

  • Generate avatar videos from portrait images
  • Text-to-speech synthesis with 30 voices
  • Support for 10 languages
  • 720p and 1080p resolution output
  • Audio-driven avatar lip-sync

使用场景

  • Creating AI presenters for product demos
  • Generating explainer videos for educational content
  • Producing virtual influencers for social media marketing
  • Localizing content across multiple languages with a single avatar

非目标

  • Performing advanced video editing beyond avatar animation
  • Providing a complete video production suite
  • Replacing live actors for all scenarios

Compliance

  • info:GDPRThe skill processes image and text data, which could potentially include personal data, but no specific sanitization is mentioned beyond what the LLM might provide.

安装

npx skills add inferen-sh/skills

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

已验证
99 /100
1 day ago 分析

信任信号

最近提交1 day ago
星标433
状态
查看源代码

类似扩展

Google Tts

100

Convert documents and text to audio using Google Cloud Text-to-Speech. Use this skill when the user wants to: narrate a document, read aloud text, generate audio from a file, convert text to speech, create a recording of documentation or analysis, create a podcast from a document, or use Google TTS/text-to-speech. Trigger phrases: "read this aloud", "narrate this", "create a recording", "text to speech", "TTS", "convert to audio", "audio from document", "listen to this", "generate audio", "google tts", "create a podcast".

技能
sanjay3290

Speech Generation Skill

100

Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; run the bundled CLI (`scripts/text_to_speech.py`) with built-in voices and require `OPENAI_API_KEY` for live calls. Custom voice creation is out of scope.

技能
openai

Openclaw

100

使用多提供商路由从文本生成图像和视频 — 支持 GPT Image 2.0(近乎完美的文本渲染)、Nanobanana 2、Seedream 5.0、Midjourney V8.1(统一的逼真+动漫风格)、Flux 2 Klein(经济高效的草稿)、Seedance 2.0 / Happyhorse 1.0 / Veo 3.1 视频,以及本地 ComfyUI 工作流。包含 1,446 个精选提示和风格感知提示增强。当用户想要创建图像/视频、设计素材、制作照片动画、增强提示或管理 AI 艺术工作流时使用。不适用于:通用聊天、代码生成、文档编写、现有素材的视频编辑、音频/TTS,或任何与 AI 图像/视频创建无关的任务。

技能
jau123

Smart Crop Avatar and Remove Background

99

Smart crop to face, remove the background, and convert to WebP for a clean user avatar.

技能
iterationlayer

Sweep Flag Namespace

99

Bulk-extract every candidate flag from a binary namespace, build an extraction inventory with occurrence counts and call-type tags, cross- reference against a documented set, and track completeness across probe campaigns until the undocumented remainder reaches zero. Covers namespace prefix harvesting, gate-vs-telemetry disambiguation at the call-site level, completeness metrics, DEFAULT-TRUE population reporting, and a final completion confirmation scan. Use upstream of probe-feature-flag- state when you need a complete catalog rather than a sample, or when a prior wave-based campaign needs a verifiable end condition.

技能
pjt222

Review Skill

99

Review a proposed Agent Skill for structural validity and content quality before publishing. Runs the skill-validator CLI to check for structural issues, scores the skill with an LLM judge, and interprets results to advise SMEs on what to address. Use when a user wants to review, validate, or quality-check an Agent Skill.

技能
mongodb