此内容尚未提供您的语言版本,正在以英文显示。

P Video Avatar

技能已验证活跃

Generate talking head avatar videos with Pruna P-Video-Avatar via inference.sh CLI. Turn a portrait image into a realistic speaking video with built-in TTS. 18x faster and 6x cheaper than competitors. Models: P-Video-Avatar, P-Image (for portrait generation). Capabilities: text-to-avatar, audio-driven avatars, 30 voices, 10 languages, 720p/1080p, built-in TTS, dynamic backgrounds, full-body control. Use for: AI presenters, product demos, explainer videos, virtual influencers, marketing, education, multilingual content, UGC, gaming avatars. Triggers: avatar video, talking head, ai avatar, p-video-avatar, pruna avatar, video avatar, ai presenter, digital human, virtual presenter, lipsync, talking avatar, ai spokesperson, heygen alternative, synthesia alternative, veed alternative, fabric alternative, omnihuman alternative

目的

To generate realistic and cost-effective talking head avatar videos for various applications like marketing, education, and social media content.

功能

Generate avatar videos from portrait images
Text-to-speech synthesis with 30 voices
Support for 10 languages
720p and 1080p resolution output
Audio-driven avatar lip-sync

使用场景

Creating AI presenters for product demos
Generating explainer videos for educational content
Producing virtual influencers for social media marketing
Localizing content across multiple languages with a single avatar

非目标

Performing advanced video editing beyond avatar animation
Providing a complete video production suite
Replacing live actors for all scenarios

Compliance

info:GDPRThe skill processes image and text data, which could potentially include personal data, but no specific sanitization is mentioned beyond what the LLM might provide.

安装

npx skills add inferen-sh/skills

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

已验证

99 /100

1 day ago 分析

信任信号

最近提交1 day ago

GitHub 所有者 inferen-sh

星标433

状态

查看源代码

类似扩展

Google Tts

100

Convert documents and text to audio using Google Cloud Text-to-Speech. Use this skill when the user wants to: narrate a document, read aloud text, generate audio from a file, convert text to speech, create a recording of documentation or analysis, create a podcast from a document, or use Google TTS/text-to-speech. Trigger phrases: "read this aloud", "narrate this", "create a recording", "text to speech", "TTS", "convert to audio", "audio from document", "listen to this", "generate audio", "google tts", "create a podcast".

技能

sanjay3290

Speech Generation Skill

100

Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; run the bundled CLI (`scripts/text_to_speech.py`) with built-in voices and require `OPENAI_API_KEY` for live calls. Custom voice creation is out of scope.

技能

openai

Openclaw

100

使用多提供商路由从文本生成图像和视频 — 支持 GPT Image 2.0（近乎完美的文本渲染）、Nanobanana 2、Seedream 5.0、Midjourney V8.1（统一的逼真+动漫风格）、Flux 2 Klein（经济高效的草稿）、Seedance 2.0 / Happyhorse 1.0 / Veo 3.1 视频，以及本地 ComfyUI 工作流。包含 1,446 个精选提示和风格感知提示增强。当用户想要创建图像/视频、设计素材、制作照片动画、增强提示或管理 AI 艺术工作流时使用。不适用于：通用聊天、代码生成、文档编写、现有素材的视频编辑、音频/TTS，或任何与 AI 图像/视频创建无关的任务。

技能

jau123

Smart Crop Avatar and Remove Background

Smart crop to face, remove the background, and convert to WebP for a clean user avatar.

技能

iterationlayer

Sweep Flag Namespace

Bulk-extract every candidate flag from a binary namespace, build an extraction inventory with occurrence counts and call-type tags, cross- reference against a documented set, and track completeness across probe campaigns until the undocumented remainder reaches zero. Covers namespace prefix harvesting, gate-vs-telemetry disambiguation at the call-site level, completeness metrics, DEFAULT-TRUE population reporting, and a final completion confirmation scan. Use upstream of probe-feature-flag- state when you need a complete catalog rather than a sample, or when a prior wave-based campaign needs a verifiable end condition.

技能

pjt222

Review Skill

Review a proposed Agent Skill for structural validity and content quality before publishing. Runs the skill-validator CLI to check for structural issues, scores the skill with an LLM judge, and interprets results to advise SMEs on what to address. Use when a user wants to review, validate, or quality-check an Agent Skill.

技能

mongodb