Tts
技能 活跃当用户想要将文本转换为语音、从文本生成音频或制作配音时,请使用此技能。触发词包括:提及 'TTS'、'text to speech'、'speak'、'say'、'voice'、'read aloud'、'audio narration'、'voiceover'、'dubbing',或要求将书面内容转换为口头音频。在将 EPUB/PDF/SRT/文章转换为音频、从参考音频克隆声音、控制语音中的情感或语速、将语音与字幕时间线对齐或生成每个片段的语音映射音频时,也请使用。
为用户提供一个多功能、高质量的工具,用于从文本生成语音音频,满足从简单配音到复杂配音的广泛需求。
功能
- 文本到语音转换
- 从参考音频进行声音克隆
- 情感和语速控制
- 从 SRT 进行时间线准确的音频渲染
- 支持 Noiz 云 API 和本地 Kokoro 后端
- Noiz API 的访客模式(无需身份验证)
使用场景
- 为视频或演示文稿生成配音
- 从文本文件或文章创建有声读物
- 为聊天机器人或虚拟助手生成合成语音
- 使用时间对齐的配音来配音视频内容
- 克隆特定声音以用于个性化音频消息
非目标
- 实时对话语音交互
- 超出简单合成和对齐范围的音频编辑
- 与聊天平台直接集成(尽管输出可用于此目的)
Maintenance
- warning:Dependency Management该技能需要 'requests' 包来支持 Noiz 后端,但没有明确提及锁文件或其自动依赖更新。
Trust
- info:Issues Attention过去 90 天内打开了 2 个问题,关闭了 0 个,表明问题上的近期活动很少。
Execution
- warning:Pinned dependencies脚本列出了 'requests' 等必需的包,但没有明确的版本固定或锁文件,这可能导致兼容性问题。
安装
npx skills add NoizAI/skills通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。
质量评分
类似扩展
Speech Generation Skill
100Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; run the bundled CLI (`scripts/text_to_speech.py`) with built-in voices and require `OPENAI_API_KEY` for live calls. Custom voice creation is out of scope.
Google Tts
100Convert documents and text to audio using Google Cloud Text-to-Speech. Use this skill when the user wants to: narrate a document, read aloud text, generate audio from a file, convert text to speech, create a recording of documentation or analysis, create a podcast from a document, or use Google TTS/text-to-speech. Trigger phrases: "read this aloud", "narrate this", "create a recording", "text to speech", "TTS", "convert to audio", "audio from document", "listen to this", "generate audio", "google tts", "create a podcast".
Characteristic Voice
95每当用户希望语音听起来更具人情味、伙伴感或情感表现力时,请使用此技能。触发词包括:任何提及“说得像”、“像...一样说话”、“听起来像”、“伙伴声音”、“安慰我”、“让我开心”、“听起来更像人”、“晚安语音”、“早安语音”,或要求为生成的语音添加填充词、情感或个性。当用户希望模仿特定角色的声音、应用说话风格预设(晚安、早安、安慰、庆祝、聊天)、调整语气等情感参数(如温暖或温柔),或使文本转语音输出听起来像真人说话时,也请使用此技能。如果用户要求“语音消息”、“伙伴音频”、“角色声音”,或者想要带有叹息、笑声、犹豫或真正温暖的声音,请使用此技能。请勿用于没有个性的纯文本转语音、音乐生成、音效或与表情语音无关的常规编码任务。
Sherpa Onnx Tts
99Local text-to-speech via sherpa-onnx (offline, no cloud)
Elevenlabs Tts
99ElevenLabs text-to-speech with 22+ premium voices, multilingual support, and voice tuning via inference.sh CLI. Models: eleven_multilingual_v2 (highest quality), eleven_turbo_v2_5 (low latency), eleven_flash_v2_5 (ultra-fast). Capabilities: text-to-speech, voice selection, stability/style control, 32 languages. Use for: voiceovers, audiobooks, video narration, podcasts, accessibility, IVR. Triggers: elevenlabs, eleven labs, elevenlabs tts, premium tts, professional voice, ai voice, high quality tts, multilingual tts, eleven labs voice, voice generation, natural speech, realistic voice, voice over, speech synthesis
AlterLab FC AI Audio Producer
96This skill should be used when the user asks about "audio production", "ElevenLabs", "voice isolator", "audio post-production", "AI narration", "text to speech production", "voiceover studio", "audio native", "transcription", "Scribe", "multi-track audio", "audio assembly", "batch audio processing", "audio export", "act as an audio producer", "audio producer mode", "TTS production", "podcast audio", "audiobook production", "narration workflow", "content series audio", "multi-tool audio chain", "ElevenLabs Projects", or needs expertise in end-to-end audio production pipelines using ElevenLabs tools. Part of the AlterLab FC Skills collection (GenAI pack).