跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

AlterLab FC AI Audio Producer

技能 已验证 活跃

This skill should be used when the user asks about "audio production", "ElevenLabs", "voice isolator", "audio post-production", "AI narration", "text to speech production", "voiceover studio", "audio native", "transcription", "Scribe", "multi-track audio", "audio assembly", "batch audio processing", "audio export", "act as an audio producer", "audio producer mode", "TTS production", "podcast audio", "audiobook production", "narration workflow", "content series audio", "multi-tool audio chain", "ElevenLabs Projects", or needs expertise in end-to-end audio production pipelines using ElevenLabs tools. Part of the AlterLab FC Skills collection (GenAI pack).

目的

To provide users with an expert autonomous agent for producing broadcast-ready audio content, leveraging the full suite of ElevenLabs tools for efficient and high-quality results.

功能

  • Autonomous audio production workflow
  • Integration with ElevenLabs tools (Voice Isolator, Studio 3.0, Scribe v2, Eleven Music)
  • End-to-end audio pipeline management
  • Batch processing for content series
  • Detailed output formats for plans, settings, and templates

使用场景

  • Cleaning noisy interview recordings with Voice Isolator
  • Producing podcast episodes or audiobooks with consistent voice and pacing
  • Embedding AI narration on web content using Audio Native
  • Generating show notes and subtitles via Scribe v2

非目标

  • Providing generic audio advice outside of ElevenLabs toolset
  • Performing tasks unrelated to audio post-production
  • Acting as a simple API wrapper without autonomous workflow execution

工作流

  1. Source audio evaluation
  2. Cleanup and voice generation
  3. Assembly and production
  4. Batch processing for content series
  5. Export, QC, and distribution

实践

  • Audio Production Standards
  • Multi-Tool Chain Orchestration
  • Batch Processing Workflows
  • Quality Control and Review

Versioning

  • info:Release ManagementThe README mentions a 'Release v1.2.0' tag, but there is no clear semver versioning in the SKILL.md frontmatter or manifest.

安装

npx skills add AlterLab-IEU/AlterLab-FC-Skills

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

已验证
96 /100
1 day ago 分析

信任信号

最近提交about 2 months ago
星标3
许可证MIT
状态
查看源代码

类似扩展

Dialogue Audio

95

Multi-speaker dialogue audio creation with ElevenLabs and Dia TTS. Covers speaker tags, emotion control, pacing, conversation flow, and post-production. Use for: podcasts, audiobooks, explainers, character dialogue, conversational content. Triggers: dialogue audio, multi speaker, conversation audio, dia tts, two speakers, podcast audio, character voices, voice acting, dialogue generation, conversation tts, multi voice, speaker tags, dialogue recording, elevenlabs dialogue, eleven labs conversation

技能
inferen-sh

Voice Design

98

Select and create the perfect AI voice for your content using ElevenLabs, Qwen3-TTS, and other platforms—matching voice characteristics to brand personality and audience. Use when: Choosing an AI voice for video narration; Creating a consistent brand voice across content; Cloning a voice for scalable production; Comparing voice synthesis platforms; Designing voice characteristics by description

技能
guia-matthieu

AI Voice Generation

95

AI voice generation, text-to-speech, and voice synthesis via inference.sh CLI. Models: Inworld TTS-2 (100+ languages, emotion/non-verbal steering), Inworld TTS 1.5 (ultra-low latency), ElevenLabs (22+ premium voices, 32 languages), Kokoro TTS, DIA, Chatterbox, Higgs, VibeVoice for natural speech. Capabilities: multiple voices, emotions, accents, long-form narration, conversation, voice transformation, delivery mode control, character voices. Use for: voiceovers, audiobooks, podcasts, video narration, accessibility, gaming NPCs, avatar audio, UGC. Triggers: voice cloning, tts, text to speech, ai voice, voice generation, voice synthesis, voice over, narration, speech synthesis, ai narrator, elevenlabs, eleven labs, natural voice, realistic speech, voice ai, voice changer, inworld, inworld tts, character voice, npc voice

技能
inferen-sh

Speech Generation Skill

100

Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; run the bundled CLI (`scripts/text_to_speech.py`) with built-in voices and require `OPENAI_API_KEY` for live calls. Custom voice creation is out of scope.

技能
openai

Elevenlabs Tts

99

ElevenLabs text-to-speech with 22+ premium voices, multilingual support, and voice tuning via inference.sh CLI. Models: eleven_multilingual_v2 (highest quality), eleven_turbo_v2_5 (low latency), eleven_flash_v2_5 (ultra-fast). Capabilities: text-to-speech, voice selection, stability/style control, 32 languages. Use for: voiceovers, audiobooks, video narration, podcasts, accessibility, IVR. Triggers: elevenlabs, eleven labs, elevenlabs tts, premium tts, professional voice, ai voice, high quality tts, multilingual tts, eleven labs voice, voice generation, natural speech, realistic voice, voice over, speech synthesis

技能
inferen-sh

Elevenlabs Dialogue

99

ElevenLabs multi-speaker dialogue generation - create conversations with different voices in a single audio file via inference.sh CLI. Capabilities: multi-voice dialogue, script-based generation, voice direction, conversation audio. Use for: podcasts, audiobooks, explainers, tutorials, character dialogue, video scripts. Triggers: elevenlabs dialogue, eleven labs dialogue, multi speaker, conversation audio, dialogue generation, text to dialogue, multi voice, voice acting, podcast dialogue, character voices, script to audio, elevenlabs conversation, two speakers

技能
inferen-sh