Skip to main content

ElevenLabs Audio Generation

Skill Active

Generate AI voiceovers, sound effects, and music using ElevenLabs APIs. Use when creating audio content for videos, podcasts, or games. Triggers include generating voiceovers, narration, dialogue, sound effects from descriptions, background music, soundtrack generation, voice cloning, or any audio synthesis task.

Purpose

To empower users to create professional-quality audio content for videos, podcasts, and games using AI, streamlining the process from script to final render.

Features

  • Generate AI voiceovers from text
  • Perform voice cloning for custom voices
  • Create sound effects and music
  • Integrate audio generation into Remotion video projects
  • Utilize ElevenLabs API with detailed examples

Use Cases

  • Creating narration for explainer videos
  • Generating dialogue for game characters
  • Producing background music for podcasts
  • Adding sound effects to video projects

Non-Goals

  • Complex video editing beyond audio integration
  • Direct control of cloud GPU instance management (handled by other skills)
  • Providing an alternative to the ElevenLabs platform itself

Workflow

  1. Read script or description
  2. Call ElevenLabs API for audio generation
  3. Save generated audio file
  4. Integrate audio into video project (e.g., Remotion)

Practices

  • API integration
  • Audio synthesis
  • Video production workflow

Prerequisites

  • ELEVENLABS_API_KEY environment variable
  • Python 3.9+ recommended
  • Node.js 18+ for Remotion integration

Security

  • warning:Secret ManagementThe skill requires an `ELEVENLABS_API_KEY` environment variable, but the documentation (`.claude/skills/elevenlabs/SKILL.md`) does not explicitly detail how this secret should be securely managed or rotated, posing a potential risk if not handled properly.

Installation

npx skills add digitalsamba/claude-code-video-toolkit

Runs the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.

Quality Score

95 /100
Analyzed 13 days ago

Trust Signals

Last commit15 days ago
Stars1.1k
LicenseMIT
Status
View Source

Similar Extensions

Elevenlabs Tts

99

ElevenLabs text-to-speech with 22+ premium voices, multilingual support, and voice tuning via inference.sh CLI. Models: eleven_multilingual_v2 (highest quality), eleven_turbo_v2_5 (low latency), eleven_flash_v2_5 (ultra-fast). Capabilities: text-to-speech, voice selection, stability/style control, 32 languages. Use for: voiceovers, audiobooks, video narration, podcasts, accessibility, IVR. Triggers: elevenlabs, eleven labs, elevenlabs tts, premium tts, professional voice, ai voice, high quality tts, multilingual tts, eleven labs voice, voice generation, natural speech, realistic voice, voice over, speech synthesis

Skill
inferen-sh

Google Tts

100

Convert documents and text to audio using Google Cloud Text-to-Speech. Use this skill when the user wants to: narrate a document, read aloud text, generate audio from a file, convert text to speech, create a recording of documentation or analysis, create a podcast from a document, or use Google TTS/text-to-speech. Trigger phrases: "read this aloud", "narrate this", "create a recording", "text to speech", "TTS", "convert to audio", "audio from document", "listen to this", "generate audio", "google tts", "create a podcast".

Skill
sanjay3290

Speech Generation Skill

100

Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; run the bundled CLI (`scripts/text_to_speech.py`) with built-in voices and require `OPENAI_API_KEY` for live calls. Custom voice creation is out of scope.

Skill
openai

Sherpa Onnx Tts

99

Local text-to-speech via sherpa-onnx (offline, no cloud)

Skill
steipete

Audio Editing Fundamentals

99

Master the essential audio post-production techniques—normalization, compression, EQ, and noise reduction—using the correct processing order to achieve professional-quality audio. Use when: Editing podcast episodes or video soundtracks; Cleaning up recorded voiceovers; Improving audio quality for marketing content; Preparing audio files for distribution; Troubleshooting common audio issues

Skill
guia-matthieu

Ffmpeg

99

Video and audio processing with FFmpeg. Use for format conversion, resizing, compression, audio extraction, and preparing assets for Remotion. Triggers include converting GIF to MP4, resizing video, extracting audio, compressing files, or any media transformation task.

Skill
digitalsamba

© 2025 SkillRepo · Find the right skill, skip the noise.