ElevenLabs Voice Changer
Skill सत्यापितTransform the voice in an audio recording into a different target voice while preserving emotion, timing, and delivery using the ElevenLabs Voice Changer (speech-to-speech) API. Use when converting one voice to another, changing the speaker/narrator of an existing recording, dubbing a voice-over in a different voice, creating character voices from a scratch performance, anonymizing a speaker, or any "voice conversion / voice transfer / speech-to-speech" task. Make sure to use this skill whenever the user mentions voice changing, voice conversion, speech-to-speech, swapping a voice in audio, re-voicing a clip, or applying a different voice to an existing recording — even if they don't explicitly say "voice changer".
This skill enables users to change the voice in an audio recording to a different target voice while maintaining the original performance characteristics. It supports various use cases including dubbing, character voice creation, and speaker anonymization, offering examples for Python, JavaScript, and cURL.
Maintenance
- warning:Commit recencyThere are no commits in the last 12 months on the default branch, suggesting the project may be unmaintained.
Versioning
- warning:Release ManagementThere is no explicit versioning information (e.g., version field in manifest, changelog, or release tags) available for this skill.
Compliance
- info:GDPRThe skill processes audio data, which may contain personal data. While it sends this to the ElevenLabs API for processing, there is no indication of direct submission to a third party beyond the API provider itself, and the documentation does not specify explicit sanitization for personal data within the audio.
इंस्टॉलेशन
npx skills add elevenlabs/skillsVercel skills CLI (skills.sh) को npx के माध्यम से चलाता है — स्थानीय रूप से Node.js और कम से कम एक इंस्टॉल किया गया skills-संगत एजेंट (Claude Code, Cursor, Codex, …) ज़रूरी है। यह मानता है कि रिपॉज़िटरी agentskills.io फ़ॉर्मैट का पालन करती है।
मिलते-जुलते एक्सटेंशन
ElevenLabs Text-to-Speech
98Convert text to speech using ElevenLabs voice AI. Use when generating audio from text, creating voiceovers, building voice apps, or synthesizing speech in 70+ languages.
ElevenLabs Music
97Generate music using ElevenLabs Music API. Use when creating instrumental tracks, songs with lyrics, background music, jingles, or any AI-generated music composition. Supports prompt-based generation, composition plans for granular control, and detailed output with metadata.
ElevenLabs Voice Isolator
95Remove background noise and isolate vocals/speech from audio using ElevenLabs Voice Isolator (audio isolation) API. Use when cleaning up noisy recordings, removing music or background ambience from dialogue, isolating speech from field recordings, preparing audio for transcription, extracting vocals, or any "denoise / clean up / isolate voice" task.
Happy Audio Gen
100Universal AI voice / text-to-speech skill supporting OpenAI TTS (gpt-4o-mini-tts, tts-1), ElevenLabs multilingual TTS with voice cloning, Bailian Qwen TTS (qwen-tts / qwen3-tts-vd with voice-design custom voices, long-text chunking built in), MiniMax speech-02-hd, SiliconFlow CosyVoice / SenseVoice, and PlayHT 2.0. Use this skill whenever the user asks to read text aloud, synthesize speech, generate narration, create voice-over, dub a script, or turn any text into audio (mp3 / wav / ogg / flac). Typical phrases include "read this aloud", "generate voice for ...", "create a narration of ...", "tts this", "把这段念出来", "做个配音", "合成语音", or mentions of voices / TTS model names like Alloy, Ash, Cherry, Rachel, CosyVoice, PlayHT. Always use this skill even if the user does not specify a provider — pick one from EXTEND.md defaults or available env keys.
ElevenLabs Speech-to-Text
98Transcribe audio to text using ElevenLabs Scribe. Supports batch transcription, realtime streaming from URLs, microphone input, and local files.
FFmpeg for Video Production
95Video and audio processing with FFmpeg. Use for format conversion, resizing, compression, audio extraction, and preparing assets for Remotion. Triggers include converting GIF to MP4, resizing video, extracting audio, compressing files, or any media transformation task.