이 콘텐츠는 아직 사용자의 언어로 제공되지 않아 영어로 표시됩니다.

Edge TTS

Skill 확인됨

Text-to-speech conversion using `uvx edge-tts` for generating audio from text. Use when (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.

AI 요약

This skill converts provided text into audio output using Microsoft Edge's neural TTS service via the `uvx edge-tts` command. It supports customization of voices, speed, pitch, and format, and can also generate subtitles. The skill is designed for scenarios where spoken output is preferred over reading, such as for multitasking or accessibility.

Practical Utility

info:Edge casesThe skill handles common TTS parameters like rate, volume, pitch, and voice selection. However, it does not explicitly document failure modes such as invalid voice names or unavailable languages, only naming the available ones.

Code Execution

warning:Tool FallbackThe skill explicitly requires the `uvx edge-tts` command, which is indicated by the `![Require](https://img.shields.io/badge/uvx-+?label=bin)` badge. There is no declared fallback or mention of its optionality.
info:ValidationThe skill correctly interpolates parameters into the command line for `uvx edge-tts`. However, there is no explicit validation schema mentioned for inputs like `--text` or `--voice` beyond what the underlying tool might provide.

Maintenance

warning:Dependency ManagementThe skill relies on the external `uvx edge-tts` command-line tool, but there is no explicit mention of how this dependency is managed, updated, or checked for vulnerabilities.

Versioning

warning:Release ManagementThere is no manifest version, GitHub release tag, or CHANGELOG. The installation instructions reference `npx skills add aahl/skills`, implying use of the default branch without explicit versioning.

설치

npx skills add aahl/skills

Vercel skills CLI(skills.sh)를 npx로 실행합니다. 로컬에 Node.js와 skills 호환 에이전트(Claude Code, Cursor, Codex 등) 중 하나 이상이 설치되어 있어야 합니다. 저장소가 agentskills.io 형식을 따른다고 가정합니다.

4 days ago

aahl

121 stars

MIT-0

skills.sh

2 days ago에 업데이트됨

소스 코드 보기

유사한 확장

Text-to-Speech (TTS)

Implement text-to-speech (TTS) capabilities using the z-ai-web-dev-sdk. Use this skill when the user needs to convert text into natural-sounding speech, create audio content, build voice-enabled applications, or generate spoken audio files. Supports multiple voices, adjustable speed, and various audio formats.

Skill

answerzhao

Happy Audio Gen

100

Universal AI voice / text-to-speech skill supporting OpenAI TTS (gpt-4o-mini-tts, tts-1), ElevenLabs multilingual TTS with voice cloning, Bailian Qwen TTS (qwen-tts / qwen3-tts-vd with voice-design custom voices, long-text chunking built in), MiniMax speech-02-hd, SiliconFlow CosyVoice / SenseVoice, and PlayHT 2.0. Use this skill whenever the user asks to read text aloud, synthesize speech, generate narration, create voice-over, dub a script, or turn any text into audio (mp3 / wav / ogg / flac). Typical phrases include "read this aloud", "generate voice for ...", "create a narration of ...", "tts this", "把这段念出来", "做个配音", "合成语音", or mentions of voices / TTS model names like Alloy, Ash, Cherry, Rachel, CosyVoice, PlayHT. Always use this skill even if the user does not specify a provider — pick one from EXTEND.md defaults or available env keys.

Skill

iamzhihuix

ElevenLabs Text-to-Speech

Convert text to speech using ElevenLabs voice AI. Use when generating audio from text, creating voiceovers, building voice apps, or synthesizing speech in 70+ languages.

Skill

elevenlabs

ElevenLabs Audio Generation

Generate AI voiceovers, sound effects, and music using ElevenLabs APIs. Use when creating audio content for videos, podcasts, or games. Triggers include generating voiceovers, narration, dialogue, sound effects from descriptions, background music, soundtrack generation, voice cloning, or any audio synthesis task.

Skill

digitalsamba

Zai-TTS

Text-to-speech conversion using GLM-TTS service via the `uvx zai-tts` command for generating audio from text. Use when (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, podcast, driving, cooking). (3) Using pre-cloned voices for speech.

Skill

aahl

Characteristic Voice

Use this skill whenever the user wants speech to sound more human, companion-like, or emotionally expressive. Triggers include: any mention of 'say like', 'talk like', 'speak like', 'companion voice', 'comfort me', 'cheer me up', 'sound more human', 'good night voice', 'good morning voice', or requests to add fillers, emotion, or personality to generated speech. Also use when the user wants to mimic a specific character's voice, apply speaking style presets (goodnight, morning, comfort, celebration, chatting), tune emotional parameters like warmth or tenderness, or make TTS output feel like a real person talking. If the user asks for a 'voice message', 'companion audio', 'character voice', or wants speech that sighs, laughs, hesitates, or sounds genuinely warm, use this skill. Do NOT use for plain text-to-speech without personality, music generation, sound effects, or general coding tasks unrelated to expressive speech.

Skill

noizai