الانتقال إلى المحتوى الرئيسي
هذا المحتوى غير متوفر بعد بلغتك ويتم عرضه باللغة الإنجليزية.

Edge TTS

Skill تم التحقق
85

Text-to-speech conversion using `uvx edge-tts` for generating audio from text. Use when (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.

ملخص الذكاء الاصطناعي

This skill converts provided text into audio output using Microsoft Edge's neural TTS service via the `uvx edge-tts` command. It supports customization of voices, speed, pitch, and format, and can also generate subtitles. The skill is designed for scenarios where spoken output is preferred over reading, such as for multitasking or accessibility.

Practical Utility

  • info:Edge casesThe skill handles common TTS parameters like rate, volume, pitch, and voice selection. However, it does not explicitly document failure modes such as invalid voice names or unavailable languages, only naming the available ones.

Code Execution

  • warning:Tool FallbackThe skill explicitly requires the `uvx edge-tts` command, which is indicated by the `![Require](https://img.shields.io/badge/uvx-+?label=bin)` badge. There is no declared fallback or mention of its optionality.
  • info:ValidationThe skill correctly interpolates parameters into the command line for `uvx edge-tts`. However, there is no explicit validation schema mentioned for inputs like `--text` or `--voice` beyond what the underlying tool might provide.

Maintenance

  • warning:Dependency ManagementThe skill relies on the external `uvx edge-tts` command-line tool, but there is no explicit mention of how this dependency is managed, updated, or checked for vulnerabilities.

Versioning

  • warning:Release ManagementThere is no manifest version, GitHub release tag, or CHANGELOG. The installation instructions reference `npx skills add aahl/skills`, implying use of the default branch without explicit versioning.

التثبيت

npx skills add aahl/skills

يشغّل Vercel skills CLI ‏(skills.sh) عبر npx — يتطلب وجود Node.js محليًا ووكيلًا واحدًا على الأقل متوافقًا مع skills (مثل Claude Code أو Cursor أو Codex). يفترض أن المستودع يتبع تنسيق agentskills.io.

تم التحديث في 6 days ago
عرض الكود المصدري

امتدادات مماثلة

Text-to-Speech (TTS)

95

Implement text-to-speech (TTS) capabilities using the z-ai-web-dev-sdk. Use this skill when the user needs to convert text into natural-sounding speech, create audio content, build voice-enabled applications, or generate spoken audio files. Supports multiple voices, adjustable speed, and various audio formats.

Skill
answerzhao

Happy Audio Gen

100

Universal AI voice / text-to-speech skill supporting OpenAI TTS (gpt-4o-mini-tts, tts-1), ElevenLabs multilingual TTS with voice cloning, Bailian Qwen TTS (qwen-tts / qwen3-tts-vd with voice-design custom voices, long-text chunking built in), MiniMax speech-02-hd, SiliconFlow CosyVoice / SenseVoice, and PlayHT 2.0. Use this skill whenever the user asks to read text aloud, synthesize speech, generate narration, create voice-over, dub a script, or turn any text into audio (mp3 / wav / ogg / flac). Typical phrases include "read this aloud", "generate voice for ...", "create a narration of ...", "tts this", "把这段念出来", "做个配音", "合成语音", or mentions of voices / TTS model names like Alloy, Ash, Cherry, Rachel, CosyVoice, PlayHT. Always use this skill even if the user does not specify a provider — pick one from EXTEND.md defaults or available env keys.

Skill
iamzhihuix

ElevenLabs Text-to-Speech

98

Convert text to speech using ElevenLabs voice AI. Use when generating audio from text, creating voiceovers, building voice apps, or synthesizing speech in 70+ languages.

Skill
elevenlabs

ElevenLabs Audio Generation

93

Generate AI voiceovers, sound effects, and music using ElevenLabs APIs. Use when creating audio content for videos, podcasts, or games. Triggers include generating voiceovers, narration, dialogue, sound effects from descriptions, background music, soundtrack generation, voice cloning, or any audio synthesis task.

Skill
digitalsamba

Zai-TTS

85

Text-to-speech conversion using GLM-TTS service via the `uvx zai-tts` command for generating audio from text. Use when (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, podcast, driving, cooking). (3) Using pre-cloned voices for speech.

Skill
aahl

Characteristic Voice

98

Use this skill whenever the user wants speech to sound more human, companion-like, or emotionally expressive. Triggers include: any mention of 'say like', 'talk like', 'speak like', 'companion voice', 'comfort me', 'cheer me up', 'sound more human', 'good night voice', 'good morning voice', or requests to add fillers, emotion, or personality to generated speech. Also use when the user wants to mimic a specific character's voice, apply speaking style presets (goodnight, morning, comfort, celebration, chatting), tune emotional parameters like warmth or tenderness, or make TTS output feel like a real person talking. If the user asks for a 'voice message', 'companion audio', 'character voice', or wants speech that sighs, laughs, hesitates, or sounds genuinely warm, use this skill. Do NOT use for plain text-to-speech without personality, music generation, sound effects, or general coding tasks unrelated to expressive speech.

Skill
noizai