Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

C Voice

Skill Aktiv

Convert speech to text using `sag` (ElevenLabs STT) and synthesize speech using `say` (macOS built-in TTS). Enables voice input transcription and audio output.

Zweck

To integrate voice input and output capabilities into Claude Code, allowing for spoken audio transcription and synthesized speech responses.

Funktionen

Speech-to-text transcription via sag (ElevenLabs STT)
Text-to-speech synthesis via say (macOS built-in TTS)
Recording audio from microphone
Processing various audio file formats (MP3, WAV, M4A, FLAC)

Anwendungsfälle

Transcribing spoken commands or dictation into text
Reading Claude's responses or summaries aloud
Capturing audio for analysis or documentation

Nicht-Ziele

Real-time voice chat
Cross-platform speech synthesis (beyond macOS for `say`)

Documentation

warning:Configuration & parameter referenceWhile tools are documented, the explicit requirement for an ElevenLabs API key and its configuration is mentioned in 'Notes' and not as a formal parameter or prerequisite.

Maintenance

warning:Commit recencyThe last commit was over 2 months ago (March 6, 2026), indicating potential lack of recent maintenance.

Security

warning:Secret ManagementThe skill requires an ElevenLabs API key, which is mentioned as an environment variable in the notes but not explicitly detailed in the setup or prerequisites regarding secure handling.

Trust

warning:Issues AttentionThere is 1 open issue from the last 90 days and 0 closed issues, indicating slow or no maintainer response to recent issues.

Versioning

warning:Release ManagementThe extension uses the `main` branch for installation and does not declare a specific version in its frontmatter or manifests, making version pinning difficult.

Compliance

info:GDPRThe skill processes audio and text, which could potentially include personal data if spoken. However, it does not submit this data to a third party without explicit use of the ElevenLabs API.

Portability

warning:Runtime stabilityThe skill explicitly states `say` is macOS built-in, implying it may not function on other operating systems. The `sag` tool's cross-platform compatibility is not detailed.

Install

warning:Installation instructionThe SKILL.md details how to use the tools but assumes the user will install `sag` and have macOS for `say`. It mentions an ElevenLabs API key requirement but lacks explicit installation and setup instructions for `sag` or API key configuration verification.

Execution

warning:Pinned dependenciesThe skill relies on external CLIs (`sag`) and macOS built-ins (`say`). While `sag` might be installed via a package manager, there's no explicit pinning or lockfile mentioned for it, and no side-effect headers are applicable to these non-script tools.

Installation

npx skills add daxaur/openpaw

Führt das Vercel skills CLI (skills.sh) via npx aus — benötigt Node.js lokal und mindestens einen installierten skills-kompatiblen Agent (Claude Code, Cursor, Codex, …). Setzt voraus, dass das Repo dem agentskills.io-Format folgt.

Qualitätspunktzahl

75 /100

Analysiert 1 day ago

Vertrauenssignale

Letzter Commit2 months ago

GitHub-Inhaber daxaur

Sterne137

Downloads 103

LizenzMIT

Websitenpmjs.com

Status

Quellcode ansehen

Ähnliche Erweiterungen

Google Tts

100

Convert documents and text to audio using Google Cloud Text-to-Speech. Use this skill when the user wants to: narrate a document, read aloud text, generate audio from a file, convert text to speech, create a recording of documentation or analysis, create a podcast from a document, or use Google TTS/text-to-speech. Trigger phrases: "read this aloud", "narrate this", "create a recording", "text to speech", "TTS", "convert to audio", "audio from document", "listen to this", "generate audio", "google tts", "create a podcast".

Skill

sanjay3290

Speech Generation Skill

100

Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; run the bundled CLI (`scripts/text_to_speech.py`) with built-in voices and require `OPENAI_API_KEY` for live calls. Custom voice creation is out of scope.

Skill

openai

Tts

Verwenden Sie diese Fähigkeit, wann immer der Benutzer Text in Sprache umwandeln, Audio aus Text generieren oder Voiceovers erstellen möchte. Auslöser sind: jede Erwähnung von 'TTS', 'Text to Speech', 'sprechen', 'sagen', 'Stimme', 'laut vorlesen', 'Audio-Narration', 'Voiceover', 'Synchronisation' oder Anfragen, geschriebene Inhalte in gesprochene Audios umzuwandeln. Verwenden Sie es auch, wenn Sie EPUB/PDF/SRT/Artikel in Audio konvertieren, Stimmen aus Referenz-Audios klonen, Emotionen oder Geschwindigkeit in der Sprache steuern, Sprache an Zeitpläne von Untertiteln anpassen oder sprach-zugeordnete Audio pro Segment produzieren.

Skill

NoizAI

Characteristic Voice

Verwenden Sie diese Fähigkeit immer dann, wenn der Benutzer möchte, dass die Sprache menschlicher, begleitender oder emotional ausdrucksstärker klingt. Auslöser sind: jegliche Erwähnung von 'sprechen wie', 'reden wie', 'begleitende Stimme', 'tröste mich', 'muntere mich auf', 'klingt menschlicher', 'Guten-Nacht-Stimme', 'Guten-Morgen-Stimme' oder Aufforderungen, Füllgeräusche, Emotionen oder Persönlichkeit hinzuzufügen. Verwenden Sie dies auch, wenn der Benutzer die Stimme eines bestimmten Charakters nachahmen, Sprechstil-Voreinstellungen anwenden (Gutenacht, Morgen, Komfort, Feier, Chatten), emotionale Parameter wie Wärme oder Zärtlichkeit abstimmen oder die TTS-Ausgabe wie eine echte Person klingen lassen möchte. Wenn der Benutzer nach einer 'Sprachnachricht', 'Begleit-Audio' oder 'Charakterstimme' fragt oder eine Sprache wünscht, die seufzt, lacht, zögert oder aufrichtig warm klingt, verwenden Sie diese Fähigkeit. Verwenden Sie dies NICHT für einfache Text-zu-Sprache ohne Persönlichkeit, Musikgenerierung, Soundeffekte oder allgemeine Codierungsaufgaben, die nichts mit ausdrucksstarker Sprache zu tun haben.

Skill

NoizAI

Sherpa Onnx Tts

Local text-to-speech via sherpa-onnx (offline, no cloud)

Skill

steipete

Elevenlabs Tts

ElevenLabs text-to-speech with 22+ premium voices, multilingual support, and voice tuning via inference.sh CLI. Models: eleven_multilingual_v2 (highest quality), eleven_turbo_v2_5 (low latency), eleven_flash_v2_5 (ultra-fast). Capabilities: text-to-speech, voice selection, stability/style control, 32 languages. Use for: voiceovers, audiobooks, video narration, podcasts, accessibility, IVR. Triggers: elevenlabs, eleven labs, elevenlabs tts, premium tts, professional voice, ai voice, high quality tts, multilingual tts, eleven labs voice, voice generation, natural speech, realistic voice, voice over, speech synthesis

Skill

inferen-sh