Skip to main content

Sherpa Onnx Tts

Skill Verified Active

Local text-to-speech via sherpa-onnx (offline, no cloud)

Purpose

To provide a local and offline text-to-speech capability, free from cloud dependencies, for users who need to generate audio from text on their own devices.

Features

  • Local text-to-speech generation
  • Offline operation, no cloud dependency
  • Supports multiple operating systems (macOS, Linux, Windows)
  • Utilizes the sherpa-onnx engine for TTS

Use Cases

  • Generating audio for local applications without internet access
  • Privacy-conscious TTS generation
  • Integrating TTS into workflows where cloud services are not feasible or desired
  • Users needing a customizable and controllable TTS solution

Non-Goals

  • Providing cloud-based TTS services
  • Supporting real-time speech recognition
  • Offering a wide variety of pre-installed voices out-of-the-box (users must download models)
  • Advanced audio editing or manipulation beyond basic TTS conversion

Installation

npx skills add steipete/clawdis

Runs the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.

Quality Score

Verified
99 /100
Analyzed about 14 hours ago

Trust Signals

Last commitabout 14 hours ago
Stars371.6k
LicenseMIT
Status
View Source

Similar Extensions

Google Tts

100

Convert documents and text to audio using Google Cloud Text-to-Speech. Use this skill when the user wants to: narrate a document, read aloud text, generate audio from a file, convert text to speech, create a recording of documentation or analysis, create a podcast from a document, or use Google TTS/text-to-speech. Trigger phrases: "read this aloud", "narrate this", "create a recording", "text to speech", "TTS", "convert to audio", "audio from document", "listen to this", "generate audio", "google tts", "create a podcast".

Skill
sanjay3290

Speech Generation Skill

100

Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; run the bundled CLI (`scripts/text_to_speech.py`) with built-in voices and require `OPENAI_API_KEY` for live calls. Custom voice creation is out of scope.

Skill
openai

Elevenlabs Tts

99

ElevenLabs text-to-speech with 22+ premium voices, multilingual support, and voice tuning via inference.sh CLI. Models: eleven_multilingual_v2 (highest quality), eleven_turbo_v2_5 (low latency), eleven_flash_v2_5 (ultra-fast). Capabilities: text-to-speech, voice selection, stability/style control, 32 languages. Use for: voiceovers, audiobooks, video narration, podcasts, accessibility, IVR. Triggers: elevenlabs, eleven labs, elevenlabs tts, premium tts, professional voice, ai voice, high quality tts, multilingual tts, eleven labs voice, voice generation, natural speech, realistic voice, voice over, speech synthesis

Skill
inferen-sh

Podcast Generation

100

Generate AI-powered podcast-style audio narratives using Azure OpenAI's GPT Realtime Mini model via WebSocket. Use when building text-to-speech features, audio narrative generation, podcast creation from content, or integrating with Azure OpenAI Realtime API for real audio output. Covers full-stack implementation from React frontend to Python FastAPI backend with WebSocket streaming.

Skill
microsoft

YouTube Downloader

100

Download and process YouTube content for research. Use when: downloading competitor videos for analysis; extracting audio for podcasts; getting transcripts for content repurposing; archiving webinars; research content curation

Skill
guia-matthieu

Remote Interview

100

Capture professional-quality remote interviews using double-ender technique and dedicated recording platforms for podcasts, media, and content production. Use when: Setting up remote podcast interviews with guests; Recording media interviews across distances; Creating customer interview content; Producing expert interviews for thought leadership; Conducting research interviews with high audio quality

Skill
guia-matthieu

© 2025 SkillRepo · Find the right skill, skip the noise.