Songsee
Skill Verified ActiveGenerate spectrograms and feature-panel visualizations from audio with the songsee CLI.
To provide users with a powerful and easy-to-use tool for creating detailed visual representations of audio data.
Features
- Generate spectrograms from audio files
- Create multi-panel feature visualizations
- Control output size and style
- Extract specific time slices from audio
- Process audio via stdin
Use Cases
- Visualizing audio content for analysis
- Creating visual assets from music tracks
- Debugging audio processing pipelines
- Generating visual representations of sound events
Non-Goals
- Audio editing or manipulation
- Real-time audio processing
- Speech recognition or analysis
Installation
npx skills add steipete/clawdisRuns the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.
Quality Score
VerifiedTrust Signals
Similar Extensions
Google Tts
100Convert documents and text to audio using Google Cloud Text-to-Speech. Use this skill when the user wants to: narrate a document, read aloud text, generate audio from a file, convert text to speech, create a recording of documentation or analysis, create a podcast from a document, or use Google TTS/text-to-speech. Trigger phrases: "read this aloud", "narrate this", "create a recording", "text to speech", "TTS", "convert to audio", "audio from document", "listen to this", "generate audio", "google tts", "create a podcast".
Podcast Generation
100Generate AI-powered podcast-style audio narratives using Azure OpenAI's GPT Realtime Mini model via WebSocket. Use when building text-to-speech features, audio narrative generation, podcast creation from content, or integrating with Azure OpenAI Realtime API for real audio output. Covers full-stack implementation from React frontend to Python FastAPI backend with WebSocket streaming.
Speech Generation Skill
100Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; run the bundled CLI (`scripts/text_to_speech.py`) with built-in voices and require `OPENAI_API_KEY` for live calls. Custom voice creation is out of scope.
YouTube Downloader
100Download and process YouTube content for research. Use when: downloading competitor videos for analysis; extracting audio for podcasts; getting transcripts for content repurposing; archiving webinars; research content curation
Remote Interview
100Capture professional-quality remote interviews using double-ender technique and dedicated recording platforms for podcasts, media, and content production. Use when: Setting up remote podcast interviews with guests; Recording media interviews across distances; Creating customer interview content; Producing expert interviews for thought leadership; Conducting research interviews with high audio quality
Audio Logo Design
100Create memorable sonic logos using design principles from Intel, Netflix, and McDonald's—crafting 2-5 second audio signatures that achieve instant brand recognition. Use when: Creating a sonic logo for a brand; Evaluating audio logo proposals from agencies; Understanding what makes sonic logos effective; Briefing sound designers on logo requirements; Analyzing competitor sonic logos