此内容尚未提供您的语言版本,正在以英文显示。

Songsee

技能已验证活跃

Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.

目的

To provide users with a powerful and easy-to-use tool for creating detailed visual representations of audio data.

功能

Generate spectrograms from audio files
Create multi-panel feature visualizations
Control output size and style
Extract specific time slices from audio
Process audio via stdin

使用场景

Visualizing audio content for analysis
Creating visual assets from music tracks
Debugging audio processing pipelines
Generating visual representations of sound events

非目标

Audio editing or manipulation
Real-time audio processing
Speech recognition or analysis

安装

npx skills add steipete/clawdis

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

已验证

99 /100

1 day ago 分析

信任信号

最近提交1 day ago

GitHub 所有者 steipete

星标371.6k

下载量 4.6M

许可证MIT

网站openclaw.ai

状态

查看源代码

类似扩展

Google Tts

100

Convert documents and text to audio using Google Cloud Text-to-Speech. Use this skill when the user wants to: narrate a document, read aloud text, generate audio from a file, convert text to speech, create a recording of documentation or analysis, create a podcast from a document, or use Google TTS/text-to-speech. Trigger phrases: "read this aloud", "narrate this", "create a recording", "text to speech", "TTS", "convert to audio", "audio from document", "listen to this", "generate audio", "google tts", "create a podcast".

技能

sanjay3290

Podcast Generation

100

Generate AI-powered podcast-style audio narratives using Azure OpenAI's GPT Realtime Mini model via WebSocket. Use when building text-to-speech features, audio narrative generation, podcast creation from content, or integrating with Azure OpenAI Realtime API for real audio output. Covers full-stack implementation from React frontend to Python FastAPI backend with WebSocket streaming.

技能

microsoft

Speech Generation Skill

100

Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; run the bundled CLI (`scripts/text_to_speech.py`) with built-in voices and require `OPENAI_API_KEY` for live calls. Custom voice creation is out of scope.

技能

openai

YouTube Downloader

100

Download and process YouTube content for research. Use when: downloading competitor videos for analysis; extracting audio for podcasts; getting transcripts for content repurposing; archiving webinars; research content curation

技能

guia-matthieu

Remote Interview

100

Capture professional-quality remote interviews using double-ender technique and dedicated recording platforms for podcasts, media, and content production. Use when: Setting up remote podcast interviews with guests; Recording media interviews across distances; Creating customer interview content; Producing expert interviews for thought leadership; Conducting research interviews with high audio quality

技能

guia-matthieu

Audio Logo Design

100

Create memorable sonic logos using design principles from Intel, Netflix, and McDonald's—crafting 2-5 second audio signatures that achieve instant brand recognition. Use when: Creating a sonic logo for a brand; Evaluating audio logo proposals from agencies; Understanding what makes sonic logos effective; Briefing sound designers on logo requirements; Analyzing competitor sonic logos

技能

guia-matthieu