Acestep
技能 已验证 活跃使用 ACE-Step 1.5 进行 AI 音乐生成 — 为视频制作提供背景音乐、人声轨道、翻唱、音轨提取、音频重绘和续写。在生成音乐、配乐、宣传曲或处理音轨时使用。触发词包括背景音乐、配乐、宣传曲、音乐生成、音轨提取、翻唱、风格迁移、重绘、续写或音乐创作任务。
旨在使用户能够为各种应用(包括背景音轨、人声创作、翻唱以及与视频制作工作流程的无缝集成)生成高质量的 AI 音乐。
功能
- 使用 ACE-Step 1.5 进行 AI 音乐生成
- 创建背景音乐、人声轨道和翻唱
- 音轨提取和音频重绘
- 现有音频的续写
- 支持多种云服务提供商(acemusic、Modal、RunPod)
- 用于视频制作的场景预设
- 详细的提示工程和歌词格式化指南
使用场景
- 为视频和演示文稿生成背景音乐
- 创建人声轨道和宣传曲
- 制作不同风格的音乐翻唱
- 编辑和增强现有音频轨道
- 根据文本提示创作音乐作品
非目标
- 语音克隆或语音合成
- 生成音效
- 直接从视频文件中提取音轨
Code Execution
- info:Validation脚本的参数使用 `argparse` 进行解析,提供了一定的验证,但没有明确演示对所有输入的详细基于模式的验证。
安装
npx skills add digitalsamba/claude-code-video-toolkit通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。
质量评分
已验证类似扩展
ElevenLabs 音频生成
95使用 ElevenLabs API 生成 AI 配音、音效和音乐。用于创建视频、播客或游戏的音频内容。触发器包括生成配音、旁白、对话、根据描述生成音效、背景音乐、配乐生成、语音克隆或任何音频合成任务。
Cn Content Matrix
99Chinese multi-platform content matrix generator — given a topic, auto-generate content adapted for Xiaohongshu, WeChat Official Account, Douyin, and Bilibili with true style transfer (not just reformatting). Supports single-platform generation, full-matrix generation, and compliance review. (中文) 中文多平台内容矩阵:小红书、微信公众号、抖音、B站,真正的风格迁移而非简单格式转换。
YouTube for Developer Relations
99When the user wants to create developer YouTube content, technical screencasts, or video tutorials. Trigger phrases include "YouTube," "developer video," "screencast," "video tutorial," "live coding," "YouTube for developers," "tech YouTube," or "YouTube thumbnails."
Voiceover Direction
98Master the art of directing voice talent to deliver performances that match your brand vision, using Anne Ganguzza's storytelling approach and industry best practices. Use when: Hiring and briefing voiceover artists for a project; Giving direction during recording sessions; Writing scripts that are easy for talent to deliver; Matching voice characteristics to brand personality; Reviewing auditions and selecting the right talent
Voice Design
98Select and create the perfect AI voice for your content using ElevenLabs, Qwen3-TTS, and other platforms—matching voice characteristics to brand personality and audience. Use when: Choosing an AI voice for video narration; Creating a consistent brand voice across content; Cloning a voice for scalable production; Comparing voice synthesis platforms; Designing voice characteristics by description
Ai Music Generation
98Generate AI music and songs with ElevenLabs, Diffrythm, Tencent Song Generation via inference.sh CLI. Models: ElevenLabs Music (up to 10 min, commercial license), Diffrythm (fast song generation), Tencent Song Generation (full songs with vocals). Capabilities: text-to-music, song generation, instrumental, lyrics to song, soundtrack creation. Use for: background music, social media content, game soundtracks, podcasts, royalty-free music. Triggers: music generation, ai music, generate song, ai composer, text to music, song generator, create music with ai, suno alternative, udio alternative, ai song, ai soundtrack, generate soundtrack, ai jingle, music ai, beat generator, elevenlabs music, eleven labs music