Audiocraft Audio Generation
Skill Verified ActivePyTorch library for audio generation including text-to-music (MusicGen) and text-to-sound (AudioGen). Use when you need to generate music from text descriptions, create sound effects, or perform melody-conditioned music generation.
To enable users to generate high-quality audio, from music to sound effects, using advanced AI models directly within their Python projects.
Features
- Text-to-music generation with MusicGen
- Text-to-sound effects generation with AudioGen
- Melody-conditioned music generation
- Style transfer and stereo audio generation
- High-fidelity neural audio codec (EnCodec)
Use Cases
- Generating background music for videos or games from text descriptions.
- Creating custom sound effects for applications or multimedia projects.
- Prototyping music generation tools and AI-powered audio applications.
- Experimenting with melody-conditioned music generation for unique compositions.
Non-Goals
- Real-time audio processing for live performance.
- Speech-to-text or text-to-speech conversion.
- Audio editing or manipulation beyond generation.
Trust
- info:Issues AttentionThere are 17 open and 4 closed issues in the last 90 days, indicating some level of engagement but potentially slow response to new issues.
Installation
npx skills add davila7/claude-code-templatesRuns the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.
Quality Score
VerifiedTrust Signals
Similar Extensions
Audiocraft Audio Generation
98PyTorch library for audio generation including text-to-music (MusicGen) and text-to-sound (AudioGen). Use when you need to generate music from text descriptions, create sound effects, or perform melody-conditioned music generation.
AlterLab FC AI Sound Effects Designer
99This skill should be used when the user asks about "AI sound effects", "text to SFX", "generate sound effects", "ElevenLabs sound effects", "foley generation", "ambient sounds", "soundscape design", "AI foley", "sound design for film", "generate audio for video", "podcast sound effects", "game audio SFX", "act as a sound effects designer", "sound effects mode", "SFX prompting", or needs expertise in AI-generated sound effects, descriptive audio prompting, soundscape layering, and foley creation on ElevenLabs. Part of the AlterLab FC Skills collection (GenAI pack).
ElevenLabs Sound Effects
98Generate AI sound effects from text descriptions with ElevenLabs via inference.sh CLI. Capabilities: text-to-sound-effect, custom duration, royalty-free audio. Use for: video production, game audio, podcasts, films, presentations, social media. Triggers: sound effects, sfx, sound generation, ai sound effects, generate sound, foley, audio effects, sound design, text to sound, elevenlabs sound, eleven labs sfx, ambient sound, cinematic sound, game sound effects
ElevenLabs Audio Generation
95Generate AI voiceovers, sound effects, and music using ElevenLabs APIs. Use when creating audio content for videos, podcasts, or games. Triggers include generating voiceovers, narration, dialogue, sound effects from descriptions, background music, soundtrack generation, voice cloning, or any audio synthesis task.
PyTorch Lightning
100Deep learning framework (PyTorch Lightning). Organize PyTorch code into LightningModules, configure Trainers for multi-GPU/TPU, implement data pipelines, callbacks, logging (W&B, TensorBoard), distributed training (DDP, FSDP, DeepSpeed), for scalable neural network training.
Implementing Llms Litgpt
100Implements and trains LLMs using Lightning AI's LitGPT with 20+ pretrained architectures (Llama, Gemma, Phi, Qwen, Mistral). Use when need clean model implementations, educational understanding of architectures, or production fine-tuning with LoRA/QLoRA. Single-file implementations, no abstraction layers.