Audiocraft Audio Generation

Skill Verified Active

PyTorch library for audio generation including text-to-music (MusicGen) and text-to-sound (AudioGen). Use when you need to generate music from text descriptions, create sound effects, or perform melody-conditioned music generation.

Purpose

To enable users to generate high-quality audio, from music to sound effects, using advanced AI models directly within their Python projects.

Features

Text-to-music generation with MusicGen
Text-to-sound effects generation with AudioGen
Melody-conditioned music generation
Style transfer and stereo audio generation
High-fidelity neural audio codec (EnCodec)

Use Cases

Generating background music for videos or games from text descriptions.
Creating custom sound effects for applications or multimedia projects.
Prototyping music generation tools and AI-powered audio applications.
Experimenting with melody-conditioned music generation for unique compositions.

Non-Goals

Real-time audio processing for live performance.
Speech-to-text or text-to-speech conversion.
Audio editing or manipulation beyond generation.

Trust

info:Issues AttentionThere are 17 open and 4 closed issues in the last 90 days, indicating some level of engagement but potentially slow response to new issues.

Installation

npx skills add davila7/claude-code-templates

Runs the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.

Quality Score

Verified

97 /100

Analyzed about 18 hours ago

Trust Signals

Last commitabout 20 hours ago

GitHub owner davila7

Stars27.2k

Downloads 23k

LicenseMIT

Websiteaitmpl.com

Status

View Source

Similar Extensions

Audiocraft Audio Generation

Skill

Orchestra-Research

AlterLab FC AI Sound Effects Designer

This skill should be used when the user asks about "AI sound effects", "text to SFX", "generate sound effects", "ElevenLabs sound effects", "foley generation", "ambient sounds", "soundscape design", "AI foley", "sound design for film", "generate audio for video", "podcast sound effects", "game audio SFX", "act as a sound effects designer", "sound effects mode", "SFX prompting", or needs expertise in AI-generated sound effects, descriptive audio prompting, soundscape layering, and foley creation on ElevenLabs. Part of the AlterLab FC Skills collection (GenAI pack).

Skill

AlterLab-IEU

ElevenLabs Sound Effects

Generate AI sound effects from text descriptions with ElevenLabs via inference.sh CLI. Capabilities: text-to-sound-effect, custom duration, royalty-free audio. Use for: video production, game audio, podcasts, films, presentations, social media. Triggers: sound effects, sfx, sound generation, ai sound effects, generate sound, foley, audio effects, sound design, text to sound, elevenlabs sound, eleven labs sfx, ambient sound, cinematic sound, game sound effects

Skill

inferen-sh

ElevenLabs Audio Generation

Generate AI voiceovers, sound effects, and music using ElevenLabs APIs. Use when creating audio content for videos, podcasts, or games. Triggers include generating voiceovers, narration, dialogue, sound effects from descriptions, background music, soundtrack generation, voice cloning, or any audio synthesis task.

Skill

digitalsamba

PyTorch Lightning

100

Deep learning framework (PyTorch Lightning). Organize PyTorch code into LightningModules, configure Trainers for multi-GPU/TPU, implement data pipelines, callbacks, logging (W&B, TensorBoard), distributed training (DDP, FSDP, DeepSpeed), for scalable neural network training.

Skill

K-Dense-AI

Implementing Llms Litgpt

100

Implements and trains LLMs using Lightning AI's LitGPT with 20+ pretrained architectures (Llama, Gemma, Phi, Qwen, Mistral). Use when need clean model implementations, educational understanding of architectures, or production fine-tuning with LoRA/QLoRA. Single-file implementations, no abstraction layers.

Skill

davila7