RunPod Cloud GPU

Skill Verified Active

Cloud GPU processing via RunPod serverless. Use when setting up RunPod endpoints, deploying Docker images, managing GPU resources, troubleshooting endpoint issues, or understanding costs. Covers all 5 toolkit images (qwen-edit, realesrgan, propainter, sadtalker, qwen3-tts).

Purpose

To enable users to leverage cloud GPU processing through RunPod serverless for deploying and managing AI models and Docker images efficiently.

Features

Setup and deployment of RunPod endpoints
Management of GPU resources and Docker images
Troubleshooting endpoint issues
Detailed RunPod API reference (GraphQL and REST)
Cost understanding and optimization guidance

Use Cases

When setting up RunPod endpoints for AI model deployment
Deploying custom Docker images to cloud GPUs
Managing GPU resources and scaling configurations
Troubleshooting issues with RunPod endpoints
Understanding and optimizing cloud GPU costs

Non-Goals

Directly running AI models locally
Managing local machine hardware
Providing a general-purpose cloud management tool
Replacing the RunPod web console for all tasks

Workflow

Add RunPod API key to .env
Run `--setup` for specific tools (image_edit, upscale, etc.)
Configure endpoint workers (min/max, idleTimeout)
Manage endpoints via RunPod dashboard or API reference
Troubleshoot common issues (cold start, OOM, worker availability)

Prerequisites

RunPod account and API key
Cloudflare R2 credentials (optional, for file transfer fallback)
Python 3.9+ recommended

Installation

npx skills add digitalsamba/claude-code-video-toolkit

Runs the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.

Quality Score

Verified

98 /100

Analyzed 1 day ago

Trust Signals

Last commit3 days ago

GitHub owner digitalsamba

Stars1.1k

LicenseMIT

Status

View Source

Similar Extensions

Modal Serverless Gpu

Serverless GPU cloud platform for running ML workloads. Use when you need on-demand GPU access without infrastructure management, deploying ML models as APIs, or running batch jobs with automatic scaling.

Skill

davila7

Alterlab Modal

Part of the AlterLab Academic Skills suite. Run Python code in the cloud with serverless containers, GPUs, and autoscaling. Use when deploying ML models, running batch processing jobs, scheduling compute-intensive tasks, or serving APIs that require GPU acceleration or dynamic scaling.

Skill

AlterLab-IEU

Modal Serverless Gpu

Skill

Orchestra-Research

Modal

Cloud computing platform for running Python on GPUs and serverless infrastructure. Use when deploying AI/ML models, running GPU-accelerated workloads, serving web endpoints, scheduling batch jobs, or scaling Python code to the cloud. Use this skill whenever the user mentions Modal, serverless GPU compute, deploying ML models to the cloud, serving inference endpoints, running batch processing in the cloud, or needs to scale Python workloads beyond their local machine. Also use when the user wants to run code on H100s, A100s, or other cloud GPUs, or needs to create a web API for a model.

Skill

K-Dense-AI

Trader Regime

100

Detect current market regime using npx neural-trader — bull/bear/ranging/volatile classification with recommended strategy

Skill

ruvnet

Trading Memory

100

Domain knowledge for AI trading memory — Outcome-Weighted Memory (OWM) architecture, 5 memory types, recall scoring, and behavioral analysis. Use when recording trades, recalling similar contexts, analyzing performance, or checking behavioral drift. Triggers on "record trade", "remember trade", "recall", "similar trades", "performance", "behavioral", "disposition", "affective state", "confidence".

Skill

mnemox-ai