Zum Hauptinhalt springen
Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

Modal Serverless Gpu

Skill Verifiziert Aktiv

Serverless GPU cloud platform for running ML workloads. Use when you need on-demand GPU access without infrastructure management, deploying ML models as APIs, or running batch jobs with automatic scaling.

Zweck

To enable users to run ML workloads on-demand with GPU access without managing infrastructure, by leveraging Modal's serverless platform for deployment and batch processing.

Funktionen

  • Serverless GPUs on-demand (T4, A10G, A100, H100, etc.)
  • Python-native infrastructure definition
  • Auto-scaling for ML workloads
  • Deploying ML models as REST APIs
  • Running batch processing jobs with automatic scaling

Anwendungsfälle

  • Running GPU-intensive ML workloads without managing infrastructure
  • Deploying ML models as auto-scaling APIs
  • Running batch processing jobs (training, inference, data processing)
  • Prototyping ML applications quickly

Nicht-Ziele

  • Using alternatives like RunPod for longer-running pods with persistent state
  • Using Lambda Labs for reserved GPU instances
  • Using SkyPilot for multi-cloud orchestration and cost optimization
  • Using Kubernetes for complex multi-service architectures

Installation

Zuerst Marketplace hinzufügen

/plugin marketplace add Orchestra-Research/AI-Research-SKILLs
/plugin install AI-Research-SKILLs@ai-research-skills

Qualitätspunktzahl

Verifiziert
95 /100
Analysiert 1 day ago

Vertrauenssignale

Letzter Commit17 days ago
Sterne8.3k
LizenzMIT
Status
Quellcode ansehen

Ähnliche Erweiterungen

Cloudflare Deploy

99

Deploy applications and infrastructure to Cloudflare using Workers, Pages, and related platform services. Use when the user asks to deploy, host, publish, or set up a project on Cloudflare.

Skill
openai

Render Deploy

99

Deploy applications to Render by analyzing codebases, generating render.yaml Blueprints, and providing Dashboard deeplinks. Use when the user wants to deploy, host, publish, or set up their application on Render's cloud platform.

Skill
openai

Cost Optimization

98

Optimize cloud costs across AWS, Azure, GCP, and OCI through resource rightsizing, tagging strategies, reserved instances, and spending analysis. Use when reducing cloud expenses, analyzing infrastructure costs, or implementing cost governance policies.

Skill
wshobson

Skypilot Multi Cloud Orchestration

98

Multi-cloud orchestration for ML workloads with automatic cost optimization. Use when you need to run training or batch jobs across multiple clouds, leverage spot instances with auto-recovery, or optimize GPU costs across providers.

Skill
Orchestra-Research

Modal Serverless Gpu

98

Serverless GPU cloud platform for running ML workloads. Use when you need on-demand GPU access without infrastructure management, deploying ML models as APIs, or running batch jobs with automatic scaling.

Skill
davila7

RunPod Cloud GPU

98

Cloud-GPU-Verarbeitung über RunPod Serverless. Verwenden Sie dies beim Einrichten von RunPod-Endpunkten, beim Bereitstellen von Docker-Images, beim Verwalten von GPU-Ressourcen, beim Beheben von Endpunktproblemen oder beim Verstehen von Kosten. Beinhaltet alle 5 Toolkit-Images (qwen-edit, realesrgan, propainter, sadtalker, qwen3-tts).

Skill
digitalsamba