跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Modal Serverless Gpu

技能 活跃

Serverless GPU cloud platform for running ML workloads. Use when you need on-demand GPU access without infrastructure management, deploying ML models as APIs, or running batch jobs with automatic scaling.

目的

To enable users to run GPU-intensive ML workloads on-demand without managing infrastructure, by leveraging Modal's serverless platform for deployment and batch processing.

功能

  • Serverless GPU access (T4, L4, A10G, A100, H100, etc.)
  • On-demand ML model deployment as APIs
  • Automatic scaling for batch jobs and inference
  • Python-native infrastructure definition
  • Sub-second cold starts and container caching

使用场景

  • Running GPU-intensive ML workloads without managing infrastructure
  • Deploying ML models as auto-scaling APIs
  • Running batch processing jobs (training, inference, data processing)
  • Prototyping ML applications quickly

非目标

  • Providing reserved GPU instances
  • Orchestrating multi-cloud deployments
  • Managing complex multi-service architectures directly

Trust

  • warning:Issues AttentionIn the last 90 days, 17 issues were opened and 4 were closed, indicating a closure rate below 50% and a significant number of open issues, suggesting potential delays in maintainer response.

安装

npx skills add davila7/claude-code-templates

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

98 /100
1 day ago 分析

信任信号

最近提交1 day ago
星标27.2k
许可证MIT
状态
查看源代码

类似扩展

Cloudflare Deploy

99

Deploy applications and infrastructure to Cloudflare using Workers, Pages, and related platform services. Use when the user asks to deploy, host, publish, or set up a project on Cloudflare.

技能
openai

Render Deploy

99

Deploy applications to Render by analyzing codebases, generating render.yaml Blueprints, and providing Dashboard deeplinks. Use when the user wants to deploy, host, publish, or set up their application on Render's cloud platform.

技能
openai

Cost Optimization

98

Optimize cloud costs across AWS, Azure, GCP, and OCI through resource rightsizing, tagging strategies, reserved instances, and spending analysis. Use when reducing cloud expenses, analyzing infrastructure costs, or implementing cost governance policies.

技能
wshobson

Skypilot Multi Cloud Orchestration

98

Multi-cloud orchestration for ML workloads with automatic cost optimization. Use when you need to run training or batch jobs across multiple clouds, leverage spot instances with auto-recovery, or optimize GPU costs across providers.

技能
Orchestra-Research

RunPod Cloud GPU

98

通过 RunPod serverless 进行云 GPU 处理。在设置 RunPod 端点、部署 Docker 映像、管理 GPU 资源、排查端点问题或了解成本时使用。涵盖所有 5 个工具包映像(qwen-edit、realesrgan、propainter、sadtalker、qwen3-tts)。

技能
digitalsamba

Alterlab Modal

98

Part of the AlterLab Academic Skills suite. Run Python code in the cloud with serverless containers, GPUs, and autoscaling. Use when deploying ML models, running batch processing jobs, scheduling compute-intensive tasks, or serving APIs that require GPU acceleration or dynamic scaling.

技能
AlterLab-IEU