跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Skypilot Multi Cloud Orchestration

技能 活跃

Multi-cloud orchestration for ML workloads with automatic cost optimization. Use when you need to run training or batch jobs across multiple clouds, leverage spot instances with auto-recovery, or optimize GPU costs across providers.

目的

Orchestrate ML workloads across multiple clouds with automatic cost optimization and spot instance management.

功能

  • Multi-cloud orchestration for ML workloads
  • Automatic cost optimization
  • Spot instance usage with auto-recovery
  • Distributed multi-node training
  • Unified interface for 20+ cloud providers

使用场景

  • Running training or batch jobs across multiple clouds (AWS, GCP, Azure)
  • Leveraging spot instances for cost savings with auto-recovery
  • Managing distributed multi-node training setups
  • Deploying ML models using Sky Serve with autoscaling

非目标

  • Simpler serverless GPU solutions (use Modal)
  • Single-cloud persistent pods (use RunPod)
  • Existing Kubernetes infrastructure management (use Kubernetes native tools)
  • Pure Ray-based orchestration (use Ray)

Trust

  • warning:Issues AttentionIn the last 90 days, 17 issues were opened and 4 were closed, resulting in a closure rate of approximately 23.5%, indicating maintainers respond slowly to open issues.

安装

npx skills add davila7/claude-code-templates

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

95 /100
1 day ago 分析

信任信号

最近提交1 day ago
星标27.2k
许可证MIT
状态
查看源代码

类似扩展

Skypilot Multi Cloud Orchestration

98

Multi-cloud orchestration for ML workloads with automatic cost optimization. Use when you need to run training or batch jobs across multiple clouds, leverage spot instances with auto-recovery, or optimize GPU costs across providers.

技能
Orchestra-Research

Orchestrate Ml Pipeline

99

Orchestrate end-to-end machine learning pipelines using Prefect or Airflow with DAG construction, task dependencies, retry logic, scheduling, monitoring, and integration with MLflow, DVC, and feature stores for production ML workflows. Use when automating multi-step ML workflows from data ingestion to deployment, scheduling periodic model retraining, coordinating distributed training tasks, or managing retry logic and failure recovery across pipeline stages.

技能
pjt222

Cost Optimization

98

Optimize cloud costs across AWS, Azure, GCP, and OCI through resource rightsizing, tagging strategies, reserved instances, and spending analysis. Use when reducing cloud expenses, analyzing infrastructure costs, or implementing cost governance policies.

技能
wshobson

Janitor Tokens

100

显示每个技能消耗的上下文窗口令牌数量。当用户询问有关令牌成本、上下文预算、技能大小,或希望了解哪些技能浪费了最多的上下文空间时使用。

技能
khendzel

Cloud Architect

100

Designs cloud architectures, creates migration plans, generates cost optimization recommendations, and produces disaster recovery strategies across AWS, Azure, and GCP. Use when designing cloud architectures, planning migrations, or optimizing multi-cloud deployments. Invoke for Well-Architected Framework, cost optimization, disaster recovery, landing zones, security architecture, serverless design.

技能
jeffallan

K8s Manifest Generator

100

Create production-ready Kubernetes manifests for Deployments, Services, ConfigMaps, and Secrets following best practices and security standards. Use when generating Kubernetes YAML manifests, creating K8s resources, or implementing production-grade Kubernetes configurations.

技能
wshobson