跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Lambda Labs GPU Cloud

技能 已验证 活跃

Reserved and on-demand GPU cloud instances for ML training and inference. Use when you need dedicated GPU instances with simple SSH access, persistent filesystems, or high-performance multi-node clusters for large-scale training.

目的

To enable users to effectively provision, configure, and manage GPU cloud instances on Lambda Labs for machine learning training and inference.

功能

  • Launch and terminate GPU instances
  • Manage SSH access and keys
  • Utilize persistent filesystems
  • Automate workflows via Python API
  • Troubleshoot common issues

使用场景

  • Provisioning dedicated GPU instances for long training jobs
  • Setting up high-performance multi-node clusters
  • Accessing pre-installed ML stacks like Lambda Stack
  • Utilizing simple pricing with no egress fees

非目标

  • Managing multi-cloud environments (use SkyPilot or Modal instead)
  • Serverless, auto-scaling workloads (use Modal instead)
  • Lowest-cost spot instances (use RunPod or Vast.ai instead)

先决条件

  • Lambda Labs account with API key
  • SSH key pair configured
  • Payment method added

安装

请先添加 Marketplace

/plugin marketplace add Orchestra-Research/AI-Research-SKILLs
/plugin install AI-Research-SKILLs@ai-research-skills

质量评分

已验证
97 /100
1 day ago 分析

信任信号

最近提交17 days ago
星标8.3k
许可证MIT
状态
查看源代码

类似扩展

Cloudflare Deploy

99

Deploy applications and infrastructure to Cloudflare using Workers, Pages, and related platform services. Use when the user asks to deploy, host, publish, or set up a project on Cloudflare.

技能
openai

Pytorch Lightning

99

High-level PyTorch framework with Trainer class, automatic distributed training (DDP/FSDP/DeepSpeed), callbacks system, and minimal boilerplate. Scales from laptop to supercomputer with same code. Use when you want clean training loops with built-in best practices.

技能
Orchestra-Research

Cost Optimization

98

Optimize cloud costs across AWS, Azure, GCP, and OCI through resource rightsizing, tagging strategies, reserved instances, and spending analysis. Use when reducing cloud expenses, analyzing infrastructure costs, or implementing cost governance policies.

技能
wshobson

Skypilot Multi Cloud Orchestration

98

Multi-cloud orchestration for ML workloads with automatic cost optimization. Use when you need to run training or batch jobs across multiple clouds, leverage spot instances with auto-recovery, or optimize GPU costs across providers.

技能
Orchestra-Research

AWQ Quantization

95

Activation-aware weight quantization for 4-bit LLM compression with 3x speedup and minimal accuracy loss. Use when deploying large models (7B-70B) on limited GPU memory, when you need faster inference than GPTQ with better accuracy preservation, or for instruction-tuned and multimodal models. MLSys 2024 Best Paper Award winner.

技能
Orchestra-Research

Lambda Labs Gpu Cloud

94

Reserved and on-demand GPU cloud instances for ML training and inference. Use when you need dedicated GPU instances with simple SSH access, persistent filesystems, or high-performance multi-node clusters for large-scale training.

技能
davila7