Zum Hauptinhalt springen
Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

Lambda Labs GPU Cloud

Skill Verifiziert Aktiv

Reserved and on-demand GPU cloud instances for ML training and inference. Use when you need dedicated GPU instances with simple SSH access, persistent filesystems, or high-performance multi-node clusters for large-scale training.

Zweck

To enable users to effectively provision, configure, and manage GPU cloud instances on Lambda Labs for machine learning training and inference.

Funktionen

  • Launch and terminate GPU instances
  • Manage SSH access and keys
  • Utilize persistent filesystems
  • Automate workflows via Python API
  • Troubleshoot common issues

Anwendungsfälle

  • Provisioning dedicated GPU instances for long training jobs
  • Setting up high-performance multi-node clusters
  • Accessing pre-installed ML stacks like Lambda Stack
  • Utilizing simple pricing with no egress fees

Nicht-Ziele

  • Managing multi-cloud environments (use SkyPilot or Modal instead)
  • Serverless, auto-scaling workloads (use Modal instead)
  • Lowest-cost spot instances (use RunPod or Vast.ai instead)

Voraussetzungen

  • Lambda Labs account with API key
  • SSH key pair configured
  • Payment method added

Installation

Zuerst Marketplace hinzufügen

/plugin marketplace add Orchestra-Research/AI-Research-SKILLs
/plugin install AI-Research-SKILLs@ai-research-skills

Qualitätspunktzahl

Verifiziert
97 /100
Analysiert 1 day ago

Vertrauenssignale

Letzter Commit17 days ago
Sterne8.3k
LizenzMIT
Status
Quellcode ansehen

Ähnliche Erweiterungen

Cloudflare Deploy

99

Deploy applications and infrastructure to Cloudflare using Workers, Pages, and related platform services. Use when the user asks to deploy, host, publish, or set up a project on Cloudflare.

Skill
openai

Pytorch Lightning

99

High-level PyTorch framework with Trainer class, automatic distributed training (DDP/FSDP/DeepSpeed), callbacks system, and minimal boilerplate. Scales from laptop to supercomputer with same code. Use when you want clean training loops with built-in best practices.

Skill
Orchestra-Research

Cost Optimization

98

Optimize cloud costs across AWS, Azure, GCP, and OCI through resource rightsizing, tagging strategies, reserved instances, and spending analysis. Use when reducing cloud expenses, analyzing infrastructure costs, or implementing cost governance policies.

Skill
wshobson

Skypilot Multi Cloud Orchestration

98

Multi-cloud orchestration for ML workloads with automatic cost optimization. Use when you need to run training or batch jobs across multiple clouds, leverage spot instances with auto-recovery, or optimize GPU costs across providers.

Skill
Orchestra-Research

AWQ Quantization

95

Activation-aware weight quantization for 4-bit LLM compression with 3x speedup and minimal accuracy loss. Use when deploying large models (7B-70B) on limited GPU memory, when you need faster inference than GPTQ with better accuracy preservation, or for instruction-tuned and multimodal models. MLSys 2024 Best Paper Award winner.

Skill
Orchestra-Research

Lambda Labs Gpu Cloud

94

Reserved and on-demand GPU cloud instances for ML training and inference. Use when you need dedicated GPU instances with simple SSH access, persistent filesystems, or high-performance multi-node clusters for large-scale training.

Skill
davila7