此内容尚未提供您的语言版本,正在以英文显示。

Lambda Labs GPU Cloud

技能已验证活跃

属于:Agent Native Research Artifact (ARA) Tooling

Reserved and on-demand GPU cloud instances for ML training and inference. Use when you need dedicated GPU instances with simple SSH access, persistent filesystems, or high-performance multi-node clusters for large-scale training.

目的

To enable users to effectively provision, configure, and manage GPU cloud instances on Lambda Labs for machine learning training and inference.

功能

Launch and terminate GPU instances
Manage SSH access and keys
Utilize persistent filesystems
Automate workflows via Python API
Troubleshoot common issues

使用场景

Provisioning dedicated GPU instances for long training jobs
Setting up high-performance multi-node clusters
Accessing pre-installed ML stacks like Lambda Stack
Utilizing simple pricing with no egress fees

非目标

Managing multi-cloud environments (use SkyPilot or Modal instead)
Serverless, auto-scaling workloads (use Modal instead)
Lowest-cost spot instances (use RunPod or Vast.ai instead)

先决条件

Lambda Labs account with API key
SSH key pair configured
Payment method added

安装

请先添加 Marketplace

/plugin marketplace add Orchestra-Research/AI-Research-SKILLs

/plugin install AI-Research-SKILLs@ai-research-skills

质量评分

已验证

97 /100

about 2 months ago 分析

信任信号

最近提交2 months ago

GitHub 所有者 Orchestra-Research

星标8.3k

下载量 0

许可证MIT

网站orchestra-research.com

状态

查看源代码

类似扩展

Cloudflare Deploy

Deploy applications and infrastructure to Cloudflare using Workers, Pages, and related platform services. Use when the user asks to deploy, host, publish, or set up a project on Cloudflare.

技能

openai

Pytorch Lightning

High-level PyTorch framework with Trainer class, automatic distributed training (DDP/FSDP/DeepSpeed), callbacks system, and minimal boilerplate. Scales from laptop to supercomputer with same code. Use when you want clean training loops with built-in best practices.

技能

Orchestra-Research

Cost Optimization

Optimize cloud costs across AWS, Azure, GCP, and OCI through resource rightsizing, tagging strategies, reserved instances, and spending analysis. Use when reducing cloud expenses, analyzing infrastructure costs, or implementing cost governance policies.

技能

wshobson

Skypilot Multi Cloud Orchestration

Multi-cloud orchestration for ML workloads with automatic cost optimization. Use when you need to run training or batch jobs across multiple clouds, leverage spot instances with auto-recovery, or optimize GPU costs across providers.

技能

Orchestra-Research

AWQ Quantization

Activation-aware weight quantization for 4-bit LLM compression with 3x speedup and minimal accuracy loss. Use when deploying large models (7B-70B) on limited GPU memory, when you need faster inference than GPTQ with better accuracy preservation, or for instruction-tuned and multimodal models. MLSys 2024 Best Paper Award winner.

技能

Orchestra-Research

Lambda Labs Gpu Cloud

技能

davila7