Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

Pytorch Fsdp2

Skill Verifiziert Aktiv

Teil von:Agent Native Research Artifact (ARA) Tooling

Adds PyTorch FSDP2 (fully_shard) to training scripts with correct init, sharding, mixed precision/offload config, and distributed checkpointing. Use when models exceed single-GPU memory or when you need DTensor-based sharding with DeviceMesh.

Zweck

Enables users to correctly implement PyTorch FSDP2 in their training scripts to manage large models that exceed single-GPU memory or require advanced sharding strategies.

Funktionen

Adds PyTorch FSDP2 integration to training scripts.
Correct initialization, sharding, and mixed precision configuration.
Guidance on distributed checkpointing (DCP).
Handles models exceeding single-GPU memory.
Supports DTensor-based sharding with DeviceMesh.

Anwendungsfälle

When models do not fit on a single GPU.
For users needing DTensor-based per-parameter sharding.
When composing DP with Tensor Parallelism using DeviceMesh.
Integrating FSDP2 into existing PyTorch training loops.

Nicht-Ziele

Replacing standard DistributedDataParallel (DDP) when model fits on one GPU.
Using older FSDP1 versions.
Providing backwards-compatible checkpoints across PyTorch versions without careful management.
Supporting PyTorch versions without the FSDP2 stack.

Installation

Zuerst Marketplace hinzufügen

/plugin marketplace add Orchestra-Research/AI-Research-SKILLs

/plugin install AI-Research-SKILLs@ai-research-skills

Qualitätspunktzahl

Verifiziert

95 /100

Analysiert 1 day ago

Vertrauenssignale

Letzter Commit17 days ago

GitHub-Inhaber Orchestra-Research

Sterne8.3k

Downloads 0

LizenzMIT

Websiteorchestra-research.com

Status

Quellcode ansehen

Pytorch Fsdp2

Funktionen

Anwendungsfälle

Nicht-Ziele

Qualitätspunktzahl

Vertrauenssignale

Ähnliche Erweiterungen

Pytorch Lightning

Huggingface Accelerate

TorchTitan Distributed LLM Pretraining

HuggingFace Accelerate

PyTorch Lightning

Nnsight Remote Interpretability