Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

Implementing Llms Litgpt

Skill Verifiziert Aktiv

Teil von:Agent Native Research Artifact (ARA) Tooling

Implements and trains LLMs using Lightning AI's LitGPT with 20+ pretrained architectures (Llama, Gemma, Phi, Qwen, Mistral). Use when need clean model implementations, educational understanding of architectures, or production fine-tuning with LoRA/QLoRA. Single-file implementations, no abstraction layers.

Zweck

To enable users to easily implement, train, and fine-tune LLMs using LitGPT, offering clean code for educational understanding and production-ready workflows for advanced ML research and development.

Funktionen

Implement and train LLMs with LitGPT
Utilize 20+ pretrained architectures (Llama, Gemma, Phi, etc.)
Production-ready fine-tuning with LoRA/QLoRA
Pretrain new models from scratch
Deploy models via API

Anwendungsfälle

When needing clean, educational understanding of LLM architectures
For production fine-tuning with efficient methods like LoRA/QLoRA
When prototyping new model ideas or adapting existing architectures
To leverage a unified framework for various LLM training tasks

Nicht-Ziele

Providing abstraction layers beyond clean model implementations
Supporting every possible LLM architecture not covered by LitGPT
Complex, multi-agent research orchestration (handled by other skills)

Installation

Zuerst Marketplace hinzufügen

/plugin marketplace add Orchestra-Research/AI-Research-SKILLs

/plugin install AI-Research-SKILLs@ai-research-skills

Qualitätspunktzahl

Verifiziert

98 /100

Analysiert 1 day ago

Vertrauenssignale

Letzter Commit17 days ago

GitHub-Inhaber Orchestra-Research

Sterne8.3k

Downloads 0

LizenzMIT

Websiteorchestra-research.com

Status

Quellcode ansehen

Ähnliche Erweiterungen

Implementing Llms Litgpt

100

Skill

davila7

Peft Fine Tuning

Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need to train <1% of parameters with minimal accuracy loss, or for multi-adapter serving. HuggingFace's official library integrated with transformers ecosystem.

Skill

Orchestra-Research

Unsloth

Expert guidance for fast fine-tuning with Unsloth - 2-5x faster training, 50-80% less memory, LoRA/QLoRA optimization

Skill

Orchestra-Research

Fine Tuning Expert

Use when fine-tuning LLMs, training custom models, or adapting foundation models for specific tasks. Invoke for configuring LoRA/QLoRA adapters, preparing JSONL training datasets, setting hyperparameters for fine-tuning runs, adapter training, transfer learning, finetuning with Hugging Face PEFT, OpenAI fine-tuning, instruction tuning, RLHF, DPO, or quantizing and deploying fine-tuned models. Trigger terms include: LoRA, QLoRA, PEFT, finetuning, fine-tuning, adapter tuning, LLM training, model training, custom model.

Skill

jeffallan

PEFT Fine Tuning

Skill

davila7

OpenVLA OFT Fine Tuning and Evaluation

Fine-tunes and evaluates OpenVLA-OFT and OpenVLA-OFT+ policies for robot action generation with continuous action heads, LoRA adaptation, and FiLM conditioning on LIBERO simulation and ALOHA real-world setups. Use when reproducing OpenVLA-OFT paper results, training custom VLA action heads (L1 or diffusion), deploying server-client inference for ALOHA, or debugging normalization, LoRA merge, and cross-GPU issues.

Skill

Orchestra-Research