此内容尚未提供您的语言版本,正在以英文显示。

Long Context

技能已验证活跃

属于:Agent Native Research Artifact (ARA) Tooling

Extend context windows of transformer models using RoPE, YaRN, ALiBi, and position interpolation techniques. Use when processing long documents (32k-128k+ tokens), extending pre-trained models beyond original context limits, or implementing efficient positional encodings. Covers rotary embeddings, attention biases, interpolation methods, and extrapolation strategies for LLMs.

目的

To enable users to process extremely long documents or extend pre-trained models beyond their original context limits by implementing efficient positional encoding techniques.

功能

Extend context windows with RoPE, YaRN, ALiBi, and Position Interpolation
Implement efficient positional encodings
Train models with length extrapolation capabilities
Fine-tune existing models for longer contexts
Inference with long context models

使用场景

Process long documents (32k-128k+ tokens)
Extend context windows of pre-trained models
Implement efficient positional encodings
Train models with length extrapolation

非目标

Modifying the core transformer architecture beyond positional encodings
Providing a generic LLM fine-tuning framework
Covering techniques unrelated to context window extension

Execution

info:Pinned dependenciesThe SKILL.md lists dependencies like `transformers`, `torch`, `einops`, `flash-attn`, but does not explicitly pin versions or mention lockfiles for these Python packages.
info:Pinned dependenciesDependencies are listed in SKILL.md but are not pinned, and there's no explicit mention of vulnerability scanning or update mechanisms for these Python packages.

Maintenance

info:Dependency ManagementDependencies are listed in SKILL.md but are not pinned, and there's no explicit mention of vulnerability scanning or update mechanisms for these Python packages.

安装

请先添加 Marketplace

/plugin marketplace add Orchestra-Research/AI-Research-SKILLs

/plugin install AI-Research-SKILLs@ai-research-skills

质量评分

已验证

95 /100

about 2 months ago 分析

信任信号

最近提交2 months ago

GitHub 所有者 Orchestra-Research

星标8.3k

下载量 0

许可证MIT

网站orchestra-research.com

状态

查看源代码

类似扩展

This skill should be used when the user asks to "compress context", "summarize conversation history", "implement compaction", "reduce token usage", or mentions context compression, structured summarization, tokens-per-task optimization, or long-running agent sessions exceeding context limits.

技能

muratcankoylan