[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"extension-skill-Orchestra-Research-ml-paper-writing-zh-CN":3,"guides-for-Orchestra-Research-ml-paper-writing":1911,"similar-k173mxkapfej2d5crv2zbwvc9986mx6e-zh-CN":1912},{"_creationTime":4,"_id":5,"children":6,"community":7,"display":9,"evaluation":15,"identity":299,"isFallback":280,"parentExtension":305,"providers":362,"relations":366,"repo":367,"tags":1909,"workflow":1910},1778695116697.1975,"k173mxkapfej2d5crv2zbwvc9986mx6e",[],{"reviewCount":8},0,{"description":10,"installMethods":11,"name":13,"sourceUrl":14},"Write publication-ready ML/AI papers for NeurIPS, ICML, ICLR, ACL, AAAI, COLM. Use when drafting papers from research repos, structuring arguments, verifying citations, or preparing camera-ready submissions. For systems venues (OSDI, NSDI, ASPLOS, SOSP), use systems-paper-writing instead.",{"claudeCode":12},"Orchestra-Research/AI-Research-SKILLs","ML Paper Writing","https://github.com/Orchestra-Research/AI-Research-SKILLs",{"_creationTime":16,"_id":17,"extensionId":5,"locale":18,"result":19,"trustSignals":278,"workflow":297},1778697448935.3792,"kn78e2xkvz45ktsmywn158w5d186nk3e","en",{"checks":20,"evaluatedAt":225,"extensionSummary":226,"features":227,"nonGoals":233,"practices":237,"prerequisites":242,"promptVersionExtension":246,"promptVersionScoring":247,"purpose":248,"rationale":249,"score":250,"summary":251,"tags":252,"targetMarket":262,"tier":263,"useCases":264,"workflow":269},[21,26,29,32,36,39,43,46,50,54,58,61,64,68,72,76,80,84,87,91,94,97,100,103,106,109,113,116,120,124,128,131,135,139,142,145,148,151,154,157,161,165,167,169,171,174,177,180,183,184,185,186,187,188,189,190,191,192,193,194,195,196,197,198,199,200,201,202,203,204,205,206,208,209,210,211,212,213,214,215,216,217,218,219,220,221,222,223,224],{"category":22,"check":23,"severity":24,"summary":25},"Practical Utility","Problem relevance","pass","The description clearly identifies the user problem of writing publication-ready ML/AI papers for specific top-tier conferences and names the artifact operated on (ML/AI papers) and the user intent (drafting, structuring, verifying, preparing).",{"category":22,"check":27,"severity":24,"summary":28},"Unique selling proposition","The skill offers significant value beyond a basic LLM by combining researcher philosophies, providing programmatic citation verification (preventing hallucinations), offering specific LaTeX templates per venue, and guiding iterative drafting based on repository analysis, which is far beyond default LLM capabilities.",{"category":22,"check":30,"severity":24,"summary":31},"Production readiness","The skill is production-ready, covering the complete lifecycle from initial repository exploration and drafting to citation management, iterative feedback, and final submission formatting using venue-specific templates and checklists.",{"category":33,"check":34,"severity":24,"summary":35},"Scope","Single responsibility principle","The skill focuses exclusively on ML/AI paper writing for specified conferences, offering guidance, templates, and citation tools within this domain. It correctly delegates systems venues to a different skill.",{"category":33,"check":37,"severity":24,"summary":38},"Description quality","The description accurately reflects the skill's capabilities, clearly stating its purpose for ML/AI paper writing, listing target conferences, and providing specific use cases for drafting, structuring, verifying citations, and preparing submissions.",{"category":40,"check":41,"severity":24,"summary":42},"Invocation","Precise Purpose","The purpose is precisely stated, naming the artifact (ML/AI papers for NeurIPS, ICML, etc.) and the user intent (writing publication-ready papers, drafting, structuring, verifying citations, preparing submissions), with clear boundaries set by excluding systems venues.",{"category":40,"check":44,"severity":24,"summary":45},"Concise Frontmatter","The frontmatter is dense and self-contained, clearly summarizing the core capability of writing ML/AI papers for top conferences, followed by specific use cases and exclusions.",{"category":47,"check":48,"severity":24,"summary":49},"Documentation","Concise Body","The skill body is well-structured with deep dives delegated to reference files, keeping the main SKILL.md concise and focused on workflows and core principles.",{"category":51,"check":52,"severity":24,"summary":53},"Context","Progressive Disclosure","Complex workflows and detailed guidance (e.g., citation workflow, template usage, writing philosophy) are effectively broken down into separate reference files linked within the main SKILL.md.",{"category":51,"check":55,"severity":56,"summary":57},"Forked exploration","not_applicable","The skill does not involve deep exploration or multi-file inspection beyond its own bundled references, making `context: fork` not applicable.",{"category":22,"check":59,"severity":24,"summary":60},"Usage examples","Sufficient examples are included, demonstrating workflows for repository exploration, citation verification via Python code, and LaTeX template setup. Plausible outputs are described for each.",{"category":22,"check":62,"severity":24,"summary":63},"Edge cases","The skill handles edge cases by explicitly documenting the critical rule of never hallucinating citations and providing a clear recovery step (marking placeholders) for when citations cannot be programmatically verified. It also lists common issues with solutions.",{"category":65,"check":66,"severity":56,"summary":67},"Code Execution","Tool Fallback","The skill uses internal tools and APIs (like Semantic Scholar, CrossRef) and does not rely on an external MCP server, making fallbacks not applicable.",{"category":69,"check":70,"severity":24,"summary":71},"Safety","Halt on unexpected state","The skill emphasizes correctness and procedural rigor, particularly around citation management and following conference guidelines, implying a halt on unexpected states that could lead to errors or misconduct.",{"category":73,"check":74,"severity":24,"summary":75},"Portability","Cross-skill coupling","The skill is self-contained for ML/AI paper writing and explicitly delegates systems venue papers to a different skill, avoiding implicit reliance on other skills.",{"category":77,"check":78,"severity":24,"summary":79},"License","License usability","The extension is distributed under the MIT license, as detected from the bundled LICENSE file and confirmed in the trust signals, which is a permissive OSS license.",{"category":81,"check":82,"severity":24,"summary":83},"Maintenance","Commit recency","The last commit was on April 28, 2026, which is within the last 3 months.",{"category":81,"check":85,"severity":24,"summary":86},"Dependency Management","Dependencies like semanticscholar, arxiv, habanero, and requests are declared and appear to be managed appropriately for skill usage.",{"category":88,"check":89,"severity":56,"summary":90},"Security","Secret Management","The skill does not appear to handle or require any secrets.",{"category":88,"check":92,"severity":24,"summary":93},"Injection","The skill focuses on structured API calls and code execution within its own bundle, and emphasizes programmatic fetching of citations, mitigating risks associated with untrusted external content.",{"category":88,"check":95,"severity":24,"summary":96},"Transitive Supply-Chain Grenades","The skill relies on well-defined API clients and bundled LaTeX templates, avoiding runtime downloads of code or data that could compromise supply chain integrity.",{"category":88,"check":98,"severity":24,"summary":99},"Sandbox Isolation","The skill operates within defined scopes, primarily managing text generation, API calls, and local file operations for LaTeX documents, without attempting to modify files outside its project folder.",{"category":88,"check":101,"severity":24,"summary":102},"Sandbox escape primitives","No evidence of detached-process spawns or deny-retry loops that could indicate sandbox escape primitives was found in the skill's instructions or scripts.",{"category":88,"check":104,"severity":24,"summary":105},"Data Exfiltration","The skill's outbound calls are limited to documented APIs (Semantic Scholar, CrossRef, arXiv) for citation retrieval, with no indication of undocumented or malicious data exfiltration.",{"category":88,"check":107,"severity":24,"summary":108},"Hidden Text Tricks","The bundled content and descriptions are free of hidden-steering tricks, invisible characters, or ANSI escapes, ensuring clarity and safety for model interpretation.",{"category":110,"check":111,"severity":24,"summary":112},"Hooks","Opaque code execution","The skill's bundled scripts and instructions are in plain, readable code and prose, avoiding obfuscation techniques like base64 payloads or runtime code fetching.",{"category":73,"check":114,"severity":24,"summary":115},"Structural Assumption","The skill operates primarily on text and API interactions, and its LaTeX template usage assumes standard LaTeX environments, avoiding assumptions about user-specific project structures.",{"category":117,"check":118,"severity":24,"summary":119},"Trust","Issues Attention","With 4 issues opened and 8 closed in the last 90 days, the closure rate is 66.7%, indicating good maintainer engagement.",{"category":121,"check":122,"severity":24,"summary":123},"Versioning","Release Management","The skill declares a version (1.2.0) in its frontmatter and has a changelog, indicating clear release management.",{"category":125,"check":126,"severity":24,"summary":127},"Execution","Pinned dependencies","Dependencies like semanticscholar, arxiv, habanero, and requests are listed and appear to be managed appropriately for the skill's execution.",{"category":33,"check":129,"severity":56,"summary":130},"Dry-run preview","The skill does not involve state-changing operations or outbound data sending that would require a dry-run mode.",{"category":132,"check":133,"severity":24,"summary":134},"Protocol","Idempotent retry & timeouts","The skill's operations involve API calls and local file manipulation, which are generally idempotent or managed within standard tool execution timeouts, and no long-awaiting operations were observed.",{"category":136,"check":137,"severity":56,"summary":138},"Compliance","GDPR","The skill does not appear to operate on personal data.",{"category":136,"check":140,"severity":24,"summary":141},"Target market","The skill's target market is global, as it provides tools and guidance applicable to researchers worldwide and does not contain region-specific logic or APIs.",{"category":73,"check":143,"severity":24,"summary":144},"Runtime stability","The skill assumes standard Python environments and LaTeX installations, with no apparent reliance on specific operating systems, shells, or editors beyond common tooling.",{"category":47,"check":146,"severity":24,"summary":147},"README","A detailed README exists within the templates directory, providing comprehensive instructions for compiling LaTeX documents and referencing the various conference templates.",{"category":33,"check":149,"severity":56,"summary":150},"Tool surface size","This is a skill, not a CLI or MCP server, and does not expose tools in the same manner. The focus is on guidance and workflows.",{"category":40,"check":152,"severity":56,"summary":153},"Overlapping near-synonym tools","As a skill, it does not expose multiple tools with overlapping functions; rather, it provides a unified workflow.",{"category":47,"check":155,"severity":24,"summary":156},"Phantom features","All features described in the SKILL.md, READMEs, and reference documents (e.g., citation workflow, LaTeX templates, writing guidance) have corresponding implementations or detailed explanations.",{"category":158,"check":159,"severity":24,"summary":160},"Install","Installation instruction","Installation instructions are implicitly covered by the `claudeCode` install method and the clear structure of the repository, with detailed usage instructions provided within the SKILL.md and README files.",{"category":162,"check":163,"severity":24,"summary":164},"Errors","Actionable error messages","The skill provides actionable guidance for common issues, such as compilation errors, bibliography problems, and citation verification failures, often with specific recovery steps or links to resources.",{"category":125,"check":126,"severity":24,"summary":166},"Dependencies are declared in the SKILL.md and appear to be standard Python libraries and LaTeX packages, with no specific version pinning explicitly required for the skill's core functionality.",{"category":33,"check":129,"severity":56,"summary":168},"The skill provides guidance and templates; it does not perform state-changing operations that would require a dry-run feature.",{"category":132,"check":133,"severity":56,"summary":170},"The skill primarily offers guidance and templates and does not involve network calls or state-changing operations that require idempotency or timeouts.",{"category":65,"check":172,"severity":24,"summary":173},"Validation","The skill's execution involves API calls and LaTeX processing, which rely on validated inputs and standard library functions. The citation manager class includes logic for handling potential data variations.",{"category":88,"check":175,"severity":56,"summary":176},"Unguarded Destructive Operations","The skill does not perform any destructive operations like file deletion or modification outside of its own template files.",{"category":65,"check":178,"severity":24,"summary":179},"Error Handling","The skill provides clear guidance on handling potential errors, such as citation verification failures and LaTeX compilation issues, with suggested recovery steps.",{"category":65,"check":181,"severity":56,"summary":182},"Logging","As a guidance and template skill, it does not perform destructive actions or outbound calls that would necessitate a local audit log.",{"category":47,"check":48,"severity":24,"summary":49},{"category":51,"check":52,"severity":24,"summary":53},{"category":51,"check":55,"severity":56,"summary":57},{"category":22,"check":59,"severity":24,"summary":60},{"category":22,"check":62,"severity":24,"summary":63},{"category":65,"check":66,"severity":56,"summary":67},{"category":69,"check":70,"severity":24,"summary":71},{"category":73,"check":74,"severity":24,"summary":75},{"category":77,"check":78,"severity":24,"summary":79},{"category":81,"check":82,"severity":24,"summary":83},{"category":81,"check":85,"severity":24,"summary":86},{"category":88,"check":89,"severity":56,"summary":90},{"category":88,"check":92,"severity":24,"summary":93},{"category":88,"check":95,"severity":24,"summary":96},{"category":88,"check":98,"severity":24,"summary":99},{"category":88,"check":101,"severity":24,"summary":102},{"category":88,"check":104,"severity":24,"summary":105},{"category":88,"check":107,"severity":24,"summary":108},{"category":110,"check":111,"severity":24,"summary":112},{"category":73,"check":114,"severity":24,"summary":115},{"category":117,"check":118,"severity":24,"summary":119},{"category":121,"check":122,"severity":24,"summary":123},{"category":125,"check":126,"severity":24,"summary":166},{"category":33,"check":129,"severity":56,"summary":207},"The skill provides guidance and templates; it does not perform state-changing operations that would require a dry-run mode.",{"category":132,"check":133,"severity":56,"summary":170},{"category":136,"check":137,"severity":56,"summary":138},{"category":136,"check":140,"severity":24,"summary":141},{"category":73,"check":143,"severity":24,"summary":144},{"category":47,"check":146,"severity":24,"summary":147},{"category":33,"check":149,"severity":56,"summary":150},{"category":40,"check":152,"severity":56,"summary":153},{"category":47,"check":155,"severity":24,"summary":156},{"category":158,"check":159,"severity":24,"summary":160},{"category":162,"check":163,"severity":24,"summary":164},{"category":125,"check":126,"severity":24,"summary":166},{"category":33,"check":129,"severity":56,"summary":168},{"category":132,"check":133,"severity":56,"summary":170},{"category":65,"check":172,"severity":24,"summary":173},{"category":88,"check":175,"severity":56,"summary":176},{"category":65,"check":178,"severity":24,"summary":179},{"category":65,"check":181,"severity":56,"summary":182},1778697448801,"This skill provides detailed guidance, LaTeX templates, and workflows for writing publication-ready ML/AI papers for top conferences like NeurIPS, ICML, and ICLR. It includes a strict citation verification process to prevent hallucinations and offers advice on structure, writing style, and submission requirements.",[228,229,230,231,232],"Write ML/AI papers for NeurIPS, ICML, ICLR, ACL, AAAI, COLM","Programmatic citation verification to prevent hallucinations","LaTeX templates for various top conferences","Guidance on paper structure, narrative, and writing style","Workflows for repository analysis and iterative drafting",[234,235,236],"Writing papers for systems venues (use systems-paper-writing skill instead)","Generating citations from memory","Providing generic LLM capabilities without domain-specific structure",[238,239,240,241],"Academic Writing","Citation Management","LaTeX Formatting","Research Workflow",[243,244,245],"Access to LaTeX environment (e.g., TeX Live, Overleaf)","Python environment for citation management scripts","Access to necessary APIs (Semantic Scholar, CrossRef, arXiv)","3.0.0","4.4.0","To assist researchers in drafting, structuring, and preparing publication-ready ML/AI papers for top-tier conferences by providing expert guidance, templates, and tools.","No critical or warning findings were identified. The 'Issues Attention' check passed with a strong closure rate. The overall quality of documentation, utility, and safety measures leads to a high score.",98,"Excellent skill for ML/AI paper writing, providing comprehensive guidance, templates, and tools.",[253,254,255,256,257,258,259,260,261],"academic-writing","paper-writing","latex","citations","research","neurips","icml","iclr","acl","global","verified",[265,266,267,268],"Drafting papers from research repositories","Structuring arguments and identifying core contributions","Finding and verifying citations programmatically","Preparing camera-ready submissions using venue-specific templates",[270,271,272,273,274,275,276,277],"Explore research repository to understand project","Identify existing citations within codebase","Clarify main contribution with scientist","Search for relevant literature using APIs","Verify and retrieve citations programmatically","Draft paper sections iteratively with feedback","Format paper using venue-specific LaTeX templates","Complete conference checklists and submit",{"codeQuality":279,"collectedAt":281,"documentation":282,"maintenance":285,"popularity":292,"security":293,"testCoverage":296},{"hasLockfile":280},true,1778697423070,{"descriptionLength":283,"readmeSize":284},289,45313,{"closedIssues90d":286,"forks":287,"hasChangelog":280,"manifestVersion":288,"openIssues90d":289,"pushedAt":290,"stars":291},8,640,"1.2.0",4,1777352967000,8343,{"npmDownloads":8},{"hasNpmPackage":280,"license":294,"smitheryVerified":295},"MIT",false,{"hasCi":280,"hasTests":295},{"updatedAt":298},1778697448935,{"basePath":300,"githubOwner":301,"githubRepo":302,"locale":18,"slug":303,"type":304},"20-ml-paper-writing/ml-paper-writing","Orchestra-Research","AI-Research-SKILLs","ml-paper-writing","skill",{"_creationTime":306,"_id":307,"community":308,"display":309,"identity":313,"parentExtension":316,"providers":349,"relations":358,"tags":359,"workflow":360},1778695116697.1702,"k17155ws9qc0hw7a568bg79sfd86max8",{"reviewCount":8},{"description":310,"installMethods":311,"name":312,"sourceUrl":14},"LLM architectures and implementations including LitGPT, Mamba, NanoGPT, RWKV, and TorchTitan. Use when implementing, training, or understanding transformer and alternative architectures.",{"claudeCode":302},"Agent-Native Research Artifact (ARA) Tooling",{"basePath":314,"githubOwner":301,"githubRepo":302,"locale":18,"slug":302,"type":315},"","plugin",{"_creationTime":317,"_id":318,"community":319,"display":320,"identity":324,"providers":326,"relations":343,"tags":345,"workflow":346},1778695116697.17,"k17755pkhk2ktxts0edcsj00s586nmvk",{"reviewCount":8},{"description":321,"installMethods":322,"name":323,"sourceUrl":14},"Comprehensive library of 98 AI research engineering skills enabling autonomous AI research from hypothesis to experimental verification",{"claudeCode":12},"AI Research Skills Library",{"basePath":314,"githubOwner":301,"githubRepo":302,"locale":18,"slug":302,"type":325},"marketplace",{"evaluate":327,"extract":336},{"promptVersionExtension":328,"promptVersionScoring":247,"score":329,"tags":330,"targetMarket":262,"tier":263},"3.1.0",99,[331,332,333,334,335],"ai-research","mlops","llm-skills","autonomous-agents","research-orchestration",{"commitSha":337,"license":294,"marketplace":338,"plugin":341},"HEAD",{"name":339,"pluginCount":340},"ai-research-skills",1,{"mcpCount":8,"provider":342,"skillCount":8},"classify",{"repoId":344},"kd70hj1y80mhra5xm5g188j5n586mg18",[331,334,333,332,335],{"evaluatedAt":347,"extractAt":348,"updatedAt":347},1778695131103,1778695116697,{"evaluate":350,"extract":355},{"promptVersionExtension":246,"promptVersionScoring":247,"score":250,"tags":351,"targetMarket":262,"tier":263},[257,352,353,354,331],"artifact","provenance","review",{"commitSha":337,"license":294,"plugin":356},{"mcpCount":8,"provider":342,"skillCount":357},96,{"parentExtensionId":318,"repoId":344},[331,352,353,257,354],{"evaluatedAt":361,"extractAt":348,"updatedAt":361},1778695555085,{"evaluate":363,"extract":365},{"promptVersionExtension":246,"promptVersionScoring":247,"score":250,"tags":364,"targetMarket":262,"tier":263},[253,254,255,256,257,258,259,260,261],{"commitSha":337,"license":294},{"parentExtensionId":307,"repoId":344},{"_creationTime":368,"_id":344,"identity":369,"providers":370,"workflow":1904},1778695107142.3535,{"githubOwner":301,"githubRepo":302,"sourceUrl":14},{"classify":371,"discover":1882,"extract":1885,"github":1886,"npm":1903},{"commitSha":337,"extensions":372},[373,386,393,400,407,414,421,428,435,442,449,456,463,470,476,483,490,497,504,511,518,525,531,538,545,569,585,599,613,626,641,656,666,682,698,710,725,738,749,760,771,783,794,807,818,834,848,858,868,884,894,902,910,918,926,940,954,986,1000,1010,1020,1030,1040,1050,1064,1075,1085,1097,1107,1126,1139,1154,1170,1184,1198,1211,1224,1237,1249,1261,1274,1293,1303,1316,1329,1342,1351,1361,1370,1380,1390,1402,1415,1428,1440,1450,1460,1470,1480,1490,1502,1511,1529,1545,1555,1565,1575,1585,1599,1612,1622,1635,1647,1661,1766,1776,1813,1821,1829,1843,1857,1867],{"basePath":314,"description":321,"displayName":339,"installMethods":374,"rationale":375,"selectedPaths":376,"source":385,"sourceLanguage":18,"type":325},{"claudeCode":12},"marketplace.json at .claude-plugin/marketplace.json",[377,380,382],{"path":378,"priority":379},".claude-plugin/marketplace.json","mandatory",{"path":381,"priority":379},"README.md",{"path":383,"priority":384},"LICENSE","high","rule",{"basePath":314,"description":310,"displayName":387,"installMethods":388,"rationale":389,"selectedPaths":390,"source":385,"sourceLanguage":18,"type":315},"model-architecture",{"claudeCode":302},"inline plugin source from marketplace.json at /",[391,392],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":394,"displayName":395,"installMethods":396,"rationale":389,"selectedPaths":397,"source":385,"sourceLanguage":18,"type":315},"Text tokenization for LLMs including HuggingFace Tokenizers and SentencePiece. Use when training custom tokenizers or handling multilingual text.","tokenization",{"claudeCode":302},[398,399],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":401,"displayName":402,"installMethods":403,"rationale":389,"selectedPaths":404,"source":385,"sourceLanguage":18,"type":315},"LLM fine-tuning frameworks including Axolotl, LLaMA-Factory, PEFT, and Unsloth. Use when fine-tuning models with LoRA, QLoRA, or full fine-tuning.","fine-tuning",{"claudeCode":302},[405,406],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":408,"displayName":409,"installMethods":410,"rationale":389,"selectedPaths":411,"source":385,"sourceLanguage":18,"type":315},"Neural network interpretability tools including TransformerLens, SAELens, NNSight, and pyvene. Use when analyzing model internals, finding circuits, or understanding how models compute.","mechanistic-interpretability",{"claudeCode":302},[412,413],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":415,"displayName":416,"installMethods":417,"rationale":389,"selectedPaths":418,"source":385,"sourceLanguage":18,"type":315},"Data curation and processing at scale including NeMo Curator and Ray Data. Use when preparing training datasets or processing large-scale data.","data-processing",{"claudeCode":302},[419,420],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":422,"displayName":423,"installMethods":424,"rationale":389,"selectedPaths":425,"source":385,"sourceLanguage":18,"type":315},"RLHF and preference alignment including TRL, GRPO, OpenRLHF, SimPO, verl, slime, miles, and torchforge. Use when aligning models with human preferences, training reward models, or large-scale RL training.","post-training",{"claudeCode":302},[426,427],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":429,"displayName":430,"installMethods":431,"rationale":389,"selectedPaths":432,"source":385,"sourceLanguage":18,"type":315},"AI safety and content moderation including Constitutional AI, LlamaGuard, NeMo Guardrails, and Prompt Guard. Use when implementing safety filters, content moderation, or prompt injection detection.","safety-alignment",{"claudeCode":302},[433,434],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":436,"displayName":437,"installMethods":438,"rationale":389,"selectedPaths":439,"source":385,"sourceLanguage":18,"type":315},"Multi-GPU and multi-node training including DeepSpeed, PyTorch FSDP, Accelerate, Megatron-Core, PyTorch Lightning, and Ray Train. Use when training large models across GPUs.","distributed-training",{"claudeCode":302},[440,441],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":443,"displayName":444,"installMethods":445,"rationale":389,"selectedPaths":446,"source":385,"sourceLanguage":18,"type":315},"GPU cloud and compute orchestration including Modal, Lambda Labs, and SkyPilot. Use when deploying training jobs or managing GPU resources.","infrastructure",{"claudeCode":302},[447,448],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":450,"displayName":451,"installMethods":452,"rationale":389,"selectedPaths":453,"source":385,"sourceLanguage":18,"type":315},"Model optimization and quantization including Flash Attention, bitsandbytes, GPTQ, AWQ, GGUF, and HQQ. Use when reducing memory, accelerating inference, or quantizing models.","optimization",{"claudeCode":302},[454,455],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":457,"displayName":458,"installMethods":459,"rationale":389,"selectedPaths":460,"source":385,"sourceLanguage":18,"type":315},"LLM benchmarking and evaluation including lm-evaluation-harness, BigCode Evaluation Harness, and NeMo Evaluator. Use when benchmarking models or measuring performance.","evaluation",{"claudeCode":302},[461,462],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":464,"displayName":465,"installMethods":466,"rationale":389,"selectedPaths":467,"source":385,"sourceLanguage":18,"type":315},"Production LLM inference including vLLM, TensorRT-LLM, llama.cpp, and SGLang. Use when deploying models for production inference.","inference-serving",{"claudeCode":302},[468,469],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":471,"displayName":332,"installMethods":472,"rationale":389,"selectedPaths":473,"source":385,"sourceLanguage":18,"type":315},"ML experiment tracking and lifecycle including Weights & Biases, MLflow, and TensorBoard. Use when tracking experiments or managing models.",{"claudeCode":302},[474,475],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":477,"displayName":478,"installMethods":479,"rationale":389,"selectedPaths":480,"source":385,"sourceLanguage":18,"type":315},"LLM agent frameworks including LangChain, LlamaIndex, CrewAI, and AutoGPT. Use when building chatbots, autonomous agents, or tool-using systems.","agents",{"claudeCode":302},[481,482],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":484,"displayName":485,"installMethods":486,"rationale":389,"selectedPaths":487,"source":385,"sourceLanguage":18,"type":315},"Retrieval-Augmented Generation including Chroma, FAISS, Pinecone, Qdrant, and Sentence Transformers. Use when building semantic search or document retrieval systems.","rag",{"claudeCode":302},[488,489],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":491,"displayName":492,"installMethods":493,"rationale":389,"selectedPaths":494,"source":385,"sourceLanguage":18,"type":315},"Structured LLM outputs including DSPy, Instructor, Guidance, and Outlines. Use when extracting structured data or constraining LLM outputs.","prompt-engineering",{"claudeCode":302},[495,496],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":498,"displayName":499,"installMethods":500,"rationale":389,"selectedPaths":501,"source":385,"sourceLanguage":18,"type":315},"LLM application monitoring including LangSmith and Phoenix. Use when debugging LLM apps or monitoring production systems.","observability",{"claudeCode":302},[502,503],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":505,"displayName":506,"installMethods":507,"rationale":389,"selectedPaths":508,"source":385,"sourceLanguage":18,"type":315},"Vision, audio, and multimodal models including CLIP, Whisper, LLaVA, BLIP-2, Segment Anything, Stable Diffusion, AudioCraft, Cosmos Policy, OpenPI, and OpenVLA-OFT. Use when working with images, audio, multimodal tasks, or vision-language-action robot policies.","multimodal",{"claudeCode":302},[509,510],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":512,"displayName":513,"installMethods":514,"rationale":389,"selectedPaths":515,"source":385,"sourceLanguage":18,"type":315},"Advanced ML techniques including MoE Training, Model Merging, Long Context, Speculative Decoding, Knowledge Distillation, and Model Pruning. Use when implementing cutting-edge optimization or architecture techniques.","emerging-techniques",{"claudeCode":302},[516,517],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":519,"displayName":520,"installMethods":521,"rationale":389,"selectedPaths":522,"source":385,"sourceLanguage":18,"type":315},"Autonomous research orchestration using a two-loop architecture. Manages the full research lifecycle from literature survey to paper writing, routing to domain-specific skills for execution. Use when starting a research project, running autonomous experiments, or managing multi-hypothesis research.","autoresearch",{"claudeCode":302},[523,524],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":526,"displayName":303,"installMethods":527,"rationale":389,"selectedPaths":528,"source":385,"sourceLanguage":18,"type":315},"Write publication-ready ML/AI/Systems papers for NeurIPS, ICML, ICLR, ACL, AAAI, COLM, OSDI, NSDI, ASPLOS, SOSP. Includes LaTeX templates, citation verification, reviewer guidelines, publication-quality figure generation, systems paper structural blueprints, and conference presentation slides.",{"claudeCode":302},[529,530],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":532,"displayName":533,"installMethods":534,"rationale":389,"selectedPaths":535,"source":385,"sourceLanguage":18,"type":315},"Research ideation frameworks including structured brainstorming and creative thinking. Use when exploring new research directions, generating novel ideas, or seeking fresh angles on existing work.","ideation",{"claudeCode":302},[536,537],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":314,"description":539,"displayName":540,"installMethods":541,"rationale":389,"selectedPaths":542,"source":385,"sourceLanguage":18,"type":315},"Agent-Native Research Artifact (ARA) tooling: compile any research input (paper, repo, notes) into a structured artifact, record session provenance as a post-task epilogue, and run Seal Level 2 epistemic review. Use when ingesting research into a falsifiable, agent-traversable artifact, capturing how a research project actually evolved, or auditing an ARA for evidence-claim alignment.","agent-native-research-artifact",{"claudeCode":302},[543,544],{"path":381,"priority":379},{"path":383,"priority":384},{"basePath":546,"description":547,"displayName":520,"installMethods":548,"rationale":549,"selectedPaths":550,"source":385,"sourceLanguage":18,"type":304},"0-autoresearch-skill","Orchestrates end-to-end autonomous AI research projects using a two-loop architecture. The inner loop runs rapid experiment iterations with clear optimization targets. The outer loop synthesizes results, identifies patterns, and steers research direction. Routes to domain-specific skills for execution, supports continuous agent operation via Claude Code /loop and OpenClaw heartbeat, and produces research presentations and papers. Use when starting a research project, running autonomous experiments, or managing a multi-hypothesis research effort.",{"claudeCode":12},"SKILL.md frontmatter at 0-autoresearch-skill/SKILL.md",[551,553,556,558,560,563,565,567],{"path":552,"priority":379},"SKILL.md",{"path":554,"priority":555},"references/agent-continuity.md","medium",{"path":557,"priority":555},"references/progress-reporting.md",{"path":559,"priority":555},"references/skill-routing.md",{"path":561,"priority":562},"templates/findings.md","low",{"path":564,"priority":562},"templates/progress-presentation.html",{"path":566,"priority":562},"templates/research-log.md",{"path":568,"priority":562},"templates/research-state.yaml",{"basePath":570,"description":571,"displayName":572,"installMethods":573,"rationale":574,"selectedPaths":575,"source":385,"sourceLanguage":18,"type":304},"01-model-architecture/litgpt","Implements and trains LLMs using Lightning AI's LitGPT with 20+ pretrained architectures (Llama, Gemma, Phi, Qwen, Mistral). Use when need clean model implementations, educational understanding of architectures, or production fine-tuning with LoRA/QLoRA. Single-file implementations, no abstraction layers.","implementing-llms-litgpt",{"claudeCode":12},"SKILL.md frontmatter at 01-model-architecture/litgpt/SKILL.md",[576,577,579,581,583],{"path":552,"priority":379},{"path":578,"priority":555},"references/custom-models.md",{"path":580,"priority":555},"references/distributed-training.md",{"path":582,"priority":555},"references/supported-models.md",{"path":584,"priority":555},"references/training-recipes.md",{"basePath":586,"description":587,"displayName":588,"installMethods":589,"rationale":590,"selectedPaths":591,"source":385,"sourceLanguage":18,"type":304},"01-model-architecture/mamba","State-space model with O(n) complexity vs Transformers' O(n²). 5× faster inference, million-token sequences, no KV cache. Selective SSM with hardware-aware design. Mamba-1 (d_state=16) and Mamba-2 (d_state=128, multi-head). Models 130M-2.8B on HuggingFace.","mamba-architecture",{"claudeCode":12},"SKILL.md frontmatter at 01-model-architecture/mamba/SKILL.md",[592,593,595,597],{"path":552,"priority":379},{"path":594,"priority":555},"references/architecture-details.md",{"path":596,"priority":555},"references/benchmarks.md",{"path":598,"priority":555},"references/training-guide.md",{"basePath":600,"description":601,"displayName":602,"installMethods":603,"rationale":604,"selectedPaths":605,"source":385,"sourceLanguage":18,"type":304},"01-model-architecture/nanogpt","Educational GPT implementation in ~300 lines. Reproduces GPT-2 (124M) on OpenWebText. Clean, hackable code for learning transformers. By Andrej Karpathy. Perfect for understanding GPT architecture from scratch. Train on Shakespeare (CPU) or OpenWebText (multi-GPU).","nanogpt",{"claudeCode":12},"SKILL.md frontmatter at 01-model-architecture/nanogpt/SKILL.md",[606,607,609,611],{"path":552,"priority":379},{"path":608,"priority":555},"references/architecture.md",{"path":610,"priority":555},"references/data.md",{"path":612,"priority":555},"references/training.md",{"basePath":614,"description":615,"displayName":616,"installMethods":617,"rationale":618,"selectedPaths":619,"source":385,"sourceLanguage":18,"type":304},"01-model-architecture/rwkv","RNN+Transformer hybrid with O(n) inference. Linear time, infinite context, no KV cache. Train like GPT (parallel), infer like RNN (sequential). Linux Foundation AI project. Production at Windows, Office, NeMo. RWKV-7 (March 2025). Models up to 14B parameters.","rwkv-architecture",{"claudeCode":12},"SKILL.md frontmatter at 01-model-architecture/rwkv/SKILL.md",[620,621,622,624],{"path":552,"priority":379},{"path":594,"priority":555},{"path":623,"priority":555},"references/rwkv7.md",{"path":625,"priority":555},"references/state-management.md",{"basePath":627,"description":628,"displayName":629,"installMethods":630,"rationale":631,"selectedPaths":632,"source":385,"sourceLanguage":18,"type":304},"01-model-architecture/torchtitan","Provides PyTorch-native distributed LLM pretraining using torchtitan with 4D parallelism (FSDP2, TP, PP, CP). Use when pretraining Llama 3.1, DeepSeek V3, or custom models at scale from 8 to 512+ GPUs with Float8, torch.compile, and distributed checkpointing.","distributed-llm-pretraining-torchtitan",{"claudeCode":12},"SKILL.md frontmatter at 01-model-architecture/torchtitan/SKILL.md",[633,634,636,637,639],{"path":552,"priority":379},{"path":635,"priority":555},"references/checkpoint.md",{"path":578,"priority":555},{"path":638,"priority":555},"references/float8.md",{"path":640,"priority":555},"references/fsdp.md",{"basePath":642,"description":643,"displayName":644,"installMethods":645,"rationale":646,"selectedPaths":647,"source":385,"sourceLanguage":18,"type":304},"02-tokenization/huggingface-tokenizers","Fast tokenizers optimized for research and production. Rust-based implementation tokenizes 1GB in \u003C20 seconds. Supports BPE, WordPiece, and Unigram algorithms. Train custom vocabularies, track alignments, handle padding/truncation. Integrates seamlessly with transformers. Use when you need high-performance tokenization or custom tokenizer training.","huggingface-tokenizers",{"claudeCode":12},"SKILL.md frontmatter at 02-tokenization/huggingface-tokenizers/SKILL.md",[648,649,651,653,655],{"path":552,"priority":379},{"path":650,"priority":555},"references/algorithms.md",{"path":652,"priority":555},"references/integration.md",{"path":654,"priority":555},"references/pipeline.md",{"path":612,"priority":555},{"basePath":657,"description":658,"displayName":659,"installMethods":660,"rationale":661,"selectedPaths":662,"source":385,"sourceLanguage":18,"type":304},"02-tokenization/sentencepiece","Language-independent tokenizer treating text as raw Unicode. Supports BPE and Unigram algorithms. Fast (50k sentences/sec), lightweight (6MB memory), deterministic vocabulary. Used by T5, ALBERT, XLNet, mBART. Train on raw text without pre-tokenization. Use when you need multilingual support, CJK languages, or reproducible tokenization.","sentencepiece",{"claudeCode":12},"SKILL.md frontmatter at 02-tokenization/sentencepiece/SKILL.md",[663,664,665],{"path":552,"priority":379},{"path":650,"priority":555},{"path":612,"priority":555},{"basePath":667,"description":668,"displayName":669,"installMethods":670,"rationale":671,"selectedPaths":672,"source":385,"sourceLanguage":18,"type":304},"03-fine-tuning/axolotl","Expert guidance for fine-tuning LLMs with Axolotl - YAML configs, 100+ models, LoRA/QLoRA, DPO/KTO/ORPO/GRPO, multimodal support","axolotl",{"claudeCode":12},"SKILL.md frontmatter at 03-fine-tuning/axolotl/SKILL.md",[673,674,676,678,680],{"path":552,"priority":379},{"path":675,"priority":555},"references/api.md",{"path":677,"priority":555},"references/dataset-formats.md",{"path":679,"priority":555},"references/index.md",{"path":681,"priority":555},"references/other.md",{"basePath":683,"description":684,"displayName":685,"installMethods":686,"rationale":687,"selectedPaths":688,"source":385,"sourceLanguage":18,"type":304},"03-fine-tuning/llama-factory","Expert guidance for fine-tuning LLMs with LLaMA-Factory - WebUI no-code, 100+ models, 2/3/4/5/6/8-bit QLoRA, multimodal support","llama-factory",{"claudeCode":12},"SKILL.md frontmatter at 03-fine-tuning/llama-factory/SKILL.md",[689,690,692,694,696,697],{"path":552,"priority":379},{"path":691,"priority":555},"references/_images.md",{"path":693,"priority":555},"references/advanced.md",{"path":695,"priority":555},"references/getting_started.md",{"path":679,"priority":555},{"path":681,"priority":555},{"basePath":699,"description":700,"displayName":701,"installMethods":702,"rationale":703,"selectedPaths":704,"source":385,"sourceLanguage":18,"type":304},"03-fine-tuning/peft","Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need to train \u003C1% of parameters with minimal accuracy loss, or for multi-adapter serving. HuggingFace's official library integrated with transformers ecosystem.","peft-fine-tuning",{"claudeCode":12},"SKILL.md frontmatter at 03-fine-tuning/peft/SKILL.md",[705,706,708],{"path":552,"priority":379},{"path":707,"priority":555},"references/advanced-usage.md",{"path":709,"priority":555},"references/troubleshooting.md",{"basePath":711,"description":712,"displayName":713,"installMethods":714,"rationale":715,"selectedPaths":716,"source":385,"sourceLanguage":18,"type":304},"03-fine-tuning/unsloth","Expert guidance for fast fine-tuning with Unsloth - 2-5x faster training, 50-80% less memory, LoRA/QLoRA optimization","unsloth",{"claudeCode":12},"SKILL.md frontmatter at 03-fine-tuning/unsloth/SKILL.md",[717,718,719,721,723],{"path":552,"priority":379},{"path":679,"priority":555},{"path":720,"priority":555},"references/llms-full.md",{"path":722,"priority":555},"references/llms-txt.md",{"path":724,"priority":555},"references/llms.md",{"basePath":726,"description":727,"displayName":728,"installMethods":729,"rationale":730,"selectedPaths":731,"source":385,"sourceLanguage":18,"type":304},"04-mechanistic-interpretability/nnsight","Provides guidance for interpreting and manipulating neural network internals using nnsight with optional NDIF remote execution. Use when needing to run interpretability experiments on massive models (70B+) without local GPU resources, or when working with any PyTorch architecture.","nnsight-remote-interpretability",{"claudeCode":12},"SKILL.md frontmatter at 04-mechanistic-interpretability/nnsight/SKILL.md",[732,733,735,736],{"path":552,"priority":379},{"path":734,"priority":555},"references/README.md",{"path":675,"priority":555},{"path":737,"priority":555},"references/tutorials.md",{"basePath":739,"description":740,"displayName":741,"installMethods":742,"rationale":743,"selectedPaths":744,"source":385,"sourceLanguage":18,"type":304},"04-mechanistic-interpretability/pyvene","Provides guidance for performing causal interventions on PyTorch models using pyvene's declarative intervention framework. Use when conducting causal tracing, activation patching, interchange intervention training, or testing causal hypotheses about model behavior.","pyvene-interventions",{"claudeCode":12},"SKILL.md frontmatter at 04-mechanistic-interpretability/pyvene/SKILL.md",[745,746,747,748],{"path":552,"priority":379},{"path":734,"priority":555},{"path":675,"priority":555},{"path":737,"priority":555},{"basePath":750,"description":751,"displayName":752,"installMethods":753,"rationale":754,"selectedPaths":755,"source":385,"sourceLanguage":18,"type":304},"04-mechanistic-interpretability/saelens","Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features. Use when discovering interpretable features, analyzing superposition, or studying monosemantic representations in language models.","sparse-autoencoder-training",{"claudeCode":12},"SKILL.md frontmatter at 04-mechanistic-interpretability/saelens/SKILL.md",[756,757,758,759],{"path":552,"priority":379},{"path":734,"priority":555},{"path":675,"priority":555},{"path":737,"priority":555},{"basePath":761,"description":762,"displayName":763,"installMethods":764,"rationale":765,"selectedPaths":766,"source":385,"sourceLanguage":18,"type":304},"04-mechanistic-interpretability/transformer-lens","Provides guidance for mechanistic interpretability research using TransformerLens to inspect and manipulate transformer internals via HookPoints and activation caching. Use when reverse-engineering model algorithms, studying attention patterns, or performing activation patching experiments.","transformer-lens-interpretability",{"claudeCode":12},"SKILL.md frontmatter at 04-mechanistic-interpretability/transformer-lens/SKILL.md",[767,768,769,770],{"path":552,"priority":379},{"path":734,"priority":555},{"path":675,"priority":555},{"path":737,"priority":555},{"basePath":772,"description":773,"displayName":774,"installMethods":775,"rationale":776,"selectedPaths":777,"source":385,"sourceLanguage":18,"type":304},"05-data-processing/nemo-curator","GPU-accelerated data curation for LLM training. Supports text/image/video/audio. Features fuzzy deduplication (16× faster), quality filtering (30+ heuristics), semantic deduplication, PII redaction, NSFW detection. Scales across GPUs with RAPIDS. Use for preparing high-quality training datasets, cleaning web data, or deduplicating large corpora.","nemo-curator",{"claudeCode":12},"SKILL.md frontmatter at 05-data-processing/nemo-curator/SKILL.md",[778,779,781],{"path":552,"priority":379},{"path":780,"priority":555},"references/deduplication.md",{"path":782,"priority":555},"references/filtering.md",{"basePath":784,"description":785,"displayName":786,"installMethods":787,"rationale":788,"selectedPaths":789,"source":385,"sourceLanguage":18,"type":304},"05-data-processing/ray-data","Scalable data processing for ML workloads. Streaming execution across CPU/GPU, supports Parquet/CSV/JSON/images. Integrates with Ray Train, PyTorch, TensorFlow. Scales from single machine to 100s of nodes. Use for batch inference, data preprocessing, multi-modal data loading, or distributed ETL pipelines.","ray-data",{"claudeCode":12},"SKILL.md frontmatter at 05-data-processing/ray-data/SKILL.md",[790,791,792],{"path":552,"priority":379},{"path":652,"priority":555},{"path":793,"priority":555},"references/transformations.md",{"basePath":795,"description":796,"displayName":797,"installMethods":798,"rationale":799,"selectedPaths":800,"source":385,"sourceLanguage":18,"type":304},"06-post-training/grpo-rl-training","Expert guidance for GRPO/RL fine-tuning with TRL for reasoning and task-specific model training","grpo-rl-training",{"claudeCode":12},"SKILL.md frontmatter at 06-post-training/grpo-rl-training/SKILL.md",[801,802,803,805],{"path":552,"priority":379},{"path":381,"priority":384},{"path":804,"priority":562},"examples/reward_functions_library.py",{"path":806,"priority":562},"templates/basic_grpo_training.py",{"basePath":808,"description":809,"displayName":810,"installMethods":811,"rationale":812,"selectedPaths":813,"source":385,"sourceLanguage":18,"type":304},"06-post-training/miles","Provides guidance for enterprise-grade RL training using miles, a production-ready fork of slime. Use when training large MoE models with FP8/INT4, needing train-inference alignment, or requiring speculative RL for maximum throughput.","miles-rl-training",{"claudeCode":12},"SKILL.md frontmatter at 06-post-training/miles/SKILL.md",[814,815,817],{"path":552,"priority":379},{"path":816,"priority":555},"references/api-reference.md",{"path":709,"priority":555},{"basePath":819,"description":820,"displayName":821,"installMethods":822,"rationale":823,"selectedPaths":824,"source":385,"sourceLanguage":18,"type":304},"06-post-training/openrlhf","High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models (7B-70B+). Built on Ray, vLLM, ZeRO-3. 2× faster than DeepSpeedChat with distributed architecture and GPU resource sharing.","openrlhf-training",{"claudeCode":12},"SKILL.md frontmatter at 06-post-training/openrlhf/SKILL.md",[825,826,828,830,832],{"path":552,"priority":379},{"path":827,"priority":555},"references/algorithm-comparison.md",{"path":829,"priority":555},"references/custom-rewards.md",{"path":831,"priority":555},"references/hybrid-engine.md",{"path":833,"priority":555},"references/multi-node-training.md",{"basePath":835,"description":836,"displayName":837,"installMethods":838,"rationale":839,"selectedPaths":840,"source":385,"sourceLanguage":18,"type":304},"06-post-training/simpo","Simple Preference Optimization for LLM alignment. Reference-free alternative to DPO with better performance (+6.4 points on AlpacaEval 2.0). No reference model needed, more efficient than DPO. Use for preference alignment when want simpler, faster training than DPO/PPO.","simpo-training",{"claudeCode":12},"SKILL.md frontmatter at 06-post-training/simpo/SKILL.md",[841,842,844,846],{"path":552,"priority":379},{"path":843,"priority":555},"references/datasets.md",{"path":845,"priority":555},"references/hyperparameters.md",{"path":847,"priority":555},"references/loss-functions.md",{"basePath":849,"description":850,"displayName":851,"installMethods":852,"rationale":853,"selectedPaths":854,"source":385,"sourceLanguage":18,"type":304},"06-post-training/slime","Provides guidance for LLM post-training with RL using slime, a Megatron+SGLang framework. Use when training GLM models, implementing custom data generation workflows, or needing tight Megatron-LM integration for RL scaling.","slime-rl-training",{"claudeCode":12},"SKILL.md frontmatter at 06-post-training/slime/SKILL.md",[855,856,857],{"path":552,"priority":379},{"path":816,"priority":555},{"path":709,"priority":555},{"basePath":859,"description":860,"displayName":861,"installMethods":862,"rationale":863,"selectedPaths":864,"source":385,"sourceLanguage":18,"type":304},"06-post-training/torchforge","Provides guidance for PyTorch-native agentic RL using torchforge, Meta's library separating infra from algorithms. Use when you want clean RL abstractions, easy algorithm experimentation, or scalable training with Monarch and TorchTitan.","torchforge-rl-training",{"claudeCode":12},"SKILL.md frontmatter at 06-post-training/torchforge/SKILL.md",[865,866,867],{"path":552,"priority":379},{"path":816,"priority":555},{"path":709,"priority":555},{"basePath":869,"description":870,"displayName":871,"installMethods":872,"rationale":873,"selectedPaths":874,"source":385,"sourceLanguage":18,"type":304},"06-post-training/trl-fine-tuning","Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment, PPO/GRPO for reward optimization, and reward model training. Use when need RLHF, align model with preferences, or train from human feedback. Works with HuggingFace Transformers.","fine-tuning-with-trl",{"claudeCode":12},"SKILL.md frontmatter at 06-post-training/trl-fine-tuning/SKILL.md",[875,876,878,880,882],{"path":552,"priority":379},{"path":877,"priority":555},"references/dpo-variants.md",{"path":879,"priority":555},"references/online-rl.md",{"path":881,"priority":555},"references/reward-modeling.md",{"path":883,"priority":555},"references/sft-training.md",{"basePath":885,"description":886,"displayName":887,"installMethods":888,"rationale":889,"selectedPaths":890,"source":385,"sourceLanguage":18,"type":304},"06-post-training/verl","Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with flexible infrastructure backends.","verl-rl-training",{"claudeCode":12},"SKILL.md frontmatter at 06-post-training/verl/SKILL.md",[891,892,893],{"path":552,"priority":379},{"path":816,"priority":555},{"path":709,"priority":555},{"basePath":895,"description":896,"displayName":897,"installMethods":898,"rationale":899,"selectedPaths":900,"source":385,"sourceLanguage":18,"type":304},"07-safety-alignment/constitutional-ai","Anthropic's method for training harmless AI through self-improvement. Two-phase approach - supervised learning with self-critique/revision, then RLAIF (RL from AI Feedback). Use for safety alignment, reducing harmful outputs without human labels. Powers Claude's safety system.","constitutional-ai",{"claudeCode":12},"SKILL.md frontmatter at 07-safety-alignment/constitutional-ai/SKILL.md",[901],{"path":552,"priority":379},{"basePath":903,"description":904,"displayName":905,"installMethods":906,"rationale":907,"selectedPaths":908,"source":385,"sourceLanguage":18,"type":304},"07-safety-alignment/llamaguard","Meta's 7-8B specialized moderation model for LLM input/output filtering. 6 safety categories - violence/hate, sexual content, weapons, substances, self-harm, criminal planning. 94-95% accuracy. Deploy with vLLM, HuggingFace, Sagemaker. Integrates with NeMo Guardrails.","llamaguard",{"claudeCode":12},"SKILL.md frontmatter at 07-safety-alignment/llamaguard/SKILL.md",[909],{"path":552,"priority":379},{"basePath":911,"description":912,"displayName":913,"installMethods":914,"rationale":915,"selectedPaths":916,"source":385,"sourceLanguage":18,"type":304},"07-safety-alignment/nemo-guardrails","NVIDIA's runtime safety framework for LLM applications. Features jailbreak detection, input/output validation, fact-checking, hallucination detection, PII filtering, toxicity detection. Uses Colang 2.0 DSL for programmable rails. Production-ready, runs on T4 GPU.","nemo-guardrails",{"claudeCode":12},"SKILL.md frontmatter at 07-safety-alignment/nemo-guardrails/SKILL.md",[917],{"path":552,"priority":379},{"basePath":919,"description":920,"displayName":921,"installMethods":922,"rationale":923,"selectedPaths":924,"source":385,"sourceLanguage":18,"type":304},"07-safety-alignment/prompt-guard","Meta's 86M prompt injection and jailbreak detector. Filters malicious prompts and third-party data for LLM apps. 99%+ TPR, \u003C1% FPR. Fast (\u003C2ms GPU). Multilingual (8 languages). Deploy with HuggingFace or batch processing for RAG security.","prompt-guard",{"claudeCode":12},"SKILL.md frontmatter at 07-safety-alignment/prompt-guard/SKILL.md",[925],{"path":552,"priority":379},{"basePath":927,"description":928,"displayName":929,"installMethods":930,"rationale":931,"selectedPaths":932,"source":385,"sourceLanguage":18,"type":304},"08-distributed-training/accelerate","Simplest distributed training API. 4 lines to add distributed support to any PyTorch script. Unified API for DeepSpeed/FSDP/Megatron/DDP. Automatic device placement, mixed precision (FP16/BF16/FP8). Interactive config, single launch command. HuggingFace ecosystem standard.","huggingface-accelerate",{"claudeCode":12},"SKILL.md frontmatter at 08-distributed-training/accelerate/SKILL.md",[933,934,936,938],{"path":552,"priority":379},{"path":935,"priority":555},"references/custom-plugins.md",{"path":937,"priority":555},"references/megatron-integration.md",{"path":939,"priority":555},"references/performance.md",{"basePath":941,"description":942,"displayName":943,"installMethods":944,"rationale":945,"selectedPaths":946,"source":385,"sourceLanguage":18,"type":304},"08-distributed-training/megatron-core","Trains large language models (2B-462B parameters) using NVIDIA Megatron-Core with advanced parallelism strategies. Use when training models >1B parameters, need maximum GPU efficiency (47% MFU on H100), or require tensor/pipeline/sequence/context/expert parallelism. Production-ready framework used for Nemotron, LLaMA, DeepSeek.","training-llms-megatron",{"claudeCode":12},"SKILL.md frontmatter at 08-distributed-training/megatron-core/SKILL.md",[947,948,949,951,953],{"path":552,"priority":379},{"path":596,"priority":555},{"path":950,"priority":555},"references/parallelism-guide.md",{"path":952,"priority":555},"references/production-examples.md",{"path":584,"priority":555},{"basePath":955,"description":956,"displayName":957,"installMethods":958,"rationale":959,"selectedPaths":960,"source":385,"sourceLanguage":18,"type":304},"08-distributed-training/pytorch-fsdp2","Adds PyTorch FSDP2 (fully_shard) to training scripts with correct init, sharding, mixed precision/offload config, and distributed checkpointing. Use when models exceed single-GPU memory or when you need DTensor-based sharding with DeviceMesh.","pytorch-fsdp2",{"claudeCode":12},"SKILL.md frontmatter at 08-distributed-training/pytorch-fsdp2/SKILL.md",[961,962,964,966,968,970,972,974,976,978,980,982,984],{"path":552,"priority":379},{"path":963,"priority":555},"references/pytorch_dcp_async_recipe.md",{"path":965,"priority":555},"references/pytorch_dcp_overview.md",{"path":967,"priority":555},"references/pytorch_dcp_recipe.md",{"path":969,"priority":555},"references/pytorch_ddp_notes.md",{"path":971,"priority":555},"references/pytorch_device_mesh_tutorial.md",{"path":973,"priority":555},"references/pytorch_examples_fsdp2.md",{"path":975,"priority":555},"references/pytorch_fsdp1_api.md",{"path":977,"priority":555},"references/pytorch_fsdp2_tutorial.md",{"path":979,"priority":555},"references/pytorch_fully_shard_api.md",{"path":981,"priority":555},"references/pytorch_tp_tutorial.md",{"path":983,"priority":555},"references/ray_train_fsdp2_example.md",{"path":985,"priority":555},"references/torchtitan_fsdp_notes.md",{"basePath":987,"description":988,"displayName":989,"installMethods":990,"rationale":991,"selectedPaths":992,"source":385,"sourceLanguage":18,"type":304},"08-distributed-training/pytorch-lightning","High-level PyTorch framework with Trainer class, automatic distributed training (DDP/FSDP/DeepSpeed), callbacks system, and minimal boilerplate. Scales from laptop to supercomputer with same code. Use when you want clean training loops with built-in best practices.","pytorch-lightning",{"claudeCode":12},"SKILL.md frontmatter at 08-distributed-training/pytorch-lightning/SKILL.md",[993,994,996,998],{"path":552,"priority":379},{"path":995,"priority":555},"references/callbacks.md",{"path":997,"priority":555},"references/distributed.md",{"path":999,"priority":555},"references/hyperparameter-tuning.md",{"basePath":1001,"description":1002,"displayName":1003,"installMethods":1004,"rationale":1005,"selectedPaths":1006,"source":385,"sourceLanguage":18,"type":304},"08-distributed-training/ray-train","Distributed training orchestration across clusters. Scales PyTorch/TensorFlow/HuggingFace from laptop to 1000s of nodes. Built-in hyperparameter tuning with Ray Tune, fault tolerance, elastic scaling. Use when training massive models across multiple machines or running distributed hyperparameter sweeps.","ray-train",{"claudeCode":12},"SKILL.md frontmatter at 08-distributed-training/ray-train/SKILL.md",[1007,1008],{"path":552,"priority":379},{"path":1009,"priority":555},"references/multi-node.md",{"basePath":1011,"description":1012,"displayName":1013,"installMethods":1014,"rationale":1015,"selectedPaths":1016,"source":385,"sourceLanguage":18,"type":304},"09-infrastructure/lambda-labs","Reserved and on-demand GPU cloud instances for ML training and inference. Use when you need dedicated GPU instances with simple SSH access, persistent filesystems, or high-performance multi-node clusters for large-scale training.","lambda-labs-gpu-cloud",{"claudeCode":12},"SKILL.md frontmatter at 09-infrastructure/lambda-labs/SKILL.md",[1017,1018,1019],{"path":552,"priority":379},{"path":707,"priority":555},{"path":709,"priority":555},{"basePath":1021,"description":1022,"displayName":1023,"installMethods":1024,"rationale":1025,"selectedPaths":1026,"source":385,"sourceLanguage":18,"type":304},"09-infrastructure/modal","Serverless GPU cloud platform for running ML workloads. Use when you need on-demand GPU access without infrastructure management, deploying ML models as APIs, or running batch jobs with automatic scaling.","modal-serverless-gpu",{"claudeCode":12},"SKILL.md frontmatter at 09-infrastructure/modal/SKILL.md",[1027,1028,1029],{"path":552,"priority":379},{"path":707,"priority":555},{"path":709,"priority":555},{"basePath":1031,"description":1032,"displayName":1033,"installMethods":1034,"rationale":1035,"selectedPaths":1036,"source":385,"sourceLanguage":18,"type":304},"09-infrastructure/skypilot","Multi-cloud orchestration for ML workloads with automatic cost optimization. Use when you need to run training or batch jobs across multiple clouds, leverage spot instances with auto-recovery, or optimize GPU costs across providers.","skypilot-multi-cloud-orchestration",{"claudeCode":12},"SKILL.md frontmatter at 09-infrastructure/skypilot/SKILL.md",[1037,1038,1039],{"path":552,"priority":379},{"path":707,"priority":555},{"path":709,"priority":555},{"basePath":1041,"description":1042,"displayName":1043,"installMethods":1044,"rationale":1045,"selectedPaths":1046,"source":385,"sourceLanguage":18,"type":304},"10-optimization/awq","Activation-aware weight quantization for 4-bit LLM compression with 3x speedup and minimal accuracy loss. Use when deploying large models (7B-70B) on limited GPU memory, when you need faster inference than GPTQ with better accuracy preservation, or for instruction-tuned and multimodal models. MLSys 2024 Best Paper Award winner.","awq-quantization",{"claudeCode":12},"SKILL.md frontmatter at 10-optimization/awq/SKILL.md",[1047,1048,1049],{"path":552,"priority":379},{"path":707,"priority":555},{"path":709,"priority":555},{"basePath":1051,"description":1052,"displayName":1053,"installMethods":1054,"rationale":1055,"selectedPaths":1056,"source":385,"sourceLanguage":18,"type":304},"10-optimization/bitsandbytes","Quantizes LLMs to 8-bit or 4-bit for 50-75% memory reduction with minimal accuracy loss. Use when GPU memory is limited, need to fit larger models, or want faster inference. Supports INT8, NF4, FP4 formats, QLoRA training, and 8-bit optimizers. Works with HuggingFace Transformers.","quantizing-models-bitsandbytes",{"claudeCode":12},"SKILL.md frontmatter at 10-optimization/bitsandbytes/SKILL.md",[1057,1058,1060,1062],{"path":552,"priority":379},{"path":1059,"priority":555},"references/memory-optimization.md",{"path":1061,"priority":555},"references/qlora-training.md",{"path":1063,"priority":555},"references/quantization-formats.md",{"basePath":1065,"description":1066,"displayName":1067,"installMethods":1068,"rationale":1069,"selectedPaths":1070,"source":385,"sourceLanguage":18,"type":304},"10-optimization/flash-attention","Optimizes transformer attention with Flash Attention for 2-4x speedup and 10-20x memory reduction. Use when training/running transformers with long sequences (>512 tokens), encountering GPU memory issues with attention, or need faster inference. Supports PyTorch native SDPA, flash-attn library, H100 FP8, and sliding window attention.","optimizing-attention-flash",{"claudeCode":12},"SKILL.md frontmatter at 10-optimization/flash-attention/SKILL.md",[1071,1072,1073],{"path":552,"priority":379},{"path":596,"priority":555},{"path":1074,"priority":555},"references/transformers-integration.md",{"basePath":1076,"description":1077,"displayName":1078,"installMethods":1079,"rationale":1080,"selectedPaths":1081,"source":385,"sourceLanguage":18,"type":304},"10-optimization/gguf","GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without GPU requirements.","gguf-quantization",{"claudeCode":12},"SKILL.md frontmatter at 10-optimization/gguf/SKILL.md",[1082,1083,1084],{"path":552,"priority":379},{"path":707,"priority":555},{"path":709,"priority":555},{"basePath":1086,"description":1087,"displayName":1088,"installMethods":1089,"rationale":1090,"selectedPaths":1091,"source":385,"sourceLanguage":18,"type":304},"10-optimization/gptq","Post-training 4-bit quantization for LLMs with minimal accuracy loss. Use for deploying large models (70B, 405B) on consumer GPUs, when you need 4× memory reduction with \u003C2% perplexity degradation, or for faster inference (3-4× speedup) vs FP16. Integrates with transformers and PEFT for QLoRA fine-tuning.","gptq",{"claudeCode":12},"SKILL.md frontmatter at 10-optimization/gptq/SKILL.md",[1092,1093,1095,1096],{"path":552,"priority":379},{"path":1094,"priority":555},"references/calibration.md",{"path":652,"priority":555},{"path":709,"priority":555},{"basePath":1098,"description":1099,"displayName":1100,"installMethods":1101,"rationale":1102,"selectedPaths":1103,"source":385,"sourceLanguage":18,"type":304},"10-optimization/hqq","Half-Quadratic Quantization for LLMs without calibration data. Use when quantizing models to 4/3/2-bit precision without needing calibration datasets, for fast quantization workflows, or when deploying with vLLM or HuggingFace Transformers.","hqq-quantization",{"claudeCode":12},"SKILL.md frontmatter at 10-optimization/hqq/SKILL.md",[1104,1105,1106],{"path":552,"priority":379},{"path":707,"priority":555},{"path":709,"priority":555},{"basePath":1108,"description":1109,"displayName":1110,"installMethods":1111,"rationale":1112,"selectedPaths":1113,"source":385,"sourceLanguage":18,"type":304},"10-optimization/ml-training-recipes","Battle-tested PyTorch training recipes for all domains — LLMs, vision, diffusion, medical imaging, protein/drug discovery, spatial omics, genomics. Covers training loops, optimizer selection (AdamW, Muon), LR scheduling, mixed precision, debugging, and systematic experimentation. Use when training or fine-tuning neural networks, debugging loss spikes or OOM, choosing architectures, or optimizing GPU throughput.","ml-training-recipes",{"claudeCode":12},"SKILL.md frontmatter at 10-optimization/ml-training-recipes/SKILL.md",[1114,1115,1116,1118,1120,1122,1124],{"path":552,"priority":379},{"path":608,"priority":555},{"path":1117,"priority":555},"references/biomedical.md",{"path":1119,"priority":555},"references/domain-specific.md",{"path":1121,"priority":555},"references/experiment-loop.md",{"path":1123,"priority":555},"references/optimizers.md",{"path":1125,"priority":555},"references/scaling-and-selection.md",{"basePath":1127,"description":1128,"displayName":1129,"installMethods":1130,"rationale":1131,"selectedPaths":1132,"source":385,"sourceLanguage":18,"type":304},"11-evaluation/bigcode-evaluation-harness","Evaluates code generation models across HumanEval, MBPP, MultiPL-E, and 15+ benchmarks with pass@k metrics. Use when benchmarking code models, comparing coding abilities, testing multi-language support, or measuring code generation quality. Industry standard from BigCode Project used by HuggingFace leaderboards.","evaluating-code-models",{"claudeCode":12},"SKILL.md frontmatter at 11-evaluation/bigcode-evaluation-harness/SKILL.md",[1133,1134,1135,1137],{"path":552,"priority":379},{"path":596,"priority":555},{"path":1136,"priority":555},"references/custom-tasks.md",{"path":1138,"priority":555},"references/issues.md",{"basePath":1140,"description":1141,"displayName":1142,"installMethods":1143,"rationale":1144,"selectedPaths":1145,"source":385,"sourceLanguage":18,"type":304},"11-evaluation/lm-evaluation-harness","Evaluates LLMs across 60+ academic benchmarks (MMLU, HumanEval, GSM8K, TruthfulQA, HellaSwag). Use when benchmarking model quality, comparing models, reporting academic results, or tracking training progress. Industry standard used by EleutherAI, HuggingFace, and major labs. Supports HuggingFace, vLLM, APIs.","evaluating-llms-harness",{"claudeCode":12},"SKILL.md frontmatter at 11-evaluation/lm-evaluation-harness/SKILL.md",[1146,1147,1149,1151,1152],{"path":552,"priority":379},{"path":1148,"priority":555},"references/api-evaluation.md",{"path":1150,"priority":555},"references/benchmark-guide.md",{"path":1136,"priority":555},{"path":1153,"priority":555},"references/distributed-eval.md",{"basePath":1155,"description":1156,"displayName":1157,"installMethods":1158,"rationale":1159,"selectedPaths":1160,"source":385,"sourceLanguage":18,"type":304},"11-evaluation/nemo-evaluator","Evaluates LLMs across 100+ benchmarks from 18+ harnesses (MMLU, HumanEval, GSM8K, safety, VLM) with multi-backend execution. Use when needing scalable evaluation on local Docker, Slurm HPC, or cloud platforms. NVIDIA's enterprise-grade platform with container-first architecture for reproducible benchmarking.","nemo-evaluator-sdk",{"claudeCode":12},"SKILL.md frontmatter at 11-evaluation/nemo-evaluator/SKILL.md",[1161,1162,1164,1166,1168],{"path":552,"priority":379},{"path":1163,"priority":555},"references/adapter-system.md",{"path":1165,"priority":555},"references/configuration.md",{"path":1167,"priority":555},"references/custom-benchmarks.md",{"path":1169,"priority":555},"references/execution-backends.md",{"basePath":1171,"description":1172,"displayName":1173,"installMethods":1174,"rationale":1175,"selectedPaths":1176,"source":385,"sourceLanguage":18,"type":304},"12-inference-serving/llama-cpp","Runs LLM inference on CPU, Apple Silicon, and consumer GPUs without NVIDIA hardware. Use for edge deployment, M1/M2/M3 Macs, AMD/Intel GPUs, or when CUDA is unavailable. Supports GGUF quantization (1.5-8 bit) for reduced memory and 4-10× speedup vs PyTorch on CPU.","llama-cpp",{"claudeCode":12},"SKILL.md frontmatter at 12-inference-serving/llama-cpp/SKILL.md",[1177,1178,1180,1182],{"path":552,"priority":379},{"path":1179,"priority":555},"references/optimization.md",{"path":1181,"priority":555},"references/quantization.md",{"path":1183,"priority":555},"references/server.md",{"basePath":1185,"description":1186,"displayName":1187,"installMethods":1188,"rationale":1189,"selectedPaths":1190,"source":385,"sourceLanguage":18,"type":304},"12-inference-serving/sglang","Fast structured generation and serving for LLMs with RadixAttention prefix caching. Use for JSON/regex outputs, constrained decoding, agentic workflows with tool calls, or when you need 5× faster inference than vLLM with prefix sharing. Powers 300,000+ GPUs at xAI, AMD, NVIDIA, and LinkedIn.","sglang",{"claudeCode":12},"SKILL.md frontmatter at 12-inference-serving/sglang/SKILL.md",[1191,1192,1194,1196],{"path":552,"priority":379},{"path":1193,"priority":555},"references/deployment.md",{"path":1195,"priority":555},"references/radix-attention.md",{"path":1197,"priority":555},"references/structured-generation.md",{"basePath":1199,"description":1200,"displayName":1201,"installMethods":1202,"rationale":1203,"selectedPaths":1204,"source":385,"sourceLanguage":18,"type":304},"12-inference-serving/tensorrt-llm","Optimizes LLM inference with NVIDIA TensorRT for maximum throughput and lowest latency. Use for production deployment on NVIDIA GPUs (A100/H100), when you need 10-100x faster inference than PyTorch, or for serving models with quantization (FP8/INT4), in-flight batching, and multi-GPU scaling.","tensorrt-llm",{"claudeCode":12},"SKILL.md frontmatter at 12-inference-serving/tensorrt-llm/SKILL.md",[1205,1206,1208,1209],{"path":552,"priority":379},{"path":1207,"priority":555},"references/multi-gpu.md",{"path":1179,"priority":555},{"path":1210,"priority":555},"references/serving.md",{"basePath":1212,"description":1213,"displayName":1214,"installMethods":1215,"rationale":1216,"selectedPaths":1217,"source":385,"sourceLanguage":18,"type":304},"12-inference-serving/vllm","Serves LLMs with high throughput using vLLM's PagedAttention and continuous batching. Use when deploying production LLM APIs, optimizing inference latency/throughput, or serving models with limited GPU memory. Supports OpenAI-compatible endpoints, quantization (GPTQ/AWQ/FP8), and tensor parallelism.","serving-llms-vllm",{"claudeCode":12},"SKILL.md frontmatter at 12-inference-serving/vllm/SKILL.md",[1218,1219,1220,1221,1223],{"path":552,"priority":379},{"path":1179,"priority":555},{"path":1181,"priority":555},{"path":1222,"priority":555},"references/server-deployment.md",{"path":709,"priority":555},{"basePath":1225,"description":1226,"displayName":1227,"installMethods":1228,"rationale":1229,"selectedPaths":1230,"source":385,"sourceLanguage":18,"type":304},"13-mlops/mlflow","Track ML experiments, manage model registry with versioning, deploy models to production, and reproduce experiments with MLflow - framework-agnostic ML lifecycle platform","mlflow",{"claudeCode":12},"SKILL.md frontmatter at 13-mlops/mlflow/SKILL.md",[1231,1232,1233,1235],{"path":552,"priority":379},{"path":1193,"priority":555},{"path":1234,"priority":555},"references/model-registry.md",{"path":1236,"priority":555},"references/tracking.md",{"basePath":1238,"description":1239,"displayName":1240,"installMethods":1241,"rationale":1242,"selectedPaths":1243,"source":385,"sourceLanguage":18,"type":304},"13-mlops/swanlab","Provides guidance for experiment tracking with SwanLab. Use when you need open-source run tracking, local or self-hosted dashboards, and lightweight media logging for ML workflows.","experiment-tracking-swanlab",{"claudeCode":12},"SKILL.md frontmatter at 13-mlops/swanlab/SKILL.md",[1244,1245,1247],{"path":552,"priority":379},{"path":1246,"priority":555},"references/integrations.md",{"path":1248,"priority":555},"references/visualization.md",{"basePath":1250,"description":1251,"displayName":1252,"installMethods":1253,"rationale":1254,"selectedPaths":1255,"source":385,"sourceLanguage":18,"type":304},"13-mlops/tensorboard","Visualize training metrics, debug models with histograms, compare experiments, visualize model graphs, and profile performance with TensorBoard - Google's ML visualization toolkit","tensorboard",{"claudeCode":12},"SKILL.md frontmatter at 13-mlops/tensorboard/SKILL.md",[1256,1257,1258,1260],{"path":552,"priority":379},{"path":1246,"priority":555},{"path":1259,"priority":555},"references/profiling.md",{"path":1248,"priority":555},{"basePath":1262,"description":1263,"displayName":1264,"installMethods":1265,"rationale":1266,"selectedPaths":1267,"source":385,"sourceLanguage":18,"type":304},"13-mlops/weights-and-biases","Track ML experiments with automatic logging, visualize training in real-time, optimize hyperparameters with sweeps, and manage model registry with W&B - collaborative MLOps platform","weights-and-biases",{"claudeCode":12},"SKILL.md frontmatter at 13-mlops/weights-and-biases/SKILL.md",[1268,1269,1271,1272],{"path":552,"priority":379},{"path":1270,"priority":555},"references/artifacts.md",{"path":1246,"priority":555},{"path":1273,"priority":555},"references/sweeps.md",{"basePath":1275,"description":1276,"displayName":1277,"installMethods":1278,"rationale":1279,"selectedPaths":1280,"source":385,"sourceLanguage":18,"type":304},"14-agents/a-evolve","Provides guidance for automatically evolving and optimizing AI agents across any domain using LLM-driven evolution algorithms. Use when building self-improving agents, optimizing agent prompts and skills against benchmarks, or implementing automated agent evaluation loops.","evolving-ai-agents",{"claudeCode":12},"SKILL.md frontmatter at 14-agents/a-evolve/SKILL.md",[1281,1282,1283,1284,1285,1287,1289,1290,1292],{"path":552,"priority":379},{"path":734,"priority":555},{"path":675,"priority":555},{"path":608,"priority":555},{"path":1286,"priority":555},"references/design-patterns.md",{"path":1288,"priority":555},"references/examples.md",{"path":1138,"priority":555},{"path":1291,"priority":555},"references/releases.md",{"path":737,"priority":555},{"basePath":1294,"description":1295,"displayName":1296,"installMethods":1297,"rationale":1298,"selectedPaths":1299,"source":385,"sourceLanguage":18,"type":304},"14-agents/autogpt","Autonomous AI agent platform for building and deploying continuous agents. Use when creating visual workflow agents, deploying persistent autonomous agents, or building complex multi-step AI automation systems.","autogpt-agents",{"claudeCode":12},"SKILL.md frontmatter at 14-agents/autogpt/SKILL.md",[1300,1301,1302],{"path":552,"priority":379},{"path":707,"priority":555},{"path":709,"priority":555},{"basePath":1304,"description":1305,"displayName":1306,"installMethods":1307,"rationale":1308,"selectedPaths":1309,"source":385,"sourceLanguage":18,"type":304},"14-agents/crewai","Multi-agent orchestration framework for autonomous AI collaboration. Use when building teams of specialized agents working together on complex tasks, when you need role-based agent collaboration with memory, or for production workflows requiring sequential/hierarchical execution. Built without LangChain dependencies for lean, fast execution.","crewai-multi-agent",{"claudeCode":12},"SKILL.md frontmatter at 14-agents/crewai/SKILL.md",[1310,1311,1313,1315],{"path":552,"priority":379},{"path":1312,"priority":555},"references/flows.md",{"path":1314,"priority":555},"references/tools.md",{"path":709,"priority":555},{"basePath":1317,"description":1318,"displayName":1319,"installMethods":1320,"rationale":1321,"selectedPaths":1322,"source":385,"sourceLanguage":18,"type":304},"14-agents/langchain","Framework for building LLM-powered applications with agents, chains, and RAG. Supports multiple providers (OpenAI, Anthropic, Google), 500+ integrations, ReAct agents, tool calling, memory management, and vector store retrieval. Use for building chatbots, question-answering systems, autonomous agents, or RAG applications. Best for rapid prototyping and production deployments.","langchain",{"claudeCode":12},"SKILL.md frontmatter at 14-agents/langchain/SKILL.md",[1323,1324,1326,1327],{"path":552,"priority":379},{"path":1325,"priority":555},"references/agents.md",{"path":652,"priority":555},{"path":1328,"priority":555},"references/rag.md",{"basePath":1330,"description":1331,"displayName":1332,"installMethods":1333,"rationale":1334,"selectedPaths":1335,"source":385,"sourceLanguage":18,"type":304},"14-agents/llamaindex","Data framework for building LLM applications with RAG. Specializes in document ingestion (300+ connectors), indexing, and querying. Features vector indices, query engines, agents, and multi-modal support. Use for document Q&A, chatbots, knowledge retrieval, or building RAG pipelines. Best for data-centric LLM applications.","llamaindex",{"claudeCode":12},"SKILL.md frontmatter at 14-agents/llamaindex/SKILL.md",[1336,1337,1338,1340],{"path":552,"priority":379},{"path":1325,"priority":555},{"path":1339,"priority":555},"references/data_connectors.md",{"path":1341,"priority":555},"references/query_engines.md",{"basePath":1343,"description":1344,"displayName":1345,"installMethods":1346,"rationale":1347,"selectedPaths":1348,"source":385,"sourceLanguage":18,"type":304},"15-rag/chroma","Open-source embedding database for AI applications. Store embeddings and metadata, perform vector and full-text search, filter by metadata. Simple 4-function API. Scales from notebooks to production clusters. Use for semantic search, RAG applications, or document retrieval. Best for local development and open-source projects.","chroma",{"claudeCode":12},"SKILL.md frontmatter at 15-rag/chroma/SKILL.md",[1349,1350],{"path":552,"priority":379},{"path":652,"priority":555},{"basePath":1352,"description":1353,"displayName":1354,"installMethods":1355,"rationale":1356,"selectedPaths":1357,"source":385,"sourceLanguage":18,"type":304},"15-rag/faiss","Facebook's library for efficient similarity search and clustering of dense vectors. Supports billions of vectors, GPU acceleration, and various index types (Flat, IVF, HNSW). Use for fast k-NN search, large-scale vector retrieval, or when you need pure similarity search without metadata. Best for high-performance applications.","faiss",{"claudeCode":12},"SKILL.md frontmatter at 15-rag/faiss/SKILL.md",[1358,1359],{"path":552,"priority":379},{"path":1360,"priority":555},"references/index_types.md",{"basePath":1362,"description":1363,"displayName":1364,"installMethods":1365,"rationale":1366,"selectedPaths":1367,"source":385,"sourceLanguage":18,"type":304},"15-rag/pinecone","Managed vector database for production AI applications. Fully managed, auto-scaling, with hybrid search (dense + sparse), metadata filtering, and namespaces. Low latency (\u003C100ms p95). Use for production RAG, recommendation systems, or semantic search at scale. Best for serverless, managed infrastructure.","pinecone",{"claudeCode":12},"SKILL.md frontmatter at 15-rag/pinecone/SKILL.md",[1368,1369],{"path":552,"priority":379},{"path":1193,"priority":555},{"basePath":1371,"description":1372,"displayName":1373,"installMethods":1374,"rationale":1375,"selectedPaths":1376,"source":385,"sourceLanguage":18,"type":304},"15-rag/qdrant","High-performance vector similarity search engine for RAG and semantic search. Use when building production RAG systems requiring fast nearest neighbor search, hybrid search with filtering, or scalable vector storage with Rust-powered performance.","qdrant-vector-search",{"claudeCode":12},"SKILL.md frontmatter at 15-rag/qdrant/SKILL.md",[1377,1378,1379],{"path":552,"priority":379},{"path":707,"priority":555},{"path":709,"priority":555},{"basePath":1381,"description":1382,"displayName":1383,"installMethods":1384,"rationale":1385,"selectedPaths":1386,"source":385,"sourceLanguage":18,"type":304},"15-rag/sentence-transformers","Framework for state-of-the-art sentence, text, and image embeddings. Provides 5000+ pre-trained models for semantic similarity, clustering, and retrieval. Supports multilingual, domain-specific, and multimodal models. Use for generating embeddings for RAG, semantic search, or similarity tasks. Best for production embedding generation.","sentence-transformers",{"claudeCode":12},"SKILL.md frontmatter at 15-rag/sentence-transformers/SKILL.md",[1387,1388],{"path":552,"priority":379},{"path":1389,"priority":555},"references/models.md",{"basePath":1391,"description":1392,"displayName":1393,"installMethods":1394,"rationale":1395,"selectedPaths":1396,"source":385,"sourceLanguage":18,"type":304},"16-prompt-engineering/dspy","Build complex AI systems with declarative programming, optimize prompts automatically, create modular RAG systems and agents with DSPy - Stanford NLP's framework for systematic LM programming","dspy",{"claudeCode":12},"SKILL.md frontmatter at 16-prompt-engineering/dspy/SKILL.md",[1397,1398,1399,1401],{"path":552,"priority":379},{"path":1288,"priority":555},{"path":1400,"priority":555},"references/modules.md",{"path":1123,"priority":555},{"basePath":1403,"description":1404,"displayName":1405,"installMethods":1406,"rationale":1407,"selectedPaths":1408,"source":385,"sourceLanguage":18,"type":304},"16-prompt-engineering/guidance","Control LLM output with regex and grammars, guarantee valid JSON/XML/code generation, enforce structured formats, and build multi-step workflows with Guidance - Microsoft Research's constrained generation framework","guidance",{"claudeCode":12},"SKILL.md frontmatter at 16-prompt-engineering/guidance/SKILL.md",[1409,1410,1412,1414],{"path":552,"priority":379},{"path":1411,"priority":555},"references/backends.md",{"path":1413,"priority":555},"references/constraints.md",{"path":1288,"priority":555},{"basePath":1416,"description":1417,"displayName":1418,"installMethods":1419,"rationale":1420,"selectedPaths":1421,"source":385,"sourceLanguage":18,"type":304},"16-prompt-engineering/instructor","Extract structured data from LLM responses with Pydantic validation, retry failed extractions automatically, parse complex JSON with type safety, and stream partial results with Instructor - battle-tested structured output library","instructor",{"claudeCode":12},"SKILL.md frontmatter at 16-prompt-engineering/instructor/SKILL.md",[1422,1423,1424,1426],{"path":552,"priority":379},{"path":1288,"priority":555},{"path":1425,"priority":555},"references/providers.md",{"path":1427,"priority":555},"references/validation.md",{"basePath":1429,"description":1430,"displayName":1431,"installMethods":1432,"rationale":1433,"selectedPaths":1434,"source":385,"sourceLanguage":18,"type":304},"16-prompt-engineering/outlines","Guarantee valid JSON/XML/code structure during generation, use Pydantic models for type-safe outputs, support local models (Transformers, vLLM), and maximize inference speed with Outlines - dottxt.ai's structured generation library","outlines",{"claudeCode":12},"SKILL.md frontmatter at 16-prompt-engineering/outlines/SKILL.md",[1435,1436,1437,1438],{"path":552,"priority":379},{"path":1411,"priority":555},{"path":1288,"priority":555},{"path":1439,"priority":555},"references/json_generation.md",{"basePath":1441,"description":1442,"displayName":1443,"installMethods":1444,"rationale":1445,"selectedPaths":1446,"source":385,"sourceLanguage":18,"type":304},"17-observability/langsmith","LLM observability platform for tracing, evaluation, and monitoring. Use when debugging LLM applications, evaluating model outputs against datasets, monitoring production systems, or building systematic testing pipelines for AI applications.","langsmith-observability",{"claudeCode":12},"SKILL.md frontmatter at 17-observability/langsmith/SKILL.md",[1447,1448,1449],{"path":552,"priority":379},{"path":707,"priority":555},{"path":709,"priority":555},{"basePath":1451,"description":1452,"displayName":1453,"installMethods":1454,"rationale":1455,"selectedPaths":1456,"source":385,"sourceLanguage":18,"type":304},"17-observability/phoenix","Open-source AI observability platform for LLM tracing, evaluation, and monitoring. Use when debugging LLM applications with detailed traces, running evaluations on datasets, or monitoring production AI systems with real-time insights.","phoenix-observability",{"claudeCode":12},"SKILL.md frontmatter at 17-observability/phoenix/SKILL.md",[1457,1458,1459],{"path":552,"priority":379},{"path":707,"priority":555},{"path":709,"priority":555},{"basePath":1461,"description":1462,"displayName":1463,"installMethods":1464,"rationale":1465,"selectedPaths":1466,"source":385,"sourceLanguage":18,"type":304},"18-multimodal/audiocraft","PyTorch library for audio generation including text-to-music (MusicGen) and text-to-sound (AudioGen). Use when you need to generate music from text descriptions, create sound effects, or perform melody-conditioned music generation.","audiocraft-audio-generation",{"claudeCode":12},"SKILL.md frontmatter at 18-multimodal/audiocraft/SKILL.md",[1467,1468,1469],{"path":552,"priority":379},{"path":707,"priority":555},{"path":709,"priority":555},{"basePath":1471,"description":1472,"displayName":1473,"installMethods":1474,"rationale":1475,"selectedPaths":1476,"source":385,"sourceLanguage":18,"type":304},"18-multimodal/blip-2","Vision-language pre-training framework bridging frozen image encoders and LLMs. Use when you need image captioning, visual question answering, image-text retrieval, or multimodal chat with state-of-the-art zero-shot performance.","blip-2-vision-language",{"claudeCode":12},"SKILL.md frontmatter at 18-multimodal/blip-2/SKILL.md",[1477,1478,1479],{"path":552,"priority":379},{"path":707,"priority":555},{"path":709,"priority":555},{"basePath":1481,"description":1482,"displayName":1483,"installMethods":1484,"rationale":1485,"selectedPaths":1486,"source":385,"sourceLanguage":18,"type":304},"18-multimodal/clip","OpenAI's model connecting vision and language. Enables zero-shot image classification, image-text matching, and cross-modal retrieval. Trained on 400M image-text pairs. Use for image search, content moderation, or vision-language tasks without fine-tuning. Best for general-purpose image understanding.","clip",{"claudeCode":12},"SKILL.md frontmatter at 18-multimodal/clip/SKILL.md",[1487,1488],{"path":552,"priority":379},{"path":1489,"priority":555},"references/applications.md",{"basePath":1491,"description":1492,"displayName":1493,"installMethods":1494,"rationale":1495,"selectedPaths":1496,"source":385,"sourceLanguage":18,"type":304},"18-multimodal/cosmos-policy","Evaluates NVIDIA Cosmos Policy on LIBERO and RoboCasa simulation environments. Use when setting up cosmos-policy for robot manipulation evaluation, running headless GPU evaluations with EGL rendering, or profiling inference latency on cluster or local GPU machines.","evaluating-cosmos-policy",{"claudeCode":12},"SKILL.md frontmatter at 18-multimodal/cosmos-policy/SKILL.md",[1497,1498,1500],{"path":552,"priority":379},{"path":1499,"priority":555},"references/libero-commands.md",{"path":1501,"priority":555},"references/robocasa-commands.md",{"basePath":1503,"description":1504,"displayName":1505,"installMethods":1506,"rationale":1507,"selectedPaths":1508,"source":385,"sourceLanguage":18,"type":304},"18-multimodal/llava","Large Language and Vision Assistant. Enables visual instruction tuning and image-based conversations. Combines CLIP vision encoder with Vicuna/LLaMA language models. Supports multi-turn image chat, visual question answering, and instruction following. Use for vision-language chatbots or image understanding tasks. Best for conversational image analysis.","llava",{"claudeCode":12},"SKILL.md frontmatter at 18-multimodal/llava/SKILL.md",[1509,1510],{"path":552,"priority":379},{"path":612,"priority":555},{"basePath":1512,"description":1513,"displayName":1514,"installMethods":1515,"rationale":1516,"selectedPaths":1517,"source":385,"sourceLanguage":18,"type":304},"18-multimodal/openpi","Fine-tune and serve Physical Intelligence OpenPI models (pi0, pi0-fast, pi0.5) using JAX or PyTorch backends for robot policy inference across ALOHA, DROID, and LIBERO environments. Use when adapting pi0 models to custom datasets, converting JAX checkpoints to PyTorch, running policy inference servers, or debugging norm stats and GPU memory issues.","fine-tuning-serving-openpi",{"claudeCode":12},"SKILL.md frontmatter at 18-multimodal/openpi/SKILL.md",[1518,1519,1521,1523,1525,1527],{"path":552,"priority":379},{"path":1520,"priority":555},"references/checkpoints-and-env-map.md",{"path":1522,"priority":555},"references/config-recipes.md",{"path":1524,"priority":555},"references/pytorch-gotchas.md",{"path":1526,"priority":555},"references/remote-client-pattern.md",{"path":1528,"priority":555},"references/training-debugging.md",{"basePath":1530,"description":1531,"displayName":1532,"installMethods":1533,"rationale":1534,"selectedPaths":1535,"source":385,"sourceLanguage":18,"type":304},"18-multimodal/openvla-oft","Fine-tunes and evaluates OpenVLA-OFT and OpenVLA-OFT+ policies for robot action generation with continuous action heads, LoRA adaptation, and FiLM conditioning on LIBERO simulation and ALOHA real-world setups. Use when reproducing OpenVLA-OFT paper results, training custom VLA action heads (L1 or diffusion), deploying server-client inference for ALOHA, or debugging normalization, LoRA merge, and cross-GPU issues.","fine-tuning-openvla-oft",{"claudeCode":12},"SKILL.md frontmatter at 18-multimodal/openvla-oft/SKILL.md",[1536,1537,1539,1541,1543],{"path":552,"priority":379},{"path":1538,"priority":555},"references/aloha-workflow.md",{"path":1540,"priority":555},"references/config-troubleshooting.md",{"path":1542,"priority":555},"references/libero-workflow.md",{"path":1544,"priority":555},"references/paper-and-checkpoints.md",{"basePath":1546,"description":1547,"displayName":1548,"installMethods":1549,"rationale":1550,"selectedPaths":1551,"source":385,"sourceLanguage":18,"type":304},"18-multimodal/segment-anything","Foundation model for image segmentation with zero-shot transfer. Use when you need to segment any object in images using points, boxes, or masks as prompts, or automatically generate all object masks in an image.","segment-anything-model",{"claudeCode":12},"SKILL.md frontmatter at 18-multimodal/segment-anything/SKILL.md",[1552,1553,1554],{"path":552,"priority":379},{"path":707,"priority":555},{"path":709,"priority":555},{"basePath":1556,"description":1557,"displayName":1558,"installMethods":1559,"rationale":1560,"selectedPaths":1561,"source":385,"sourceLanguage":18,"type":304},"18-multimodal/stable-diffusion","State-of-the-art text-to-image generation with Stable Diffusion models via HuggingFace Diffusers. Use when generating images from text prompts, performing image-to-image translation, inpainting, or building custom diffusion pipelines.","stable-diffusion-image-generation",{"claudeCode":12},"SKILL.md frontmatter at 18-multimodal/stable-diffusion/SKILL.md",[1562,1563,1564],{"path":552,"priority":379},{"path":707,"priority":555},{"path":709,"priority":555},{"basePath":1566,"description":1567,"displayName":1568,"installMethods":1569,"rationale":1570,"selectedPaths":1571,"source":385,"sourceLanguage":18,"type":304},"18-multimodal/whisper","OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M params). Use for speech-to-text, podcast transcription, or multilingual audio processing. Best for robust, multilingual ASR.","whisper",{"claudeCode":12},"SKILL.md frontmatter at 18-multimodal/whisper/SKILL.md",[1572,1573],{"path":552,"priority":379},{"path":1574,"priority":555},"references/languages.md",{"basePath":1576,"description":1577,"displayName":1578,"installMethods":1579,"rationale":1580,"selectedPaths":1581,"source":385,"sourceLanguage":18,"type":304},"19-emerging-techniques/knowledge-distillation","Compress large language models using knowledge distillation from teacher to student models. Use when deploying smaller models with retained performance, transferring GPT-4 capabilities to open-source models, or reducing inference costs. Covers temperature scaling, soft targets, reverse KLD, logit distillation, and MiniLLM training strategies.","knowledge-distillation",{"claudeCode":12},"SKILL.md frontmatter at 19-emerging-techniques/knowledge-distillation/SKILL.md",[1582,1583],{"path":552,"priority":379},{"path":1584,"priority":555},"references/minillm.md",{"basePath":1586,"description":1587,"displayName":1588,"installMethods":1589,"rationale":1590,"selectedPaths":1591,"source":385,"sourceLanguage":18,"type":304},"19-emerging-techniques/long-context","Extend context windows of transformer models using RoPE, YaRN, ALiBi, and position interpolation techniques. Use when processing long documents (32k-128k+ tokens), extending pre-trained models beyond original context limits, or implementing efficient positional encodings. Covers rotary embeddings, attention biases, interpolation methods, and extrapolation strategies for LLMs.","long-context",{"claudeCode":12},"SKILL.md frontmatter at 19-emerging-techniques/long-context/SKILL.md",[1592,1593,1595,1597],{"path":552,"priority":379},{"path":1594,"priority":555},"references/extension_methods.md",{"path":1596,"priority":555},"references/fine_tuning.md",{"path":1598,"priority":555},"references/rope.md",{"basePath":1600,"description":1601,"displayName":1602,"installMethods":1603,"rationale":1604,"selectedPaths":1605,"source":385,"sourceLanguage":18,"type":304},"19-emerging-techniques/model-merging","Merge multiple fine-tuned models using mergekit to combine capabilities without retraining. Use when creating specialized models by blending domain-specific expertise (math + coding + chat), improving performance beyond single models, or experimenting rapidly with model variants. Covers SLERP, TIES-Merging, DARE, Task Arithmetic, linear merging, and production deployment strategies.","model-merging",{"claudeCode":12},"SKILL.md frontmatter at 19-emerging-techniques/model-merging/SKILL.md",[1606,1607,1609,1610],{"path":552,"priority":379},{"path":1608,"priority":555},"references/evaluation.md",{"path":1288,"priority":555},{"path":1611,"priority":555},"references/methods.md",{"basePath":1613,"description":1614,"displayName":1615,"installMethods":1616,"rationale":1617,"selectedPaths":1618,"source":385,"sourceLanguage":18,"type":304},"19-emerging-techniques/model-pruning","Reduce LLM size and accelerate inference using pruning techniques like Wanda and SparseGPT. Use when compressing models without retraining, achieving 50% sparsity with minimal accuracy loss, or enabling faster inference on hardware accelerators. Covers unstructured pruning, structured pruning, N:M sparsity, magnitude pruning, and one-shot methods.","model-pruning",{"claudeCode":12},"SKILL.md frontmatter at 19-emerging-techniques/model-pruning/SKILL.md",[1619,1620],{"path":552,"priority":379},{"path":1621,"priority":555},"references/wanda.md",{"basePath":1623,"description":1624,"displayName":1625,"installMethods":1626,"rationale":1627,"selectedPaths":1628,"source":385,"sourceLanguage":18,"type":304},"19-emerging-techniques/moe-training","Train Mixture of Experts (MoE) models using DeepSpeed or HuggingFace. Use when training large-scale models with limited compute (5× cost reduction vs dense models), implementing sparse architectures like Mixtral 8x7B or DeepSeek-V3, or scaling model capacity without proportional compute increase. Covers MoE architectures, routing mechanisms, load balancing, expert parallelism, and inference optimization.","moe-training",{"claudeCode":12},"SKILL.md frontmatter at 19-emerging-techniques/moe-training/SKILL.md",[1629,1630,1632,1634],{"path":552,"priority":379},{"path":1631,"priority":555},"references/architectures.md",{"path":1633,"priority":555},"references/inference.md",{"path":612,"priority":555},{"basePath":1636,"description":1637,"displayName":1638,"installMethods":1639,"rationale":1640,"selectedPaths":1641,"source":385,"sourceLanguage":18,"type":304},"19-emerging-techniques/speculative-decoding","Accelerate LLM inference using speculative decoding, Medusa multiple heads, and lookahead decoding techniques. Use when optimizing inference speed (1.5-3.6× speedup), reducing latency for real-time applications, or deploying models with limited compute. Covers draft models, tree-based attention, Jacobi iteration, parallel token generation, and production deployment strategies.","speculative-decoding",{"claudeCode":12},"SKILL.md frontmatter at 19-emerging-techniques/speculative-decoding/SKILL.md",[1642,1643,1645],{"path":552,"priority":379},{"path":1644,"priority":555},"references/lookahead.md",{"path":1646,"priority":555},"references/medusa.md",{"basePath":1648,"description":1649,"displayName":1650,"installMethods":1651,"rationale":1652,"selectedPaths":1653,"source":385,"sourceLanguage":18,"type":304},"20-ml-paper-writing/academic-plotting","Generates publication-quality figures for ML papers from research context. Given a paper section or description, extracts system components and relationships to generate architecture diagrams via Gemini. Given experiment results or data, auto-selects chart type and generates data-driven figures via matplotlib/seaborn. Use when creating any figure for a conference paper.","academic-plotting",{"claudeCode":12},"SKILL.md frontmatter at 20-ml-paper-writing/academic-plotting/SKILL.md",[1654,1655,1657,1659],{"path":552,"priority":379},{"path":1656,"priority":555},"references/data-visualization.md",{"path":1658,"priority":555},"references/diagram-generation.md",{"path":1660,"priority":555},"references/style-guide.md",{"basePath":300,"description":10,"displayName":303,"installMethods":1662,"rationale":1663,"selectedPaths":1664,"source":385,"sourceLanguage":18,"type":304},{"claudeCode":12},"SKILL.md frontmatter at 20-ml-paper-writing/ml-paper-writing/SKILL.md",[1665,1666,1668,1670,1672,1674,1676,1678,1680,1682,1684,1686,1688,1690,1692,1694,1696,1698,1700,1702,1704,1706,1708,1710,1712,1714,1716,1718,1720,1722,1724,1726,1728,1730,1732,1734,1736,1738,1740,1742,1744,1746,1748,1750,1752,1754,1756,1758,1760,1762,1764],{"path":552,"priority":379},{"path":1667,"priority":555},"references/checklists.md",{"path":1669,"priority":555},"references/citation-workflow.md",{"path":1671,"priority":555},"references/reviewer-guidelines.md",{"path":1673,"priority":555},"references/sources.md",{"path":1675,"priority":555},"references/writing-guide.md",{"path":1677,"priority":562},"templates/README.md",{"path":1679,"priority":562},"templates/aaai2026/README.md",{"path":1681,"priority":562},"templates/aaai2026/aaai2026-unified-supp.tex",{"path":1683,"priority":562},"templates/aaai2026/aaai2026-unified-template.tex",{"path":1685,"priority":562},"templates/aaai2026/aaai2026.bib",{"path":1687,"priority":562},"templates/aaai2026/aaai2026.bst",{"path":1689,"priority":562},"templates/aaai2026/aaai2026.sty",{"path":1691,"priority":562},"templates/acl/README.md",{"path":1693,"priority":562},"templates/acl/acl.sty",{"path":1695,"priority":562},"templates/acl/acl_latex.tex",{"path":1697,"priority":562},"templates/acl/acl_lualatex.tex",{"path":1699,"priority":562},"templates/acl/acl_natbib.bst",{"path":1701,"priority":562},"templates/acl/anthology.bib.txt",{"path":1703,"priority":562},"templates/acl/custom.bib",{"path":1705,"priority":562},"templates/acl/formatting.md",{"path":1707,"priority":562},"templates/colm2025/README.md",{"path":1709,"priority":562},"templates/colm2025/colm2025_conference.bib",{"path":1711,"priority":562},"templates/colm2025/colm2025_conference.bst",{"path":1713,"priority":562},"templates/colm2025/colm2025_conference.pdf",{"path":1715,"priority":562},"templates/colm2025/colm2025_conference.sty",{"path":1717,"priority":562},"templates/colm2025/colm2025_conference.tex",{"path":1719,"priority":562},"templates/colm2025/fancyhdr.sty",{"path":1721,"priority":562},"templates/colm2025/math_commands.tex",{"path":1723,"priority":562},"templates/colm2025/natbib.sty",{"path":1725,"priority":562},"templates/iclr2026/fancyhdr.sty",{"path":1727,"priority":562},"templates/iclr2026/iclr2026_conference.bib",{"path":1729,"priority":562},"templates/iclr2026/iclr2026_conference.bst",{"path":1731,"priority":562},"templates/iclr2026/iclr2026_conference.pdf",{"path":1733,"priority":562},"templates/iclr2026/iclr2026_conference.sty",{"path":1735,"priority":562},"templates/iclr2026/iclr2026_conference.tex",{"path":1737,"priority":562},"templates/iclr2026/math_commands.tex",{"path":1739,"priority":562},"templates/iclr2026/natbib.sty",{"path":1741,"priority":562},"templates/icml2026/algorithm.sty",{"path":1743,"priority":562},"templates/icml2026/algorithmic.sty",{"path":1745,"priority":562},"templates/icml2026/example_paper.bib",{"path":1747,"priority":562},"templates/icml2026/example_paper.pdf",{"path":1749,"priority":562},"templates/icml2026/example_paper.tex",{"path":1751,"priority":562},"templates/icml2026/fancyhdr.sty",{"path":1753,"priority":562},"templates/icml2026/icml2026.bst",{"path":1755,"priority":562},"templates/icml2026/icml2026.sty",{"path":1757,"priority":562},"templates/icml2026/icml_numpapers.pdf",{"path":1759,"priority":562},"templates/neurips2025/Makefile",{"path":1761,"priority":562},"templates/neurips2025/extra_pkgs.tex",{"path":1763,"priority":562},"templates/neurips2025/main.tex",{"path":1765,"priority":562},"templates/neurips2025/neurips.sty",{"basePath":1767,"description":1768,"displayName":1769,"installMethods":1770,"rationale":1771,"selectedPaths":1772,"source":385,"sourceLanguage":18,"type":304},"20-ml-paper-writing/presenting-conference-talks","Generates conference presentation slides (Beamer LaTeX PDF and editable PPTX) from a compiled paper with speaker notes and talk script. Use when preparing oral talks, spotlight presentations, or invited talks for ML and systems conferences.","presenting-conference-talks",{"claudeCode":12},"SKILL.md frontmatter at 20-ml-paper-writing/presenting-conference-talks/SKILL.md",[1773,1774],{"path":552,"priority":379},{"path":1775,"priority":555},"references/slide-templates.md",{"basePath":1777,"description":1778,"displayName":1779,"installMethods":1780,"rationale":1781,"selectedPaths":1782,"source":385,"sourceLanguage":18,"type":304},"20-ml-paper-writing/systems-paper-writing","Comprehensive guide for writing systems papers targeting OSDI, SOSP, ASPLOS, NSDI, and EuroSys. Provides paragraph-level structural blueprints, writing patterns, venue-specific checklists, reviewer guidelines, LaTeX templates, and conference deadlines. Use this skill for all systems conference paper writing.","systems-paper-writing",{"claudeCode":12},"SKILL.md frontmatter at 20-ml-paper-writing/systems-paper-writing/SKILL.md",[1783,1784,1786,1787,1789,1791,1793,1795,1797,1799,1801,1803,1805,1807,1809,1811],{"path":552,"priority":379},{"path":1785,"priority":555},"references/checklist.md",{"path":1671,"priority":555},{"path":1788,"priority":555},"references/section-blueprints.md",{"path":1790,"priority":555},"references/systems-conferences.md",{"path":1792,"priority":555},"references/writing-patterns.md",{"path":1794,"priority":562},"templates/asplos2027/main.tex",{"path":1796,"priority":562},"templates/asplos2027/references.bib",{"path":1798,"priority":562},"templates/nsdi2027/main.tex",{"path":1800,"priority":562},"templates/nsdi2027/references.bib",{"path":1802,"priority":562},"templates/nsdi2027/usenix-2020-09.sty",{"path":1804,"priority":562},"templates/osdi2026/main.tex",{"path":1806,"priority":562},"templates/osdi2026/references.bib",{"path":1808,"priority":562},"templates/osdi2026/usenix-2020-09.sty",{"path":1810,"priority":562},"templates/sosp2026/main.tex",{"path":1812,"priority":562},"templates/sosp2026/references.bib",{"basePath":1814,"description":1815,"displayName":1816,"installMethods":1817,"rationale":1818,"selectedPaths":1819,"source":385,"sourceLanguage":18,"type":304},"21-research-ideation/brainstorming-research-ideas","Guides researchers through structured ideation frameworks to discover high-impact research directions. Use when exploring new problem spaces, pivoting between projects, or seeking novel angles on existing work.","brainstorming-research-ideas",{"claudeCode":12},"SKILL.md frontmatter at 21-research-ideation/brainstorming-research-ideas/SKILL.md",[1820],{"path":552,"priority":379},{"basePath":1822,"description":1823,"displayName":1824,"installMethods":1825,"rationale":1826,"selectedPaths":1827,"source":385,"sourceLanguage":18,"type":304},"21-research-ideation/creative-thinking-for-research","Applies cognitive science frameworks for creative thinking to CS and AI research ideation. Use when seeking genuinely novel research directions by leveraging combinatorial creativity, analogical reasoning, constraint manipulation, and other empirically grounded creative strategies.","creative-thinking-for-research",{"claudeCode":12},"SKILL.md frontmatter at 21-research-ideation/creative-thinking-for-research/SKILL.md",[1828],{"path":552,"priority":379},{"basePath":1830,"description":1831,"displayName":1832,"installMethods":1833,"rationale":1834,"selectedPaths":1835,"source":385,"sourceLanguage":18,"type":304},"22-agent-native-research-artifact/compiler","Compiles any research input — PDF papers, GitHub repositories, experiment logs, code directories, or raw notes — into a complete Agent-Native Research Artifact (ARA) with cognitive layer (claims, concepts, heuristics), physical layer (configs, code stubs), exploration graph, and grounded evidence. Use when ingesting a paper or codebase into a structured, machine-executable knowledge package, building an ARA from scratch, or converting research outputs into a falsifiable, agent-traversable form.","ara-compiler",{"claudeCode":12},"SKILL.md frontmatter at 22-agent-native-research-artifact/compiler/SKILL.md",[1836,1837,1839,1841],{"path":552,"priority":379},{"path":1838,"priority":555},"references/ara-schema.md",{"path":1840,"priority":555},"references/exploration-tree-spec.md",{"path":1842,"priority":555},"references/validation-checklist.md",{"basePath":1844,"description":1845,"displayName":1846,"installMethods":1847,"rationale":1848,"selectedPaths":1849,"source":385,"sourceLanguage":18,"type":304},"22-agent-native-research-artifact/research-manager","Records research provenance as a post-task epilogue, scanning conversation history at the end of a coding or research session to extract decisions, experiments, dead ends, claims, heuristics, and pivots, and writing them into the ara/ directory with user-vs-AI provenance tags. Use as a session epilogue — never during execution — to maintain a faithful, auditable trace of how a research project actually evolved.","ara-research-manager",{"claudeCode":12},"SKILL.md frontmatter at 22-agent-native-research-artifact/research-manager/SKILL.md",[1850,1851,1853,1855],{"path":552,"priority":379},{"path":1852,"priority":555},"references/event-taxonomy.md",{"path":1854,"priority":555},"references/provenance-tags.md",{"path":1856,"priority":555},"references/session-protocol.md",{"basePath":1858,"description":1859,"displayName":1860,"installMethods":1861,"rationale":1862,"selectedPaths":1863,"source":385,"sourceLanguage":18,"type":304},"22-agent-native-research-artifact/rigor-reviewer","Performs ARA Seal Level 2 semantic epistemic review on Agent-Native Research Artifacts, scoring six dimensions (evidence relevance, falsifiability, scope calibration, argument coherence, exploration integrity, methodological rigor) and producing a constructive, severity-ranked report with a Strong Accept-to-Reject recommendation. Use after Level 1 structural validation passes, when an ARA needs an objective epistemic critique before publication or release.","ara-rigor-reviewer",{"claudeCode":12},"SKILL.md frontmatter at 22-agent-native-research-artifact/rigor-reviewer/SKILL.md",[1864,1865],{"path":552,"priority":379},{"path":1866,"priority":555},"references/review-dimensions.md",{"basePath":1868,"description":1869,"displayName":1870,"installMethods":1871,"license":294,"rationale":1872,"selectedPaths":1873,"source":385,"sourceLanguage":18,"type":1881},"packages/ai-research-skills","Install AI research engineering skills to your coding agents (Claude Code, OpenCode, Cursor, Gemini CLI, Hermes Agent, and more)","@orchestra-research/ai-research-skills",{"npm":1870},"cli ecosystem detected at packages/ai-research-skills",[1874,1876,1877,1879],{"path":1875,"priority":379},"package.json",{"path":381,"priority":379},{"path":1878,"priority":555},"bin/cli.js",{"path":1880,"priority":562},"src/index.js","cli",{"sources":1883},[1884],"manual",{"npmPackage":339},{"closedIssues90d":286,"description":1887,"forks":287,"homepage":1888,"license":294,"openIssues90d":289,"pushedAt":290,"readmeSize":284,"stars":291,"topics":1889},"Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepower. Maintained by Orchestra Research.","http://orchestra-research.com",[1890,331,1891,1892,1893,1894,1895,1896,1897,1898,1899,1900,1901,1902],"ai","claude","claude-code","claude-skills","codex","gemini","gpt-5","grpo","huggingface","machine-leanring","megatron","skills","vllm",{"downloads":8},{"classifiedAt":1905,"discoverAt":1906,"extractAt":1907,"githubAt":1907,"npmAt":1908,"updatedAt":1905},1778695115942,1778695107142,1778695112108,1778695113836,[253,261,256,260,259,255,258,254,257],{"evaluatedAt":298,"extractAt":348,"updatedAt":298},[],[1913,1938,1966,1990,2012,2041],{"_creationTime":1914,"_id":1915,"community":1916,"display":1917,"identity":1922,"providers":1926,"relations":1932,"tags":1934,"workflow":1935},1778685991755.7175,"k17f8rmpn5qwjjz3a7kmhkhbxn86m0bh",{"reviewCount":8},{"description":1918,"installMethods":1919,"name":13,"sourceUrl":1921},"Write publication-ready ML/AI papers for NeurIPS, ICML, ICLR, ACL, AAAI, COLM. Use when drafting papers from research repos, structuring arguments, verifying citations, or preparing camera-ready submissions. Includes LaTeX templates, reviewer guidelines, and citation verification workflows.",{"claudeCode":1920},"davila7/claude-code-templates","https://github.com/davila7/claude-code-templates",{"basePath":1923,"githubOwner":1924,"githubRepo":1925,"locale":18,"slug":303,"type":304},"cli-tool/components/skills/ai-research/ml-paper-writing","davila7","claude-code-templates",{"evaluate":1927,"extract":1931},{"promptVersionExtension":246,"promptVersionScoring":247,"score":1928,"tags":1929,"targetMarket":262,"tier":263},95,[253,258,259,260,255,1930,257,254],"citation",{"commitSha":337,"license":294},{"repoId":1933},"kd71fzn4s7r0269fkw47wt670n86ndz0",[253,1930,260,259,255,258,254,257],{"evaluatedAt":1936,"extractAt":1937,"updatedAt":1936},1778687755118,1778685991755,{"_creationTime":1939,"_id":1940,"community":1941,"display":1942,"identity":1948,"providers":1952,"relations":1960,"tags":1962,"workflow":1963},1778675145461.8848,"k17721ymkyebk1ppx80e1wqfp186n5s7",{"reviewCount":8},{"description":1943,"installMethods":1944,"name":1946,"sourceUrl":1947},"Part of the AlterLab Academic Skills suite. Structured hypothesis formulation from observations. Use when you have experimental observations or data and need to formulate testable hypotheses with predictions, propose mechanisms, and design experiments to test them. Follows scientific method framework. For open-ended ideation use scientific-brainstorming; for automated LLM-driven hypothesis testing on datasets use hypogenic.",{"claudeCode":1945},"AlterLab-IEU/AlterLab-Academic-Skills","alterlab-hypothesis-gen","https://github.com/AlterLab-IEU/AlterLab-Academic-Skills",{"basePath":1949,"githubOwner":1950,"githubRepo":1951,"locale":18,"slug":1946,"type":304},"skills/writing-tools/alterlab-hypothesis-gen","AlterLab-IEU","AlterLab-Academic-Skills",{"evaluate":1953,"extract":1959},{"promptVersionExtension":246,"promptVersionScoring":247,"score":1928,"tags":1954,"targetMarket":262,"tier":1958},[1955,1956,1957,253,255],"scientific-research","hypothesis-generation","experimental-design","community",{"commitSha":337},{"repoId":1961},"kd7fqvj70pvyn4r3q9kctpnd7d86mfqd",[253,1957,1956,255,1955],{"evaluatedAt":1964,"extractAt":1965,"updatedAt":1964},1778682894119,1778675145461,{"_creationTime":1967,"_id":1968,"community":1969,"display":1970,"identity":1974,"providers":1977,"relations":1986,"tags":1987,"workflow":1988},1778675145461.854,"k172kvmxv3wy036913pax9mmv186n3nb",{"reviewCount":8},{"description":1971,"installMethods":1972,"name":1973,"sourceUrl":1947},"Part of the AlterLab Academic Skills suite for faculty and researchers. Academic paper writing skill with 12-agent pipeline. v2.4: LaTeX output formatting hardening — mandatory apa7 class, text justification fix, table column width formula, bilingual abstract centering, standardized font stack, PDF must compile from LaTeX. Supports IMRaD, literature review, theoretical, case study, policy brief, and conference paper structures. APA 7.0 (default), Chicago, MLA, IEEE, Vancouver citation formats. Bilingual abstracts (zh-TW + EN). Multi-format output (LaTeX, DOCX, PDF, Markdown). Triggers on: write paper, academic paper, paper outline, write abstract, revise paper, check citations, convert to LaTeX, guide my paper, parse reviews, revision roadmap, 寫論文, 學術論文, 論文大綱, 寫摘要, 修改論文, 檢查引用, 引導我寫論文, 帶我規劃論文, 逐章規劃, 論文架構, 審查意見, 修訂路線圖.",{"claudeCode":1945},"AlterLab Paper Writer",{"basePath":1975,"githubOwner":1950,"githubRepo":1951,"locale":18,"slug":1976,"type":304},"skills/core/alterlab-paper-writer","alterlab-paper-writer",{"evaluate":1978,"extract":1985},{"promptVersionExtension":246,"promptVersionScoring":247,"score":1928,"tags":1979,"targetMarket":262,"tier":1958},[253,255,1980,1981,1982,1983,1984],"apa-7","citation-management","research-tools","writing-assistance","bilingual",{"commitSha":337,"license":294},{"repoId":1961},[253,1980,1984,1981,255,1982,1983],{"evaluatedAt":1989,"extractAt":1965,"updatedAt":1989},1778676316330,{"_creationTime":1991,"_id":1992,"community":1993,"display":1994,"identity":1998,"providers":2000,"relations":2008,"tags":2009,"workflow":2010},1778675145461.887,"k173bvxadrv7gcx73cbt9tw4zs86n9wt",{"reviewCount":8},{"description":1995,"installMethods":1996,"name":1997,"sourceUrl":1947},"Part of the AlterLab Academic Skills suite. Core skill for the deep research and writing tool. Write scientific manuscripts in full paragraphs (never bullet points). Use two-stage process with (1) section outlines with key points using research-lookup then (2) convert to flowing prose. IMRAD structure, citations (APA/AMA/Vancouver), figures/tables, reporting guidelines (CONSORT/STROBE/PRISMA), for research papers and journal submissions.",{"claudeCode":1945},"alterlab-scientific-writing",{"basePath":1999,"githubOwner":1950,"githubRepo":1951,"locale":18,"slug":1997,"type":304},"skills/writing-tools/alterlab-scientific-writing",{"evaluate":2001,"extract":2007},{"promptVersionExtension":246,"promptVersionScoring":247,"score":2002,"tags":2003,"targetMarket":262,"tier":263},100,[2004,2005,2006,255,1982],"scientific-writing","manuscript-preparation","academic-publishing",{"commitSha":337},{"repoId":1961},[2006,255,2005,1982,2004],{"evaluatedAt":2011,"extractAt":1965,"updatedAt":2011},1778683103843,{"_creationTime":2013,"_id":2014,"community":2015,"display":2016,"identity":2022,"providers":2026,"relations":2034,"tags":2037,"workflow":2038},1778699234184.6135,"k175frmf44tn80mcd6gvw1c1th86ngq9",{"reviewCount":8},{"description":2017,"installMethods":2018,"name":2020,"sourceUrl":2021},"Invoke parallel document-specialist agents for external web searches and documentation lookup",{"claudeCode":2019},"Yeachan-Heo/oh-my-claudecode","external-context","https://github.com/Yeachan-Heo/oh-my-claudecode",{"basePath":2023,"githubOwner":2024,"githubRepo":2025,"locale":18,"slug":2020,"type":304},"skills/external-context","Yeachan-Heo","oh-my-claudecode",{"evaluate":2027,"extract":2033},{"promptVersionExtension":246,"promptVersionScoring":247,"score":2002,"tags":2028,"targetMarket":262,"tier":263},[2029,2030,257,2031,2032],"search","documentation","information-retrieval","multi-agent",{"commitSha":337},{"parentExtensionId":2035,"repoId":2036},"k17brg5egdw1jbncj1j4wfv3fh86n639","kd74zv63fryf9prygtq7gf4es986n22y",[2030,2031,2032,257,2029],{"evaluatedAt":2039,"extractAt":2040,"updatedAt":2039},1778699449790,1778699234184,{"_creationTime":2042,"_id":2043,"community":2044,"display":2045,"identity":2048,"providers":2050,"relations":2057,"tags":2058,"workflow":2059},1778695116697.199,"k17cex5hqwje7qgvts5evct1d186nd4m",{"reviewCount":8},{"description":1845,"installMethods":2046,"name":2047,"sourceUrl":14},{"claudeCode":12},"ARA Research Manager",{"basePath":1844,"githubOwner":301,"githubRepo":302,"locale":18,"slug":2049,"type":304},"research-manager",{"evaluate":2051,"extract":2056},{"promptVersionExtension":246,"promptVersionScoring":247,"score":2002,"tags":2052,"targetMarket":262,"tier":263},[257,353,2053,2054,2055],"knowledge-management","session-logging","ara",{"commitSha":337,"license":294},{"parentExtensionId":307,"repoId":344},[2055,2053,353,257,2054],{"evaluatedAt":2060,"extractAt":348,"updatedAt":2060},1778697541177]