LLM Application Development
Plugin Verified ActiveLLM application development with LangGraph, RAG systems, vector search, and AI agent architectures for Claude 4.6 and GPT-5.4
Enables developers to build production-ready LLM applications, advanced RAG systems, and intelligent agents with modern AI patterns.
Features
- LangGraph StateGraph workflows
- Production RAG systems with hybrid search
- AI agent architectures with memory and tool use
- Vector search and embedding strategies
- Advanced prompt engineering techniques
Use Cases
- Building production-grade LLM applications
- Implementing advanced RAG systems
- Developing intelligent AI agents
- Optimizing prompts for LLM performance
Non-Goals
- Providing a full-fledged IDE for LLM development
- Replacing core LLM model providers
- Managing cloud infrastructure deployments
Workflow
- Select embedding model and vector database
- Design chunking and retrieval strategy
- Implement RAG pipeline with LangGraph
- Integrate LLM and tools for agent
- Test and optimize prompt engineering
- Deploy and monitor the application
Practices
- Prompt Engineering
- Agent Design
- RAG Implementation
- Vector Search Optimization
Prerequisites
- LangChain >= 1.2.0
- LangGraph >= 0.3.0
- Python 3.11+
Documentation
- info:Configuration & parameter referenceWhile requirements are listed, specific plugin configuration parameters and their precedence are not explicitly detailed in the README.
Installation
First, add the marketplace
/plugin marketplace add wshobson/agents/plugin install llm-application-dev@claude-code-workflowsContains 8 extensions
Skill (8)
Select and optimize embedding models for semantic search and RAG applications. Use when choosing embedding models, implementing chunking strategies, or optimizing embedding quality for specific domains.
Combine vector and keyword search for improved retrieval. Use when implementing RAG systems, building search engines, or when neither approach alone provides sufficient recall.
Design LLM applications using LangChain 1.x and LangGraph for agents, memory, and tool integration. Use when building LangChain applications, implementing AI agents, or creating complex LLM workflows.
Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or establishing evaluation frameworks.
Master advanced prompt engineering techniques to maximize LLM performance, reliability, and controllability in production. Use when optimizing prompts, improving LLM outputs, or designing production prompt templates.
Build Retrieval-Augmented Generation (RAG) systems for LLM applications with vector databases and semantic search. Use when implementing knowledge-grounded AI, building document Q&A systems, or integrating LLMs with external knowledge bases.
Implement efficient similarity search with vector databases. Use when building semantic search, implementing nearest neighbor queries, or optimizing retrieval performance.
Optimize vector index performance for latency, recall, and memory. Use when tuning HNSW parameters, selecting quantization strategies, or scaling vector search infrastructure.
Quality Score
VerifiedTrust Signals
Similar Extensions
Ruflo Rag Memory
99RuVector memory with HNSW search, AgentDB, and semantic retrieval
Microsoft Learn MCP Server
100Access official Microsoft documentation, API references, and code samples for Azure, .NET, Windows, and more.
Agents Design Experience
99Agents for UI/UX design, accessibility, and user experience optimization
Plugin Development Toolkit
99Comprehensive toolkit for developing Claude Code plugins. Includes 7 expert skills covering hooks, MCP integration, commands, agents, and best practices. AI-assisted plugin creation and validation.
Claude Cost Optimizer
99Cost-conscious mode for Claude Code. Saves 30-60% on costs through concise responses, model routing, and efficient workflow patterns.
Brave Search Skills
99Official Brave Search API skills for AI coding agents