Senior Ml Engineer

Skill Verified Active

ML engineering skill for productionizing models, building MLOps pipelines, and integrating LLMs. Covers model deployment, feature stores, drift monitoring, RAG systems, and cost optimization. Use when the user asks about deploying ML models to production, setting up MLOps infrastructure (MLflow, Kubeflow, Kubernetes, Docker), monitoring model performance or drift, building RAG pipelines, or integrating LLM APIs with retry logic and cost controls. Focused on production and operational concerns rather than model research or initial training.

Purpose

To provide ML engineers with robust patterns and tools for deploying, monitoring, and managing ML models and LLM integrations in production environments.

Features

Model deployment workflows and containerization
MLOps pipeline setup with feature stores and experiment tracking
LLM integration with retry logic and cost controls
RAG system implementation guidance
Model monitoring for drift and performance degradation

Use Cases

Deploying trained ML models to production environments.
Setting up automated MLOps infrastructure (MLflow, Kubeflow, Kubernetes).
Integrating LLM APIs into applications with advanced controls.
Building and optimizing Retrieval-Augmented Generation (RAG) systems.
Monitoring model performance and detecting data/model drift.

Non-Goals

Initial model research and development.
Model training script generation.
Abstracting away core MLOps tooling entirely (provides patterns, not a black box).

Installation

First, add the marketplace

/plugin marketplace add alirezarezvani/claude-skills

/plugin install engineering-team@claude-code-skills

Quality Score

Verified

98 /100

Analyzed about 19 hours ago

Trust Signals

Last commitabout 23 hours ago

GitHub owner alirezarezvani

Stars14.6k

LicenseMIT

Websitealirezarezvani.medium.com

Status

View Source

Similar Extensions

Chat Format

100

Format prompts for different LLM providers with chat templates and HNSW-powered context retrieval

Skill

ruvnet

Rag Architect

100

Use when the user asks to design RAG pipelines, optimize retrieval strategies, choose embedding models, implement vector search, or build knowledge retrieval systems.

Skill

alirezarezvani

TimesFM Forecasting

100

Zero-shot time series forecasting with Google's TimesFM foundation model. Use for any univariate time series (sales, sensors, energy, vitals, weather) without training a custom model. Supports CSV/DataFrame/array inputs with point forecasts and prediction intervals. Includes a preflight system checker script to verify RAM/GPU before first use.

Skill

K-Dense-AI

Azure Monitor Query Py

100

Azure Monitor Query SDK for Python. Use for querying Log Analytics workspaces and Azure Monitor metrics. Triggers: "azure-monitor-query", "LogsQueryClient", "MetricsQueryClient", "Log Analytics", "Kusto queries", "Azure metrics".

Skill

microsoft

Embedding Strategies

100

Select and optimize embedding models for semantic search and RAG applications. Use when choosing embedding models, implementing chunking strategies, or optimizing embedding quality for specific domains.

Skill

wshobson

Build Feature Store

Build a feature store using Feast for centralized feature management, configure offline and online stores for batch and real-time serving, define feature views with transformations, and implement point-in-time correct joins for ML pipelines. Use when managing features for multiple ML models, ensuring training-serving consistency, serving low-latency features for real-time inference, reusing feature definitions across projects, or building a feature catalog for discovery and governance.

Skill

pjt222