Scikit Learn
技能 已验证 活跃Machine learning in Python with scikit-learn. Use when working with supervised learning (classification, regression), unsupervised learning (clustering, dimensionality reduction), model evaluation, hyperparameter tuning, preprocessing, or building ML pipelines. Provides comprehensive reference documentation for algorithms, preprocessing techniques, pipelines, and best practices.
Empowers users to perform a wide range of machine learning tasks in Python using the scikit-learn library, from basic model building to complex pipeline construction.
功能
- Comprehensive scikit-learn algorithm documentation
- Examples for classification, regression, and clustering
- Detailed preprocessing and feature engineering guidance
- Model evaluation, cross-validation, and hyperparameter tuning
- ML pipeline construction and best practices
使用场景
- Building classification or regression models for prediction tasks
- Performing clustering and dimensionality reduction on unlabeled data
- Preprocessing and transforming data for machine learning models
- Tuning hyperparameters and evaluating models for optimal performance
- Creating reproducible ML pipelines for production workflows
非目标
- Deep learning frameworks like TensorFlow or PyTorch
- Reinforcement learning algorithms
- Advanced statistical modeling beyond scikit-learn's scope
- Deploying models into production environments
工作流
- Load and explore data
- Split data into training and testing sets
- Create preprocessing pipeline for features
- Train and compare different models using cross-validation
- Tune hyperparameters for the best model
- Evaluate the final model on the test set
- Analyze feature importance or visualize results
实践
- Model selection
- Data preprocessing
- Hyperparameter tuning
- ML pipelines
- Evaluation metrics
先决条件
- Python 3.11+ (3.12+ recommended)
- uv package manager
- scikit-learn package installed
Practical Utility
- info:Unique selling propositionThe skill provides comprehensive documentation and examples for scikit-learn, which is a standard library. While valuable, it primarily offers structured access to existing functionality rather than a unique selling proposition beyond curated expertise.
Code Execution
- info:ValidationThe provided scripts demonstrate good practices like using Pipelines and explicit type handling, but schema validation libraries like Zod or Pydantic are not explicitly used or mentioned for input/output sanitization.
安装
npx skills add K-Dense-AI/claude-scientific-skills通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。
质量评分
已验证类似扩展
Alterlab Aeon
98Part of the AlterLab Academic Skills suite. This skill should be used for time series machine learning tasks including classification, regression, clustering, forecasting, anomaly detection, segmentation, and similarity search. Use when working with temporal data, sequential patterns, or time-indexed observations requiring specialized algorithms beyond standard ML approaches. Particularly suited for univariate and multivariate time series analysis with scikit-learn compatible APIs.
AlterLab scikit Learn
96Part of the AlterLab Academic Skills suite. Machine learning in Python with scikit-learn. Use when working with supervised learning (classification, regression), unsupervised learning (clustering, dimensionality reduction), model evaluation, hyperparameter tuning, preprocessing, or building ML pipelines. Provides comprehensive reference documentation for algorithms, preprocessing techniques, pipelines, and best practices.
SHAP Model Interpretability
100Model interpretability and explainability using SHAP (SHapley Additive exPlanations). Use this skill when explaining machine learning model predictions, computing feature importance, generating SHAP plots (waterfall, beeswarm, bar, scatter, force, heatmap), debugging models, analyzing model bias or fairness, comparing models, or implementing explainable AI. Works with tree-based models (XGBoost, LightGBM, Random Forest), deep learning (TensorFlow, PyTorch), linear models, and any black-box model.
TimesFM Forecasting
100Zero-shot time series forecasting with Google's TimesFM foundation model. Use for any univariate time series (sales, sensors, energy, vitals, weather) without training a custom model. Supports CSV/DataFrame/array inputs with point forecasts and prediction intervals. Includes a preflight system checker script to verify RAM/GPU before first use.
Molfeat
99Molecular featurization for ML (100+ featurizers). ECFP, MACCS, descriptors, pretrained models (ChemBERTa), convert SMILES to features, for QSAR and molecular ML.
Aeon Time Series Machine Learning
95This skill should be used for time series machine learning tasks including classification, regression, clustering, forecasting, anomaly detection, segmentation, and similarity search. Use when working with temporal data, sequential patterns, or time-indexed observations requiring specialized algorithms beyond standard ML approaches. Particularly suited for univariate and multivariate time series analysis with scikit-learn compatible APIs.