跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Scikit Learn

技能 已验证 活跃

Machine learning in Python with scikit-learn. Use when working with supervised learning (classification, regression), unsupervised learning (clustering, dimensionality reduction), model evaluation, hyperparameter tuning, preprocessing, or building ML pipelines. Provides comprehensive reference documentation for algorithms, preprocessing techniques, pipelines, and best practices.

目的

Empowers users to perform a wide range of machine learning tasks in Python using the scikit-learn library, from basic model building to complex pipeline construction.

功能

  • Comprehensive scikit-learn algorithm documentation
  • Examples for classification, regression, and clustering
  • Detailed preprocessing and feature engineering guidance
  • Model evaluation, cross-validation, and hyperparameter tuning
  • ML pipeline construction and best practices

使用场景

  • Building classification or regression models for prediction tasks
  • Performing clustering and dimensionality reduction on unlabeled data
  • Preprocessing and transforming data for machine learning models
  • Tuning hyperparameters and evaluating models for optimal performance
  • Creating reproducible ML pipelines for production workflows

非目标

  • Deep learning frameworks like TensorFlow or PyTorch
  • Reinforcement learning algorithms
  • Advanced statistical modeling beyond scikit-learn's scope
  • Deploying models into production environments

工作流

  1. Load and explore data
  2. Split data into training and testing sets
  3. Create preprocessing pipeline for features
  4. Train and compare different models using cross-validation
  5. Tune hyperparameters for the best model
  6. Evaluate the final model on the test set
  7. Analyze feature importance or visualize results

实践

  • Model selection
  • Data preprocessing
  • Hyperparameter tuning
  • ML pipelines
  • Evaluation metrics

先决条件

  • Python 3.11+ (3.12+ recommended)
  • uv package manager
  • scikit-learn package installed

Practical Utility

  • info:Unique selling propositionThe skill provides comprehensive documentation and examples for scikit-learn, which is a standard library. While valuable, it primarily offers structured access to existing functionality rather than a unique selling proposition beyond curated expertise.

Code Execution

  • info:ValidationThe provided scripts demonstrate good practices like using Pipelines and explicit type handling, but schema validation libraries like Zod or Pydantic are not explicitly used or mentioned for input/output sanitization.

安装

npx skills add K-Dense-AI/claude-scientific-skills

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

已验证
95 /100
1 day ago 分析

信任信号

最近提交3 days ago
星标21k
许可证BSD-3-Clause
状态
查看源代码

类似扩展

Alterlab Aeon

98

Part of the AlterLab Academic Skills suite. This skill should be used for time series machine learning tasks including classification, regression, clustering, forecasting, anomaly detection, segmentation, and similarity search. Use when working with temporal data, sequential patterns, or time-indexed observations requiring specialized algorithms beyond standard ML approaches. Particularly suited for univariate and multivariate time series analysis with scikit-learn compatible APIs.

技能
AlterLab-IEU

AlterLab scikit Learn

96

Part of the AlterLab Academic Skills suite. Machine learning in Python with scikit-learn. Use when working with supervised learning (classification, regression), unsupervised learning (clustering, dimensionality reduction), model evaluation, hyperparameter tuning, preprocessing, or building ML pipelines. Provides comprehensive reference documentation for algorithms, preprocessing techniques, pipelines, and best practices.

技能
AlterLab-IEU

SHAP Model Interpretability

100

Model interpretability and explainability using SHAP (SHapley Additive exPlanations). Use this skill when explaining machine learning model predictions, computing feature importance, generating SHAP plots (waterfall, beeswarm, bar, scatter, force, heatmap), debugging models, analyzing model bias or fairness, comparing models, or implementing explainable AI. Works with tree-based models (XGBoost, LightGBM, Random Forest), deep learning (TensorFlow, PyTorch), linear models, and any black-box model.

技能
K-Dense-AI

TimesFM Forecasting

100

Zero-shot time series forecasting with Google's TimesFM foundation model. Use for any univariate time series (sales, sensors, energy, vitals, weather) without training a custom model. Supports CSV/DataFrame/array inputs with point forecasts and prediction intervals. Includes a preflight system checker script to verify RAM/GPU before first use.

技能
K-Dense-AI

Molfeat

99

Molecular featurization for ML (100+ featurizers). ECFP, MACCS, descriptors, pretrained models (ChemBERTa), convert SMILES to features, for QSAR and molecular ML.

技能
K-Dense-AI

Aeon Time Series Machine Learning

95

This skill should be used for time series machine learning tasks including classification, regression, clustering, forecasting, anomaly detection, segmentation, and similarity search. Use when working with temporal data, sequential patterns, or time-indexed observations requiring specialized algorithms beyond standard ML approaches. Particularly suited for univariate and multivariate time series analysis with scikit-learn compatible APIs.

技能
K-Dense-AI