Scikit Learn
Skill Verifiziert AktivMachine learning in Python with scikit-learn. Use when working with supervised learning (classification, regression), unsupervised learning (clustering, dimensionality reduction), model evaluation, hyperparameter tuning, preprocessing, or building ML pipelines. Provides comprehensive reference documentation for algorithms, preprocessing techniques, pipelines, and best practices.
Empowers users to perform a wide range of machine learning tasks in Python using the scikit-learn library, from basic model building to complex pipeline construction.
Funktionen
- Comprehensive scikit-learn algorithm documentation
- Examples for classification, regression, and clustering
- Detailed preprocessing and feature engineering guidance
- Model evaluation, cross-validation, and hyperparameter tuning
- ML pipeline construction and best practices
Anwendungsfälle
- Building classification or regression models for prediction tasks
- Performing clustering and dimensionality reduction on unlabeled data
- Preprocessing and transforming data for machine learning models
- Tuning hyperparameters and evaluating models for optimal performance
- Creating reproducible ML pipelines for production workflows
Nicht-Ziele
- Deep learning frameworks like TensorFlow or PyTorch
- Reinforcement learning algorithms
- Advanced statistical modeling beyond scikit-learn's scope
- Deploying models into production environments
Workflow
- Load and explore data
- Split data into training and testing sets
- Create preprocessing pipeline for features
- Train and compare different models using cross-validation
- Tune hyperparameters for the best model
- Evaluate the final model on the test set
- Analyze feature importance or visualize results
Praktiken
- Model selection
- Data preprocessing
- Hyperparameter tuning
- ML pipelines
- Evaluation metrics
Voraussetzungen
- Python 3.11+ (3.12+ recommended)
- uv package manager
- scikit-learn package installed
Practical Utility
- info:Unique selling propositionThe skill provides comprehensive documentation and examples for scikit-learn, which is a standard library. While valuable, it primarily offers structured access to existing functionality rather than a unique selling proposition beyond curated expertise.
Code Execution
- info:ValidationThe provided scripts demonstrate good practices like using Pipelines and explicit type handling, but schema validation libraries like Zod or Pydantic are not explicitly used or mentioned for input/output sanitization.
Installation
npx skills add K-Dense-AI/claude-scientific-skillsFührt das Vercel skills CLI (skills.sh) via npx aus — benötigt Node.js lokal und mindestens einen installierten skills-kompatiblen Agent (Claude Code, Cursor, Codex, …). Setzt voraus, dass das Repo dem agentskills.io-Format folgt.
Qualitätspunktzahl
VerifiziertVertrauenssignale
Ähnliche Erweiterungen
Alterlab Aeon
98Part of the AlterLab Academic Skills suite. This skill should be used for time series machine learning tasks including classification, regression, clustering, forecasting, anomaly detection, segmentation, and similarity search. Use when working with temporal data, sequential patterns, or time-indexed observations requiring specialized algorithms beyond standard ML approaches. Particularly suited for univariate and multivariate time series analysis with scikit-learn compatible APIs.
AlterLab scikit Learn
96Part of the AlterLab Academic Skills suite. Machine learning in Python with scikit-learn. Use when working with supervised learning (classification, regression), unsupervised learning (clustering, dimensionality reduction), model evaluation, hyperparameter tuning, preprocessing, or building ML pipelines. Provides comprehensive reference documentation for algorithms, preprocessing techniques, pipelines, and best practices.
SHAP Model Interpretability
100Model interpretability and explainability using SHAP (SHapley Additive exPlanations). Use this skill when explaining machine learning model predictions, computing feature importance, generating SHAP plots (waterfall, beeswarm, bar, scatter, force, heatmap), debugging models, analyzing model bias or fairness, comparing models, or implementing explainable AI. Works with tree-based models (XGBoost, LightGBM, Random Forest), deep learning (TensorFlow, PyTorch), linear models, and any black-box model.
TimesFM Forecasting
100Zero-shot time series forecasting with Google's TimesFM foundation model. Use for any univariate time series (sales, sensors, energy, vitals, weather) without training a custom model. Supports CSV/DataFrame/array inputs with point forecasts and prediction intervals. Includes a preflight system checker script to verify RAM/GPU before first use.
Molfeat
99Molecular featurization for ML (100+ featurizers). ECFP, MACCS, descriptors, pretrained models (ChemBERTa), convert SMILES to features, for QSAR and molecular ML.
Aeon Time Series Machine Learning
95This skill should be used for time series machine learning tasks including classification, regression, clustering, forecasting, anomaly detection, segmentation, and similarity search. Use when working with temporal data, sequential patterns, or time-indexed observations requiring specialized algorithms beyond standard ML approaches. Particularly suited for univariate and multivariate time series analysis with scikit-learn compatible APIs.