跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Molfeat

技能 已验证 活跃

Molecular featurization for ML (100+ featurizers). ECFP, MACCS, descriptors, pretrained models (ChemBERTa), convert SMILES to features, for QSAR and molecular ML.

目的

To provide a unified and efficient way to convert molecular structures into numerical representations for machine learning applications.

功能

  • 100+ molecular featurizers (fingerprints, descriptors, embeddings)
  • Pretrained models (ChemBERTa, GIN, etc.)
  • Scikit-learn compatible transformers
  • Parallel processing for batch featurization
  • Support for SMILES and RDKit molecules

使用场景

  • Building QSAR/QSPR models
  • Performing virtual screening and similarity searching
  • Generating molecular embeddings for deep learning
  • Converting SMILES strings to ML-ready numerical features

非目标

  • Performing molecular simulations
  • De novo molecular generation (though some models can be used for this)
  • Directly handling experimental data beyond molecular structures

安装

npx skills add K-Dense-AI/claude-scientific-skills

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

已验证
99 /100
about 17 hours ago 分析

信任信号

最近提交3 days ago
星标21k
许可证Apache-2.0
状态
查看源代码

类似扩展

Deepchem

99

Molecular ML with diverse featurizers and pre-built datasets. Use for property prediction (ADMET, toxicity) with traditional ML or GNNs when you want extensive featurization options and MoleculeNet benchmarks. Best for quick experiments with pre-trained models, diverse molecular representations. For graph-first PyTorch workflows use torchdrug; for benchmark datasets use pytdc.

技能
K-Dense-AI

PyTDC (Therapeutics Data Commons)

99

Therapeutics Data Commons. AI-ready drug discovery datasets (ADME, toxicity, DTI), benchmarks, scaffold splits, molecular oracles, for therapeutic ML and pharmacological prediction.

技能
K-Dense-AI

Alterlab Rdkit

98

Cheminformatics toolkit for fine-grained molecular control. SMILES/SDF parsing, descriptors (MW, LogP, TPSA), fingerprints, substructure search, 2D/3D generation, similarity, reactions. For standard workflows with simpler interface, use datamol (wrapper around RDKit). Use rdkit for advanced control, custom sanitization, specialized algorithms. Part of the AlterLab Academic Skills suite.

技能
AlterLab-IEU

SHAP Model Interpretability

100

Model interpretability and explainability using SHAP (SHapley Additive exPlanations). Use this skill when explaining machine learning model predictions, computing feature importance, generating SHAP plots (waterfall, beeswarm, bar, scatter, force, heatmap), debugging models, analyzing model bias or fairness, comparing models, or implementing explainable AI. Works with tree-based models (XGBoost, LightGBM, Random Forest), deep learning (TensorFlow, PyTorch), linear models, and any black-box model.

技能
K-Dense-AI

TimesFM Forecasting

100

Zero-shot time series forecasting with Google's TimesFM foundation model. Use for any univariate time series (sales, sensors, energy, vitals, weather) without training a custom model. Supports CSV/DataFrame/array inputs with point forecasts and prediction intervals. Includes a preflight system checker script to verify RAM/GPU before first use.

技能
K-Dense-AI

RDKit Cheminformatics Toolkit

99

Cheminformatics toolkit for fine-grained molecular control. SMILES/SDF parsing, descriptors (MW, LogP, TPSA), fingerprints, substructure search, 2D/3D generation, similarity, reactions. For standard workflows with simpler interface, use datamol (wrapper around RDKit). Use rdkit for advanced control, custom sanitization, specialized algorithms.

技能
K-Dense-AI