跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Datamol Cheminformatics Skill

技能 已验证 活跃

Pythonic wrapper around RDKit with simplified interface and sensible defaults. Preferred for standard drug discovery including SMILES parsing, standardization, descriptors, fingerprints, clustering, 3D conformers, parallel processing. Returns native rdkit.Chem.Mol objects. For advanced control or custom parameters, use rdkit directly. Part of the AlterLab Academic Skills suite.

目的

To simplify complex molecular operations for drug discovery and related research by offering a user-friendly interface with sensible defaults over the RDKit library.

功能

  • SMILES parsing and standardization
  • Molecular descriptor and fingerprint computation
  • 3D conformer generation and analysis
  • Clustering and diversity selection
  • Scaffold and fragment analysis
  • Support for various molecular file formats (SDF, SMILES, CSV, Excel)
  • Remote file support via fsspec
  • Parallel processing for batch operations

使用场景

  • Standardizing and cleaning molecular datasets
  • Calculating molecular properties for filtering or QSAR
  • Generating diverse sets of molecules for screening
  • Analyzing scaffold diversity in compound libraries
  • Visualizing molecular structures and conformers

非目标

  • Providing a full replacement for RDKit's advanced control
  • Handling highly specialized or non-standard cheminformatics tasks
  • Advanced quantum chemistry calculations

工作流

  1. Load molecular data from files or strings
  2. Standardize and sanitize molecules
  3. Compute descriptors or fingerprints
  4. Perform clustering, diversity selection, or scaffold analysis
  5. Generate 3D conformers if needed
  6. Visualize results or export processed data

实践

  • Data standardization
  • Molecular descriptor calculation
  • Similarity analysis
  • Structure-activity relationship analysis
  • Machine learning feature generation

先决条件

  • Python environment
  • uv pip installed

安装

npx skills add AlterLab-IEU/AlterLab-Academic-Skills

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

已验证
99 /100
1 day ago 分析

信任信号

最近提交17 days ago
星标15
许可证MIT
状态
查看源代码

类似扩展

RDKit Cheminformatics Toolkit

99

Cheminformatics toolkit for fine-grained molecular control. SMILES/SDF parsing, descriptors (MW, LogP, TPSA), fingerprints, substructure search, 2D/3D generation, similarity, reactions. For standard workflows with simpler interface, use datamol (wrapper around RDKit). Use rdkit for advanced control, custom sanitization, specialized algorithms.

技能
K-Dense-AI

Alterlab Rdkit

98

Cheminformatics toolkit for fine-grained molecular control. SMILES/SDF parsing, descriptors (MW, LogP, TPSA), fingerprints, substructure search, 2D/3D generation, similarity, reactions. For standard workflows with simpler interface, use datamol (wrapper around RDKit). Use rdkit for advanced control, custom sanitization, specialized algorithms. Part of the AlterLab Academic Skills suite.

技能
AlterLab-IEU

Datamol

97

Pythonic wrapper around RDKit with simplified interface and sensible defaults. Preferred for standard drug discovery including SMILES parsing, standardization, descriptors, fingerprints, clustering, 3D conformers, parallel processing. Returns native rdkit.Chem.Mol objects. For advanced control or custom parameters, use rdkit directly.

技能
K-Dense-AI

Fit Drift Diffusion Model

100

Fit cognitive drift-diffusion models (Ratcliff DDM) to reaction time and accuracy data with parameter estimation (drift rate, boundary separation, non-decision time), model comparison, and parameter recovery validation. Use when modeling binary decision-making with reaction time data, estimating cognitive parameters from experimental data, comparing sequential sampling model variants, or decomposing speed-accuracy tradeoff effects into latent cognitive components.

技能
pjt222

PyTDC (Therapeutics Data Commons)

99

Therapeutics Data Commons. AI-ready drug discovery datasets (ADME, toxicity, DTI), benchmarks, scaffold splits, molecular oracles, for therapeutic ML and pharmacological prediction.

技能
K-Dense-AI

Molfeat

99

Molecular featurization for ML (100+ featurizers). ECFP, MACCS, descriptors, pretrained models (ChemBERTa), convert SMILES to features, for QSAR and molecular ML.

技能
K-Dense-AI