Zum Hauptinhalt springen
Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

Datamol Cheminformatics Skill

Skill Verifiziert Aktiv

Pythonic wrapper around RDKit with simplified interface and sensible defaults. Preferred for standard drug discovery including SMILES parsing, standardization, descriptors, fingerprints, clustering, 3D conformers, parallel processing. Returns native rdkit.Chem.Mol objects. For advanced control or custom parameters, use rdkit directly. Part of the AlterLab Academic Skills suite.

Zweck

To simplify complex molecular operations for drug discovery and related research by offering a user-friendly interface with sensible defaults over the RDKit library.

Funktionen

  • SMILES parsing and standardization
  • Molecular descriptor and fingerprint computation
  • 3D conformer generation and analysis
  • Clustering and diversity selection
  • Scaffold and fragment analysis
  • Support for various molecular file formats (SDF, SMILES, CSV, Excel)
  • Remote file support via fsspec
  • Parallel processing for batch operations

Anwendungsfälle

  • Standardizing and cleaning molecular datasets
  • Calculating molecular properties for filtering or QSAR
  • Generating diverse sets of molecules for screening
  • Analyzing scaffold diversity in compound libraries
  • Visualizing molecular structures and conformers

Nicht-Ziele

  • Providing a full replacement for RDKit's advanced control
  • Handling highly specialized or non-standard cheminformatics tasks
  • Advanced quantum chemistry calculations

Workflow

  1. Load molecular data from files or strings
  2. Standardize and sanitize molecules
  3. Compute descriptors or fingerprints
  4. Perform clustering, diversity selection, or scaffold analysis
  5. Generate 3D conformers if needed
  6. Visualize results or export processed data

Praktiken

  • Data standardization
  • Molecular descriptor calculation
  • Similarity analysis
  • Structure-activity relationship analysis
  • Machine learning feature generation

Voraussetzungen

  • Python environment
  • uv pip installed

Installation

npx skills add AlterLab-IEU/AlterLab-Academic-Skills

Führt das Vercel skills CLI (skills.sh) via npx aus — benötigt Node.js lokal und mindestens einen installierten skills-kompatiblen Agent (Claude Code, Cursor, Codex, …). Setzt voraus, dass das Repo dem agentskills.io-Format folgt.

Qualitätspunktzahl

Verifiziert
99 /100
Analysiert 1 day ago

Vertrauenssignale

Letzter Commit17 days ago
Sterne15
LizenzMIT
Status
Quellcode ansehen

Ähnliche Erweiterungen

RDKit Cheminformatics Toolkit

99

Cheminformatics toolkit for fine-grained molecular control. SMILES/SDF parsing, descriptors (MW, LogP, TPSA), fingerprints, substructure search, 2D/3D generation, similarity, reactions. For standard workflows with simpler interface, use datamol (wrapper around RDKit). Use rdkit for advanced control, custom sanitization, specialized algorithms.

Skill
K-Dense-AI

Alterlab Rdkit

98

Cheminformatics toolkit for fine-grained molecular control. SMILES/SDF parsing, descriptors (MW, LogP, TPSA), fingerprints, substructure search, 2D/3D generation, similarity, reactions. For standard workflows with simpler interface, use datamol (wrapper around RDKit). Use rdkit for advanced control, custom sanitization, specialized algorithms. Part of the AlterLab Academic Skills suite.

Skill
AlterLab-IEU

Datamol

97

Pythonic wrapper around RDKit with simplified interface and sensible defaults. Preferred for standard drug discovery including SMILES parsing, standardization, descriptors, fingerprints, clustering, 3D conformers, parallel processing. Returns native rdkit.Chem.Mol objects. For advanced control or custom parameters, use rdkit directly.

Skill
K-Dense-AI

Fit Drift Diffusion Model

100

Fit cognitive drift-diffusion models (Ratcliff DDM) to reaction time and accuracy data with parameter estimation (drift rate, boundary separation, non-decision time), model comparison, and parameter recovery validation. Use when modeling binary decision-making with reaction time data, estimating cognitive parameters from experimental data, comparing sequential sampling model variants, or decomposing speed-accuracy tradeoff effects into latent cognitive components.

Skill
pjt222

PyTDC (Therapeutics Data Commons)

99

Therapeutics Data Commons. AI-ready drug discovery datasets (ADME, toxicity, DTI), benchmarks, scaffold splits, molecular oracles, for therapeutic ML and pharmacological prediction.

Skill
K-Dense-AI

Molfeat

99

Molecular featurization for ML (100+ featurizers). ECFP, MACCS, descriptors, pretrained models (ChemBERTa), convert SMILES to features, for QSAR and molecular ML.

Skill
K-Dense-AI