Datamol Cheminformatics Skill
Skill Verifiziert AktivPythonic wrapper around RDKit with simplified interface and sensible defaults. Preferred for standard drug discovery including SMILES parsing, standardization, descriptors, fingerprints, clustering, 3D conformers, parallel processing. Returns native rdkit.Chem.Mol objects. For advanced control or custom parameters, use rdkit directly. Part of the AlterLab Academic Skills suite.
To simplify complex molecular operations for drug discovery and related research by offering a user-friendly interface with sensible defaults over the RDKit library.
Funktionen
- SMILES parsing and standardization
- Molecular descriptor and fingerprint computation
- 3D conformer generation and analysis
- Clustering and diversity selection
- Scaffold and fragment analysis
- Support for various molecular file formats (SDF, SMILES, CSV, Excel)
- Remote file support via fsspec
- Parallel processing for batch operations
Anwendungsfälle
- Standardizing and cleaning molecular datasets
- Calculating molecular properties for filtering or QSAR
- Generating diverse sets of molecules for screening
- Analyzing scaffold diversity in compound libraries
- Visualizing molecular structures and conformers
Nicht-Ziele
- Providing a full replacement for RDKit's advanced control
- Handling highly specialized or non-standard cheminformatics tasks
- Advanced quantum chemistry calculations
Workflow
- Load molecular data from files or strings
- Standardize and sanitize molecules
- Compute descriptors or fingerprints
- Perform clustering, diversity selection, or scaffold analysis
- Generate 3D conformers if needed
- Visualize results or export processed data
Praktiken
- Data standardization
- Molecular descriptor calculation
- Similarity analysis
- Structure-activity relationship analysis
- Machine learning feature generation
Voraussetzungen
- Python environment
- uv pip installed
Installation
npx skills add AlterLab-IEU/AlterLab-Academic-SkillsFührt das Vercel skills CLI (skills.sh) via npx aus — benötigt Node.js lokal und mindestens einen installierten skills-kompatiblen Agent (Claude Code, Cursor, Codex, …). Setzt voraus, dass das Repo dem agentskills.io-Format folgt.
Qualitätspunktzahl
VerifiziertVertrauenssignale
Ähnliche Erweiterungen
RDKit Cheminformatics Toolkit
99Cheminformatics toolkit for fine-grained molecular control. SMILES/SDF parsing, descriptors (MW, LogP, TPSA), fingerprints, substructure search, 2D/3D generation, similarity, reactions. For standard workflows with simpler interface, use datamol (wrapper around RDKit). Use rdkit for advanced control, custom sanitization, specialized algorithms.
Alterlab Rdkit
98Cheminformatics toolkit for fine-grained molecular control. SMILES/SDF parsing, descriptors (MW, LogP, TPSA), fingerprints, substructure search, 2D/3D generation, similarity, reactions. For standard workflows with simpler interface, use datamol (wrapper around RDKit). Use rdkit for advanced control, custom sanitization, specialized algorithms. Part of the AlterLab Academic Skills suite.
Datamol
97Pythonic wrapper around RDKit with simplified interface and sensible defaults. Preferred for standard drug discovery including SMILES parsing, standardization, descriptors, fingerprints, clustering, 3D conformers, parallel processing. Returns native rdkit.Chem.Mol objects. For advanced control or custom parameters, use rdkit directly.
Fit Drift Diffusion Model
100Fit cognitive drift-diffusion models (Ratcliff DDM) to reaction time and accuracy data with parameter estimation (drift rate, boundary separation, non-decision time), model comparison, and parameter recovery validation. Use when modeling binary decision-making with reaction time data, estimating cognitive parameters from experimental data, comparing sequential sampling model variants, or decomposing speed-accuracy tradeoff effects into latent cognitive components.
PyTDC (Therapeutics Data Commons)
99Therapeutics Data Commons. AI-ready drug discovery datasets (ADME, toxicity, DTI), benchmarks, scaffold splits, molecular oracles, for therapeutic ML and pharmacological prediction.
Molfeat
99Molecular featurization for ML (100+ featurizers). ECFP, MACCS, descriptors, pretrained models (ChemBERTa), convert SMILES to features, for QSAR and molecular ML.