Zum Hauptinhalt springen
Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

PyTDC (Therapeutics Data Commons)

Skill Verifiziert Aktiv

Therapeutics Data Commons. AI-ready drug discovery datasets (ADME, toxicity, DTI), benchmarks, scaffold splits, molecular oracles, for therapeutic ML and pharmacological prediction. Part of the AlterLab Academic Skills suite.

Zweck

To empower researchers in drug discovery and therapeutic ML by providing easy access to standardized datasets, benchmarks, and evaluation tools.

Funktionen

  • Access AI-ready drug discovery datasets (ADME, toxicity, DTI)
  • Utilize standardized benchmarks and data splits
  • Perform molecular property prediction and generation
  • Integrates with the PyTDC Python library for programmatic access

Anwendungsfälle

  • Working with drug discovery or therapeutic ML datasets
  • Benchmarking machine learning models on pharmaceutical tasks
  • Predicting molecular properties (ADME, toxicity, bioactivity)
  • Accessing curated datasets with proper train/test splits

Nicht-Ziele

  • Performing actual wet-lab experiments
  • Replacing dedicated cheminformatics software for advanced molecular modeling
  • Providing real-time clinical decision support

Workflow

  1. Install PyTDC via pip
  2. Import relevant modules from tdc
  3. Load a specific dataset (e.g., ADME, DTI)
  4. Get data split (scaffold, random, cold splits)
  5. Process data (convert formats, filter molecules)
  6. Train a model (user-implemented)
  7. Evaluate model using TDC evaluators

Praktiken

  • Data Curation
  • Machine Learning Benchmarking
  • Reproducible Research

Voraussetzungen

  • Python environment
  • pip or uv for package installation
  • PyTDC library installed

Installation

npx skills add AlterLab-IEU/AlterLab-Academic-Skills

Führt das Vercel skills CLI (skills.sh) via npx aus — benötigt Node.js lokal und mindestens einen installierten skills-kompatiblen Agent (Claude Code, Cursor, Codex, …). Setzt voraus, dass das Repo dem agentskills.io-Format folgt.

Qualitätspunktzahl

Verifiziert
99 /100
Analysiert 1 day ago

Vertrauenssignale

Letzter Commit17 days ago
Sterne15
LizenzMIT
Status
Quellcode ansehen

Ähnliche Erweiterungen

PyTDC (Therapeutics Data Commons)

99

Therapeutics Data Commons. AI-ready drug discovery datasets (ADME, toxicity, DTI), benchmarks, scaffold splits, molecular oracles, for therapeutic ML and pharmacological prediction.

Skill
K-Dense-AI

Molfeat

99

Molecular featurization for ML (100+ featurizers). ECFP, MACCS, descriptors, pretrained models (ChemBERTa), convert SMILES to features, for QSAR and molecular ML.

Skill
K-Dense-AI

Deepchem

99

Molecular ML with diverse featurizers and pre-built datasets. Use for property prediction (ADMET, toxicity) with traditional ML or GNNs when you want extensive featurization options and MoleculeNet benchmarks. Best for quick experiments with pre-trained models, diverse molecular representations. For graph-first PyTorch workflows use torchdrug; for benchmark datasets use pytdc.

Skill
K-Dense-AI

Open Targets Platform Query Skill

100

Query Open Targets Platform for target-disease associations, drug target discovery, tractability/safety data, genetics/omics evidence, known drugs, for therapeutic target identification. Part of the AlterLab Academic Skills suite.

Skill
AlterLab-IEU

RDKit Cheminformatics Toolkit

99

Cheminformatics toolkit for fine-grained molecular control. SMILES/SDF parsing, descriptors (MW, LogP, TPSA), fingerprints, substructure search, 2D/3D generation, similarity, reactions. For standard workflows with simpler interface, use datamol (wrapper around RDKit). Use rdkit for advanced control, custom sanitization, specialized algorithms.

Skill
K-Dense-AI

Medchem

99

Medicinal chemistry filters. Apply drug-likeness rules (Lipinski, Veber), PAINS filters, structural alerts, complexity metrics, for compound prioritization and library filtering.

Skill
K-Dense-AI