跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Geniml

技能 已验证 活跃

Part of the AlterLab Academic Skills suite. This skill should be used when working with genomic interval data (BED files) for machine learning tasks. Use for training region embeddings (Region2Vec, BEDspace), single-cell ATAC-seq analysis (scEmbed), building consensus peaks (universes), or any ML-based analysis of genomic regions. Applies to BED file collections, scATAC-seq data, chromatin accessibility datasets, and region-based genomic feature learning.

目的

To enable researchers to perform advanced machine learning tasks on genomic interval data by providing specialized tools for embedding generation, analysis, and interpretation.

功能

  • Train region embeddings (Region2Vec)
  • Joint region and metadata embeddings (BEDspace)
  • Single-cell ATAC-seq embedding (scEmbed)
  • Build consensus peak universes
  • Genomic region tokenization utilities
  • Data caching and randomization tools

使用场景

  • Training embeddings for genomic regions for ML tasks
  • Analyzing scATAC-seq data for cell-type clustering
  • Building statistically rigorous reference peak sets (universes)
  • Performing metadata-aware similarity searches on genomic regions

非目标

  • General-purpose bioinformatics pipeline
  • Analysis of non-genomic interval data
  • Direct biological interpretation without ML models

工作流

  1. Prepare genomic data (BED files, AnnData)
  2. Tokenize regions using a reference universe
  3. Train embedding models (Region2Vec, BEDspace, scEmbed)
  4. Generate embeddings for analysis
  5. Perform downstream tasks (clustering, similarity search, visualization)

安装

npx skills add AlterLab-IEU/AlterLab-Academic-Skills

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

已验证
99 /100
1 day ago 分析

信任信号

最近提交17 days ago
星标15
许可证MIT
状态
查看源代码

类似扩展

Geniml

99

This skill should be used when working with genomic interval data (BED files) for machine learning tasks. Use for training region embeddings (Region2Vec, BEDspace), single-cell ATAC-seq analysis (scEmbed), building consensus peaks (universes), or any ML-based analysis of genomic regions. Applies to BED file collections, scATAC-seq data, chromatin accessibility datasets, and region-based genomic feature learning.

技能
K-Dense-AI

Gtars

99

High-performance toolkit for genomic interval analysis in Rust with Python bindings. Use when working with genomic regions, BED files, coverage tracks, overlap detection, tokenization for ML models, or fragment analysis in computational genomics and machine learning applications.

技能
K-Dense-AI

PyDESeq2

100

Differential gene expression analysis (Python DESeq2). Identify DE genes from bulk RNA-seq counts, Wald tests, FDR correction, volcano/MA plots, for RNA-seq analysis.

技能
K-Dense-AI

SHAP Model Interpretability

100

Model interpretability and explainability using SHAP (SHapley Additive exPlanations). Use this skill when explaining machine learning model predictions, computing feature importance, generating SHAP plots (waterfall, beeswarm, bar, scatter, force, heatmap), debugging models, analyzing model bias or fairness, comparing models, or implementing explainable AI. Works with tree-based models (XGBoost, LightGBM, Random Forest), deep learning (TensorFlow, PyTorch), linear models, and any black-box model.

技能
K-Dense-AI

Scanpy

99

Standard single-cell RNA-seq analysis pipeline. Use for QC, normalization, dimensionality reduction (PCA/UMAP/t-SNE), clustering, differential expression, and visualization. Best for exploratory scRNA-seq analysis with established workflows. For deep learning models use scvi-tools; for data format questions use anndata.

技能
K-Dense-AI

Pysam

99

Genomic file toolkit. Read/write SAM/BAM/CRAM alignments, VCF/BCF variants, FASTA/FASTQ sequences, extract regions, calculate coverage, for NGS data processing pipelines.

技能
K-Dense-AI