跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Tiledbvcf

技能 已验证 活跃

Efficient storage and retrieval of genomic variant data using TileDB. Scalable VCF/BCF ingestion, incremental sample addition, compressed storage, parallel queries, and export capabilities for population genomics.

目的

To enable researchers and bioinformaticians to efficiently manage and query large genomic variant datasets using the TileDB-VCF framework, streamlining population genomics analyses.

功能

  • Scalable VCF/BCF ingestion
  • Incremental sample addition
  • Compressed storage
  • Parallel querying of genomic regions and samples
  • Data export to VCF and TSV formats
  • Cloud storage integration (S3, Azure, GCS)

使用场景

  • Building population genomics databases
  • Performing genome-wide association studies (GWAS)
  • Efficiently querying specific genomic regions across many samples
  • Integrating new samples into existing variant datasets incrementally
  • Exporting subsets of large VCF datasets for downstream analysis

非目标

  • Direct execution of arbitrary C++ TileDB-VCF library functions not exposed through the Python or CLI interfaces
  • Replacing comprehensive genome browsers or visualization tools
  • Performing complex statistical modeling or machine learning directly on variant data (requires export to other tools)

Errors

  • info:Actionable error messagesWhile the documentation mentions common pitfalls, it does not explicitly detail actionable error messages for every failure path, only general recovery steps.

安装

npx skills add K-Dense-AI/claude-scientific-skills

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

已验证
98 /100
1 day ago 分析

信任信号

最近提交3 days ago
星标21k
许可证MIT
状态
查看源代码

类似扩展

Alterlab Tiledbvcf

96

Efficient storage and retrieval of genomic variant data using TileDB. Scalable VCF/BCF ingestion, incremental sample addition, compressed storage, parallel queries, and export capabilities for population genomics. Part of the AlterLab Academic Skills suite.

技能
AlterLab-IEU

PyDESeq2

100

Differential gene expression analysis (Python DESeq2). Identify DE genes from bulk RNA-seq counts, Wald tests, FDR correction, volcano/MA plots, for RNA-seq analysis.

技能
K-Dense-AI

Scanpy

99

Standard single-cell RNA-seq analysis pipeline. Use for QC, normalization, dimensionality reduction (PCA/UMAP/t-SNE), clustering, differential expression, and visualization. Best for exploratory scRNA-seq analysis with established workflows. For deep learning models use scvi-tools; for data format questions use anndata.

技能
K-Dense-AI

Pysam

99

Genomic file toolkit. Read/write SAM/BAM/CRAM alignments, VCF/BCF variants, FASTA/FASTQ sequences, extract regions, calculate coverage, for NGS data processing pipelines.

技能
K-Dense-AI

Polars Bio

99

High-performance genomic interval operations and bioinformatics file I/O on Polars DataFrames. Overlap, nearest, merge, coverage, complement, subtract for BED/VCF/BAM/GFF intervals. Streaming, cloud-native, faster bioframe alternative.

技能
K-Dense-AI

Gtars

99

High-performance toolkit for genomic interval analysis in Rust with Python bindings. Use when working with genomic regions, BED files, coverage tracks, overlap detection, tokenization for ML models, or fragment analysis in computational genomics and machine learning applications.

技能
K-Dense-AI