Skip to main content

Alterlab Tiledbvcf

Skill Active

Efficient storage and retrieval of genomic variant data using TileDB. Scalable VCF/BCF ingestion, incremental sample addition, compressed storage, parallel queries, and export capabilities for population genomics. Part of the AlterLab Academic Skills suite.

Purpose

To enable researchers to efficiently manage and query large genomic variant datasets using TileDB, facilitating population genomics and other bioinformatic analyses.

Features

  • Scalable VCF/BCF ingestion
  • Incremental sample addition
  • Compressed storage
  • Parallel queries
  • Export to VCF and TSV
  • Cloud storage integration

Use Cases

  • Building population genomics databases
  • Performing genome-wide association studies
  • Querying specific genomic regions across many samples
  • Prototyping genomics analyses

Non-Goals

  • Handling multi-sample VCFs directly during ingestion
  • Replacing core VCF manipulation tools for simple tasks
  • Providing a web UI for data exploration (focus is on programmatic access)

Trust

  • warning:Issues AttentionThere are 2 open issues and 0 closed issues in the last 90 days, indicating slow responsiveness to new issues.

Installation

npx skills add AlterLab-IEU/AlterLab-Academic-Skills

Runs the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.

Quality Score

96 /100
Analyzed 1 day ago

Trust Signals

Last commit17 days ago
Stars15
LicenseMIT
Status
View Source

Similar Extensions

Tiledbvcf

98

Efficient storage and retrieval of genomic variant data using TileDB. Scalable VCF/BCF ingestion, incremental sample addition, compressed storage, parallel queries, and export capabilities for population genomics.

Skill
K-Dense-AI

PyDESeq2

100

Differential gene expression analysis (Python DESeq2). Identify DE genes from bulk RNA-seq counts, Wald tests, FDR correction, volcano/MA plots, for RNA-seq analysis.

Skill
K-Dense-AI

Scanpy

99

Standard single-cell RNA-seq analysis pipeline. Use for QC, normalization, dimensionality reduction (PCA/UMAP/t-SNE), clustering, differential expression, and visualization. Best for exploratory scRNA-seq analysis with established workflows. For deep learning models use scvi-tools; for data format questions use anndata.

Skill
K-Dense-AI

Pysam

99

Genomic file toolkit. Read/write SAM/BAM/CRAM alignments, VCF/BCF variants, FASTA/FASTQ sequences, extract regions, calculate coverage, for NGS data processing pipelines.

Skill
K-Dense-AI

Polars Bio

99

High-performance genomic interval operations and bioinformatics file I/O on Polars DataFrames. Overlap, nearest, merge, coverage, complement, subtract for BED/VCF/BAM/GFF intervals. Streaming, cloud-native, faster bioframe alternative.

Skill
K-Dense-AI

Gtars

99

High-performance toolkit for genomic interval analysis in Rust with Python bindings. Use when working with genomic regions, BED files, coverage tracks, overlap detection, tokenization for ML models, or fragment analysis in computational genomics and machine learning applications.

Skill
K-Dense-AI

© 2025 SkillRepo · Find the right skill, skip the noise.