此内容尚未提供您的语言版本,正在以英文显示。

Pysam

技能已验证活跃

Genomic file toolkit. Read/write SAM/BAM/CRAM alignments, VCF/BCF variants, FASTA/FASTQ sequences, extract regions, calculate coverage, for NGS data processing pipelines.

目的

To enable AI agents to perform complex genomic data processing and analysis tasks by leveraging the powerful `pysam` Python library, streamlining NGS pipelines.

功能

Read/write SAM/BAM/CRAM alignment files
Read/write VCF/BCF variant files
Read FASTA/FASTQ sequence files
Extract genomic regions and sequences
Calculate coverage and perform pileup analysis
Access and manipulate read/variant attributes and tags
Integrated bioinformatics workflows

使用场景

Analyzing sequencing alignment results
Processing genetic variants for analysis or annotation
Extracting gene sequences or regions of interest
Calculating read depth and coverage statistics
Quality control of genomic data
Implementing bioinformatics analysis pipelines

非目标

Performing wet-lab experimental design
Executing complex statistical modeling beyond basic data extraction
Replacing dedicated GUI-based genome browsers

工作流

Open genomic file (BAM, VCF, FASTA)
Fetch data by region or iterate through records
Process/analyze data (e.g., extract sequence, count variants, calculate coverage)
Optionally write modified data to new file
Close file handle

先决条件

Python 3.11+

Code Execution

info:ValidationWhile input parameters in examples are generally well-defined, explicit schema validation libraries like Zod or Pydantic are not demonstrated for command-line arguments or file contents.

安装

npx skills add K-Dense-AI/claude-scientific-skills

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

已验证

99 /100

1 day ago 分析

信任信号

最近提交3 days ago

GitHub 所有者 K-Dense-AI

星标21k

许可证MIT

网站k-dense.ai

状态

查看源代码

类似扩展

Polars Bio

High-performance genomic interval operations and bioinformatics file I/O on Polars DataFrames. Overlap, nearest, merge, coverage, complement, subtract for BED/VCF/BAM/GFF intervals. Streaming, cloud-native, faster bioframe alternative.

技能

K-Dense-AI

Biopython

Comprehensive molecular biology toolkit. Use for sequence manipulation, file parsing (FASTA/GenBank/PDB), phylogenetics, and programmatic NCBI/PubMed access (Bio.Entrez). Best for batch processing, custom bioinformatics pipelines, BLAST automation. For quick lookups use gget; for multi-service integration use bioservices.

技能

K-Dense-AI

PyDESeq2

100

Differential gene expression analysis (Python DESeq2). Identify DE genes from bulk RNA-seq counts, Wald tests, FDR correction, volcano/MA plots, for RNA-seq analysis.

技能

K-Dense-AI

Scanpy

Standard single-cell RNA-seq analysis pipeline. Use for QC, normalization, dimensionality reduction (PCA/UMAP/t-SNE), clustering, differential expression, and visualization. Best for exploratory scRNA-seq analysis with established workflows. For deep learning models use scvi-tools; for data format questions use anndata.

技能

K-Dense-AI

Gtars

High-performance toolkit for genomic interval analysis in Rust with Python bindings. Use when working with genomic regions, BED files, coverage tracks, overlap detection, tokenization for ML models, or fragment analysis in computational genomics and machine learning applications.

技能

K-Dense-AI

Geniml

This skill should be used when working with genomic interval data (BED files) for machine learning tasks. Use for training region embeddings (Region2Vec, BEDspace), single-cell ATAC-seq analysis (scEmbed), building consensus peaks (universes), or any ML-based analysis of genomic regions. Applies to BED file collections, scATAC-seq data, chromatin accessibility datasets, and region-based genomic feature learning.

技能

K-Dense-AI