AlterLab Biopython Bioinformatics Skill
技能 已验证 活跃Comprehensive molecular biology toolkit. Use for sequence manipulation, file parsing (FASTA/GenBank/PDB), phylogenetics, and programmatic NCBI/PubMed access (Bio.Entrez). Best for batch processing, custom bioinformatics pipelines, BLAST automation. For quick lookups use gget; for multi-service integration use bioservices. Part of the AlterLab Academic Skills suite.
To serve as a powerful, programmatic toolkit for a wide range of computational molecular biology tasks, suitable for batch processing and custom bioinformatics pipelines.
功能
- Sequence manipulation and analysis
- Parsing of biological file formats (FASTA, GenBank, PDB)
- Programmatic access to NCBI databases (Entrez)
- BLAST search automation and result parsing
- Phylogenetic tree analysis
- Structural bioinformatics analysis
使用场景
- Working with biological sequences (DNA, RNA, protein)
- Reading, writing, or converting biological file formats
- Accessing and querying NCBI databases
- Performing BLAST searches or analyzing results
- Analyzing protein structures and phylogenetic trees
非目标
- Replacing dedicated GUI tools for quick lookups (e.g., gget)
- Providing multi-service integration beyond Biopython's scope (suggests bioservices)
- Serving as a general-purpose programming assistant
实践
- Sequence handling
- Database access
- Sequence alignment
- Phylogenetics
- Structural bioinformatics
先决条件
- Python 3
- NumPy
- Biopython 1.85+
Execution
- info:Pinned dependenciesThe installation instruction uses `uv pip install biopython`, which typically installs the latest version but does not explicitly pin dependencies or provide a lockfile.
安装
npx skills add AlterLab-IEU/AlterLab-Academic-Skills通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。
质量评分
已验证类似扩展
Biopython
99Comprehensive molecular biology toolkit. Use for sequence manipulation, file parsing (FASTA/GenBank/PDB), phylogenetics, and programmatic NCBI/PubMed access (Bio.Entrez). Best for batch processing, custom bioinformatics pipelines, BLAST automation. For quick lookups use gget; for multi-service integration use bioservices.
Pysam
99Genomic file toolkit. Read/write SAM/BAM/CRAM alignments, VCF/BCF variants, FASTA/FASTQ sequences, extract regions, calculate coverage, for NGS data processing pipelines.
PyDESeq2
100Differential gene expression analysis (Python DESeq2). Identify DE genes from bulk RNA-seq counts, Wald tests, FDR correction, volcano/MA plots, for RNA-seq analysis.
Scanpy
99Standard single-cell RNA-seq analysis pipeline. Use for QC, normalization, dimensionality reduction (PCA/UMAP/t-SNE), clustering, differential expression, and visualization. Best for exploratory scRNA-seq analysis with established workflows. For deep learning models use scvi-tools; for data format questions use anndata.
Polars Bio
99High-performance genomic interval operations and bioinformatics file I/O on Polars DataFrames. Overlap, nearest, merge, coverage, complement, subtract for BED/VCF/BAM/GFF intervals. Streaming, cloud-native, faster bioframe alternative.
Gtars
99High-performance toolkit for genomic interval analysis in Rust with Python bindings. Use when working with genomic regions, BED files, coverage tracks, overlap detection, tokenization for ML models, or fragment analysis in computational genomics and machine learning applications.