Zum Hauptinhalt springen
Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

Pysam

Skill Verifiziert Aktiv

Genomic file toolkit. Read/write SAM/BAM/CRAM alignments, VCF/BCF variants, FASTA/FASTQ sequences, extract regions, calculate coverage, for NGS data processing pipelines.

Zweck

To enable AI agents to perform complex genomic data processing and analysis tasks by leveraging the powerful `pysam` Python library, streamlining NGS pipelines.

Funktionen

  • Read/write SAM/BAM/CRAM alignment files
  • Read/write VCF/BCF variant files
  • Read FASTA/FASTQ sequence files
  • Extract genomic regions and sequences
  • Calculate coverage and perform pileup analysis
  • Access and manipulate read/variant attributes and tags
  • Integrated bioinformatics workflows

Anwendungsfälle

  • Analyzing sequencing alignment results
  • Processing genetic variants for analysis or annotation
  • Extracting gene sequences or regions of interest
  • Calculating read depth and coverage statistics
  • Quality control of genomic data
  • Implementing bioinformatics analysis pipelines

Nicht-Ziele

  • Performing wet-lab experimental design
  • Executing complex statistical modeling beyond basic data extraction
  • Replacing dedicated GUI-based genome browsers

Workflow

  1. Open genomic file (BAM, VCF, FASTA)
  2. Fetch data by region or iterate through records
  3. Process/analyze data (e.g., extract sequence, count variants, calculate coverage)
  4. Optionally write modified data to new file
  5. Close file handle

Voraussetzungen

  • Python 3.11+

Code Execution

  • info:ValidationWhile input parameters in examples are generally well-defined, explicit schema validation libraries like Zod or Pydantic are not demonstrated for command-line arguments or file contents.

Installation

npx skills add K-Dense-AI/claude-scientific-skills

Führt das Vercel skills CLI (skills.sh) via npx aus — benötigt Node.js lokal und mindestens einen installierten skills-kompatiblen Agent (Claude Code, Cursor, Codex, …). Setzt voraus, dass das Repo dem agentskills.io-Format folgt.

Qualitätspunktzahl

Verifiziert
99 /100
Analysiert 1 day ago

Vertrauenssignale

Letzter Commit3 days ago
Sterne21k
LizenzMIT
Status
Quellcode ansehen

Ähnliche Erweiterungen

Polars Bio

99

High-performance genomic interval operations and bioinformatics file I/O on Polars DataFrames. Overlap, nearest, merge, coverage, complement, subtract for BED/VCF/BAM/GFF intervals. Streaming, cloud-native, faster bioframe alternative.

Skill
K-Dense-AI

Biopython

99

Comprehensive molecular biology toolkit. Use for sequence manipulation, file parsing (FASTA/GenBank/PDB), phylogenetics, and programmatic NCBI/PubMed access (Bio.Entrez). Best for batch processing, custom bioinformatics pipelines, BLAST automation. For quick lookups use gget; for multi-service integration use bioservices.

Skill
K-Dense-AI

PyDESeq2

100

Differential gene expression analysis (Python DESeq2). Identify DE genes from bulk RNA-seq counts, Wald tests, FDR correction, volcano/MA plots, for RNA-seq analysis.

Skill
K-Dense-AI

Scanpy

99

Standard single-cell RNA-seq analysis pipeline. Use for QC, normalization, dimensionality reduction (PCA/UMAP/t-SNE), clustering, differential expression, and visualization. Best for exploratory scRNA-seq analysis with established workflows. For deep learning models use scvi-tools; for data format questions use anndata.

Skill
K-Dense-AI

Gtars

99

High-performance toolkit for genomic interval analysis in Rust with Python bindings. Use when working with genomic regions, BED files, coverage tracks, overlap detection, tokenization for ML models, or fragment analysis in computational genomics and machine learning applications.

Skill
K-Dense-AI

Geniml

99

This skill should be used when working with genomic interval data (BED files) for machine learning tasks. Use for training region embeddings (Region2Vec, BEDspace), single-cell ATAC-seq analysis (scEmbed), building consensus peaks (universes), or any ML-based analysis of genomic regions. Applies to BED file collections, scATAC-seq data, chromatin accessibility datasets, and region-based genomic feature learning.

Skill
K-Dense-AI