跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Exploratory Data Analysis Skill

技能 已验证 活跃

Perform comprehensive exploratory data analysis on scientific data files across 200+ file formats. This skill should be used when analyzing any scientific data file to understand its structure, content, quality, and characteristics. Automatically detects file type and generates detailed markdown reports with format-specific analysis, quality metrics, and downstream analysis recommendations. Covers chemistry, bioinformatics, microscopy, spectroscopy, proteomics, metabolomics, and general scientific data formats.

目的

To enable AI agents to perform thorough exploratory data analysis on a wide variety of scientific data files, providing insights into their structure, content, quality, and characteristics.

功能

  • Automated detection of 200+ scientific file formats
  • Format-specific analysis and metadata extraction
  • Data quality and integrity assessment
  • Generation of detailed markdown reports
  • Recommendations for downstream analysis and visualization

使用场景

  • When analyzing any scientific data file to understand its structure, content, quality, and characteristics.
  • When a user asks to 'explore', 'analyze', or 'summarize' a data file.
  • When needing a comprehensive report before planning further analysis.
  • When assessing data quality or completeness of scientific datasets.

非目标

  • Performing specific downstream scientific analysis (e.g., differential expression, docking).
  • Interpreting the scientific meaning of the data beyond structural and quality characteristics.
  • Modifying or transforming the original data file.

工作流

  1. Detect file type and category based on extension.
  2. Load format-specific information from reference files.
  3. Perform format-specific data analysis using provided script or custom logic.
  4. Generate a comprehensive markdown report including analysis results and recommendations.
  5. Save the report to a specified file or print to console.

实践

  • Data Quality Assessment
  • Scientific Data Handling
  • Report Generation

先决条件

  • Python 3.11+ (3.12+ recommended)
  • uv package manager
  • Agent supporting Agent Skills standard (e.g., Cursor, Claude Code)

安装

npx skills add K-Dense-AI/claude-scientific-skills

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

已验证
97 /100
1 day ago 分析

信任信号

最近提交3 days ago
星标21k
许可证MIT
状态
查看源代码

类似扩展

Alterlab Eda

90

Part of the AlterLab Academic Skills suite. Perform comprehensive exploratory data analysis on scientific data files across 200+ file formats. This skill should be used when analyzing any scientific data file to understand its structure, content, quality, and characteristics. Automatically detects file type and generates detailed markdown reports with format-specific analysis, quality metrics, and downstream analysis recommendations. Covers chemistry, bioinformatics, microscopy, spectroscopy, proteomics, metabolomics, and general scientific data formats.

技能
AlterLab-IEU

Survey Insect Population

100

Design and execute insect population surveys covering survey design, sampling methods, field execution, specimen identification, diversity index calculation including Shannon-Wiener and Simpson indices, statistical analysis, and reporting. Covers defining survey objectives, selecting study sites, determining sampling intensity and replication, choosing sampling methods appropriate to target taxa, standardizing collection effort, recording environmental covariates, identifying specimens to the lowest practical taxonomic level, calculating species richness, Shannon-Wiener diversity (H'), Simpson diversity (1-D), evenness, rarefaction curves, multivariate ordination, and producing survey reports with species lists and conservation implications. Use when conducting baseline biodiversity assessments, monitoring insect populations over time, comparing insect communities across habitats or treatments, assessing environmental impact, or supporting conservation planning with quantitative ecological data.

技能
pjt222

Fit Drift Diffusion Model

100

Fit cognitive drift-diffusion models (Ratcliff DDM) to reaction time and accuracy data with parameter estimation (drift rate, boundary separation, non-decision time), model comparison, and parameter recovery validation. Use when modeling binary decision-making with reaction time data, estimating cognitive parameters from experimental data, comparing sequential sampling model variants, or decomposing speed-accuracy tradeoff effects into latent cognitive components.

技能
pjt222

Measure Experiment Design

100

Designs an A/B test or experiment with clear hypothesis, variants, success metrics, sample size, and duration. Use when planning experiments to validate product changes or test hypotheses.

技能
product-on-purpose

PyDESeq2

100

Differential gene expression analysis (Python DESeq2). Identify DE genes from bulk RNA-seq counts, Wald tests, FDR correction, volcano/MA plots, for RNA-seq analysis.

技能
K-Dense-AI

Ops Yolo

100

YOLO mode. Spawns 4 parallel C-suite agents (CEO, CTO, CFO, COO). Each analyzes the business from their perspective using ALL available data. Produces unfiltered Hard Truths report. After user types YOLO, autonomously runs the business for a day using /loop.

技能
Lifecycle-Innovations-Limited