Explore Data
技能 活跃Profile and explore a dataset to understand its shape, quality, and patterns. Use when encountering a new table or file, checking null rates and column distributions, spotting data quality issues like duplicates or suspicious values, or deciding which dimensions and metrics to analyze.
To enable users to quickly understand and assess the quality of new datasets before in-depth analysis, by providing a structured profile and identifying potential issues.
功能
- Data profiling for tables and files
- Column type classification (Identifier, Dimension, Metric, Temporal, Text, Boolean)
- Data quality issue detection (null rates, cardinality, suspicious values)
- Pattern discovery (distributions, temporal trends, correlations)
- Recommendations for analysis and follow-up
使用场景
- Encountering a new table or file for the first time
- Checking null rates and column distributions
- Spotting data quality issues like duplicates or suspicious values
- Deciding which dimensions and metrics to analyze
非目标
- Performing complex statistical modeling
- Generating final reports or visualizations
- Directly cleaning or imputing data without user guidance
Trust
- warning:Issues Attention29 issues opened and 4 closed in the last 90 days indicates a closure rate below 50%, suggesting slow maintainer response.
安装
请先添加 Marketplace
/plugin marketplace add anthropics/knowledge-work-plugins/plugin install data@knowledge-work-plugins质量评分
类似扩展
Fit Drift Diffusion Model
100Fit cognitive drift-diffusion models (Ratcliff DDM) to reaction time and accuracy data with parameter estimation (drift rate, boundary separation, non-decision time), model comparison, and parameter recovery validation. Use when modeling binary decision-making with reaction time data, estimating cognitive parameters from experimental data, comparing sequential sampling model variants, or decomposing speed-accuracy tradeoff effects into latent cognitive components.
Data Quality Auditor
97Audit datasets for completeness, consistency, accuracy, and validity. Profile data distributions, detect anomalies and outliers, surface structural issues, and produce an actionable remediation plan.
Survey Insect Population
100Design and execute insect population surveys covering survey design, sampling methods, field execution, specimen identification, diversity index calculation including Shannon-Wiener and Simpson indices, statistical analysis, and reporting. Covers defining survey objectives, selecting study sites, determining sampling intensity and replication, choosing sampling methods appropriate to target taxa, standardizing collection effort, recording environmental covariates, identifying specimens to the lowest practical taxonomic level, calculating species richness, Shannon-Wiener diversity (H'), Simpson diversity (1-D), evenness, rarefaction curves, multivariate ordination, and producing survey reports with species lists and conservation implications. Use when conducting baseline biodiversity assessments, monitoring insect populations over time, comparing insect communities across habitats or treatments, assessing environmental impact, or supporting conservation planning with quantitative ecological data.
Measure Experiment Design
100Designs an A/B test or experiment with clear hypothesis, variants, success metrics, sample size, and duration. Use when planning experiments to validate product changes or test hypotheses.
PyDESeq2
100Differential gene expression analysis (Python DESeq2). Identify DE genes from bulk RNA-seq counts, Wald tests, FDR correction, volcano/MA plots, for RNA-seq analysis.
Ops Yolo
100YOLO mode. Spawns 4 parallel C-suite agents (CEO, CTO, CFO, COO). Each analyzes the business from their perspective using ALL available data. Produces unfiltered Hard Truths report. After user types YOLO, autonomously runs the business for a day using /loop.