跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Ab Test Analysis

技能 已验证 活跃

Analyze A/B test results with statistical significance, sample size validation, confidence intervals, and ship/extend/stop recommendations. Use when evaluating experiment results, checking if a test reached significance, interpreting split test data, or deciding whether to ship a variant.

目的

Analyze A/B test results with statistical rigor to translate findings into clear product decisions, ensuring data-driven choices for product development.

功能

  • Statistical significance testing
  • Sample size and duration validation
  • Confidence interval calculation
  • Guardrail metric checking
  • Ship/extend/stop recommendation generation

使用场景

  • Evaluating experiment results with statistical rigor
  • Interpreting split test data for product decisions
  • Determining whether to ship a variant based on test outcomes
  • Validating test setup for sample size and duration

非目标

  • Hypothesis generation for A/B tests
  • Designing initial experiment setups
  • Performing qualitative user research

安装

请先添加 Marketplace

/plugin marketplace add phuryn/pm-skills
/plugin install pm-data-analytics@pm-skills

质量评分

已验证
98 /100
about 16 hours ago 分析

信任信号

最近提交22 days ago
星标11.2k
许可证MIT
状态
查看源代码

类似扩展

Measure Experiment Design

100

Designs an A/B test or experiment with clear hypothesis, variants, success metrics, sample size, and duration. Use when planning experiments to validate product changes or test hypotheses.

技能
product-on-purpose

Game Analytics Setup

100

Invoke when the user needs to set up analytics, define telemetry events, establish KPIs, build dashboards, configure A/B testing, or implement data-driven design capabilities. Triggers on: "analytics", "telemetry", "KPIs", "metrics", "player data", "retention", "DAU", "dashboard", "A/B testing", "funnel analysis". Do NOT invoke for balance tuning (use game-balance-check) or economy design (use game-economy-designer). Part of the AlterLab GameForge collection.

技能
AlterLab-IEU

Measure Dashboard Requirements

100

Specifies requirements for an analytics dashboard including metrics, visualizations, filters, and data sources. Use when requesting dashboards from data teams, defining KPI tracking, or documenting reporting needs.

技能
product-on-purpose

Acquisition Channel Advisor

100

Evaluate acquisition channels using unit economics, customer quality, and scalability. Use when deciding whether to scale, test, or kill a growth channel.

技能
deanpeters

Fit Drift Diffusion Model

100

Fit cognitive drift-diffusion models (Ratcliff DDM) to reaction time and accuracy data with parameter estimation (drift rate, boundary separation, non-decision time), model comparison, and parameter recovery validation. Use when modeling binary decision-making with reaction time data, estimating cognitive parameters from experimental data, comparing sequential sampling model variants, or decomposing speed-accuracy tradeoff effects into latent cognitive components.

技能
pjt222

Experiment Design

99

A discipline for designing experiments (A/B tests, multivariate, holdouts) so the results actually answer the question you asked. Hypothesis writing, sample size, duration, segment analysis, interpretation, decision-making, and the common failure modes that produce confidently wrong shipping decisions.

技能
rampstackco