Pm Ab Test
Skill Verifiziert AktivDesign rigorous A/B tests with hypothesis formulation, sample size calculation, success criteria, guardrail metrics, and rollout planning. Includes Bayesian vs frequentist guidance and compliance-aware staged rollout for ERP features. Use when someone says "A/B test", "experiment", "split test", "hypothesis", "test this feature", "should we experiment", "sample size", "statistical significance".
To design scientifically sound and compliant A/B tests that yield trustworthy results, preventing common experimentation pitfalls for product teams.
Funktionen
- Hypothesis formulation with quality checklist
- Variant definition and randomization strategy selection
- Sample size calculation with B2B SaaS volume reality checks
- Detailed success criteria, primary, secondary, and guardrail metrics
- Staged rollout protocols for compliance-aware features
- Bayesian vs. Frequentist guidance tailored for B2B SaaS
- Pre-committed experiment rules to prevent misuse
Anwendungsfälle
- Designing an A/B test for a new feature rollout
- Determining the appropriate sample size and duration for an experiment
- Planning a staged rollout for a compliance-affecting feature
- Establishing clear success criteria and guardrail metrics before an experiment begins
Nicht-Ziele
- Running the A/B test itself
- Analyzing raw test results or statistical significance during runtime
- Implementing the product changes being tested
- Making the final go/no-go decision (the skill provides the framework for it)
Compliance
- info:GDPRThe skill handles experimental design and does not explicitly operate on personal data, but no specific sanitization is mentioned if such data were incidentally provided.
Practical Utility
- info:Usage examplesWhile the skill is interactive and guides the user, explicit end-to-end examples with inputs and outputs are not provided within the documentation.
Installation
Zuerst Marketplace hinzufügen
/plugin marketplace add marfoerst/the-pragmatic-pm/plugin install the-pragmatic-pm@the-pragmatic-pmQualitätspunktzahl
VerifiziertVertrauenssignale
Ähnliche Erweiterungen
Measure Experiment Design
100Designs an A/B test or experiment with clear hypothesis, variants, success metrics, sample size, and duration. Use when planning experiments to validate product changes or test hypotheses.
Experiment Designer
99Use when planning product experiments, writing testable hypotheses, estimating sample size, prioritizing tests, or interpreting A/B outcomes with practical statistical rigor.
Fit Drift Diffusion Model
100Fit cognitive drift-diffusion models (Ratcliff DDM) to reaction time and accuracy data with parameter estimation (drift rate, boundary separation, non-decision time), model comparison, and parameter recovery validation. Use when modeling binary decision-making with reaction time data, estimating cognitive parameters from experimental data, comparing sequential sampling model variants, or decomposing speed-accuracy tradeoff effects into latent cognitive components.
Brainstorm Experiments New
100Design lean startup experiments (pretotypes) for a new product. Creates XYZ hypotheses and suggests low-effort validation methods like landing pages, explainer videos, and pre-orders. Use when validating a new product idea, creating pretotypes, or testing market demand.
OraClaw Bandit
99A/B-Tests und Funktionsoptimierung für KI-Agenten. Wählen Sie automatisch die beste Option mit Multi-Armed Bandits und kontextbezogenen Bandits (LinUCB). Kein Data Warehouse erforderlich – funktioniert ab der Anfrage.
Statistical Analyst
99Run hypothesis tests, analyze A/B experiment results, calculate sample sizes, and interpret statistical significance with effect sizes. Use when you need to validate whether observed differences are real, size an experiment correctly before launch, or interpret test results with confidence.