ab-test-setup

by coreyhaines31

ab-test-setup helps teams turn experiment ideas into runnable A/B test plans for Conversion. Use it to define hypotheses, choose A/B vs A/B/n, estimate sample size and duration, set primary and guardrail metrics, and use repo templates for structured test briefs.

Stars17.3k

Favorites0

Comments0

AddedMar 29, 2026

CategoryConversion

Install Command

npx skills add coreyhaines31/marketingskills --skill ab-test-setup

Curation Score

This skill scores 78/100, which makes it a solid directory listing candidate for users who want structured help planning A/B tests. The repository gives clear trigger language, substantial workflow guidance, and useful supporting references, so an agent is likely to do better than with a generic prompt. Users should still expect this to be a planning/design skill rather than a tool-backed implementation package.

78/100

Strengths

Strong triggerability: the description names many natural user phrases like “A/B test,” “split test,” “which version is better,” and “how long should I run this test.”
Operationally useful content: SKILL.md covers hypothesis design, test constraints, and experiment principles, with references for sample size and test-plan templates.
Trust signal from evals: evals specify expected behaviors such as checking product-marketing context, defining metrics, handling sample size, and warning about peeking.

Cautions

Limited implementation leverage: there are no scripts, install steps, or tool-specific execution instructions, so agents still need judgment to operationalize the plan.
Workflow signaling is lighter than ideal: structural signals show workflow 0, so some step-by-step execution details may be inferred rather than explicitly prescribed.

Marketing Analytics Testing Cro Strategy Workflow Gtm Ga4

Overview

Overview of ab-test-setup skill

What ab-test-setup is for

The ab-test-setup skill helps you turn a vague experiment idea into a test plan that is actually runnable for Conversion work. It is best for marketers, growth teams, product marketers, and PMs who need to decide what to test, how to structure it, and whether they have enough traffic to learn anything.

Who should install this skill

Install ab-test-setup if you regularly ask for help with:

headline or CTA experiments
landing page and signup flow tests
variant planning for messaging or offer changes
sample size, duration, and significance questions
deciding whether an idea should be A/B tested at all

It is especially useful if your team already has ideas but lacks a repeatable experiment brief.

The real job-to-be-done

Most failed tests do not fail because variant ideas are bad. They fail because the setup is weak: no clear hypothesis, too many changes at once, no baseline, no detectable effect target, or no guardrails. The ab-test-setup skill is designed to force that missing discipline before launch.

What makes this skill different from a generic prompt

A generic prompt will often suggest test ideas. ab-test-setup pushes toward a more valid experiment plan:

starts from hypothesis, not just “try two versions”
asks for baseline conversion rate and traffic
accounts for sample size and test duration
distinguishes A/B vs A/B/n vs multivariate choices
warns against peeking and underpowered tests
points to templates and a sample-size reference in the repo

Best-fit and misfit cases

Best fit:

you already know the page, audience, and goal
you need a structured test brief fast
you want better prompts for Conversion experimentation

Misfit:

you first need instrumentation or event tracking design
you want page rewrite ideas without a testing plan
you have very low traffic and need alternatives to formal testing

How to Use ab-test-setup skill

Install ab-test-setup in your skills environment

Use the repository install pattern shown by the directory baseline:

npx skills add https://github.com/coreyhaines31/marketingskills --skill ab-test-setup

After install, open:

skills/ab-test-setup/SKILL.md
skills/ab-test-setup/references/sample-size-guide.md
skills/ab-test-setup/references/test-templates.md
skills/ab-test-setup/evals/evals.json

Those files matter more than a quick skim because they show the intended decision flow, output shape, and quality bar.

Read these files first

If you only read three files before using ab-test-setup, read:

SKILL.md for trigger conditions and planning logic
references/sample-size-guide.md for feasibility and duration decisions
references/test-templates.md for the final structure you want the model to produce

Then check evals/evals.json to see what the skill considers a good answer in realistic prompts.

What input ab-test-setup needs

The skill gets much better when you provide:

page or feature being tested
primary conversion event
current baseline conversion rate
monthly or weekly traffic volume
proposed change
audience segment
tooling constraints
timeline or launch window
risk tolerance for false positives

Without baseline and traffic, ab-test-setup usage becomes more generic and less decision-useful.

Start with product marketing context if available

The repo explicitly tells the skill to check .agents/product-marketing-context.md or .claude/product-marketing-context.md first. That matters because good experiment design depends on:

audience
positioning
core claims
current messaging strategy
funnel stage

If your environment has that file, make sure the model reads it before asking repetitive discovery questions.

Turn a rough idea into a strong ab-test-setup prompt

Weak prompt:

We want to test our homepage headline. What should we do?

Better prompt:

Use ab-test-setup to plan an A/B test for our homepage headline. Current headline: "The All-in-One Project Management Tool." Proposed direction: more benefit-focused messaging for SaaS team leads. Baseline signup rate is 3.2%. We get about 15,000 homepage visitors per month. Primary goal is signup rate. We can implement one variant only, 50/50 traffic split, in our existing testing tool. Please create a hypothesis, recommend test type, estimate sample needs and likely duration, define primary/secondary/guardrail metrics, and flag risks like peeking or low power.

That second version gives the skill enough context to produce a plan instead of generic brainstorming.

Ask for the output format you actually need

The references include reusable templates, so ask for one of these formats:

experiment brief for approval
launch checklist
test plan template
stakeholder update
post-test readout shell

Practical prompt:

Use the test plan template format from references/test-templates.md and fill only fields we can support with the data provided. Mark missing assumptions clearly.

This reduces cleanup work and exposes missing inputs early.

Use the skill for decisions, not just idea generation

The most useful ab-test-setup guide workflow is:

describe the proposed change
state the business goal
provide baseline and traffic
ask whether the test is viable
ask for exact metrics and run conditions
only then ask for variant recommendations

This order matters. It stops teams from over-investing in tests that cannot reach adequate sample size.

Know the core planning rules it enforces

From the source, the skill strongly centers on:

start with a clear hypothesis
test one thing at a time
define primary, secondary, and guardrail metrics
estimate sample size and minimum duration
avoid ending tests early based on noisy early wins

If your organization often launches “quick tests” without these controls, this skill adds real value.

How to use ab-test-setup for Conversion work

For ab-test-setup for Conversion, include the business stakes, not just the variant idea. Good inputs:

current conversion bottleneck
why the current page may underperform
expected mechanism of change
minimum lift worth acting on
segments that must not degrade

Example:

We think our pricing page CTA underperforms because it asks for commitment too early. Plan an A/B test comparing "Start Free Trial" vs "See Plans First." Baseline click-through is 6.8%, downstream trial-start rate is 2.1%, and pricing page traffic is 40,000 sessions/month. We care most about completed trial starts, not just button clicks. Include guardrails so a CTR lift does not hide lower-quality signups.

That prompt leads to better metric selection than simply asking for a button-color test.

When the skill will push back on your idea

Expect ab-test-setup to be most helpful when it says:

this should not be multivariate
you do not have enough traffic for four variants
your MDE is unrealistically small
your primary metric is too far from the tested change
you are mixing too many changes to learn causally

That pushback is a feature, not friction.

Common repo-backed use cases

Based on the skill text and evals, good uses include:

homepage headline A/B tests
CTA variant tests on pricing or signup pages
deciding if A/B/n is realistic
duration planning from traffic and baseline
creating structured documentation for experiment rollout

The evals also show the skill should catch casual requests like “should we test 4 CTA colors?” and steer users toward stronger experiment design.

ab-test-setup skill FAQ

Is ab-test-setup good for beginners?

Yes, if you already understand your page and goal. The skill gives structure beginners often miss: hypothesis, sample size thinking, metrics, and duration. It is less suitable if you need a statistics primer from scratch.

What is the main advantage over ordinary prompting?

The main advantage is constraint. ab-test-setup does not just generate variants; it frames whether the test is worth running and what valid measurement requires. That usually saves more time than idea generation.

Do I need exact traffic and conversion data?

Exact is best, directional is still useful. If you only have rough estimates, say so explicitly. The skill can still produce a planning draft, but confidence in sample-size and duration guidance will be lower.

Can ab-test-setup handle more than two variants?

Yes, but it should also warn that extra variants increase sample requirements. If traffic is modest, an A/B test is often more practical than A/B/n or multivariate testing.

When should I not use ab-test-setup?

Do not use it as your main tool when:

tracking is missing or unreliable
traffic is too low for meaningful inference
you need a CRO rewrite, not a test plan
the change is so large that implementation feasibility is the real blocker
you need analytics instrumentation design first

Is this skill tied to one testing platform?

No evidence suggests a platform lock-in. The skill is planning-oriented, so it should work with most experimentation tools as long as you can specify traffic split, metrics, and implementation constraints.

Does ab-test-setup help with post-test analysis?

Partly. The templates include results documentation, but the strongest value is still pre-launch setup. Use it to define what success means before the test starts.

How to Improve ab-test-setup skill

Give stronger hypotheses, not just variant requests

Bad input:

Test this new copy against the old copy.

Better input:

Because users may not understand our current value proposition quickly, we believe replacing feature-led copy with outcome-led copy will increase signup starts among first-time visitors. We will measure signup rate as the primary metric and bounce rate plus demo-request rate as secondary checks.

This gives ab-test-setup a causal story to test, not just two artifacts to compare.

Provide the minimum viable experiment data set

To improve ab-test-setup output quality, always try to include:

baseline conversion rate
traffic volume
minimum meaningful lift
exact conversion event
audience
implementation constraints
acceptable test duration

These inputs directly improve sample-size logic and feasibility recommendations.

Avoid the most common failure modes

Weak outputs usually come from one of these:

too many changes bundled into one test
no baseline metric
vanity metric as primary KPI
asking for significance without traffic reality
testing an upstream micro-metric while the real business goal is downstream

If you fix those before prompting, the skill becomes much more useful.

Tell the skill what must not get worse

A stronger ab-test-setup skill prompt includes guardrails such as:

lead quality
refund rate
bounce rate
activation rate
revenue per visitor

This prevents false “wins” where the top-line metric rises but business quality falls.

Use the sample-size reference as a feasibility filter

Before spending time on variants, check references/sample-size-guide.md. It helps answer:

can this test finish in a reasonable window?
is the desired lift too small to detect?
would fewer variants be smarter?
should we use a larger change instead of a subtle tweak?

This is one of the highest-value files in the repo for install decisions.

Reuse the templates instead of freeform outputs

references/test-templates.md is the fastest path to better team adoption. Ask the model to fill:

test plan
prioritization scorecard
stakeholder update
hypothesis bank entry

Freeform responses are easy to generate but harder to operationalize.

Iterate after the first draft

After the first ab-test-setup usage pass, do one refinement round:

tighten the hypothesis
cut scope to one variable
replace weak metrics with operational definitions
confirm traffic split and duration
ask what assumptions are still missing

That second pass often improves the plan more than adding more variant ideas.

Pair ab-test-setup with adjacent skills carefully

The skill itself points to adjacent needs:

use analytics-tracking if measurement setup is the blocker
use page-cro if you need page-level optimization ideas before formal testing

That division is useful. ab-test-setup is strongest once you already know what change you want to evaluate and need a valid experiment plan.

Ratings & Reviews

No ratings yet

Share your review

0/10000

Latest reviews

Saving...

more skill

ua-campaign

by Eronred

The ua-campaign skill helps you plan and optimize paid user acquisition campaigns for mobile apps, with a focus on installs, CPI, ROAS, and channel mix. Use it for ua-campaign guide-style decisions on Apple Search Ads, Meta, Google UAC, TikTok, and other performance channels when you need a practical plan, not just ad copy.

Ad Optimization

Favorites 0GitHub 1.2k

referral-program

by Eronred

referral-program helps you design, launch, and improve an in-app referral or invite system for conversion. Use it to choose rewards, define invite mechanics, assess deep-link dependencies, reduce fraud risk, and size incentives against CAC and LTV. It’s a practical referral-program guide for products with real sharing behavior.

Conversion

Favorites 0GitHub 1.2k

rating-prompt-strategy

by Eronred

rating-prompt-strategy helps you choose when, how, and to whom an app should ask for a review so you raise star ratings without annoying users. Use this rating-prompt-strategy skill for stronger conversion, better review timing, and recovery after a bad rating period across iOS and Android.

Conversion

Favorites 0GitHub 1.2k

seo-ecommerce

by AgriciDaniel

seo-ecommerce helps analyze and improve product pages, Google Shopping visibility, marketplace positioning, and product schema. Use it standalone for on-page SEO or with DataForSEO Merchant API for competitor pricing and marketplace keyword gaps. It’s built for e-commerce operators and SEO teams who need practical, evidence-based fixes.

SEO Content

Favorites 0GitHub 0

homepage-audit

by BrianRWagner

homepage-audit is a conversion-focused skill for auditing homepages and landing pages, identifying why a page is not converting, and turning findings into a prioritized fix plan. Use it when you have a URL, screenshot, or above-the-fold copy and want structured feedback, rewrites, and next steps.

Conversion

Favorites 0GitHub 0

copywriting

by coreyhaines31

The copywriting skill helps agents write or rewrite conversion-focused website and landing page copy. It guides page structure, checks product-marketing context files first, and supports practical install and usage for homepage, pricing, feature, about, and product pages.

Copywriting

Favorites 0GitHub 17.3k

onboarding-optimization

by Eronred

The onboarding-optimization skill helps improve first-run flows, reduce early drop-off, and increase activation so more new users reach first value. Use this onboarding-optimization guide when you need a structured, conversion-focused approach to onboarding-optimization for Conversion, from install to activation, with practical next steps for diagnosis and iteration.

Conversion

Favorites 0GitHub 1.2k

android-aso

by Eronred

android-aso is a Google Play ASO skill for improving Android app store visibility and conversion. Use it to plan title, short description, full description, keywords, ratings, screenshots, and Play Store experiments with Google Play-specific guidance instead of generic iOS advice. It is a practical android-aso guide for SEO Content and listing decisions.

SEO Content

Favorites 0GitHub 1.2k

popup-cro

by coreyhaines31

popup-cro helps create and optimize popups, modals, slide-ins, and sticky bars for Conversion. It guides install, setup, and usage with practical advice on triggers, offers, form friction, mobile UX, frequency rules, and product-context checks.

Conversion

Favorites 0GitHub 17.3k

paywall-upgrade-cro

by coreyhaines31

paywall-upgrade-cro helps teams improve in-product paywalls, upgrade screens, trial-expiration prompts, and feature gates. Learn when to use it, how to install it, which files to read first, and how to apply it to freemium, trial-to-paid, and tier-upgrade conversion work.

Conversion

Favorites 0GitHub 17.3k

page-cro

by coreyhaines31

page-cro is a CRO skill for reviewing marketing pages and finding practical ways to improve conversions. Use it to analyze landing, pricing, homepage, feature, and blog pages with clear input on goal, traffic source, audience, and blockers. Includes install guidance, workflow tips, prompt examples, and experiment-focused usage.

Conversion

Favorites 0GitHub 17.3k

onboarding-cro

by coreyhaines31

onboarding-cro helps improve post-signup onboarding, activation, and time-to-value. Install it from the marketingskills repo to diagnose onboarding friction, define the aha moment, refine checklists and empty states, and turn weak first-run flows into measurable experiments.

Conversion

Favorites 0GitHub 17.3k

lead-magnets

by coreyhaines31

lead-magnets helps plan the right lead magnet for email capture and lead generation. Use it to choose formats, review benchmarks, read key files, and connect offers to product-focused conversion paths.

Content Marketing

Favorites 0GitHub 17.3k

form-cro

by coreyhaines31

form-cro is a skill for auditing and improving non-signup forms like lead, contact, demo request, application, survey, quote, and checkout-style forms. It helps teams reduce field friction, diagnose abandonment, and restructure forms using repo-backed guidance, eval examples, and a clear install and usage path.

Conversion

Favorites 0GitHub 17.3k

monetization-strategy

by phuryn

monetization-strategy helps you brainstorm 3-5 realistic revenue models for a product or feature, with audience fit, risks, pricing mechanics, and validation experiments. Use it as a monetization-strategy guide for Pricing Strategy decisions, not a generic make-money prompt.

Pricing Strategy

Favorites 0GitHub 11k

ab-test-analysis

by phuryn

ab-test-analysis helps you evaluate A/B test results with statistical rigor, including sample size validation, confidence intervals, significance testing, and ship/extend/stop recommendations. Use it for experiment review, split-test interpretation, and decision-making for Data Analysis workflows.

Data Analysis

Favorites 0GitHub 11k