Home > Technical & Creative Skills > Tracking Campaigns

How long do you wait before picking a winner when split-testing? (6)

12-03-2014 04:52 AM #1 xckt56 (AMC Alumnus)
How long do you wait before picking a winner when split-testing?

I typically wait for 30 conversions before accepting a winner from an A/B test. But I just read this interesting article on ConversionXL that makes me think I'm being lazy with my testing, 12 A/B Split Testing Mistakes I See Businesses Make All The Time - http://conversionxl.com/12-ab-split-...-all-the-time/

For more accurate split-testing, he says:

- You should not call tests before you’ve reached 95% or higher.

- A good ballpark is to aim for at least 100 conversions per variation before looking at statistical confidence. If you have a lot of traffic, go for at least 250 conversions per variations. It’ll be more accurate if it’s 350-400 conversions per variation.

- If we didn’t test for full weeks, the results would be inaccurate. So this is what you must always do: run tests for 7 days at a time. If confidence is not achieved within the first 7 days, run it another 7 days.

How much of this do you guys think is overkill? How many conversions do you typically wait for a winner, and do you also always run tests for a full week?

12-03-2014 05:57 AM #2 andyvon (AMC Alumnus)

Picking an arbitrary number of conversions to "wait for" isn't a good idea in my opinion, no matter how high/low the number.

Just use a statistical significance calculator like this one, decide how high you want the confidence to be and act accordingly.

As for running tests for 7 days at a time, I don't see how that makes too much of a difference as long as you run both variations over the same timeframe and not e.g. variation A on Monday and variation B on Tuesday.

12-03-2014 07:42 AM #3 xckt56 (AMC Alumnus)

Good link there (thanks), caurmen says the same thing in that post though:

The way we're doing it at the moment, split-tests need a lot of data to be significant. We tend to recommend 100 clicks on an LP as an absolute minimum, and realistically, we're usually underestimating that: 300 - 500 is more like it.

Note that the calculator will give you results even on very small sample sizes. This is a purely mathematical model, so isn't taking into account the fact that you're dealing with the real world here.

Since your ads aren't being shown to perfectly spherical, frictionless visitors of uniform mass, it's best to get to at least 50 clicks and a few conversions before taking the data seriously, ideally over a couple of days.

Old adwords guru Amit Mehta posted about this once, got it bookmarked - http://www.superaffiliatemindset.com...sting-dilemma/

Optimizing for CTR is fast since it’s based on clicks. Optimizing for Conversion Rate takes a LONG TIME, in most cases you need at least 20-30 conversions on each ad to make a statistically valid decision, compared to 20-30 clicks on each ad to make a decision as to which one has a higher CTR.

I'm currently testing 2 variations against a default landing page for a campaign, running search traffic to the LPs and rotating them with Voluum, and right now I'm not focused on testing the ads, but my goal now is to improve the conversion % on my lander.

I'm sure it varies based on the offer/payout too... just wondering if anyone else has some good rules of thumb for testing LP variations for conversions

12-03-2014 07:57 AM #4 andyvon (AMC Alumnus)

Why would it be any different for conversions? It's even more useful and actually in my opinion the only metric you should base your split tests on since CTR won't make you any money

My point was not that running for 100 conversions is too much or too little, but that it entirely depends on the specific campaigns and that general statements like "run tests for xxx number of conversion to be statistically significant" are just bad.

If the calculator tells you that you have 95 % confidence of a variation outperforming the other, it doesn't matter if it's after 100 conversions or 20.

edit: just saw that you edited your second post, so this respnse doesn't make nearly as much sense anymore

btw: the payout doesn't matter at all as long as you don't split test offers with different payouts

12-03-2014 10:44 AM #5 milobanski (AMC Alumnus)

Here's a hopefully interesting example as to why looking at CTR (click through rate) vs CR (conversion rate) can be extremely misleading. Results on this test are statistically significant. This was a small test on the order form of an advertiser. Had a huge impact on CTR to the checkout page, but surprisingly didn't lead to higher conversions at all.

CTR:

CV:

12-05-2014 06:10 PM #6 xckt56 (AMC Alumnus)

Originally Posted by andyvon

Why would it be any different for conversions? It's even more useful and actually in my opinion the only metric you should base your split tests on since CTR won't make you any money

My point was not that running for 100 conversions is too much or too little, but that it entirely depends on the specific campaigns and that general statements like "run tests for xxx number of conversion to be statistically significant" are just bad.

A buddy of mine also said that it sort of depends on the specific campaigns too, but he'll still run variations for a week or two straight and get a lot of conversions in each before picking a winner. He said you need enough sample size/data before you can take the test findings seriously.

@milobanski Yup, exactly what I'm asking this. I'm focused on testing a few variations for high conversions and want to make sure I don't call tests too early.

Home > Technical & Creative Skills > Tracking Campaigns