Rules for a successful multivariate test (Billy’s Optimization Guide Part 3)

No Comments Methodology, Testing Concerns, Testing Techniques

Rules of Six Detail

If you missed it, see Part 1 (A/B Split Testing) and Part 2 (Multivariate Test Basics).

With the basics of part 2 down, it’s time to start designing a multivariate test.  Every optimization project has different challenges and goals, luckily though, there are a few rules that apply to every multivariate test design.  These rules fit into two categories: technical rules and content rules.

Technical rules:

  1. Choose the appropriate multivariate test type (full or fractional factorial)
  2. Determine the number of factors and levels that can be tested based on estimated conversion traffic (choose a test array)
  3. Stop the test when it has stabilized, not based on your earlier estimations

These rules ensure statistical significance by constraining the test to the appropriate size at the beginning and then letting the test gather the proper amount of data at the end.

Running a test full factorial, if your traffic supports it, may be a good choice if you’re testing content that you believe to have many interactions or if you only want to test 2 factors with 2 levels each.  (Note: the smallest fractional factorial test size is 3 factors with 2 levels each.)  Typically though, you’ll want to run a fractional factorial test to save time and expand the number of factors and levels you can test.

In order to find out how many factors and levels you can test, you need to have some idea of your predicted page views, conversions, as well as an estimate of lift.  The reason that lift matters, is that a large lift will get you more conversions and so your test will stabilize quicker.  Because of this, I would be conservative with lift estimates to ensure that the test is not designed too large.  At Widemile, we have a large list of arrays available to our tool and have calculated the approximate conversions needed to stabilize, allowing me to look at the three criteria I listed and find the arrays that are statistically viable for testing.  You should look for something similar with your tool of choice.

To figure out when a test is stabilized, I prefer to primarily look at level influence stabilization with experiment conversion rate stabilization for support.  Widemile Optimize shows this using graphs, so I simply look for horizontal trending of lines, meaning winning levels and experiments stay winners and their level of influence or conversion rates stay fairly constant (look horizontal) over 3-5 days.  If you don’t have graphs available,  the historical cumulative conversion rate for your experiments and see if there is a lot of variance between the latest few days of your test.

Content rules:

  1. Every item you test should answer an important question
  2. Test variety not quantity
  3. Test opposites first then refine
  4. Remember you can run more than one test

The content rules are closely tied together.  In effect, they ensure that the items selected for testing have purpose and that they don’t needlessly expand the size of your test, reducing its efficiency.  I begin designing tests by creating hypothesis regarding issues with the page and then choose factors and design levels to address those issues.

An example hypothesis is “Having a hero shot on the right side of the page causes users to ignore the important value proposition on the left side.”  To test this, I would choose hero shot position as a factor and then have “left side hero shot” as the baseline level and “right side hero shot” as the second level.  This example also illustrates that, other than headlines and images, testing layout is possible with creative use of CSS and sometimes JavaScript.  As long as you can revert from one to another and it matches the other factors and levels, you are at liberty to test anything.

Coming back to the rules, make sure that you are testing as few items as possible to find out what you need.  Before testing a collection of lifestyle hero shots, choose one and test it against an iconic hero shot.  This will save you the time of going down a path of testing something that may not work.

Lastly, you aren’t going to be able to get the best page on the first run or even second, third, etc.  If you knew what your audience liked 100% of the time then you wouldn’t need testing.  Remember to think of your overall test plan beyond just the first run, so that you can answer all the questions you need without having to force everything into one test.

In summary, determine what you’re trying to achieve, select the proper testing method to meet those goals and then make sure to be purposeful and efficient with the content you end up testing in front of your visitors.  Testing and optimization is not difficult, although it can be tough to start.  Follow these rules and you’ll be on your way to conquering conversion rates, bounce rates, funnel drop-offs and many other metrics.

Photo credit: Aranda\Lasch (CC)

Share and Enjoy:
  • Sphinn
  • del.icio.us
  • Facebook
  • Digg
  • Google Bookmarks
  • Technorati

My response to Google’s Lead Designer leaving because of testing culture

No Comments Methodology

design is dead

I recently read Douglas Bowman’s blog, Google’s former Visual Design Lead, about why he left Google.  In it, he describes how the engineering culture contributed to his decision to leave:

When a company is filled with engineers, it turns to engineering to solve problems. Reduce each decision to a simple logic problem. Remove all subjectivity and just look at the data. [...] that data eventually becomes a crutch for every decision, paralyzing the company and preventing it from making any daring design decisions.

He then references Google testing 41 shades of blue and a recent debate he had over “whether a border should be 3, 4 or 5 pixels wide” and was asked to provide data to back up that decision.

Bowman’s post brought up some feelings of disappointment towards Google because despite having their own optimization tool, they did not create a culture to encourage their lead designer to expand his work and actually drove him away.  Optimization and testing is still in its early stage, so mistakes will be common, however I hope news like this doesn’t scare others away from testing.

Rather, I hope companies can learn from Bowman’s experience.  Instead of holding designers to every detail, testing should allow them to explore, learn and refine their ideas.  Testing should not prevent “any daring design decisions,” I feel it should actually encourage them.  As I said before, gamble with your conversions to raise them.

In the end, it’s all about having an understanding of how testing should and should not be used.  You can use testing to find the best shade of blue, but that doesn’t necessarily mean that’s what you should be testing right now.  Don’t be afraid to take a step back and try something new rather than fiddling with details, testing tools give you that freedom.  Big risks, reap big rewards in optimization.  Not taking risks leads to inefficient testing and, in Google’s case, a designer’s resignation.

Photo credit: i-marco (CC)

Share and Enjoy:
  • Sphinn
  • del.icio.us
  • Facebook
  • Digg
  • Google Bookmarks
  • Technorati

How to pick a page to test and optimize

No Comments Methodology

pick1 449x300

Selecting the right page to test is possibly the most important decision of an optimization campaign.  You can have great ideas, the technology and talent behind you, but if you pick the wrong page you could be doing a lot of work for minimal return.

So how do you get the biggest bang for your buck with testing?  Here’s a quick list of things to look for in a page:

  • A single, specific and easy to measure conversion goal
  • Sizable conversion traffic (at least 200 conversions in a week)
  • A page that suffers from poor design or unclear conversion goal
  • No large technical hurdles to implementing and executing the test
  • A conversion rate that’s lower than comparable pages

Attacking pages with these attributes will get you some easy wins and help establish testing in your company.  Typically landing pages are the best pages to optimize, especially if they have the end conversion goal on the page, e.g. a form submission, download or click-out.

From there, I would move onto other pages in the funnel, taking a look at bounce rates to help determine where you need to help push visitors further into the funnel.  If there are no other pages in the funnel, find other poor or under performing pages on your site and take a look at them according to the rules above.

The main idea is to see that testing is a process and that just because you have ideas to improve a page, does not mean it is the best page to spend your time improving.

Photo credit: lepiaf.geo (CC)

Share and Enjoy:
  • Sphinn
  • del.icio.us
  • Facebook
  • Digg
  • Google Bookmarks
  • Technorati

Breaking down multivariate testing (Billy’s Optimization Guide Part 2)

No Comments Methodology, Terminology

If you missed it, see Part 1 (A/B Split Testing).  Update: Part 3 on Rules for a Successful Multivariate Test is here.

The technical and statistical aspects of multivariate testing can be complicated but in order to design successful tests you don’t need to know everything, just the basics of how it works and some guidelines.  I’m assuming you already have some understanding of multivariate testing, however I want to cover the basics and make sure we’re on the same level before going into how to design good multivariate tests.

Check out the wireframe below.  Pretty standard for a landing page, right?  To properly design a multivariate test, we have to look at the page in a certain way.  Using three key terms, factors, levels and experiments, we can break down a test and describe its framework.

page

Factor: An element of the Web page (headline, image, text) being tested.  The element can also be groups of content, e.g. left column, button and hero shot together, or all banner ads on the page.

Level: Content that is assigned to a specific factor to be tested.  For example, one variation of a hero shot.

Below are 4 factors from our example page (headline, hero shot, offer and button) and then each of those factors with 4 levels represented by the different colors.  Note that the levels of one factor do not have to relate in anyway to the levels of other factors.

factors and levels 450x156

The last term, experiments, makes use of both factors and levels.

Experiment: A unique combination of levels used during a test.

Here you can see 4 different experiments.  Each experiment is different and holds different combinations of levels.  Note that there actually are many more variations (4×4x4×4=256 combinations).

experiments example 400x300

Essentially a multivariate test involves showing these experiments randomly to live traffic, while tracking how each experiment performs.  The one that performs the best wins.  Each experiment is shown to many people, but each person only sees one experiment.  (There is some complexity in this, if you are still confused or want to know more, go to my primer on full and fractional factorial testing.)

In my next post, I will use these terms to outline the rules to creating a great multivariate test.

Share and Enjoy:
  • Sphinn
  • del.icio.us
  • Facebook
  • Digg
  • Google Bookmarks
  • Technorati

3 ways to use an a/b split test (Billy’s Optimization Guide Part 1)

3 Comments About, Site News, Testing Techniques, Why Test?

Update: Check out Part 2 on Breaking Down Multivariate Testing and Part 3 on Rules for a Successful Multivariate Test.

Testing is not hard, but there are fundamentals that guarantee a successful optimization campaign.  To help get marketers up to speed with the basics, starting today, I will be writing about one topic per post and put together what I call Billy’s Optimization Guide.

The natural place to start is with a/b split tests, so let’s begin there.

ab split test 450x141

First, a quick useful definition of an a/b split test: the competition of two distinct pages, where a portion of live traffic, usually 50%, is sent to one page and the rest to the other.  The winner is the page that provides the highest conversion rate, or whatever KPI is appropriate.

I want to emphasize that a good a/b split test requires distinct pages.  If that’s too vague, a simple rule that we follow at Widemile is:

You should be able to tell the difference between the 2 pages from 15 feet away.

Anything else isn’t a big enough change to be efficient in a split test and likely should go into a multivariate test.

With that definition in mind, here are three essential types of a/b split tests.  These are three of the tools in the testing toolbox that you should consider when putting together your optimization campaign.

  1. Template test: Test the same general content (hero shot, copy, and button color) with a different layout and/or creative treatment.  The goal is to have a new template that better emphasizes the value proposition, improves readability and sets up well for a multivariate test.

    template test

    Use this when… you want to make sure you have a solid design, before or after testing messaging.  The majority of the time this should be your first test.

  2. New concept test: Test a totally new approach.  Don’t let anything hold you back, test what you think will work best and see if it beats the original.  The approach here is to break out of the box and create a page that’s holistically designed around a new marketing concept.  Sometimes this involves introducing new functionality, animation, interactivity and other dramatic steps.  However it can also be on the smaller scale, such as introducing new messaging that requires a complete redesign.

    new concept

    Use this when… your current page has already been tested many times and beating it has become difficult or you believe the way to really grab your visitors is through a big change.  This should only be done when the benefits of multivariate testing (knowing individual factor influences) are outweighed by the possible gains.

  3. Funnel test: Send users to different multi-page experiences, e.g. no registration vs. requiring registration (below) and a one page form vs. a 3 page form. A funnel test can also be done with a multivariate but is simpler as an a/b split test.

    funnel test

    Use this when… you want to test content that extends past one page.  This should be done earlier in the testing process so that you don’t end up optimizing a page and then find out it’s a suboptimal experience.  It can be more technically demanding to do this sort of test though.

Every optimization campaign is different and so knowing what kinds of tests are available is one of the most important places to start.  For my next post, I will talk about the different ways to use a multivariate test.  Please post in the comments if you have any questions or contact me via Twitter @billysblog.

Share and Enjoy:
  • Sphinn
  • del.icio.us
  • Facebook
  • Digg
  • Google Bookmarks
  • Technorati