Log-in | Contact Jeff | Email Updates
Measuring Usability Homepage
Quantitative Usability, Statistics & Six Sigma by Jeff Sauro

If 1 of 5 users has a problem in a usability test will it impact 1% or 20% of all users? View More Blog Posts

Using probability to make informed decisions in usability tests

by Jeff Sauro | February 1, 2010 :: Get RSS Feed Subscribe to RSS Feed of Measuring Usability Updates


Let's imagine you are testing five users as part of an iterative testing approach to find and fix problems. During the test only one user encounters a problem with logging in. To fix this particular problem would take a lot of effort and the small sample size is met with skepticism from the overburdened and overcommitted development team. They say "We really don't know whether this problem will affect 1 out of 5 users (20%) or 1 out of 100 users (1%) and don't have the resources to fix all edge cases." They send you packing to ponder the merits of larger sample sizes or changing careers.

While many people have come to accept testing with small sample sizes in usability testing, there is still a lot of discomfort when extrapolating the results to the entire user population.  It turns out the developers are right: we don't know whether it will affect 1%, 20% or another percentage. But consider these uncertainties:
  • Insurance companies don't know if you're going to die next year when they issue you a policy.
  • Drug companies don't know for sure if that drug will work or cause a fatal side-effect.
  • You don't know that you'll win a hand of black-jack with your two queens.

Life is full of uncertainly and usability testing is no exception. Small sample sizes don't preclude us from using the same method as insurance companies, drug companies and compulsive gamblers use to understand the uncertainly and make informed decisions.

For this particular question we can use probability to help decide. Doing so is what makes a usability evaluation quantitative.  We can use the Binomial Probability Mass function to compare the probability 1 out of 5 users encountering a problem comes from an actual occurrences in the user-population of 20% versus 1%.

The probability the actual occurrence of the problem is 20% is about 41%. You can use the Excel formula =BINOMDIST(1,5,0.2,FALSE) to get the answer. And the probability the actual occurrence is 1% is a probability of 4.8%. Use the formula BINOMDIST(1,5,0.01,FALSE). I put together a calculator which will do the calculations for you.

Therefore, it is about 8.5 times more likely the problem you saw in your test of five users will affect 20% of users than 1% of users (41% divided by 4.8%). Now, it is also not likely that it will impact exactly 1% or 20%, but we can generate a binomial confidence interval around the problem occurrence to know that we can be 95% sure the problem will affect between 2% and 64% of users.

If you don't like the thought of using formulas you can think of it conceptually. With small sample sizes of around 5 users, you are more likely to observe frequently occurring problems and miss the infrequent ones. If the problem didn't occur frequently then you probably wouldn't have seen it in a test. Of course there is a chance that you just happened to see a rare problem with five users, but it is not very likely. In other words, when you see a problem from a small sample of users, put your money on it affecting more than a tiny percentage of users. You could be wrong, just like drug companies, insurance companies and gamblers get it wrong, but using probability as part of a quantitative testing strategy means you at least know the risk you're taking.

View More Blog Posts | Updates by Email | Get RSS Feed | Follow on Twitter

You Might Also Be Interested In:

What five users can tell you that 5000 cannot
Will five users really find 85% of all usability problems?
A Brief History of the Magic Number 5 in Usability Testing

Rate this Blog

Avg. Rating 8.82 (11)

Poor         Excellent
012345678910

Comments

There are 3 Comments

February 16, 2010 | Jeff Sauro wrote:
Asbjrn. That sounds like a typical result. The 95% confidence interval around a 1 out of 5 problem suggests the actual occurrence is between 2 and 64%. You are seeing around a 10% rate after 20 users. At 4x the sample size you cut your uncertainty in half to between 2% and 31% suggesting this problem occurs less frequently. It is still more likely to affect 10%-20% of users than 1-2%.
 
February 4, 2010 | Francis Norton wrote:
Good post - useful to have a good mathematical response to this kind of challenge.
 
February 3, 2010 | Asbjrn wrote:
Thanks for an intersting post. I have given the topic you address some thought earlier, based on highly preliminary findings in a small number of datasets with approx. 20 users. Here, problems observed by 1 in 5 were unlikely to be observed by more than 2 in 20.

It would be interesting to know if you - or any other - have done more thorough empirical studies of this kind.
 

Post a Comment

Comment
Name
Email Address
Not Published, sold or abused

To prevent comment spam, please answer the following question before submitting (tags not permitted) :
What is 5 + 4: (enter the number)