Jeff Sauro • September 17, 2004
We already saw how a manageable sample of users can provide meaningful data for discrete-binary data like task completion. With continuous data like task times, the sample size can be even smaller.| Time (in seconds) | |
| You | 101 |
| Colleague 1 | 132 |
| Colleague 2 | 125 |
| Colleague 3 | 145 |
| Mean | 125.75 |
| St Deviation | 18.46 |
| Range | 44 |
![]()
To arrive at the elusive "significant" sample size, you need to try a few reasonable sample sizes and see which ones fall within the limits of the confidence interval. The values (n) you choose will affect the the critical value for t and the Standard Error since both use n in their equation. We'll use 25, 20, 15, 10 and 5 and which ever value has a confidence interval at about 10 seconds we'll use as the ideal sample. (Again all this assumes that our internal sample did a good job of determining the standard deviation of the larger population).
| Sample | 95% CI | SE | SQRT N | Stdev | t * |
| 25 | 7.61 | 3.692 | 5 | 18.46 | 2.063 |
| 20 | 8.63 | 4.12 | 4.47 | 18.46 | 2.093 |
| 15 | 10.22 | 4.76 | 3.87 | 18.46 | 2.144 |
| 10 | 13.20 | 5.83 | 3.16 | 18.46 | 2.262 |
| 5 | 22.92 | 8.25 | 2.23 | 18.46 | 2.776 |
At about 15 users, the conifdence interval narrows close enough to ten seconds that it will probably be sufficient. I'd use this 15 as the approximate number of users you'd need to sample and know that to get more precise, you'd need to sample more than 15 users. This result is much better than thinking you need to test 100 or 1000 in order to get "statistically significant results. If +/- 10 seconds isn't precise enough you can:
Sample Sizes in the Real World of Usability Testing
If you've run enough usability tests, in many cases your sample size is usually determined ahead of time--that is, you know your budget and time frame and therefore approximately how many users you'll be sampling--usually somewhere between 10 and 30. I then approach sampling as getting as many users as I can within that range and then compute the statistics later.
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
![]()
| mean time of your sample (126.6) | |
| true mean time of the entire population of users | |
| n | number of users in the sample (15) |
| s | the standard deviation of the sample (16.33) |
| t* | t statistic = (2.144789) or use the excel function =TINV(.05,14) [confidence level(.05) and degrees of freedom n-1 (14) ] |
Plugging in the numbers, for the estimated mean of the total population of users on this task we get:
= 126.6 + or - 9.08
Jeff Sauro is the founding principal of Measuring Usability LLC, a company providing statistics and usability consulting to
Fortune 1000 companies.
He is the author of over
15 journal articles and 3 books on statistics and the user-experience.
More about Jeff...
Distrust in Social Networks: Google+, Twitter, Facebook
5 Examples of Quantifying Qualitative Data
Confidence Interval Calculator for a Completion Rate
25 Resources for Measuring Usability
8 Ways to Show Design Changes Improved the User Experience
Top 10 Research-Based Usability Findings of 2010
10 Things to Know about Usability Problems
Why you only need to test with five users (explained)
What five users can tell you that 5000 cannot
A Brief History of the Magic Number 5 in Usability Testing
How common are usability problems?
97 Things to Know about Usability
The Five Most Influential Papers in Usability