Jeff Sauro • June 6, 2008
It is common to think of time-on-task data gathered only during summative evaluations because, during a formative evaluation, the focus is on finding and fixing problems, or at least finding the problems and delivering a report. For a variety of reasons, time-on-task measures often get left out of the mix. In this article, I show that time-on-task can be a valuable diagnostic and comparative tool during formative evaluations.
The three most common reasons I've heard for not using time-on-task in formative studies are:
Below I discuss why these reasons should NOT prevent you from collecting time-on-task in your next formative evaluation.
Getting an accurate and stable measure of the actual user time-on-task is more problematic that comparing designs. One would expect task times to increase as users are asked to think-aloud while completing tasks. The published data, however, is mixed, with some published studies actually showing faster performance while thinking-aloud possibly due to the invocation of cognitive processes that improve rather than degrade performance (Berry and Broadbent (1990). For a good summary of the evidence, see Lewis 2006 p. 1282. More research is needed to draw a conclusion on this aspect. Regardless, I recommend focusing on relative task time improvements between designs because this avoids this issues altogether.

Figure 1: Time to cancel a reservation on a hotel-website (in log-transformed seconds). One user took over 4 times the mean time to complete the task. Red solid line is the geometric mean and the green-dashed lines are the upper and lower bounds of the 95% Confidence Interval.
In graphing the report we quickly see that one user took over 4 times longer than the mean time to cancel the reservation (I graphed the data using the Graph and Calculator for Confidence Intervals for Task Times). This simple graph of the task times allows the investigator and reader of a report to zero in on potential causes of such a long task time (relative to the other users). While it's unclear from the report as to what was occurring during this task, an analysis of this user's profile shows that she had never visited a hotel website or ever made a reservation at a hotel website prior to the test. Her comments also reinforce her being a "novice" Internet user: "I feel that my inexperience with the web had a lot to do with difficulties." Whether it was just the user's inexperience or some specific interface problems, perhaps particularly damaging to a novice, it is clear this user had trouble during the task. A few pixels tell the story.
Time-on-task is an under-utilized tool for formative evaluations. It costs nothing (just start and stop the time), is useful with any-number of users and it can be a valuable tool for diagnosing problems as well as making objective comparisons between iterations. I encourage you to collect time-on-task during your next formative evaluation.
Jeff Sauro is the founding principal of Measuring Usability LLC, a company providing statistics and usability consulting to
Fortune 1000 companies.
He is the author of over
15 journal articles and 3 books on statistics and the user-experience.
More about Jeff...
Distrust in Social Networks: Google+, Twitter, Facebook
5 Examples of Quantifying Qualitative Data
Does better usability increase customer loyalty?
The Five Most Influential Papers in Usability
A Brief History of the Magic Number 5 in Usability Testing
How common are usability problems?
What five users can tell you that 5000 cannot
Should you use 5 or 7 point scales?
Why you only need to test with five users (explained)
Top 10 Research-Based Usability Findings of 2010
8 Ways to Show Design Changes Improved the User Experience
25 Resources for Measuring Usability
10 Things to Know about Usability Problems