<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01//EN">
<html>
<head>
<meta http-equiv="Content-Type" content="text/html" charset="UTF-8">
<script src="https://fred-wang.github.io/TeXZilla/TeXZilla-min.js"></script>
<script src="https://fred-wang.github.io/TeXZilla/examples/customElement.js"></script>
<title>
Fitts' Law
</title>

</head>

<body>
<blockquote>
<hr>
<font size="-1">
MacKenzie, I. S. (2018). Fitts' law. In 
K. L. Norman & J. Kirakowski (Eds.), 
<i>Handbook of human-computer interaction</i>, pp. 349-370.
Hoboken, NJ: Wiley.
doi:10.1002/9781118976005
[<a href="hhci2018.pdf">PDF</a>]
[<a href="http://www.yorku.ca/mack/FittsLawSoftware/">software</a>]
</font>
<hr>
<p>

<center>
<h1>
Fitts' Law
</h1>

<h3>
<a href="http://www.yorku.ca/mack/">I. Scott MacKenzie</a>
</h3>



York University<br>
Dept of Electrical Engineering and Computer Science<br>
Toronto, Canada<br>
mack@cse.yorku.ca
<p>
</center>



<H2>Introduction</H2>
<p>

Human movement is ubiquitous in computing.  Our arms, wrists, and fingers busy 
themselves on keyboards, desktops, and contact-sensitive displays.  And so, 
matching the movement limits and capabilities of humans with interaction 
techniques on computing systems is an important area of research in 
human-computer interaction (HCI).  Considerable HCI research is directed at 
modeling, predicting, and measuring human performance.  In the realm of human 
movement, Fitts' law is the pre-eminent model for this research.
<p>

The full spectrum of human movement applicable to Fitts' law is broader than 
the three examples &ndash; arms, wrists, and fingers &ndash; given in the preceding 
paragraph.  In contexts such as gaming, virtual reality, or accessible 
computing, movements may also involve the torso, legs, feet, eyes, face, 
tongue, lip, skin, head, and so on.   Notably, for each of these input 
modalities, there are examples where Fitts' law was used to explore the design 
space or to quantify human performance.
<p>

This chapter provides an overview of Fitts' law.  As we shall see, Fitts' law 
is a model both for predicting and measuring.  For predicting, Fitts' law is an 
equation giving the time to acquire and select a target based on the distance 
moved and the size of the target.  For measuring, Fitts' law provides a method 
to quantify human performance in a single measure, "throughput".  Throughput, 
when calculated as described later in this chapter, combines speed and accuracy 
in performing a target acquisition task. 
<p>

We begin with background details and a brief tour of Fitts' law, and follow by 
describing refinements to correct flaws or to improve the model's prediction 
power or theoretical basis.  Fitts' law evaluations of computer input 
techniques are more consistent in recent years due to the emergence of ISO 
9241-9, an ISO standard for evaluating input devices.  The Fitts' law methods 
used in the standard are summarized and software tools are presented that 
implement the methods.  Since Fitts' throughput is the main performance measure 
for such evaluations, we also detail the calculation of throughput according to 
best-practice methods.  We then present an example of the use of Fitts' law and 
ISO 9241-9 for measuring human performance.  The example involves touch-based 
target selection on a mobile phone with a contact-sensitive display.
<p>

<H2>Background</H2>
<p>

Like many psychologists in the 1950s, Fitts was motivated to investigate 
whether human performance could be quantified using a metaphor from the new and 
exciting field of information theory.   This field emerged from the work of 
Shannon, Wiener, and other mathematicians in the 1940s.  The terms probability, 
<I>redundancy</I>, <I>bits</I>, <I>noise</I>, and <I>channels</I> entered the vocabulary of experimental 
psychologists as they explored the latest technique of measuring and modeling 
human behavior.  Two well-known models in this vein are the Hick-Hyman law for 
choice reaction time (Hick, 1952; Hyman, 1953) and Fitts' law for the 
information capacity of the human motor system (Fitts, 1954).
<p>

Fitts' particular interest was rapid-aimed movements, where a human operator 
acquires or selects targets of a certain size over a certain distance.  Fitts 
proposed a model &ndash; now "law" &ndash; that is widely used in fields such as 
ergonomics, engineering, psychology, and human-computer interaction.  The 
starting point for Fitts' law is an equation known as Shannon's <I>Theorem 17</I>, 
which gives the information capacity <I>C</I> (in bits/s) of a communications channel 
of bandwidth <I>B</I> (in s<sup>-1</sup> or Hz) as
<p>

<table width=100%>
<tr>
<td width=10%>&nbsp;
<td width=80% align="center">
      <la-tex>
	  C = B \: \log_{2}\left(\frac{S}{N} +1\right)
	  </la-tex>
<!-- <img src="hhci2018-eq1.jpg"> -->
<td width=10% align="right">(17.1)
</table>
<p>

where <i>S</i>  is the signal power and <I>N</I> is the noise power 
(Shannon & Weaver, 1949, pp. 100-103). Fitts reasoned that a human operator 
that performs a movement over a certain amplitude to acquire a target of a 
certain width is demonstrating a "rate of information transfer" (Fitts, 1954, 
p. 381).  In Fitts' analogy, movement amplitudes are like signals and target 
tolerances or widths are like noise.
<p>

Fitts proposed an <I>index of difficulty</I> (<i>ID</i>) for a target acquisition task using 
a log-term slightly rearranged from Eq. 17.1.  Signal power (<I>S</I>) and noise 
power (<I>N</I>) are replaced by movement amplitude (<I>A</I>) and target width (<I>W</I>), 
respectively:

<table width=100%>
<tr>
<td width=10%>&nbsp;
<td width=80% align="center">
      <la-tex>
	  ID = \log_{2}\left(\frac{2A}{W}\right)
	  </la-tex>
<!-- <img src="hhci2018-eq2.jpg"> -->
<td width=10% align="right">(17.2)
</table>
<p>

Fitts referred to the target width as the "permissible 
variability" or the "movement tolerance" (Fitts, 1954, p. 382).  This is the 
region within which a movement is terminated.  As with the log-term in 
Eq. 17.1, the units for <i>ID</i> are <I>bits</I> because the ratio within the parentheses 
is unitless and the log is taken to base 2.
<p>

Fitts' idea was novel for two reasons:  First, it suggested that the difficulty 
of a target selection task could be quantified using the information metric 
<I>bits</I>.  Second, it introduced the idea that the act of performing a target 
selection task is akin to transmitting information through a channel &ndash; a human 
channel.  Fitts called the rate of transmission the <I>index of performance</I>, 
although today the term <I>throughput</I> (<i>TP</i>) is more common.
(For consistency, the term <i>throughput</i> is used throughout this chapter.)
<p>
 
Throughput is calculated over a sequence of trials as a simple quotient.  The 
index of difficulty (<i>ID</i>) of the task is the numerator and the mean movement 
time (<I>MT</I>) is the denominator:
<p>

<table width=100%>
<tr>
<td width=10%>&nbsp;
<td width=80% align="center">
      <la-tex>
	  TP = \left(\frac{ID}{MT}\right)
	  </la-tex>
<!-- <img src="hhci2018-eq3.jpg"> -->
<td width=10% align="right">(17.3)
</table>
<p>

With <i>ID</i> in bits and <I>MT</I> in seconds, <i>TP</i> has units <I>bits per second</I> 
or <I>bits/s</I>.  A central thesis in Fitts' work is that throughput is independent 
of movement amplitude and target width, as embedded in <i>ID</i>.  In other words, as 
<i>ID</i> changes (due to changes in <I>A</I> or <I>W</I>), <I>MT</I> changes in an opposing manner and <i>TP</i> 
remains more or less constant. 
<p>

Of course, throughput is expected to be influenced by other factors, such as 
device, interaction property, or environment.  Two devices were compared in 
Fitts' original experiment (see next section).  In HCI, a myriad of factors, or 
independent variables, can be explored using Fitts' throughput as a dependent 
variable.  Examples include "device" (mouse vs. stylus vs. trackball &ndash; see 
MacKenzie, Sellen, & Buxton, 1991), "dwell interval" with an eye tracker (700 
ms vs. 500 ms &ndash; see Zhang & MacKenzie, 2007), or "device position" (supported 
vs. mobile &ndash; see MacKenzie, 2015).  Throughput is particularly appealing as a 
dependent variable because it combines speed and accuracy in a single measure 
(using a technique described shortly).
<p>

Of the two uses of Fitts' law noted above &ndash; predicting and measuring &ndash; 
throughput exemplifies the use of Fitts' law for measuring.
<p>

<H2>Fitts' Experiments</H2>
<p>

 The original investigation (Fitts, 1954) involved four experiment conditions: 
 two reciprocal or serial tapping tasks (1-oz stylus and 1-lb stylus), a disc 
 transfer task, and a pin transfer task.  For the tapping condition, a 
 participant moved a stylus back and forth between two plates as quickly as 
 possible and tapped the plates at their centers (see Figure 17.1a).  Fitts later 
 devised a discrete variation of the task (Fitts & Peterson, 1964).  For the 
 discrete task, the participant selected one of two targets in response to a 
 stimulus light (see Figure 17.1b).  The tasks in Figure 17.1 are commonly called the 
 "Fitts' paradigm".  It is easy to image how to update Fitts' apparatus using 
 contemporary computing technology.
<p>

<center>
<blockquote>
<a name="figure1"></a>
(a) <a href="hhci2018-f1a.jpg"><img src="hhci2018-f1a.jpg" alt="figure 1a" width=400></a>
(b) <a href="hhci2018-f1b.jpg"><img src="hhci2018-f1b.jpg" alt="figure 1b" width=300></a>
<br>
<B>Figure 17.1.</B> The Fitts paradigm. (a) serial tapping task (after Fitts, 1954) (b) 
discrete task (after Fitts & Peterson, 1964).
</blockquote>
</center>
<p>

Fitts published summary data for his 1954 experiments, so a re-examination of 
his results is possible.  For the stylus tapping conditions, four target 
amplitudes (<I>A</I>) were crossed with four target widths (<I>W</I>).  For each <I>A-W</I> 
condition, participants performed two sequences of trials lasting 15 s 
each. (In current practice, a "sequence" is usually a specified number of trials, for instance 25, 
rather than a specified time interval.)  The summary data for the 1-oz stylus condition are given in Table 17.1.  
As well as <I>A</I> and <I>W</I>, the table includes the error rate (<I>ER</I>), index of difficulty 
(<i>ID</i>), movement time (<I>MT</I>), and throughput (<i>TP</i>).  The effective target width (<i>W</i><sub>e</sub>) 
column was added, as discussed shortly. 
<p>

<center>
<blockquote>
<B>Table 17.1</B><br>
Data from Fitts' (1954) serial tapping task experiment with a 1-oz stylus. An 
extra column shows the effective target width (<i>W</i><sub>e</sub>) after adjusting <I>W</I> for the 
percentage errors (see text).<br>

<table border="1px" cellspacing=0 cellpadding=5 width="49%">

<tr align=center bgcolor="#f0f0e0"><th><I>A</I><br>(in)<th><I>W</I><br>(in)<th><I>W</I><sub>e</sub><br>(in)
<th><I>ER</I><br>(%)<th><I>ID</I><br>(bits)<th><I>MT</I><br>(ms)<th><I>TP</I><br>(bits/s)

<tr align="center"><td width="7%">2<td width="7%">0.25<td width="7%">0.243<td width="7%">3.35<td width="7%">4<td width="7%">392<td width="7%">10.20

<tr align="center"><td>2<td>0.50<td>0.444<td>1.99<td>3<td>281<td>10.68

<tr align="center"><td>2<td>1.00<td>0.725<td>0.44<td>2<td>212<td>9.43

<tr align="center"><td>2<td>2.00<td>1.020<td>0.00<td>1<td>180<td>5.56

<tr align="center"><td>4<td>0.25<td>0.244<td>3.41<td>5<td>484<td>10.33

<tr align="center"><td>4<td>0.50<td>0.468<td>2.72<td>4<td>372<td>10.75

<tr align="center"><td>4<td>1.00<td>0.812<td>1.09<td>3<td>260<td>11.54

<tr align="center"><td>4<td>2.00<td>1.233<td>0.08<td>2<td>203<td>9.85

<tr align="center"><td>8<td>0.25<td>0.235<td>2.78<td>6<td>580<td>10.34

<tr align="center"><td>8<td>0.50<td>0.446<td>2.05<td>5<td>469<td>10.66

<tr align="center"><td>8<td>1.00<td>0.914<td>2.38<td>4<td>357<td>11.20

<tr align="center"><td>8<td>2.00<td>1.576<td>0.87<td>3<td>279<td>10.75

<tr align="center"><td>16<td>0.25<td>0.247<td>3.65<td>7<td>731<td>9.58

<tr align="center"><td>16<td>0.50<td>0.468<td>2.73<td>6<td>595<td>10.08

<tr align="center"><td>16<td>1.00<td>0.832<td>1.30<td>5<td>481<td>10.40

<tr align="center"><td>16<td>2.00<td>1.519<td>0.65<td>4<td>388<td>10.31

<tr><td colspan=5 align="right">Mean<td align="center">391.5<td align="center">10.10

<tr><td colspan=5 align="right"><I>SD</I><td align="center">157.3<td align="center">1.33

</table>
</blockquote>
</center>
<p>

The combination of conditions in Table 17.1 yields task difficulties ranging from 
1 bit to 7 bits.  The mean <I>MT</I>s observed ranged from 180 ms (<i>ID</i> = 1 bit) to 731 
ms (<i>ID</i> = 7 bits), with each mean derived from more than 600 observations over 
16 participants.  The standard deviation in the <I>MT</I> values was 157.3 ms, which 
is 40.2% of the mean.  This is fully expected since "hard tasks" (e.g., <i>ID</i> = 7 
bits) will obviously take longer than "easy tasks" (e.g., <i>ID</i> = 1 bit).
<p>

Fitts calculated throughput by dividing <i>ID</i> by <I>MT</I> (Eq. 17.3) for each task 
condition.  The mean throughput was 10.10 bits/s.  A quick glance at the <i>TP</i> 
column in Table 17.1 shows strong evidence for the thesis that the rate of 
information processing is relatively independent of task difficulty.  Despite 
the wide range of task difficulties, the standard deviation of the <i>TP</i> values 
was 1.33 bits/s, which is just 13.2% of the mean.  
<p>

One way to visualize the data in Table 17.1 and the independence of <i>ID</i> on <i>TP</i> is 
through a scatter plot showing the <I>MT</I>-<i>ID</i> point for each task condition.  
Figure 17.2 shows such a plot for the data in Table 17.1.  The figure also includes 
the best-fitting line (via least-squares regression), the linear equation, and 
the squared correlation.  The independence of <i>ID</i> on <i>TP</i> is reflected in the 
closeness of the points to the regression line (indicating a constant <i>ID</i> / <I>MT</I> 
ratio).  Indeed, the fit is very good with 96.6% of the variance explained by 
the model.
<p>

<center>
<blockquote>
<a name="figure2"></a>
<a href="hhci2018-f2.jpg"><img src="hhci2018-f2.jpg" alt="figure 2" width=500></a><br>
<B>Figure 17.2.</B> Scatter plot and least-squares regression analysis for the data in 
Table 17.1.
</blockquote>
</center>
<p>

The linear equation in Figure 17.2 takes the following general form:
<p>


<table width=100%>
<tr>
<td width=10%>&nbsp;
<td width=80% align="center"><I>MT</I> = <I>a</I> + <I>b</I> <I>ID</I>
<td width=10% align="right">(17.4)
</table>
<p>

The regression coefficients include an intercept <I>a</I> with units 
<I>seconds</I> and a slope <I>b</I> with units <I>seconds per bit</I>. 
(Several interesting yet difficult issues arise in interpreting the slope and 
intercept coefficients in Eq. 17.4.  Due to space limitations, these are not 
elaborated here.  The interested reader is directed to sections 3.4 and 3.5 in 
Soukoreff and MacKenzie, 2004.)  Eq. 17.4 exemplifies the 
use of Fitts' law for <I>predicting</I>.  This is in contrast with Eq. 17.3 which is 
the use of Fitts' law for <I>measuring</I>.
<p>

<H2>Refinements to Fitts' Law</H2>
<p>

In the years since the first publication in 1954, many changes or refinements 
to Fitts' law have been proposed.  While there are considerations in both 
theory and practice, a prevailing rationale is the need for precise 
mathematical formulations in HCI and other fields for the purpose of 
measurement.  One can imagine (and hope!) that different researchers using 
Fitts' law to examine similar phenomena should obtain similar results.  This is 
only possible if there is general agreement on the methods for gathering and 
applying data.
<p>

An early motivation for altering or improving Fitts' law stemmed from the 
observation that the <I>MT</I>-<i>ID</i> data points curved away from the regression line, 
with the most deviate point at <i>ID</i> = 1 bit.  This is clearly seen in the 
left-most point in Figure 17.2.  In an effort to improve the data-to-model fit, 
Welford (1960, 1968, p. 147) introduced the following formulation:
<p>

<table width=100%>
<tr>
<td width=10%>&nbsp;
<td width=80% align="center">
      <la-tex>
	  ID = \log_{2}\left(\frac{A}{W} + 0.5\right)
	  </la-tex>
<!-- <img src="hhci2018-eq5.jpg"> -->
<td width=10% align="right">(17.5)
</table>
<p>

This version of <i>ID</i> has been used frequently over the 
years, and in particular by Card et al. (1978) in their comparative evaluation 
of the computer mouse. 
(A re-analysis of the results reported by Card et al., 1978, are given by 
MacKenzie and Soukoreff, 2003, in view of a contemporary understanding of 
Fitts' law.)  Fitts also used the Welford formulation in a 1968 
paper and reported an improvement in the regression-line fit compared to the 
Fitts formulation (Fitts & Peterson, 1964, p. 110).
<p>

In 1989, it was shown that Fitts deduced his relationship citing an 
approximation of Shannon's theorem that only applies if the signal-to-noise 
ratio is large (Fitts, 1954, p. 388; Goldman, 1953, p. 157; MacKenzie, 1989, 
1992).  The signal-to-noise ratio in Shannon's theorem appears as the <I>A</I>-to-<I>W</I> 
ratio in Fitts' analogy.  As seen in Table 17.1, the <I>A</I>-to-<I>W</I> ratio in Fitts' 
stylus-tapping experiment extended as low as 1:1! The variation of Fitts' index 
of difficulty suggested by direct analogy with Shannon's information theorem is
<p>

<table width=100%>
<tr>
<td width=10%>&nbsp;
<td width=80% align="center">
      <la-tex>
	  ID = \log_{2}\left(\frac{A}{W} + 1\right)
	  </la-tex>
<!-- <img src="hhci2018-eq6.jpg"> -->
<td width=10% align="right">(17.6)
</table>
<p>

Besides the improved link with information theory, 
Eq. 17.6, known as the <I>Shannon formulation</I>, provides better correlations 
compared to the Fitts or Welford formulation (MacKenzie, 1989, Table 1 and Table 
2; 1991, Table 4; 2013, Table 3). 
<p>

An additional feature of the Shannon formulation is that <i>ID</i> cannot be negative. 
 Obviously, a negative rating for task difficulty presents a serious 
theoretical problem.  Although the prospect of a negative <i>ID</i> may seem unlikely, 
such conditions have actually been reported in the Fitts' law literature (Card 
et al., 1978; Crossman & Goodeve, 1983; Gillan, Holden, Adam, Rudisill, & 
Magee, 1990; Ware & Mikaelian, 1987).  With the Shannon formulation, a negative 
<i>ID</i> is simply not possible.  This is illustrated in Figure 17.3, which shows <i>ID</i> 
smoothly approaching 0 bits as <I>A</I> approaches 0.  With the Fitts and Welford 
formulations, <i>ID</i> dips negative for small <I>A</I>.
<p>

<center>
<blockquote>
<a name="figure3"></a>
<a href="hhci2018-f3.jpg"><img src="hhci2018-f3.jpg" alt="figure 3" width=600></a><br>
<B>Figure 17.3.</B> With the Shannon formulation, <i>ID</i> approaches 0 bits as <I>A</I> approaches 0. 
</blockquote>
</center>
<p>

<H2>Adjustment for Accuracy</H2>
<p>

 Of greater practical importance is a technique to improve the 
 information-theoretic analogy in Fitts' law by adjusting the specified or set 
 target width (akin to noise) according to the spatial variability in the human 
 operator's output responses over a sequence of trials.   The idea was first 
 proposed by Crossman in 1957 in an unpublished report (cited in Welford, 1968, 
 p. 147).  Use of the adjustment was later examined and endorsed by Fitts 
 (Fitts & Peterson, 1964, p. 110).
<p>

The output or <I>effective target width</I> (<i>W</i><sub>e</sub>) is derived from the distribution of 
"hits" (see MacKenzie, 1992, section 3.4; Welford, 1968, pp. 147-148).  This 
adjustment lies at the very heart of the information-theoretic metaphor &ndash; that 
movement amplitudes are analogous to "signals" and that endpoint variability 
(viz., target width) is analogous to "noise."  In fact, the information theorem 
underlying Fitts' law assumes that the signal is "perturbed by white thermal 
noise" (Shannon & Weaver, 1949, p. 100).  The analogous requirement in motor 
tasks is a Gaussian or normal distribution of hits &ndash; a property observed by 
numerous researchers (e.g., Fitts & Radford, 1966; MacKenzie, 1991, p. 84; 
2015; Welford, 1968, p. 154).
<p>

The experimental implication of normalizing output measures is illustrated as 
follows. The entropy, or information, in a normal distribution is 
log<sub>2</sub>((2&pi;e)<sup>1/2</sup>&sigma;) = 
log<sub>2</sub>(4.133 &sigma;), where &sigma; is the standard deviation in the unit of 
measurement.  Splitting the constant 4.133 into a pair of <I>z</I>-scores for the 
unit-normal curve (i.e., &sigma; = 1), one finds that 96% of the total area is 
bounded by &minus;2.066 &lt; <i>z</i> &lt; +2.066.  In other words, a condition that target width 
is analogous to noise is that the distribution is normal with 96% of the hits 
falling within the target and 4% of the hits missing the target. See Figure 17.4a. 
When an error rate other than 4% is observed, target width is adjusted to form 
the effective target width in keeping with the underlying theory.
<p>

<center>
<blockquote>
<a name="figure4"></a>
(a) <a href="hhci2018-f4a.jpg"><img src="hhci2018-f4a.jpg" alt="figure 4a" width=500></a>
(b) <a href="hhci2018-f4b.jpg"><img src="hhci2018-f4b.jpg" alt="figure 4b" width=500></a><br>
<B>Figure 17.4.</B> Method of adjusting target width based on the distribution of 
selections. (a) When 4% errors occur, the effective target width, <i>W</i><sub>e</sub> = <I>W</I>. (b) 
When less than 4% errors occurs, <i>W</i><sub>e</sub> &lt; <I>W</I>.
</blockquote>
</center>
<p>

There are two methods for determining the effective target width, the 
<I>standard-deviation method</I> and the <I>discrete-error method</I>.  If the standard 
deviation of the endpoint coordinates is known, just multiply <I>SD</I> by 4.133 to 
get <i>W</i><sub>e</sub>.  If only the percentage of errors is known, the method uses a table of 
<I>z</I>-scores for areas under the unit-normal curve.
(Such a table is found in the appendix of most statistics textbooks.  <I>z</I>-scores 
are also available using the NORM.S.INV function in Microsoft <I>Excel</I>.)
Here is the method:  If <I>n</I> 
percent errors are observed over a sequence of trials for a particular <I>A-W</I> 
condition, determine <I>z</I> such that &plusmn;z contains 100 - <I>n</I> percent of the area under 
the unit-normal curve.  Multiply <I>W</I> by 2.066 / <I>z</I> to get <i>W</i><sub>e</sub>.  As an example, if 2% 
errors occur on a sequence of trials when selecting a 5-cm wide target, then 
<I>W</I><sub>e</sub> =  2.066 / <I>z</I>  &times; <I>W</I> =  2.066 / 2.326 &times; 5 = 4.45 cm.  
See Figure 17.4b.  
Broadly, the figure 
illustrates that <i>W</i><sub>e</sub> &lt; <I>W</I> when error rates are less than 4% and that <i>W</i><sub>e</sub> &gt; <I>W</I> when error rates exceed 4%.
<p>

Experiments using the adjusted or effective target width will typically find a 
reduced variation in <I>TP</I> because of the speed-accuracy tradeoff:  Participants 
who take longer are more accurate and demonstrate less endpoint variability.  
Reduced endpoint variability decreases the effective target width and therefore 
increases the effective index of difficulty (see Eq. 17.3).  The converse is 
also true.  On the whole, an increase (or decrease) in <I>MT</I> is accompanied by an 
increase (or decrease) in the effective <i>ID</i>, and this tends to lessen the 
variability in <I>TP</I> (see Eq. 17.2).
<p>

The technique just described dates to 1957, yet it was largely ignored in the 
published body of Fitts' law research that followed.<a href="#f1"><sup>1</sup></a>  There are several 
possible reasons.  First, the method is tricky and its derivation from 
information-theoretic principles is complicated (see Reza, 1961, pp. 278-282). 
 Second, selection coordinates must be recorded for each trial in order to 
calculate <i>W</i><sub>e</sub> from the standard deviation. This is feasible using a computer for 
data acquisition and statistical software for analysis, but manual measurement 
and data entry are extremely awkward.
<p>

Inaccuracy may enter when adjustments use the percent errors &ndash; the 
discrete-error method &ndash; because the extreme tails of the unit-normal 
distribution are involved.  It is necessary to use <I>z</I>-scores with at least three 
decimal places of accuracy for the factoring ratio (which is multiplied by <I>W</I> to 
yield <i>W</i><sub>e</sub>).  Manual look-up methods are prone to precision errors.  Furthermore, 
some of the easier experimental conditions may have error rates too low to 
reveal the true distribution of hits.  The technique cannot accommodate 
"perfect performance"!  An example appears in Table 17.1 for the condition <I>A</I> = <I>W</I> = 
2 in.  Fitts reported an error rate of 0.00%, which seems reasonable because 
the target edges were touching.  This observation implies a large adjustment 
because the distribution is very narrow in comparison to the target width over 
which the hits should have been distributed &ndash; with 4% errors!  A pragmatic 
approach in this case is to assume a worst-case error rate of 0.0049% (which 
rounds to 0.00%) and proceed to make the adjustment.
<p>

Introducing a post hoc adjustment on target width as just described is 
important to maintain the information-theoretic analogy.  There is a tacit 
assumption in Fitts' law that participants, although instructed to move "as 
quickly and accurately as possible," balance the demands of tasks to meet the 
spatial constraint that 96% of the hits fall within the target.  When this 
condition is not met, the adjustment should be introduced.  Note as well that 
if participants slow down and place undue emphasis on accuracy, the task 
changes; the constraints become temporal, and the prediction power of the model 
falls off (Meyer, Abrams, Kornblum, Wright, & Smith, 1988).  In summary, Fitts' 
law is a model for rapid, aimed movements, and the presence of a nominal yet 
consistent error rate in participants' behavior is assumed and arguably vital.
<p>

Table 17.1 includes an additional column for the effective target width (<i>W</i><sub>e</sub>), 
computed using the discrete-error method.  A re-analysis of the data in Table 17.1 
using <i>W</i><sub>e</sub> and the Shannon formulation for the index of difficulty is shown in 
Figure 17.5.  The fit of the model is improved (<I>R</I><sup>2</sup> = .9877) as the data points are 
now closer to the best-fitting line.  The curving away from the regression line 
for easy tasks appears corrected.  Note that the range of <i>ID</i>s is narrower using 
adjusted measures (cf. Figure 17.2 & Figure 17.5). This is due to the 1-bit decrease 
when <i>ID</i> is greater than about 2 bits (see Figure 17.3) and the general increase in 
<i>ID</i> for "easy" tasks because of the narrow distribution of hits.  
<p>

<center>
<blockquote>
<a name="figure5"></a>
<a href="hhci2018-f5.jpg"><img src="hhci2018-f5.jpg" alt="figure 5" width=500></a><br>
<B>Figure 17.5.</B> Re-analysis of data in Table 17.1 using the effective target width (<i>W</i><sub>e</sub>) 
and the Shannon formulation of index of difficulty (<i><i>ID</i></i><sub>e</sub>).
</blockquote>
</center>
<p>

Although Fitts' apparatus only recorded "hit" or "miss", modern computer-based 
systems are usually capable of recording the coordinate of target selection.
(There are exceptions.  Interaction methods that employ <I>dwell-time selection</I> 
perform target selection by maintaining the cursor within the target for a 
prescribed time interval.   There is no selection coordinate per se.  Examples 
of dwell-time selection include input using an eye tracker, such as MacKenzie, 2012 and 
Zhang & MacKenzie, 2007, or tilt-based input, such as Constantin & MacKenzie, 2014 and 
MacKenzie & Teather, 2012.)
<p>

As noted earlier, these data allow use of the standard-deviation method to 
calculate <i>W</i><sub>e</sub>.  It is also possible, therefore, to calculate an effective 
amplitude (<i>A</i><sub>e</sub>) &ndash; the actual distance moved.  The use of the <i>A</i><sub>e</sub> has little 
influence provided selections are distributed about the center of the targets. 
 However, it is important to use <i>A</i><sub>e</sub> to prevent "gaming the system."  For 
example, if all movements fall short and only traverse, say, &frac34; &times; <I>A</I>, <i><i>TP</i></i><sub>e</sub> is 
artificially inflated if calculated using <I>A</I>.  Using <i>A</i><sub>e</sub> prevents this.  This is 
part of the overall premise in using "effective" values:  Participants get 
credit for what they actually did, not for what they were asked to do. 
<p>

<H2>What is Fitts' Law?</H2>
<p>

At this juncture, it is worth stepping back and considering the big picture:  
What is Fitts' law?  Among the refinements to Fitts' index of difficulty noted 
earlier, only the Welford and Shannon formulations were presented.  Although 
other formulations exist, they are not reviewed here.  There is a reason.  In 
most cases, alternative formulations were introduced following a 
straight-forward process:  A change was proposed and rationalized and then a 
new prediction equation was presented and empirically tested for goodness of 
fit.  Researchers often approach this exercise in a rather single-minded way.  
The goal is to improve the fit.  A higher correlation is deemed evidence that 
the change improves the model &ndash; period.   But, there is a problem.  The altered 
model often lacks any term with units "bits".  And so, the information metaphor 
is lost.  This can occur for a variety of reasons, such as using a non-log form 
of <i>ID</i> (e.g., power, linear), inserting new terms, or splitting the log-term 
into separate terms for <I>A</I> and <I>W</I>.  If there is no term with units "bits", there 
is no throughput.  While such models may indeed be valid, characterizing them 
as improvements to Fitts' law, or even as variations on Fitts' law is, 
arguably, wrong.  They are entirely different models.
<p>

The positon taken in the above paragraph follows from two points.  First, the 
prediction form of Fitts' law (Eq. 17.4) does not appear in Fitts' original 
1954 publication.  Thus, it is questionable whether any effort motivated simply 
to improve the fit of the prediction equation falls within the realm of Fitts' 
law research.  Second, Fitts' law is fundamentally about the <I>information 
capacity of the human motor system</I>. 
(The title of Fitts' 1954 paper begins with the words set in italics.)  
The true embodiment of Fitts' law is 
Eq. 17.3 for throughput, which appears in the original paper, albeit with 
different labels (Fitts, 1954, Eq. 2).  Thus, retaining the information 
metaphor is central to Fitts' law.
<p>

<H2>ISO 9241-9</H2>
<p>

In the decades after the first publication (Fitts, 1954), numerous Fitts' law 
studies appeared &ndash; and in a great variety of forms.  While the internal 
validity of these studies is not in question, there is considerable 
inconsistency in this body of research, and this renders across-study 
comparisons a daunting task.  Simply put, it is often not possible to compare 
throughput values from one study to another.  Reading carefully, details are 
often inadequately given.  Where details are given, it is clear that throughput 
was often calculated in different ways.  Furthermore, inconsistencies exist in 
the data collected or in the way the data are put to work in building Fitts' 
law models or calculating throughput.  Clearly, Fitts' law research could 
benefit from a standardized methodology.  This is particularly true in HCI, 
where the practical benefits of new ideas must be assessed and compared with 
related ideas in other publications.  Enter ISO 9241-9.
<p>

ISO standards are written by technical committees drawn from the research and 
applied research communities.  One standard relevant to HCI is the multi-part 
ISO 9241, "Ergonomic requirements for office work with visual display terminals 
(VDTs)." Draft versions began to appear in the 1990s.  Part 9 is "Requirements 
for non-keyboard input devices" (ISO, 2000).  The standard has since been 
updated to the more generic title "Ergonomics of human-system interaction".
The parts have also been updated, renamed, and renumbered.  Part 9 is now Part 
411, "Evaluation methods for the design of physical input devices" (ISO, 
2012).
(References in this chapter to ISO 9241-9 also apply to ISO 9241-411.  With 
respect to the Fitts' law testing procedures, the two versions are the same.)
The standard is relevant to virtually any input mechanism that can 
perform point-select operations on a computer.   If there is one key benefit of 
ISO 9241-9, it is the standardization brought to the application of Fitts' law 
to input research in HCI. 
<p>

The two main performance testing procedures in ISO 9241-9 employ the Fitts' 
paradigm.  There is a one-dimensional (1D) task and a two-dimensional (2D) 
task, both using serial target selections.  Including a 2D task is a pragmatic 
extension to Fitts' law to support interactions  commonly found in computing 
systems.  Although the possibility of a discrete task was described by Fitts 
(see Figure 17.1b herein) and is used in some Fitts' law studies, discrete tasks 
are not included in ISO 9241-9. 
<p>

Screen snaps from the author's implementations are shown in Figure 17.6a for 
F<font size=-2>ITTS</font>T<font size=-2>ASK</font>O<font size=-2>NE</font> (1D) and in Figure 17.6b for 
F<font size=-2>ITTS</font>T<font size=-2>ASK</font>T<font size=-2>WO</font> (2D).
(Available as free downloads at <a href="http://www.yorku.ca/mack/FittsLawSoftware/"><code>http://www.yorku.ca/mack/FittsLawSoftware/</code></a>.  
The downloads include Java source and class files, executable JAR files, 
examples, and detailed APIs.)
For the 2D image, 
dashed lines are superimposed to show the sequence of target selections.  As 
each target is selected, the highlight moves to a position across the layout 
circle to reveal the next target to the participant.  Figure 17.6c shows a typical 
popup dialog after a sequence of trials using a mouse with 
F<font size=-2>ITTS</font>T<font size=-2>ASK</font>T<font size=-2>WO</font>.  The 
throughput value of 4.9 bits/s is typical for a mouse in this context.
<p>

<center>
<blockquote>
<a name="figure6"></a>
(a) <a href="hhci2018-f6a.jpg"><img src="hhci2018-f6a.jpg" alt="figure 6a" height=300></a>
(b) <a href="hhci2018-f6b.jpg"><img src="hhci2018-f6b.jpg" alt="figure 6b" height=300></a>
(c) <a href="hhci2018-f6c.jpg"><img src="hhci2018-f6c.jpg" alt="figure 6c" height=300></a><br>
<B>Figure 17.6.</B> Implementations of the (a) one-dimensional 
(F<font size=-2>ITTS</font>T<font size=-2>ASK</font>O<font size=-2>NE</font>) and (b) 
two-dimensional 
(F<font size=-2>ITTS</font>T<font size=-2>ASK</font>T<font size=-2>WO</font>) tasks in ISO 9241-9. (c) Popup dialog after a 
sequence of trials.
</blockquote>
</center>
<p>

ISO 9241-9 and the Fitts' paradigm have been used in many studies over the past 
15 or so years.  Examples of novel interactions or devices evaluated according 
to the standard include a trackball game controller (Natapov, Castellucci, & 
MacKenzie, 2009), smartphone touch input (MacKenzie, 2015), tabletop touch 
input (Sasangohar, MacKenzie, & Scott, 2009), <I>Wiimote</I> gun attachments 
(McArthur, Castellucci, & MacKenzie, 2009), eye tracking (Zhang & MacKenzie, 
2007), glove input (Calvo, Burnett, Finomore, & Perugini, 2012), and lip input 
(Jos&eacute; & de Deus Lopes, 2015).  Throughput values range from about 1 bit/s for 
lip input to about 7 bits/s for touch input.  Mouse values are typically in the 
4-5 bits/s range. 
<p>

<H2>Calculation of Throughput</H2>
<p>

Although ISO 9241-9 provides the correct formula for Fitts' throughput, little 
guidance is offered on the data collection, data aggregation, or in performing 
the adjustment for accuracy.  The latter presents a particular challenge when 
using the 2D task.  In this section we examine the best-practice method for 
calculating Fitts' throughput.  We begin with Figure 17.7 which shows the formula 
for throughput, expanded to reveal the Shannon formulation for <i>ID</i> and the use 
of effective values for target amplitude and target width.  The figure also 
highlights the presence of speed (1 / <I>MT</I>) and accuracy (<I>SD<sub>x</sub></i>) in the 
calculation.
<p>

<center>
<blockquote>
<a name="figure7"></a>
<a href="hhci2018-f7.jpg"><img src="hhci2018-f7.jpg" alt="figure 7" height=200></a><br>
<B>Figure 17.7.</B> Formula for throughput showing the Shannon formulation for <i>ID</i> and the 
adjustment for accuracy.  Speed (1 / <I>MT</I>) and accuracy (<I>SD<sub>x</sub></I>) are featured.
</blockquote>
</center>
<p>

 Whether using the 1D or the 2D task, the calculation of throughput requires 
 Cartesian coordinate data for each trial.  Data are required for three points: 
 the starting position ("from"), the target position ("to"), and the trial-end 
 position ("select").  See Figure 17.8.  Although the figure shows a trial with 
 horizontal movement to the right, the calculations described next are valid 
 for movements in any direction or angle.  Circular targets are shown to 
 provide a conceptual visualization of the task.  Other target shapes are 
 possible, depending on the setup in the experiment. 
<p>

<center>
<blockquote>
<a name="figure8"></a>
<a href="hhci2018-f8.jpg"><img src="hhci2018-f8.jpg" alt="figure 8" height=200></a><br>
<B>Figure 17.8.</B> Geometry for a trial.
</blockquote>
</center>
<p>

The calculation begins by computing the length of the sides connecting the 
<code>from</code>, <code>to</code>, and <code>select</code> points in the figure.  Using Java syntax,
<p>

<blockquote>
<pre>
double a = Math.hypot(x1 - x2, y1 - y2);
double b = Math.hypot(x - x2, y - y2);
double c = Math.hypot(x1 - x, y1 - y);
</pre>
</blockquote>
<p>

The <I>x-y</I> coordinates correspond to the <code>from</code> (<I>x</I><sub>1</sub>, <I>y</I><sub>1</sub>), 
<code>to</code> (<I>x</I><sub>2</sub>, <I>y</I><sub>2</sub>), and <code>select</code> 
(<I>x</I>, <i>y</i>) points in the figure.  Given <code>a</code>, <code>b</code>, and <code>c</code>, as above, 
<code>dx</code> and <code>ae</code> are then 
calculated:
<p>

<blockquote>
<pre>
double dx = (c * c &minus; b * b &minus; a * a) / (2.0 * a);
double ae = a + dx; 
</pre>
</blockquote>
<p>

Note that <code>dx</code> is 0 for a selection at the center of the target (as projected on 
the task axis), positive for a selection on the far side of center, and 
negative for a selection on the near side. It is an expected behaviour that 
some selections will miss the target.
<p>

The effective target amplitude (<i>A</i><sub>e</sub>) is <code>ae</code> in the code above.  It is the actual 
point-to-point movement distance for the trial, as projected on the task axis. 
 For serial responses, an additional adjustment for <i>A</i><sub>e</sub> is to add <code>dx</code> from the 
previous trial (for all trials after the first).  This is necessary since each 
trial begins at the selection point of the previous trial.  For discrete 
responses, each trial begins at the center of the <code>from</code> target.
<p>

Given arrays for the <code>from</code>, <code>to</code>, and <code>select</code> points in a sequence of trials and 
the computed <code>ae</code> and <code>dx</code> for each trial, <i>A</i><sub>e</sub> is the mean of the <code>ae</code> values and 
<I>SD<sub>x</sub></I> 
is the standard deviation in the <code>dx</code> values. With these, <i><i>TP</i></i><sub>e</sub> is computed using 
Eq. 17.6 (substituting <i>A</i><sub>e</sub> and <i>W</i><sub>e</sub> = 4.133 &times; <I>SD<sub>x</sub></I>) and throughput (<i>TP</i>) is 
computed using Eq. 17.3 (using <i><i>ID</i></i><sub>e</sub>).  See also the equation in Figure 17.7.  Of 
course, movement time (<I>MT</I>) is the mean of the times recorded for all trials in 
the sequence.
<p>

One final point concerns the <I>unit of analysis</I> for calculating throughput.  The 
correct unit of analysis for throughput is an un-interrupted sequence of trials 
for a single participant.  The premise for this is twofold:
<p>

<ul>
<li>Throughput cannot be calculated on a single trial;
<p>

<li>A sequence of trials is the smallest unit of action for which throughput can 
 be attributed as a measure of performance.
<p>
</ul>
 
On the first point, the calculation of throughput includes the variability in 
selection coordinates, akin to "noise".  Thus, multiple selections are required 
and from the collected data the variability in the coordinates is computed.
<p>

The second point is of ecological concern.  After a sequence of trials, the 
participant pauses, stretches, adjusts the apparatus, has a sip of tea, adjusts 
her position on a chair, or something.  There is a demarcation between 
sequences and for no particular purpose other than to provide a break or pause, 
or perhaps to change to a different test condition.  It is reasonable to assert 
that once a sequence is over, it is over!  Behaviours were exhibited, observed, 
and measured and the next sequence is treated as a separate unit of action with 
separate performance measurements.
<p>

Given the above points, a closer look at the calculation of throughput is 
warranted.  Consider Table 17.1.  Each row in the table summarizes the results for 
16 participants performing two 15-second sequences of trials at the indicated <I>A</I> 
and <I>W</I>.  For each sequence, <I>MT</I> = 15 / <I>m</I>, where <I>m</I> is the number of stylus taps.  
<I>MT</I> in the table is the mean computed over 16 participants, two sequences each. 
 <i>ID</i> in the table is calculated from <I>A</I> and <I>W</I> using Eq. 17.2.  Throughput for 
each row is calculated once, as <i>ID</i> / <I>MT</I> from the values in that row.  The 
expanded formula for <i>TP</i> is as follows: 
<p>

<table width=100%>
<tr>
<td width=10%>&nbsp;
<td width=80% align="center">
<la-tex>
TP = \frac{\frac{1}{n}\sum_{i=1}^{n}ID_i}{\frac{1}{n}\sum_{i=1}^{n}MT_i}
</la-tex>
<!-- <img src="hhci2018-eq7.jpg"> -->
<td width=10% align="right">(17.7)
</table>
<p>

where <I>n</I> is the number 
of Participant &times; Sequence combinations &ndash; 32 in this case.  But, the correct 
calculation, respecting the appropriate unit of analysis, is
<p>

<table width=100%>
<tr>
<td width=10%>&nbsp;
<td width=80% align="center">
<la-tex>
TP = \frac{1}{n} Σ_{i=1}^{n} \frac{ID_i}{MT_i}
</la-tex>
<!-- <img src="hhci2018-eq8.jpg"> -->
<td width=10% align="right">(17.8)
</table>
<p>


With Eq. 17.8, throughput is calculated on each sequence of trials.  The 
overall throughput is the mean of <I>n</I> values.  Eq. 17.7 and Eq. 17.8 will 
yield the same value for the data in Table 17.1, because the iterated values for 
<i>ID</i> are the same across participants and sequences.  But, when Crossman's 
adjustment for accuracy is used, the situation is different.  The numerator in 
Eq. 17.7 is <i><i>ID</i></i><sub>e</sub> computed using <i>W</i><sub>e</sub>, as described earlier.  Spatial variability 
is distilled into a single value which in turn spawns a single <i><i>ID</i></i><sub>e</sub>.  Let's call 
this <i>ID'</i><sub>e</sub>.  Eq. 17.7, with the adjustment for accuracy, is then
<p>

<table width=100%>
<tr>
<td width=10%>&nbsp;
<td width=80% align="center">
<la-tex>
TP = \frac{ID'_{e}}{\frac{1}{n} \sum_{i=1}^{n} MT_i} 
</la-tex>
<!-- <img src="hhci2018-eq9.jpg"> -->
<td width=10% align="right">(17.9)
</table>
<p>

In essence, the accuracy component 
in a sequence of trials is differed.  Accuracy is included at the end as a 
single composite adjustment applicable to all participants and trial sequences. 
 Given the complexity of the log-term for <i>ID</i>, this method is likely to 
introduce a bias in the calculation of throughput.  Again, respecting the unit 
of analysis, the correct calculation for throughput including the adjustment 
for accuracy is 
<p>

<table width=100%>
<tr>
<td width=10%>&nbsp;
<td width=80% align="center">
<la-tex>
TP = \frac{1}{n} \sum_{i=1}^{n} \frac{ID_{e_{i}}}{MT_i}
</la-tex>
<!-- <img src="hhci2018-eq10.jpg"> -->
<td width=10% align="right">(17.10)
</table>
<p>


Eq. 17.10 treats each sequence of 
trials as a separate unit of action.  Speed and accuracy come together into a 
single measure of participant behaviour, throughput.  These measures are then 
summed and averaged across participants and trial sequences.
<p>

Eq. 17.9 and Eq. 17.10 will yield different values for throughput.   For 
the data in Table 17.1, <i>TP</i> = 8.97 bits/s using Eq. 17.9.  This is in contrast to 
the value of <i>TP</i> = 10.10 bits/s seen in Table 17.1, which uses Eq. 17.7.  It is 
not possible to recalculate throughput using Eq. 17.10 because the data from 
Fitts' experiment are not available for each participant on each trial 
sequence. 
<p>

In summary, reducing the data from a Fitts' law experiment as in Table 17.1, while 
useful for summarizing participant responses or building a Fitts' law 
prediction equation (see Eq. 17.4), is not recommended if the goal is to 
measure the rate of information transfer (i.e., throughput; see Eq. 17.3 and 
Figure 17.7).  For this, Eq. 17.10 should be used with each value for <i><i>ID</i></i><sub>e</sub> 
computed using Eq. 17.3 (as per Figure 17.7) on the data from a single sequence 
of trials.  Here again we see a distinction between Fitts' law as a model for 
predicting and Fitts' law as a model for measuring.  Let's continue with an 
example of the use of Fitts' throughput for interactions typically found in 
contemporary computing systems.
<p>
<H2>
Example User Study</H2>
<p>

We now put together the ideas above in an example user study investigating 
touch-based target selection on a smart phone.<sup><a href="#f2">2</a></sup>  Since 1D and 2D task types 
are both common in Fitts' law studies, it is worth asking whether there is an 
inherent difference in throughput for a 1D task compared to a 2D task.  It 
seems this question has not been explored in a systematic way, that is, using 
task type (1D vs. 2D) as an independent variable in a controlled experiment.
<p>

<H3>Participants</H3>
<p>

Participants were recruited from the local university campus.  The only 
stipulation was that participants were regular users of a touchscreen phone, 
pad, or tablet.  Sixteen participants were recruited from a wide range for 
disciplines.  Six were female.  The mean age was 24.3 years (<I>SD</I> = 3.0).  
Participants' average touchscreen experience was 22.9 months (<I>SD</I> = 15.8).  All 
participants were right-handed.
<p>

<H3>Apparatus (Hardware and Software)</H3>
<p>

The testing device was an LG <I>Nexus 4</I> touchscreen smartphone running Android OS 
version 4.2.2.  The display was 61 &times; 102 mm (2.4 in &times; 4.0 in) with a resolution 
of 768 &times; 1184 pixels and a pixel density of 320 dpi.  All communication with 
the phone was disabled during testing.
<p>

Custom Android software called F<font size=-2>ITTS</font>T<font size=-2>OUCH</font> was developed using Java SDK 1.6.  
The software implemented the serial 1D and 2D tasks commonly used in Fitts' law 
experiments and prescribed in ISO 9241-9.
(F<font size=-2>ITTS</font>T<font size=-2>OUCH</font> is available as a free download including source code.  See 
above.)
<P>

The same target amplitude and width conditions were used for both task types.  
The range was limited due to the small display and finger input.  In all, six 
combinations were used: <I>A</I> = { 156, 312, 624 } pixels &times; <I>W</I> = { 78, 130 } pixels. 
 These corresponded to task difficulties from <i>ID</i> = 1.14 bits to <i>ID</i> = 3.17 bits 
(see Eq. 17.6).  A wider range is desirable but pilot testing revealed very 
high error rates for smaller targets.
(This due to a phenomenon of touch input known as the <I>fat-finger problem</I> &ndash; 
Wigdor, Forlines, Baudisch, Barnwell, & Shen, 2007.)
The scale of target conditions was 
chosen such that the widest condition (largest <I>A</I>, largest <I>W</I>) spanned the width 
of the display (portrait orientation) minus 10 pixels on each side.  Examples 
of target conditions are shown in Figure 17.9. 
<p>

<center>
<blockquote>
<a name="figure9"></a>
(a) <a href="hhci2018-f9a.jpg"><img src="hhci2018-f9a.jpg" alt="figure 9a" height=300></a>
(b) <a href="hhci2018-f9b.jpg"><img src="hhci2018-f9b.jpg" alt="figure 9b" height=300></a>
(c) <a href="hhci2018-f9c.jpg"><img src="hhci2018-f9c.jpg" alt="figure 9c" height=300></a><br>
<B>Figure 17.9.</B> Example task conditions. (a) 1D with <I>A</I> = 312 & <I>W</I> = 78. (b) 2D with <I>A</I> 
= 156 & <I>W</I> = 130. (c) 2D with <I>A</I> = 624 & <I>W</I> = 78.  All units pixels.
</blockquote>
</center>
<p>

 The 2D conditions included 20 targets, which was the number of trials in a 
 sequence. The target to select was highlighted.  Upon selection, the highlight 
 moved to the opposite target. Selections proceeded in a rotating pattern 
 around the layout circle until all targets were selected. For the 1D task, 
 selections were back and forth.  Data collection for a sequence began on the 
 first tap and ended after 20 target selections (21 taps).  
<p>

<H3>Procedure</H3>
<p>

 After signing a consent form, participants were briefed on the goals of the 
 experiment.  The experiment task was demonstrated to participants, after which 
 they did a few practice sequences. They sat at a desk with the device 
 positioned on the desk surface.  They were allowed to anchor the device with 
 their non-dominant hand, if desired.  An example of a participant performing 
 trials in the 1D condition is shown in Figure 17.10a.  An auditory beep sounded 
 if a target was missed.  At the end of each sequence a dialog appeared showing 
 summary results for the sequence.  See Figure 17.10b for an example.  The dialog 
 is useful for demos and to help inform and motivate participants during 
 testing. 
<p>

<center>
<blockquote>
<a name="figure10"></a>
(a) <a href="hhci2018-f10a.jpg"><img src="hhci2018-f10a.jpg" alt="figure 10a" height=300></a>
(b) <a href="hhci2018-f10b.jpg"><img src="hhci2018-f10b.jpg" alt="figure 10b" height=300></a><br>
<B>Figure 17.10.</B> (a) A participant performing trials in the 1D condition. (b) Example 
dialog at the end of a sequence.
</blockquote>
</center>
<p>

Participants were asked to select targets as quickly and accurately as 
possible, at a comfortable pace. They were told that missing an occasional 
target was OK, but that if many targets were missed, they should slow down.
<p>

<H3>Design</H3>
<p>

 The experiment was fully within-subjects with the following independent 
 variables and levels:
<p>

<blockquote>
<table>
<tr><td>Task <td>		1D, 2D
<tr><td>Block <td>		1, 2, 3, 4, 5
<tr><td>Amplitude&nbsp;&nbsp;&nbsp; <td>	156, 312, 624 pixels
<tr><td>Width<td> 		78, 130 pixels
</table>
</blockquote>
<p>

The primary independent variable was task.  Block, amplitude, and width were 
included to gather a sufficient quantity of data over a reasonable range of 
task difficulties.
<p>

For each condition, participants performed a sequence of 20 trials. The task 
conditions were counterbalanced with 8 participants per order.  The amplitude 
and width conditions were randomized within blocks.
<p>

The dependent variable was throughput.  Testing lasted 
about 45 minutes per participant.  The total number of trials was 16 
Participants &times; 2 Tasks &times; 5 Blocks &times; 3 Amplitudes &times; 2 Widths &times; 20 Trials = 
19,200. 
<p>

<H2>Results and Discussion</H2>
<p>

The grand mean for throughput was 6.85 bits/s.  This result, in itself, is 
remarkable.  Here we see empirical evidence underpinning the tremendous success 
of contemporary touch-based interaction.  Not only is the touch <I>experience</I> 
appealing, touch <I>performance</I> is measurably superior compared to traditional 
interaction techniques.  For desktop interaction the mouse is well-known to 
perform best for most point-select interaction tasks.
(A possible exception is the stylus. Performance with a stylus is generally 
as good as, or sometimes slightly better than, a mouse &ndash; see MacKenzie et al., 
1991.)
In a review of Fitts' 
law studies following the ISO 9241-9 standard, throughput values for the mouse 
ranged from 3.7 bits/s to 4.9 bits/s (Soukoreff & MacKenzie, 2004, Table 5).
The value just reported for touch input reveals a performance advantage for 
touch in the range of 40% to 85% compared to a mouse.
(Of course, a direct comparison is not possible since mouse input is not 
supported on small touchscreen devices such the LG <I>Nexus 4</I> used in this study.)
The most likely reason 
lies in the distinguishing properties of <I>direct input</I> vs. <I>indirect input</I>.  With 
a mouse or other traditional pointing device, the user manipulates a device to 
indirectly control an on-screen tracking symbol.  Selection requires pressing a 
button on the device.  With touch input there is neither a tracking symbol nor 
a button:  Input is direct! 
<p>

The results for throughput by participant and task are shown in Table 17.2.  The 
1D task yielded a throughput of  7.43 bits/s, which was 18.5% higher than the 
mean of 6.27 bits/s for the 2D task.  The difference was statistically 
significant (<I>F</I><sub>1,15</sub> = 29.8, <I>p</I> &lt; .0001).  All participants had higher throughput 
for the 1D task.  Throughput was fairly flat over the five blocks of testing 
with &lt; 3% change in throughput from block 1 to block 5.  Consequently, a 
breakdown of results by block is not given.
<p>

<center>
<blockquote>
<B>Table 17.2</B><br>
Throughput (bits/s) by participant and task.<br>

<table border="1px" cellspacing=0 cellpadding=5 width=30%>
<tr align=center bgcolor="#f0f0e0"><th rowspan=2>Participant<th colspan=2>Task
<tr align="center"  bgcolor="#f0f0e0"><th>1D<th>2D
<tr align="center"><td>P01<td>6.28<td>6.19
<tr align="center"><td>P02<td>4.83<td>4.79
<tr align="center"><td>P03<td>5.90<td>5.34
<tr align="center"><td>P04<td>7.05<td>5.42
<tr align="center"><td>P05<td>7.83<td>5.83
<tr align="center"><td>P06<td>6.72<td>5.65
<tr align="center"><td>P07<td>6.38<td>5.05
<tr align="center"><td>P08<td>7.45<td>6.62
<tr align="center"><td>P09<td>8.26<td>6.09
<tr align="center"><td>P10<td>6.42<td>6.40
<tr align="center"><td>P11<td>8.33<td>5.94
<tr align="center"><td>P12<td>9.37<td>8.30
<tr align="center"><td>P13<td>8.75<td>6.17
<tr align="center"><td>P14<td>7.26<td>5.88
<tr align="center"><td>P15<td>9.01<td>7.76
<tr align="center"><td>P16<td>8.97<td>8.84
<tr align="center"><td align="right">Mean<td>7.43<td>6.27
<tr align="center"><td align="right"><I>SD</I><td>1.30<td>1.13
</table>
</blockquote>
</center>
<p>

The higher throughput for the 1D condition is explained as follows.  With 
side-to-side movement only, the 1D condition is easier.  Movements in the 2D 
condition are more complicated, since the direction of movement changes by 360&deg; 
/ 20 = 18&deg; with each trial.  Furthermore, occlusion is unavoidable for some 
trials in a sequence. This does not occur for the 1D task.
<p>

Throughput was calculated using Eq. 17.3 using the Shannon formulation for <i>ID</i> 
along with <i>A</i><sub>e</sub> and <i>W</i><sub>e</sub> (as per Figure 17.7).  The unit of analysis for the 
calculation was a sequence of trials, as discussed earlier.  Each value of 
throughput in Table 17.2 is therefore the mean of 30 values of throughput, since 
each participant performed five sequences of trials (1 per block) for each of 
six <I>A-W</I> conditions.
<p>

Figure 17.11 shows a chart of the findings for throughput by task, as might appear 
in a research paper.  The error bars show &plusmn;1 <I>SD</I> using the values along the 
bottom row in Table 17.2.
<p>

<center>
<blockquote>
<a name="figure11"></a>
<a href="hhci2018-f11.jpg"><img src="hhci2018-f11.jpg" alt="figure 11" height=400></a><br>
<B>Figure 17.11.</B> Throughput (bits/s) by task.  Error bars show &plusmn;1 <I>SD</I>.
</blockquote>
</center>
<p>

<H2>Conclusion</H2>
<p>

This chapter has provided an overview of Fitts' law in view of current practice 
in human-computer interaction (HCI).  It is important to bear in mind the long 
history of Fitts' law research in other fields and in the early years of HCI.  
In the 1950s, when Fitts proposed his model of human movement, graphical user 
interfaces and computer pointing devices did not exist.  Yet, throughout the 
history of HCI (since Card et al., 1978), research on point-select computing 
tasks is inseparable from Fitts' law.  The initial studies focused on device 
comparisons and model conformity.  Since then &ndash; and partly due to the 
publication of ISO 9241-9 &ndash; focus has shifted to the use of Fitts' throughput 
(in "bits/s") as a dependent variable.  This is in keeping with Fitts' original 
intention to explore the information capacity of the human motor system.  Much 
of this research has seen Fitts' law applied to topics only peripherally 
related to pointing devices.  Examples include expanding targets, hidden 
targets, fish-eye targets, pointing on the move, eye tracking, force feedback, 
tilt input, gravity wells, multi-monitor displays, wearable computing, 
accessible computer, virtual reality, 3D, magic lenses, and so on.  Research in 
these topics, and others, has thrived on the theory and information metaphor 
inspired and guided by Fitts' law.  This is Fitts' legacy to research in 
human-computer interaction.
<p>

<H2>References</H2>
<p>

Calvo, A., Burnett, G., Finomore, V., & Perugini, S. (2012). The design, 
implementation, and evaluation of a pointing device for a wearable computer. 
<I>Proceedings of the Human Factors and Ergonomics Society 56th Annual Meeting - 
HFES 2012</I>, 521-525, Santa Monica, CA: HFES.
<p>

Card, S. K., English, W. K., & Burr, B. J. (1978). Evaluation of mouse, 
rate-controlled isometric joystick, step keys, and text keys for text selection 
on a CRT. <I>Ergonomics, 21</I>, 601-613.
<p>

Constantin, C., & MacKenzie, I. S. (2014). Tilt-controlled mobile games: 
Velocity-control vs. position-control. <I>Proceedings of the 6th IEEE Consumer 
Electronics Society Games, Entertainment, Media Conference - IEEE-GEM 2014</I>, 
24-30, New York: ACM.
<p>

Crossman, E. R. F. W., & Goodeve, P. J. (1983). Feedback control of 
hand-movement and Fitts' law: Communication to the Experimental Society. 
<I>Journal of Experimental Psychology, 35A</I>, 251-278. 
<p>

Fitts, P. M. (1954). The information capacity of the human motor system in 
controlling the amplitude of movement. <I>Journal of Experimental Psychology, 47</I>, 
381-391. 
<p>

Fitts, P. M., & Peterson, J. R. (1964). Information capacity of discrete motor 
responses. <I>Journal of Experimental Psychology, 67</I>, 103-112.
<p>

Fitts, P. M., & Radford, B. K. (1966). Information capacity of discrete motor 
responses under different cognitive sets. <I>Journal of Experimental Psychology, 
71</I>, 475-482. 
<p>

Gillan, D. J., Holden, K., Adam, S., Rudisill, M., & Magee, L. (1990). How does 
Fitts' law fit pointing and dragging? <I>Proceedings of the ACM SIGCHI Conference 
on Human Factors in Computing Systems - CHI '90</I>, 227-234, New York: ACM.
<p>

Goldman, S. (1953). <I>Information Theory</I>. New York. Prentice-Hall.
<p>

Hick, W. E. (1952). On the rate of gain of information. <I>Quarterly Journal of 
Experimental Psychology, 4</I>, 11-36. 
<p>

Hyman, R. (1953). Stimulus information as a determinant of reaction time. 
<I>Journal of Experimental Psychology, 45</I>, 188-196. 
<p>

ISO. (2000). <I>Ergonomic requirements for office work with visual display 
terminals (VDTs) - Part 9: Requirements for non-keyboard input devices (ISO 
9241-9)</I>: International Organisation for Standardisation.
<p>

ISO. (2012). <I>Evaluation methods for the design of physical input devices - 
ISO/TC 9241-411: 2012(E)</I>: International Organisation for Standardisation.
<p>

Jos&eacute;, M. A., & de Deus Lopes, R. (2015). Human-computer interface controlled by 
the lip. <I>IEEE Journal of Biomedical and Health Informatics, 19</I>(1), 302-308.
<p>

MacKenzie, I. S. (1989). A note on the information-theoretic basis for Fitts' 
law. <I>Journal of Motor Behavior, 21</I>, 323-330. 
<p>

MacKenzie, I. S. (1991). <I>Fitts' law as a performance model in human-computer 
interaction</I>. (Doctoral Dissertation), University of Toronto 
(http://www.yorku.ca/mack/phd.html).
<p>

MacKenzie, I. S. (1992). Fitts' law as a research and design tool in 
human-computer interaction. <I>Human-Computer Interaction, 7</I>, 91-139.
<p>

MacKenzie, I. S. (2012). Evaluating eye tracking systems for computer input. In 
P. Majaranta, H. Aoki, M. Donegan, D. W. Hansen, J. P. Hansen, A. Hyrskykari & 
K.-J. R&auml;ih&auml; (Eds.), <I>Gaze interaction and applications of eye tracking: Advances 
in assistive technologies</I> (pp. 205-225): Hershey, PA: IGI Global.
<p>

MacKenzie, I. S. (2013). A note on the validity of the Shannon formulation for 
Fitts' index of difficulty. <I>Open Journal of Applied Science, 3</I>(6), 360-368.
<p>

MacKenzie, I. S. (2015). Fitts' throughput and the remarkable case of 
touch-based target selection. <I>Proceedings of HCI International - HCII 2015</I> 
(LNCS 9170), 238-249, Switzerland: Springer.
<p>

MacKenzie, I. S., Sellen, A., & Buxton, W. (1991). A comparison of input 
devices in elemental pointing and dragging tasks. <I>Proceedings of the ACM SIGCHI 
Conference on Human Factors in Computing Systems - CHI '91</I>, 161-166, New York: 
ACM.
<p>

MacKenzie, I. S., & Soukoreff, R. W. (2003). Card, English, and Burr (1978) - 
25 years later. <I>Extended Abstracts of the ACM SIGCHI Conference on Human 
Factors in Computing Systems - CHI 2003</I>, 760-761, New York: ACM.
<p>

MacKenzie, I. S., & Teather, R. J. (2012). FittsTilt: The application of Fitts' 
law to tilt-based interaction. <I>Proceedings of the 7th Nordic Conference on 
Human-Computer Interaction - NordiCHI 2012</I>, 568-577, New York: ACM.
<p>

McArthur, V., Castellucci, S. J., & MacKenzie, I. S. (2009). An empirical 
comparison of "Wiimode" gun attachments for pointing tasks. <I>Proceedings of the 
ACM Symposium on Engineering Interactive Computing Systems &ndash; EICS 2009</I>, 
203-208, New York: ACM.
<p>

Meyer, D. E., Abrams, R. A., Kornblum, S., Wright, C. E., & Smith, J. E. K. 
(1988). Optimality in human motor performance: Ideal control of rapid aimed 
movements. <I>Psychological Review, 95</I>, 340-370.
<p>

Natapov, D., Castellucci, S. J., & MacKenzie, I. S. (2009). ISO 9241-9 
evaluation of video game controllers. <I>Proceedings of Graphics Interface 2009</I>, 
223-230, Toronto: CIPS.
<p>

Reza, F. M. (1961). <I>An Introduction to Information Theory</I>. New York. 
McGraw-Hill.
<p>

Sasangohar, F., MacKenzie, I. S., & Scott, S. (2009). Evaluation of mouse and 
touch input for a tabletop display using Fitts' reciprocal tapping task. 
<I>Proceedings of the 53rd Annual Meeting of the Human Factors and Ergonomics 
Society - HFES 2009</I>, 839-843, Santa Monica, CA: HFES.
<p>

Shannon, C. E., & Weaver, W. (1949). <I>The mathematical theory of communications</I>. 
Urbana, Il. Urbana, IL: University of Illinois Press.
<p>

Soukoreff, R. W., & MacKenzie, I. S. (2004). Towards a standard for pointing 
device evaluation: Perspectives on 27 years of Fitts' law research in HCI. 
International <I>Journal of Human-Computer Studies, 61</I>, 751-789.
<p>

Ware, C., & Mikaelian, H. H. (1987). An evaluation of an eye tracker as a 
device for computer input. <I>Proceedings of the CHI+GI '87 Conference on Human 
Factors in Computing Systems and Graphics Interface</I>, 183-188, New York: ACM.
<p>

Welford, A. T. (1960). The measurement of sensory-motor performance: Survey and 
reappraisal of twelve years progress. <I>Ergonomics, 3</I>, 189-230.
<p>

Welford, A. T. (1968). <I>Fundamentals of skill</I>. London. Methuen.
<p>

Wigdor, D., Forlines, C., Baudisch, P., Barnwell, J., & Shen, C. (2007). Lucid 
touch: A see-through mobile device. <I>Proceedings of the ACM Symposium on User 
Interface Software and Technology - UIST 2007</I>, 269-278, New York: ACM.
<p>

Zhang, X., & MacKenzie, I. S. (2007). Evaluating eye tracking with ISO 9241 -- 
Part 9. <I>Proceedings of HCI International 2007</I>, 779-788, Heidelberg: Springer.
<p>

-----
<p>

<B>Footnotes:</B>
<p>

<a name="f1">1</a>. Since the early 1990s, use of the effective target width has increased, 
particularly in human-computer interaction.  This is in part due to the 
recommended use of <i>W</i><sub>e</sub> in the performance evaluations described in ISO 9241-9 
(ISO, 2000). The first use of <i>W</i><sub>e</sub> in HCI is the Fitts' law study described by 
MacKenzie, Sellen, and Buxton (1991).
<p>

<a name="f2">2</a>.  The example is a subset of a larger user study (see MacKenzie, 2015).  The 
full study included an additional independent variable (device position: 
supported vs. mobile) and additional dependent variables (movement time, error 
rate).  The original study also examined results by participant finger size and 
tested the distribution characteristics of selection coordinates.  Consult for 
details.
<p>

</body>
</html>