Dept. of Computer Science and Engineering
York University
Toronto, Ontario, Canada M3J 1P3
mack@cse.yorku.ca
Last update: 12-May-08
Human-computer interaction research often involves experiments with human participants to test one or more hypotheses. One of the most common statistical tools for hypothesis testing is the analysis of variance (ANOVA). The ANOVA result is reported as an F statistic and its associated degrees of freedom and p value.
This research note does not explain the analysis of variance, or even the F statistic itself. Rather, we explain only the proper way to report an F statistic. "Proper way" refers to the formatting of the statistic and to the construction of a sentence that presents it. Simple as this seems, F statistics are often improperly formatted and poorly presented in research papers. Let's get to it.
Assume you conducted an experiment with ten participants to compare natural and abstract icons. The independent variable is Icon Type with two levels, Natural and Abstract. Participants completed a task where they associated the meaning of icons with icon images, grouped according to type. The dependent variable is Task Completion Time, in seconds.
After conducting the experiment, you have the following data:
656,702
259,339
612,658
609,645
1049,1129
1135,1179
542,604
495,551
905,893
715,803
Each row contains the responses for one participant. The left and right columns contain the Task Completion Times for the Natural and Abstract icons, respectively. Using your favourite statistics program, you run an analysis of variance on the data and obtain the following:

There was a significant effect of Icon Type on Task Completion Time (F1,9 = 33.4, p < .005).
And that's about it. Of course, there are many ways to construct a sentence reporting the result; this is just a simple example. Importantly, the sentence captures the relationship between the independent variable and the dependent variable; i.e., "There was a significant effect of [independent variable] on [dependent variable]".
Of more importance here is the formatting of the F statistic: Note the following:
· Set in parentheses
· Uppercase for F
· Lowercase for p
· Italics for F and p
· F statistic rounded to three (maybe four) significant digits
· F statistic followed by a comma, then a space
· Space on both sides of equal sign and both sides of less than sign
· Degrees of freedom set as subscript, plain, smaller font
· No space following the comma in the degrees of freedom
· Exact value of p not reported
· p rounded up to a more conservative value from the set {.05, .01, .005, .001, .0005, .0001}
· No zero before the decimal point for p (because it is constrained between 0 and 1)
There are a few minor and accepted variations to the formatting points above, but stick to these rules and you're in good shape.
If p is above .05, the result is not statistically significant. In this case, there are two possibilities. If p is greater than .05 and F is greater than 1, report the result something like this:
There was no significant effect of Icon Type on Task Completion Time (F1,9 = 2.34, p > .05).
If p is greater than .05 and F is less than 1, report the result something like this:
There was no significant effect of Icon Type on Task Completion Time (F1,9 = 0.876, ns).
Statistical significance is impossible if F is less than 1; hence, the convention of reporting the probability simply as "ns" for "not significant".
Acknowledgement
The motivation to put together this research note came from discussions with Wendy MacKay at CHI 2008. The hypothetical experiment and data are from Dix et al.'s Human Computer Interaction (Prentice Hall, 2004, 3rd ed., p. 337).