Tukey's B method, also known as the Tukey-Kramer B procedure, or Tukey's Wholly Significant Difference (WSD) is a post-hoc multiple comparison statistical test used to identify which specific group means differ significantly from each other after a statistically significant result has been obtained from an analysis of variance (ANOVA). It is considered a compromise between two other popular multiple comparison procedures: Tukey's range test and the Newman-Keuls method.
The primary purpose of post-hoc tests like Tukey's B is to control the family-wise error rate (FWER) when performing multiple comparisons. Without such control, the probability of making at least one Type I error increases with the number of comparisons made.
The development of multiple comparison procedures stems from the work of Ronald Fisher, John Tukey and others in the mid-20th century. Tukey's HSD test is a conservative method that guarantees the FWER does not exceed the chosen significance level (e.g., ). Conversely, the Newman-Keuls (NK) method, while providing higher statistical power, is known to be anti-conservative; that is, not strictly controlling the FWER as the number of groups increases.
Tukey's B method was introduced to provide an intermediate level of conservatism. It seeks to balance the strict error control of HSD with the greater sensitivity to differences offered by Newman-Keuls.
Tukey's B method operates by comparing all possible pairs of means. For each pair, it calculates a critical value based on the studentized range distribution.
While Tukey's HSD uses a single critical value derived from the total number of groups (), and Newman-Keuls uses critical values that vary depending on the number of steps between the ordered means (), Tukey's B calculates the critical value () as the simple arithmetic mean of the critical values obtained from those two procedures:
The absolute difference between two means, , is then compared against a critical difference value:
where:
If , the difference is declared statistically significant.
Tukey's B method is a standard post-hoc option in statistical packages such as SPSS, and provides a middle ground for researchers:
In contemporary statistical practice, the procedure has largely fallen out of favor due to several factors: