Targeted testing for bias in order assignment, with an application to Texas election ballots

doi:10.1016/j.jspi.2019.09.002

Journal of Statistical Planning and Inference

Volume 206, May 2020, Pages 12-28

https://doi.org/10.1016/j.jspi.2019.09.002 Get rights and content

Highlights

•
A priori information about bias in the ordering agents improves power.
•
Our Linear Concordance test is straightforward, robust, and effective.
•
Texas election officials failed to randomize candidate order on election ballots.

Abstract

Statistical methods are developed for assessing the likelihood of prejudicial bias in agent-assigned permutations, such as the ordering of candidates on an election ballot. The null hypothesis of an unbiased order assignment is represented by several forms of probabilistic exchangeability of the random orderings, while bias is represented either by compatibility with an assumed ranking of the items with respect to a hypothesized preference criterion (PC) or by linear concordance with assumed scores of the items on a PC scale. A power analysis indicates the superiority of these methods to a neutral alternative when appropriate a priori information is available; their usefulness is affirmed in an application to the ordering of candidates on 2014 Texas Republican primary election ballots. Significant evidence of bias is found in three of the five races studied, a finding that does not obtain using currently available tests.

Introduction

Each year an unusual ritual takes place in school district offices, city halls, and county courthouses across the state of Texas: the drawing of the order in which candidates for public office will be placed on the ballot, as required by state law. Candidates often attend these drawings to ensure that the agent conducting them does not manipulate the ordering and place a competitor higher on the ballot, conferring upon them an electoral advantage known as the ballot-order effect. In Texas primary and runoff elections for statewide office (Grant, 2017) find this effect to be sizeable and monotonic in ballot order, especially in low-profile or low-information races. This finding is corroborated for other states by several studies cited therein, while Meredith and Salant (2013) obtain broadly similar results for local elections in California. The possibility that such orderings might be prejudicially biased, either consciously or unconsciously, by the agents executing them is not far-fetched. Darcy and McAllister (1990) noted the evidence for such bias in their review of the early literature on the ballot order effect.³ One can test statistically for such bias when orderings (i.e. permutations) of the same set of $k$ items are conducted repeatedly and (presumably) independently $N$ times, as in Texas, where ballot order for primary and runoff elections is determined by agents at the county level, even for offices contested statewide. A test for uniform random ordering when $k = 2$ is elementary, but this is not the case when multiple items are being ordered.

The general problem of testing for the uniform randomness of permutations does not just arise in political science. A classic example from computer science is testing random number generators via the randomness of repeated sequences of digits (Knuth, 1981).⁴ In addition, when ordering contestants in musical or athletic contests, randomization helps ensure fairness because arbiters’ fastidiousness can vary over the course of a day or competition. It is desirable in sequencing courtroom trials for the same reason (Danziger et al., 2011). Most generally, the ballot order effect is an example of a more general psychological phenomenon, the primacy effect (cf. Murdock, 1962), in which the first-listed of a set of options tends to be chosen more frequently. Thus, in many scenarios in which a set of competing decisions must be made sequentially without prejudice, the agents ordering those decisions may be tempted to manipulate the orderings in accordance with their preferences. Testing for randomization should reduce the likelihood of such manipulations and can uncover them when they occur.

Despite the generality of the problem, however, a consensus on testing methodology has not emerged. Even within the ballot order literature, a variety of options are used. Grant (2017) applies Fisher’s Exact Test to the cross-tabulation of candidates and ballot positions in Texas, to determine if this cross-tabulation is likely to have occurred by random chance. Meredith and Salant (2013) apply Pearson’s chi-squared test to determine if incumbent candidates are equally likely to end up at any position on the ballot. And Ho and Imai (2008) apply a series of rank tests based on the average absolute difference in rank between pairs of letters to randomized alphabets that are used for ballot ordering in California.

Each approach has limitations. The first procedure, Fisher’s Exact Test, aggregates the $N$ observed orderings into a $k \times k$ contingency table, where $k$ is the number of items and order positions. Among other problems,⁵ such aggregation loses relevant information contained in the orderings themselves. It is possible – and, in political applications, probable – that agents with opposing prejudicial biases manipulate orderings in opposite directions, but these offsetting manipulations may not be apparent in the aggregate. Similarly, important information is lost by applying Pearson’s chi-squared test to incumbents alone: all information about non-incumbents is ignored.

In addition, unless $k$ is very small or $N$ very large, these tests, along with those of Ho and Imai (2008), may lack the sensitivity needed to detect the specific deviations from uniform randomness that may be encountered. This limited sensitivity would derive, in part, from the untargeted nature of such tests, which do not utilize a priori information that could increase their power. Such information is often available in political and other applications in which human agents perform the orderings; any deviations from uniform randomness are likely to reflect these agents’ preferences.

In this paper, procedures are developed that utilize the individual orderings, not their aggregation, and that will detect departures from uniform randomness that are to be expected from a priori information about the characteristics of the items and the preference criteria of the agents executing the ordering. In Section 2, the null hypothesis of an unbiased order assignment is represented by several forms of exchangeability of a random permutation. In Section 3, the alternative hypothesis of bias in order assignment is represented by compatibility with an assumed preferential ranking (ties permitted) of the items, while in Section 4 bias is represented by linear concordance with assumed preference scores of the items. In both cases methods for detecting the corresponding alternatives are obtained. Section 5 analyzes these tests’ power relative to one another and an neutral alternative – the rank test of Ho and Imai (2008) – and outlines their practical application when the true form of deviations from uniform randomness is unknown. In Section 6 these procedures are applied to five races in the 2014 Texas Republican primary. Significant evidence of bias in at least one of the approximately 245 reporting counties is found in three of the five races; in two of these, significant evidence is found for bias in at least six and ten counties.

The tests developed in this paper rely on assumptions about agents’ preferences that, while appropriate in the political context, are somewhat strong. A sequel develops power-enhancing tests for more general sets of preferences and for the most general case of all, in which no a priori knowledge is available.

Section snippets

Unbiased and biased order assignments

Suppose that $k$ items, arbitrarily labeled $1, \dots, k$ , are being ordered by each of $N$ agents. An ordering is simply a permutation $π \equiv (π_{1}, \dots, π_{k})$ of $(1, \dots, k)$ such that item $i$ is assigned position $π_{i}$ , $i = 1, \dots, k$ . The primacy effect implies that positions near 1 ( $π_{i} \approx 1$ ) in the ordering are advantageous, while positions near $k$ ( $π_{i} \approx k$ ) are disadvantageous.

We shall assume that the ordering $π$ selected by an agent is a realization of a random permutation $Π \equiv (Π_{1}, \dots, Π_{k})$ (quite possibly degenerate). The gold standard for

Preference criteria (PC) expressed by ranks

Suppose that an agent ranks the $k$ items (with ties permitted) in accordance with a particular PC, so that a low (high) rank indicates agreement (disagreement) with the PC. This yields a partitioning $B \equiv (B_{1}, \dots, B_{r})$ of the items into blocks $B_{1}, \dots, B_{r}$ of sizes $k_{1}, \dots, k_{r}$ ( $r \geq 2$ , $k_{1} + \dots + k_{r} = k$ ). Items in block $B_{j}$ have a lower (the same) PC ranking than (as) those in $B_{j^{'}}$ if $j < j^{'}$ ( $j = j^{'}$ ).

If each $k_{j} = 1$ then the items are totally ordered w.r.t. the PC, while if one or more $k_{j} > 1$ then they are partially ordered.⁹

Preference criteria (PC) expressed by scores

In Section 3 it was assumed that if an agent is unbiased, they will select a random ordering according to the exchangeable model $H_{0}$ , while if biased, they will select an ordering that conforms exactly to a partitioning $B$ or its opposite $\tilde{B}$ , specified by a uni-directional or bi-directional preference criterion (PC). In some cases, however, this may over-simplify the true behavioral processes that generate bias. Other processes may generate biased orderings that only partly conform with a PC, for

Power analysis and practical test usage

We now perform a simulation study to compare the statistical power of our rank-compatibility and linear concordance tests to a neutral alternative, the rank test used by Ho and Imai (2008).¹³ This test was chosen as the standard of comparison because it neither aggregates nor disregards information, obtains simulation-consistent size for a given $α$ level (the null distribution must be

Results from the 2014 Texas Republican primary election

Finally, we apply these three tests to ballot-order data from the 2014 Texas Republican primary elections (used in Grant, 2017) for $N = 245$ of the 254 counties in Texas.¹⁴

Concluding remarks

The ballot-order effect is an example of a more general psychological phenomenon, the primacy effect (cf. Murdock, 1962), in which the first-listed of a set of options tends to be chosen more frequently. Other situations in which the primacy effect may affect outcomes include athletic or artistic competitions, funding approval processes – such as those at the NIH and NSF – and college admissions decisions. In such scenarios one can often identify various preference criteria – political, social,

Acknowledgments

We would like to thank our reviewer for the insightful commentary on the first submission of the paper, as well as Kosuke Imai for his feedback on an early draft of this paper. Thanks also to the Department of Defense, the National Institutes of Health, and Sam Houston State University for funding that permitted this work. And cheers to interdisciplinary collaborations!

Funding

This work was supported by the National Institutes of Health, United States of America [Grant No. HSN268201-600310A] and the

References (13)

DarcyR. et al.
Ballot position effects
Elect. Stud.
(1990)
DanzigerS. et al.
Extraneous factors in judicial decisions
Proc. Natl. Acad. Sci.
(2011)
FellerW.
An Introduction to Probability Theory and Its Applications Vol. II
(1966)
GrantD.
The ballot order effect is huge: evidence from Texas
Public Choice
(2017)
HoD.E. et al.
Estimating causal effects of ballot order from a randomized natural experimentthe California alphabet lottery, 1978–2002
Public Opin. Q.
(2008)
KendallM.
Rank Correlation Methods
(1948)

There are more references available in the full text version of this article.

Cited by (2)

Uncovering bias in order assignment
2023, Economic Inquiry
Uncovering Bias in Order Assignment
2020, SSRN

¹: Research was supported in part by National Institutes of Health, United States of America Grant HHSN268201-600310A.

²: Research was supported in part by U.S. Department of Defense Grant H98230-10-C-0263.

View full text

Targeted testing for bias in order assignment, with an application to Texas election ballots

Highlights

Abstract

Introduction

Section snippets

Unbiased and biased order assignments

Preference criteria (PC) expressed by ranks

Preference criteria (PC) expressed by scores

Power analysis and practical test usage

Results from the 2014 Texas Republican primary election

Concluding remarks

Acknowledgments

Funding

Elect. Stud.

Extraneous factors in judicial decisions

Proc. Natl. Acad. Sci.

An Introduction to Probability Theory and Its Applications Vol. II

The ballot order effect is huge: evidence from Texas

Public Choice

Estimating causal effects of ballot order from a randomized natural experimentthe California alphabet lottery, 1978–2002

Public Opin. Q.

Rank Correlation Methods