An estimation of cattle movement parameters in the Central States of the US

https://doi.org/10.1016/j.compag.2015.06.016Get rights and content

Highlights

  • We estimate the movement parameters of cattle across 10 Central States.

  • Large USDA data sets were fed into an optimization procedure to extract parameters.

  • Comparison with other literature suggests a high epidemic risk for US cattle systems.

Abstract

The characterization of cattle demographics and especially movements is an essential component in the modeling of dynamics in cattle systems, yet for cattle systems of the United States (US), this is missing. Through a large-scale maximum entropy optimization formulation, we estimate cattle movement parameters to characterize the movements of cattle across 10 Central States and 1034 counties of the United States. Inputs to the estimation problem are taken from the United States Department of Agriculture National Agricultural Statistics Service database and are pre-processed in a pair of tightly constrained optimization problems to recover non-disclosed elements of data. We compare stochastic subpopulation-based movements generated from the estimated parameters to operation-based movements published by the United States Department of Agriculture. Our novel method to estimate cattle movements across large US regions characterizes county-level stratified subpopulations of cattle for data-driven livestock modeling. Our estimated movement parameters suggest a significant risk of a disease successfully invading the US cattle systems.

Introduction

Livestock systems serve significant roles for many regions across the world, yet past outbreaks of disease have demonstrated that they can possess a number of vulnerabilities (Brester et al., 2003, Ferguson et al., 2001, Keeling et al., 2001, Green et al., 2006, Kao et al., 2007, Bajardi et al., 2012). The livestock systems of the United States (US), though strictly regulated, may yet be found susceptible to foreign diseases such as Rift Valley Fever (Xue et al., 2013). The successful modeling and analysis of livestock epidemics for any region relies heavily on an understanding of the underlying system components. The three most critical elements in a practical epidemic model are the disease progression model, the geo-spatial characterization of the susceptible populations, and the spatial–temporal description of the interactions of individuals within the system (Colizza and Vespignani, 2008, Balcan et al., 2009). The models of disease progression are several and often independent of the region studied (May and Anderson, 1979, Barrat et al., 2008, Vespignani, 2012). Data-driven, spatial characterizations of populations are available through regularly conducted censuses (censi) (US census bureau, 2013, USDA-NASS, 2013). The third element, the interactions and movements of individuals within the system, has been shown to be as significant as that of epidemic parameters in metapopulation models (Balcan and Vespignani, 2012). Domestic livestock systems are well-suited for metapopulation models as the movements of livestock are controlled and the individuals are restricted to reside within populations rarely defined by their choice. Within the US, livestock movements are controlled by the cattle industries, primarily beef, dairy, breeding, and showmanship.

Within Europe, motivated by outbreaks of Foot and Mouth Disease, a number of governments have designed and implemented animal tracking systems, even to the resolution of individuals’ daily movements. The databases created by these studies have generated very detailed characterizations of livestock movements for a number of European nations (Green et al., 2006, Kao et al., 2007, Bajardi et al., 2012, European Council, 2000, Bigras-Poulin et al., 2006, Brennan et al., 2008, Keeling et al., 2010). No similar program has yet to be mandated for the United States, although the National Animal Identification System has long been in preparation (Animal and plant health inspection service, 2015). In the US, a cultural appreciation of personal privacy, competition between meat production companies, and a U.S. Federal privacy protection law restrict the ability of the government to collect and release livestock data at a finer spatial resolution than is currently done through the United States Department of Agriculture’s (USDA) Census of Agriculture (USDA-NASS, 2013). To address this challenge, a number of survey-based methods have been used to study livestock movements across small regions (Bailey et al., 1995, Forde et al., 1998, Bates et al., 2001, Dominguez, 2007). However, the national scale of US cattle trade and the potential for livestock diseases to impact the entire country necessitate movement data, models, or estimates to be determined for larger regions. Recently a study has been published of a nation-wide movement estimation based on a 10% sample of veterinary records from State border-crossing cattle shipments (Buhnerkempe et al., 2013). These confidential records were acquired from each State and measures were taken to protect the anonymity of the operations involved. This study could only capture shipments of cattle that crossed state borders and was unable to capture the livestock movements within each state. Further studies with these veterinary records have produced a model for generating model cattle networks and characterized a theoretical disease outbreak in the U.S. (Lindström et al., 2013, Buhnerkempe et al., 2014).

We formulate a large, convex optimization problem to estimate parameters describing the movements of cattle within 10 Central States of the United States. We collect cattle population and aggregated movement data from the United States Department of Agriculture’s database and optimally estimate anonymous data points to construct a database of inputs for an estimation of cattle movement parameters. We design the estimation method to produce a high resolution of cattle demographic and movement parameters and to include the minimal set of assumptions. Our results produce county-to-county movement probabilities among stratified subpopulations as well as birth, slaughter, and expiration rates of cattle for 1034 US counties. In Section 2, we describe the USDA data structures and challenges present in the database. We estimate non-disclosed data points and discuss the mapping of USDA data to inputs for our formulation. In Section 3, we formulate the estimation problem and describe the maximum-entropy objective and the flexible set of linear constraints with minimal assumptions. We solve the optimal estimation problem, display a subset of the results, and compare the results to both summary USDA data and a null model in Section 4. Section 5 summarizes this paper with a discussion of the results and a calculation of critical movement thresholds that demonstrate the potential for a disease outbreak in this region.

Section snippets

Data collection and structure

Every five years, the United States Department of Agriculture (USDA) conducts the United States Census of Agriculture (USDA-NASS, 2013). The National Agricultural Statistics Service (NASS) of USDA then summarizes and publishes a large set of data covering livestock, crops, operator demographics, and much more (USDA-NASS, 2013). As the most comprehensive database of US livestock statistics, the NASS database is used for our estimation of cattle movement parameters. In particular, we use data

Cattle movement parameter estimation

We formulate a non-linear, convex optimization problem with an objective to maximize the entropy of the out-going distributions of each subpopulation (Yeung, 2008). This formulation contains only linear constraints. The choice of maximum entropy for the problem objective aims to predict the solution having the minimal assumptions (maximum uncertainty) beyond the information contained within the set of constraints (Yeung, 2008). We chose this form of objective to not force any artificial

Optimization results

We solved the cattle movement parameter problem of Section 3 using the AIMMS Modeling System of Paragon Decision Technology. The complete formulation is composed of 81,142 constraints with 80,107 variables and the objective function. For the error limit, a value of fmin=0.012 was obtained, representing a limit of 1.2% of the total number of cattle, 51,252,890. The limits on the demographic probabilities attempt to capture loose bounds on feasible average rates of birth, culling, and slaughter.

Discussion and conclusions

We have designed and solved a large-scale optimal estimation problem in an attempt to address the privacy challenges and the need for livestock movement data in the United States. Given the resolution limitations of the data available, we do not try to estimate very precise parameters, but rather adopt a stratified metapopulation approach where we shape the structure of our variables around the structure of the NASS data. Our approach is limited by the timescale resolution of the census report.

Acknowledgments

The authors would like to express their gratitude to Dr. Sanderson, Dr. Schroeder, and local farmers and stockers for helpful insights into the beef cattle industry. The authors would further like to express their gratitude to the anonymous reviewers who have contributed significantly to the content. This material is based upon work supported by the U.S. Department of Homeland Security under Grant Award Number 2010-ST061-AG0001 and a grant through the Kansas Biosciences Authority. The views and

References (34)

  • Phillip Schumm et al.

    Global epidemic invasion thresholds in directed cattle subpopulation networks having source, sink, and transit nodes

    J. Theor. Biol.

    (2015)
  • “Animal and plant health inspection service: APHIS national animal health monitoring system: NAHMS,” March...
  • DeeVon Bailey et al.

    Identifying buyer market areas and the impact of buyer concentration in feeder cattle markets using mapping and spatial statistics

    Am. J. Agric. Econom.

    (1995)
  • Paolo Bajardi et al.

    Optimizing surveillance for livestock disease spreading through animal movements

    J. Royal Soc. Interface

    (2012)
  • Duygu Balcan et al.

    Multiscale mobility networks and the spatial spreading of infectious diseases

    Proc. Nat. Acad. Sci.

    (2009)
  • Alain Barrat et al.

    Dynamical Processes on Complex Networks

    (2008)
  • T.W. Bates et al.

    Direct and indirect contact rates among beef, dairy, goat, sheep, and swine herds in three california counties, with reference to control of potential foot-and-mouth disease transmission

    Am. J. Vet. Res.

    (2001)
  • Cited by (9)

    • Modeling U.S. cattle movements until the cows come home: Who ships to whom and how many?

      2022, Computers and Electronics in Agriculture
      Citation Excerpt :

      Given the lack of U.S. cattle shipment data, researchers in livestock epidemiology are forced to rely on either estimations of contact rates through surveys (e.g. McReynolds et al., 2014; Bates et al., 2001; Marshall et al., 2009) or numerical methods to infer information about cattle trade networks of interest from the limited data that is available. For instance Schumm et al. (2015) was able to estimate various parameters related to the movement of different sub-populations of cattle across ten U.S. states by formulating and solving an optimization problem, while Yang et al. (2019) developed an agent-based model capable of simulating the cattle industry and transport network of south-west Kansas. The data used to inform both of these studies was sourced from the National Agricultural Statistical Survey (NASS) census of agriculture (recurring every five years, e.g. USDA, 2014a) where various county-level statistics regarding the livestock industry are available.

    • Estimating and exploring the proportions of inter- and intrastate cattle shipments in the United States

      2019, Preventive Veterinary Medicine
      Citation Excerpt :

      In addition to these county characteristics, we examined four measures of the cattle industry in our analyses; these include: the number of operations with cattle inventory, including calves; the inventory of cattle, including calves; the proportion of operations that are beef operations; and the number of feedlots (operations with cattle on feed). These measures are publicly available through the National Agricultural Statistics Service (NASS) 2012 Census of Agriculture (USDA, 2014) and have been used to inform cattle shipment models (Lindström et al., 2013; Schumm et al., 2015). For each model, we conducted model selection using backwards elimination based on Akaike information criteria (AIC).

    • Genotypic and epidemiologic characterization of extended-spectrum cephalosporin resistant Salmonella enterica from US beef feedlots

      2017, Preventive Veterinary Medicine
      Citation Excerpt :

      While E. coli isolates bearing blaCTX-M have been cultured from the enteric flora of US feedlot cattle (Cottell et al., 2013), we did not recover any Salmonella isolates with this genotype from these 2011 study samples although the same selection pressures would be expected to apply to both the blaCMY-2 and blaCTX-M genotypes. Due to the dynamic nature of both the US beef feedlot industry (Schumm et al., 2015) and enteric flora of beef cattle (Noyes et al., 2016), we would expect to see some variability regarding the distribution of Salmonella strains and antimicrobial resistance mechanisms across feedlots and over time (Dargatz et al., 2016). However, this MDR-AmpC plasmid/gene combination has remained the dominant resistance genotype (Sjölund-Karlsson et al., 2013).

    View all citing articles on Scopus
    View full text