Empirical Identification in the Mixed Logit Model: Analysing the Effect of Data Richness

Cherchi, Elisabetta; de Dios Ortúzar, Juan

doi:10.1007/s11067-007-9045-4

Empirical Identification in the Mixed Logit Model: Analysing the Effect of Data Richness

Published: 29 December 2007

Volume 8, pages 109–124, (2008)
Cite this article

Networks and Spatial Economics Aims and scope Submit manuscript

Elisabetta Cherchi¹ &
Juan de Dios Ortúzar²

454 Accesses
32 Citations
Explore all metrics

Abstract

The flexible structure of the mixed logit (ML) model is at the root of the difficulties associated to its estimation. Major problems are parameter identification and the distinction between different substitution patterns. In this paper we focus on the empirical identification problem and investigate the effect of low information richness in the data on the capability of estimating a correct ML model (i.e. with identifiable parameters and free of confounding effects). In particular, we analyse to which extent the empirical identification problem depends on the variability of the data among alternatives, on the degree of heterogeneity of the taste parameters, on the dimension of the sample and on the number of choice tasks for each individual. To test for information richness of the data and its effect on the capability of the ML model to reproduce random heterogeneity in tastes, a collection of datasets was generated varying systematically (a) the standard deviation (SD) of the distribution of travel time differences between the two alternatives, (b) the SD of the random parameter, (c) the number of choice tasks for each individual and (d) the number of individuals in relation to the number of choice tasks. Then, several ML models allowing for random travel time parameters were estimated using different number of draws and results were compared in terms of model goodness of fit and, also, on the capability of reproducing the real parameters used to generate each dataset. Our results suggest that identification problems depend only on the (low) variability of the associated data and disappear as the richness of the data associated to the random parameter increases. However, rich enough data only allows obtaining good statistics but the estimated parameters do not always reproduce the correct values, as the capability of the ML to reproduce random heterogeneity depends on the random parameter distribution (degree of variability and symmetry). Moreover, the capability of the ML to reproduce random heterogeneity increases when more than one choice is available for each individual and the effect of sample size on the empirical identification reduces considerably.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The benefits of incorporating utility dependencies in finite mixture probit models

Article 06 May 2017

Friederike Paetz & Winfried J. Steiner

Logit Models of Individual Choice

How to generalize from a hierarchical model?

Article Open access 17 May 2020

Max J. Pachali, Peter Kurz & Thomas Otter

References

Bath C, Castelar S (2002) A unified mixed logit framework for modelling revealed and stated preferences: formulation and application to congestion pricing analysis in the San Francisco Bay Area. Transp Res 36B:593–616
Google Scholar
Brownstone D, Bunch D, Train K (2000) Joint mixed logit models of stated and revealed preferences for alternative-fuel vehicles. Transp Res 34B:315–338
Article Google Scholar
Cherchi E, Ortúzar JdeD (2006) Predicting best with Mixed Logit models: understanding some confounding effects. Environ Plan A (under review).
Cherchi E, Ortúzar JdeD (2007a) A Monte Carlo analysis to explore the effect of data richness in the empirical identification of the mixed logit model. 11th World Conference on Transport Research, Berkeley, California, 24–28 June 2007
Cherchi E, Ortúzar JdeD (2007b) On the efficiency of Mixed Logit parameters estimates: analysing the effect of data richness XIII Congreso Chileno de Ingeniería de Transporte, Santiago, 22–26 October 2007
Cherchi E, Polak JW (2005) The assessment of user benefits using discrete choice models: implications of specification errors under random taste heterogeneity. Transp Res Rec 1926:61–69
Article Google Scholar
Chiou L, Walker J (2006) Identification and estimation of mixed logit models under simulation methods. 25th Annual Meeting of the Transportation Research Board. Washington, DC, January 2006
Hensher DA (1998) Extending valuation to controlled value functions and non-uniform scaling with generalised unobserved variances. In: Gärling T, Laitila T, Westin K (eds) Theoretical foundations of travel choice modelling. Elsevier, Amsterdam
Google Scholar
Hensher DA, Greene WH (2003) The mixed logit model: the state of practice. Transportation 30:133–176
Article Google Scholar
Koopmans TC (1949) Identification problems in economic model construction. Econometrica 17:125–144
Article Google Scholar
Munizaga M, Alvarez R (2001) Mixed logit vs. nested logit and probit models. 5th Tri-annual invitational choice symposium. Workshop: hybrid choice models, formulation and practical issues. Asilomar, Texas
Munizaga M, Alvarez R (2005) Testing mixed logit and probit by simulation. Transp Res Rec 1921
Sillano M, Ortúzar JdeD (2005) Willingness-to-pay estimation with mixed logit models: some new evidence. Environ Plan 37A:525–550
Article Google Scholar
Swait JD, Bergantino A (2000) Distinguishing taste variation from error structure in discrete choice data. Transp Res 34B:1–15
Article Google Scholar
Train KE (2003) Discrete choice methods with simulation. Cambridge University Press, Cambridge
Google Scholar
Walker J (2001) Extended discrete choice models: integrated framework, flexible error structures, and latent variables. PhD Thesis, Department of Civil and Environmental Engineering, MIT
Walker J (2002) The mixed logit (or logit kernel) model: dispelling misconceptions of identification. Transp Res Rec 1805:86–98
Article Google Scholar
Williams HCWL, Ortúzar JdeD (1982) Behavioural theories of dispersion and the mis-specification of travel demand models. Transp Res 16B:167–219
Article Google Scholar

Download references

Author information

Authors and Affiliations

CRiMM - Dipartimento di Ingegneria del Territorio, Facoltà di Ingegneria - Università di Cagliari, Piazza d’Armi, 16, 09123, Cagliari, Italia
Elisabetta Cherchi
Departamento de Ingeniería de Transporte y Logística, Pontificia Universidad Católica de Chile, Casilla 306, Cod. 105, Santiago 22, Chile
Juan de Dios Ortúzar

Authors

Elisabetta Cherchi
View author publications
You can also search for this author in PubMed Google Scholar
Juan de Dios Ortúzar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Elisabetta Cherchi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cherchi, E., de Dios Ortúzar, J. Empirical Identification in the Mixed Logit Model: Analysing the Effect of Data Richness. Netw Spat Econ 8, 109–124 (2008). https://doi.org/10.1007/s11067-007-9045-4

Download citation

Published: 29 December 2007
Issue Date: September 2008
DOI: https://doi.org/10.1007/s11067-007-9045-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Empirical Identification in the Mixed Logit Model: Analysing the Effect of Data Richness

Abstract

Access this article

Similar content being viewed by others

The benefits of incorporating utility dependencies in finite mixture probit models

Logit Models of Individual Choice

How to generalize from a hierarchical model?

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Empirical Identification in the Mixed Logit Model: Analysing the Effect of Data Richness

Abstract

Access this article

Similar content being viewed by others

The benefits of incorporating utility dependencies in finite mixture probit models

Logit Models of Individual Choice

How to generalize from a hierarchical model?

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation