🤖 AI Summary
Hybrid models (e.g., mixed logit and latent class models) are widely assumed to outperform standard discrete choice models in predicting market-level and individual-level choices, yet their generalizability remains untested. Method: Using both revealed and stated preference data, we systematically compare in-sample and out-of-sample predictive performance across multiple modeling approaches that accommodate random heterogeneity—focusing on individual choice accuracy and market share recovery. Contribution/Results: While hybrid models significantly improve in-sample fit and effectively uncover unobserved preference heterogeneity, their predictive advantage is confined to repeated predictions for the same individuals under identical choice contexts. In realistic out-of-sample forecasting—where predictions are made for new individuals or novel choice sets—they offer no consistent improvement over simpler models. This study challenges the prevailing assumption of inherent superiority of hybrid models and provides the first empirical evidence of their limited predictive generalizability, delivering a critical methodological caution for model selection in applied choice analysis.
📝 Abstract
Models allowing for random heterogeneity, such as mixed logit and latent class, are generally observed to obtain superior model fit and yield detailed insights into unobserved preference heterogeneity. Using theoretical arguments and two case studies on revealed and stated choice data, this paper highlights that these advantages do not translate into any benefits in forecasting, whether looking at prediction performance or the recovery of market shares. The only exception arises when using conditional distributions in making predictions for the same individuals included in the estimation sample, which obviously precludes any out-of-sample forecasting.