Flexibility without foresight: the predictive limitations of mixture models

📅 2025-10-10

📈 Citations: 0

✨ Influential: 0

career value

160K/year

🤖 AI Summary

Hybrid models (e.g., mixed logit and latent class models) are widely assumed to outperform standard discrete choice models in predicting market-level and individual-level choices, yet their generalizability remains untested. Method: Using both revealed and stated preference data, we systematically compare in-sample and out-of-sample predictive performance across multiple modeling approaches that accommodate random heterogeneity—focusing on individual choice accuracy and market share recovery. Contribution/Results: While hybrid models significantly improve in-sample fit and effectively uncover unobserved preference heterogeneity, their predictive advantage is confined to repeated predictions for the same individuals under identical choice contexts. In realistic out-of-sample forecasting—where predictions are made for new individuals or novel choice sets—they offer no consistent improvement over simpler models. This study challenges the prevailing assumption of inherent superiority of hybrid models and provides the first empirical evidence of their limited predictive generalizability, delivering a critical methodological caution for model selection in applied choice analysis.

Technology Category

Application Category

📝 Abstract

Models allowing for random heterogeneity, such as mixed logit and latent class, are generally observed to obtain superior model fit and yield detailed insights into unobserved preference heterogeneity. Using theoretical arguments and two case studies on revealed and stated choice data, this paper highlights that these advantages do not translate into any benefits in forecasting, whether looking at prediction performance or the recovery of market shares. The only exception arises when using conditional distributions in making predictions for the same individuals included in the estimation sample, which obviously precludes any out-of-sample forecasting.

Problem

Research questions and friction points this paper is trying to address.

Mixture models fail to improve forecasting accuracy despite better fit

They show no benefits in predicting out-of-sample market shares

Only work for same estimation sample using conditional distributions

Innovation

Methods, ideas, or system contributions that make the work stand out.

Mixture models reveal unobserved preference heterogeneity

Models show no forecasting benefits despite superior fit

Conditional distributions only work for in-sample predictions

🔎 Similar Papers

No similar papers found.