Bayesian analysis of mixtures of lognormal distribution with an unknown number of components from grouped data

📅 2022-10-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the challenge of determining the unknown number of components in a lognormal mixture model for grouped income data. We propose a Bayesian inference framework based on reversible jump Markov chain Monte Carlo (RJMCMC), innovatively extending RJMCMC to grouped-data settings for the first time. By integrating a Bayesian nonparametric prior with a grouped-likelihood construction, our method simultaneously selects the optimal number of mixture components and estimates all parameters, while yielding the full posterior distribution of the Gini coefficient. Simulation studies and empirical analysis of Japan’s 2020 household income data demonstrate that the method robustly identifies two distinct income subpopulations. The posterior mean Gini estimate exhibits absolute bias <0.005 relative to the true value, and its 95% credible interval covers the ground truth—outperforming fixed-component benchmark models. This framework establishes a generalizable Bayesian paradigm for inequality measurement under incompletely observed income data.
📝 Abstract
This study proposes a reversible jump Markov chain Monte Carlo method for estimating parameters of lognormal distribution mixtures for income. Using simulated data examples, we examined the proposed algorithm's performance and the accuracy of posterior distributions of the Gini coefficients. Results suggest that the parameters were estimated accurately. Therefore, the posterior distributions are close to the true distributions even when the different data generating process is accounted for. Moreover, promising results for Gini coefficients encouraged us to apply our method to real data from Japan. The empirical examples indicate two subgroups in Japan (2020) and the Gini coefficients' integrity.
Problem

Research questions and friction points this paper is trying to address.

Estimating lognormal mixture parameters for income data
Assessing Gini coefficient accuracy using Bayesian methods
Identifying income subgroups and inequality in real datasets
Innovation

Methods, ideas, or system contributions that make the work stand out.

Reversible jump MCMC for lognormal mixture estimation
Bayesian analysis of grouped income data components
Posterior distribution estimation for Gini coefficients
🔎 Similar Papers
No similar papers found.