🤖 AI Summary
This study addresses real-time forecasting of Eurozone GDP growth, focusing on three key questions: (i) whether disaggregated, country-level forecasts aggregated via optimal weights outperform direct aggregate forecasting; (ii) whether incorporating data from smaller member states enhances accuracy; and (iii) whether high-frequency news sentiment mitigates statistical lags in official macroeconomic releases. We propose a mixed-frequency modeling framework integrating MIDAS regressions, machine learning techniques, and panel regression to jointly exploit daily news text and lagged macroeconomic indicators. Empirical results show that: (i) weighted aggregation of country-specific forecasts consistently dominates direct aggregate prediction; (ii) information from smaller euro area countries provides indispensable predictive power—particularly during structural breaks such as the pandemic; and (iii) news-based features significantly improve short-horizon forecast accuracy by bridging informational gaps caused by publication delays in official statistics. The study contributes both methodological innovation in real-time regional macroeconomic monitoring and robust empirical evidence supporting granular, high-frequency data integration.
📝 Abstract
The paper studies the nowcasting of Euro area Gross Domestic Product (GDP) growth using mixed data sampling machine learning panel data regressions with both standard macro releases and daily news data. Using a panel of 19 Euro area countries, we investigate whether directly nowcasting the Euro area aggregate is better than weighted individual country nowcasts. Our results highlight the importance of the information from small- and medium-sized countries, particularly when including the COVID-19 pandemic period. The empirical analysis is supplemented by studying the so-called Big Four -- France, Germany, Italy, and Spain -- and the value added of news data when official statistics are lagging. From a theoretical perspective, we formally show that the aggregation of individual components forecasted with pooled panel data regressions is superior to direct aggregate forecasting due to lower estimation error.