The More the Merrier? A Machine Learning Algorithm for Optimal Pooling of Panel Data
Electronic Access:
Free Download. Use the free Adobe Acrobat Reader to view this PDF file
Disclaimer: IMF Working Papers describe research in progress by the author(s) and are published to elicit comments and to encourage debate. The views expressed in IMF Working Papers are those of the author(s) and do not necessarily represent the views of the IMF, its Executive Board, or IMF management.
Summary:
We leverage insights from machine learning to optimize the tradeoff between bias and variance when estimating economic models using pooled datasets. Specifically, we develop a simple algorithm that estimates the similarity of economic structures across countries and selects the optimal pool of countries to maximize out-of-sample prediction accuracy of a model. We apply the new alogrithm by nowcasting output growth with a panel of 102 countries and are able to significantly improve forecast accuracy relative to alternative pools. The algortihm improves nowcast performance for advanced economies, as well as emerging market and developing economies, suggesting that machine learning techniques using pooled data could be an important macro tool for many countries.
Series:
Working Paper No. 2020/044
Subject:
English
Publication Date:
February 28, 2020
ISBN/ISSN:
9781513529974/1018-5941
Stock No:
WPIEA2020044
Pages:
21
Please address any questions about this title to publications@imf.org