How Effectively Can Current LLMs Analyze Macrofinancial Issues?

February 27, 2026

Disclaimer: IMF Working Papers describe research in progress by the author(s) and are published to elicit comments and to encourage debate. The views expressed in IMF Working Papers are those of the author(s) and do not necessarily represent the views of the IMF, its Executive Board, or IMF management.

Summary

This paper empirically evaluates the ability of current Large Language Models (LLMs) to analyze macrofinancial coverage in IMF Article IV staff reports, using human economists' assessments as a benchmark. We test several GPT models on reports from 2016-2024, assessing their performance on both qualitative ratings and binary questions. Our findings indicate that the latest models can meaningfully assist economists, achieving an average accuracy of 71-75% on ratings and an average exact match rate of 76-81% on binary questions in 2024 across advanced GPT models. However, we find that LLMs tend to assign higher, less-dispersed ratings than human experts and struggle with open-ended questions that require deep contextual judgment. The paper provides quantitative evidence on current LLM accuracy in this domain, explores the drivers of its performance, and discusses key limitations such as optimistic bias.

Subject: Financial Sector Assessment Program, Financial sector policy and analysis, Macrofinancial analysis, Systemic risk

Keywords: AI, Financial Sector Assessment Program, Human-AI Comparison, IMF article IV, IMF seminar, IMF staff, IMF Staff Reports, IMF working papers, Large Language Model, Large Language Models, Macrofinancial analysis, Macrofinancial Surveillance, Systemic risk, Textual Analysis

Pages:
36
Volume:
2026
DOI:
https://doi.org/10.5089/9798229038935.001
Issue:
035
Series:
Working Paper No. 2026/035
Stock No:
WPIEA2026035
ISBN:
9798229038935
ISSN:
1018-5941

IMF’s Work

RESOURCES

TOPICS

Flagship Publications

Other Publications

IMF reports and publications by country

Regional Offices

All News

See Also

For Journalists

Press Center

RESOURCES

FLAGSHIPS

KEY SERIES

IMF NOTES

Loading component...

IMF Working Papers