IMF Working Papers

How Effectively Can Current LLMs Analyze Macrofinancial Issues?

By. Paola Ganum, Tohid Atashbar

February 27, 2026

Preview Citation

Format: Chicago

. Paola Ganum, and Tohid Atashbar. "How Effectively Can Current LLMs Analyze Macrofinancial Issues?", IMF Working Papers 2026, 035 (2026), accessed 2/27/2026, https://doi.org/10.5089/9798229038935.001

Export Citation

  • ProCite
  • RefWorks
  • Reference Manager
  • BibTex
  • Zotero
  • EndNote

Disclaimer: IMF Working Papers describe research in progress by the author(s) and are published to elicit comments and to encourage debate. The views expressed in IMF Working Papers are those of the author(s) and do not necessarily represent the views of the IMF, its Executive Board, or IMF management.

Summary

This paper empirically evaluates the ability of current Large Language Models (LLMs) to analyze macrofinancial coverage in IMF Article IV staff reports, using human economists' assessments as a benchmark. We test several GPT models on reports from 2016-2024, assessing their performance on both qualitative ratings and binary questions. Our findings indicate that the latest models can meaningfully assist economists, achieving an average accuracy of 71-75% on ratings and an average exact match rate of 76-81% on binary questions in 2024 across advanced GPT models. However, we find that LLMs tend to assign higher, less-dispersed ratings than human experts and struggle with open-ended questions that require deep contextual judgment. The paper provides quantitative evidence on current LLM accuracy in this domain, explores the drivers of its performance, and discusses key limitations such as optimistic bias.

Subject: Economic sectors, Financial institutions, Financial sector, Financial sector policy and analysis, Futures, Macrofinancial analysis, Systemic risk

Keywords: AI, Financial sector, Futures, Human-AI Comparison, IMF Staff Reports, Large Language Model, Macrofinancial analysis, Macrofinancial Surveillance, North America, Systemic risk, Textual Analysis

Supplemental Resources