Using GPT-4 for Conventional Metaphor Detection in English News Texts

Authors

  • Jiahui Liang University of Leiden
  • Aletta G. Dorst University of Leiden
  • Jelena Prokic University of Leiden
  • Stephan Raaijmakers University of Leiden

Abstract

Metaphor detection presents a significant challenge in natural language processing (NLP) due to the intrinsic complexity of metaphors. In this work, we apply a prompting approach to evaluate GPT-4’s performance on the conventional metaphor identification task. We specifically investigate the effects of prompt variation, output stability, and the role of n-shot prompting. The results indicate that GPT-4’s performance on the metaphor identification task is consistently low across all tested settings, significantly lagging behind the top-performing BERT model. Based on our findings and error analysis, we propose possible approaches for utilizing LLMs and AI assistants
in metaphor detection and analysis.

Downloads

Published

2025-07-15

How to Cite

Liang, J., Dorst, A. G., Prokic, J., & Raaijmakers, S. (2025). Using GPT-4 for Conventional Metaphor Detection in English News Texts. Computational Linguistics in the Netherlands Journal, 14, 307–341. Retrieved from https://clinjournal.org/clinj/article/view/203

Issue

Section

Articles

Most read articles by the same author(s)