Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Tug-of-war between idioms' figurative and literal interpretations in LLMs

Created by
  • Haebom

Author

Soyoung Oh, Xinting Huang, Mathis Pink, Michael Hahn, Vera Demberg

Outline

This paper analyzes how a language model processes idioms with non-constructive figurative interpretations through causal tracing. Specifically, we uncover three mechanisms by which a pre-trained causal transformer handles idiom ambiguity: (i) Early lower layers and specific attention heads retrieve figurative interpretations of idioms and suppress literal interpretations. (ii) If context appears before the idiom, the model utilizes it from the earliest layers onward, and if the context conflicts with the retrieved interpretation, it refines the interpretation in later layers. (iii) It propagates both interpretations through selective and competing paths, with the intermediate path prioritizing the figurative interpretation, and the parallel direct path favoring the literal interpretation, ensuring that both interpretations are preserved.

Takeaways, Limitations

Takeaways:
Providing mechanical evidence for idiom understanding within an unsupervised regression transformer.
A specific mechanism is presented for how language models handle the figurative interpretation of idioms.
Emphasize the role of the early layers and the importance of context in idiom processing.
Limitations:
Research results may be limited to specific model architectures.
Further verification of generalizability across various idioms and contexts is needed.
Absence of metrics to quantitatively assess the level of accurate understanding of the model
👍