Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Meaning-infused grammar: Gradient Acceptability Shapes the Geometric Representations of Constructions in LLMs

Created by
  • Haebom

Author

Supantho Rakshit, Adele Goldberg

Outline

This study, drawing on a usage-based constructivism (UCx) approach, investigates whether the internal representations of large-scale language models (LLMs) reflect feature-rich, hierarchical structures. Using the Pythia-1.4B model, we analyze the representations of English double-object (DO) and preposition-object (PO) phrases. We leverage a dataset of 5,000 sentence pairs in which human-rated preference for DO or PO was systematically varied. Geometric analysis reveals that the separability of two phrase representations, as measured by energy distance or Jensen-Shannon divergence, is systematically modulated by the gradient preference strength. That is, more typical exemplars of each phrase occupy more distinct regions in the activation space, whereas sentences that are equally likely to appear in either phrase do not. These results provide evidence that LLMs learn rich, hierarchical phrase representations and support the geometric measurement approach to LLM representations.

Takeaways, Limitations

Takeaways:
We present evidence that LLM learns hierarchical syntactic representations that incorporate features suggested by usage-based constructivism (UCx).
We demonstrate the usefulness of applying geometric measurement methods to the analysis of internal representations of LLM.
Provides new insights into the understanding of LLM syntax.
Limitations:
Further research is needed to determine whether the scale and structural characteristics of the LLM (Pythia-1.4B) used in the analysis can be generalized to other LLMs.
Subjectivity in human preference assessments may influence the results.
Because the analysis target sentences are limited to the English DO and PO sentences, generalizability to other languages or sentence types is required.
👍