Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Causal Interventions Reveal Shared Structure Across English Filler-Gap Constructions

Created by
  • Haebom

Author

Sasha Boguraev, Christopher Potts, Kyle Mahowald

Outline

Language models (LMs) have emerged as a powerful source of evidence for linguists seeking to develop syntactic theories. This paper argues that applying causal interpretability methods to LMs can significantly enhance the value of this evidence by characterizing the abstract mechanisms that LMs learn to use. We conduct experiments focusing on filler-gap dependency structures in English (e.g., questions, relative clauses). Using experiments based on distributed exchange interventions, we demonstrate that LMs converge on a similar abstract analysis of these structures. This analysis can reveal previously overlooked factors related to frequency, filler type, and surrounding context, potentially leading to changes in standard linguistic theory. Overall, our findings suggest that mechanistic internal analysis of LMs can advance linguistic theory.

Takeaways, Limitations

The application of causal interpretability methodology to LM can contribute to the development of linguistic theory.
We found a similar abstract analysis of LM for the English filler-gap dependence structure.
It suggests that factors such as frequency, filler type, and surrounding context may influence linguistic theory.
The specific Limitations of the paper was not specified.
👍