Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Evaluating Contrast Localizer for Identifying Causal Units in Social & Mathematical Tasks in Language Models

Created by
  • Haebom

Author

Yassine Jamaa, Badr AlKhamissi, Satrajit Ghosh, Martin Schrimpf

Outline

This study applies a neuroscientific contrastive localization technique to identify units causally relevant to theory of mind (ToM) and mathematical reasoning tasks in large-scale language models (LLMs) and visual-language models (VLMs). Using contrastive stimulus sets, we localized top-activated units across 11 LLMs and 5 VLMs, ranging from 3 billion to 90 billion parameters, and assessed their causal roles through targeted ablation. We compared the effects of functionally selected units on downstream accuracy in established ToM and mathematics benchmarks to those of low-activation and randomly selected units. Contrary to expectations, low-activation units sometimes resulted in greater performance impairment than high-activation units, and units derived from mathematical localizers often impaired ToM performance more than units derived from ToM localizers. These results question the causal relevance of contrast-based localizers and highlight the need to more accurately capture broader stimulus sets and task-specific units.

Takeaways, Limitations

Takeaways: A novel approach to identifying causally relevant units for theory of mind and mathematical reasoning tasks in large-scale language and visual-language models is presented. Experimental results demonstrating the limitations of contrast-based localization techniques are presented, highlighting the importance of low-activation units.
Limitations: The stimulus set used may be limited. It may not accurately capture task-specific units. This raises questions about the reliability of contrast-based localization techniques.
👍