Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

AUTALIC: A Dataset for Anti-AUTistic Ableist Language In Context

Created by
  • Haebom

Author

Naba Rizvi, Harper Strickland, Daniel Gitelman, Tristan Cooper, Alexis Morales-Flores, Michael Golden, Aekta Kallepalli, Akshat Alurkar, Haaset Owens, Saleha Ahmedi, Isha Khirwadkar, Imani Munyaka, Nedjma Ousidhoum

Outline

As understanding of autism and ableism grows, so does the understanding of ableist language associated with autism. This language presents significant challenges for NLP research due to its nuanced and context-dependent nature. However, detecting anti-autistic ableist language remains an unexplored area, and existing NLP tools often fail to capture its subtle expressions. In this paper, we address this critical gap by presenting AUTALIC, the first benchmark dataset dedicated to detecting anti-autistic ableist language in context. This dataset consists of 2,400 autism-related sentences and their surrounding context collected from Reddit, annotated by experienced experts with a background in neurodiversity. Comprehensive evaluations demonstrate that current language models, including state-of-the-art LLMs, struggle to reliably identify anti-autistic ableism and match human judgment, highlighting limitations in this area. By publicly releasing AUTALIC, along with its individual annotations, we provide a valuable resource for researchers studying ableism, neurodiversity, and the discrepancy in annotation efforts. This dataset is an important step toward developing more comprehensive and context-aware NLP systems that better reflect diverse perspectives.

Takeaways, Limitations

Takeaways: Made a significant contribution to NLP research by providing AUTALIC, the first benchmark dataset for autistic-ableistic language detection. It highlighted the limitations of existing language models and highlighted the need for more comprehensive NLP systems. It also provided valuable information for neurodiversity research and annotation task mismatch studies.
Limitations: Because the dataset is based on data collected from Reddit, it may reflect platform-specific characteristics. The issue of inconsistency in annotation tasks should be further addressed in future research. Currently, there is a lack of in-depth analysis of the causes of poor language model performance.
👍