Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Gender and Political Bias in Large Language Models: A Demonstration Platform

Created by
  • Haebom

Author

Wenjie Lin, Hange Liu, Xutao Mao, Yingying Zhuang, Jingwei Shi, Xudong Han, Tianyu Shi, Jinrui Yang

Outline

ParlAI Vote is an interactive system for exploring European Parliament debates and votes, testing Large Language Models (LLMs) for vote prediction and bias analysis. The platform links debate topics, speeches, and list voting results, including rich demographic data such as gender, age, country, and political group. Users can explore debates, review linked speeches, compare state-of-the-art LLM predictions with actual voting results, and view error breakdowns by demographic group. By visualizing the EuroParlVote benchmark and the core tasks of gender classification and vote prediction, ParlAI Vote highlights systematic performance biases in state-of-the-art LLMs. The system integrates data, models, and visual analytics into a single interface, lowering the barriers to result replication, behavioral audits, and running counter-empirical scenarios. It supports research, education, and public engagement in legislative decision-making, while also highlighting the strengths and limitations of current LLMs in political analysis.

Takeaways, Limitations

Takeaways:
Combining European Parliament debate and voting data with cutting-edge LLM expertise provides a powerful platform for vote prediction and bias analysis.
Increase transparency and accountability by visualizing performance bias in LLM.
Improving accessibility for research, education, and public engagement.
Enhancing understanding of LLM decision-making processes through semi-empirical scenarios.
Limitations:
The platform's data is limited to the European Parliament.
This is not a proposal to completely address the biases of LLM.
Further research is needed on the platform's usability and scalability.
👍