Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Bias in Decision-Making for AI's Ethical Dilemmas: A Comparative Study of ChatGPT and Claude

Created by
  • Haebom

Author

Yile Yan, Yuqi Zhu, Wentao Xu

Outline

This study systematically evaluates the ethical decision-making capabilities and potential biases of nine popular large-scale language models (LLMs). We evaluate the models' ethical preferences, sensitivity, stability, and clustering patterns across 50,400 trials, covering four ethical dilemma scenarios (protective vs. harmful) involving protected attributes, including single and cross-attribute combinations. Results reveal significant biases toward protected attributes across all models, with preferences varying across model type and dilemma contexts. Specifically, open-source LLMs exhibit stronger preferences for marginalized groups and greater sensitivity in harmful scenarios, whereas closed-source models are more selective in protective scenarios and tend to favor mainstream groups. Furthermore, ethical behavior varies across dilemmas. LLMs maintain consistent patterns in protective scenarios, but make more diverse and cognitively demanding decisions in harmful scenarios. Furthermore, models exhibit more pronounced ethical biases in cross-attribute settings than in single-attribute settings, suggesting that complex inputs reveal deeper biases. These results highlight the need for a multidimensional and context-aware assessment of ethical behavior in LLMs, and suggest a systematic assessment and approach to understanding and addressing fairness in LLM decision-making.

Takeaways, Limitations

Takeaways:
Providing a systematic evaluation framework for ethical decision-making in LLMs
Identifying Differences in Ethical Bias Between Open-Source and Closed-Source LLMs (Open-Source Programs Show Greater Favorability for Underserved Groups)
Analysis of ethical behavioral changes in LLM students according to dilemma type and attribute combination (bias is more severe in cross-attribute combinations)
Directions for Improving the Fairness of LLM
Limitations:
Limitations of the dilemma scenarios used in the evaluation
Limits the generalizability of the analysis results to specific LLM models.
Lack of consideration for the subjectivity and diversity of ethical judgments
Further research is needed to determine real-world applicability.
👍