Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Learning to Drive Ethically: Embedding Moral Reasoning into Autonomous Driving

Created by
  • Haebom

Author

Dianzhao Li, Ostap Okhrin

Outline

This paper presents a hierarchical Safe Reinforcement Learning (Safe RL) framework for ethical decision-making in autonomous vehicles. This framework is designed around a Safe RL agent that generates high-level action goals using ethical risk costs, which combine crash probability and damage severity. It leverages a dynamic prioritized experience replay mechanism to enhance learning about rare but critical high-risk events, and generates smooth, feasible trajectories through polynomial path planning and PID and Stanley controllers. Training and validation using a real-world traffic dataset demonstrates superior performance compared to existing methods in terms of ethical risk reduction and driving performance maintenance. Notably, this is the first Safe RL study evaluating ethical decision-making in autonomous vehicles in a real-world, mixed-traffic scenario.

Takeaways, Limitations

Takeaways:
Presentation and performance validation of a Safe RL framework for ethical decision-making in autonomous vehicles using real-world data.
Developing a decision-making system that explicitly considers ethical risks (probability of collision and severity of damage).
Enhanced learning of high-risk events through dynamic priority experience replay mechanisms.
Enhancing ethically responsible autonomy by combining formal control theory and data-driven learning.
Contributing to the protection of vulnerable road users such as pedestrians and cyclists.
Limitations:
Further research is needed on the limitations and generalizability of real-world datasets.
Further review is needed to determine whether comprehensive treatment of various ethical dilemmas is warranted.
Further analysis of the framework's computational cost and real-time performance is needed.
Further validation of long-term safety and reliability is needed.
👍