Assimilating Human Feedback from Autonomous Vehicle Interaction in Reinforcement Learning Models

Richard Fox; Elliot Ludvig

doi:None

Title	Authors

Assimilating Human Feedback from Autonomous Vehicle Interaction in Reinforcement Learning Models

Contributors:

Date created: | Last Updated:

: DOI | ARK

Creating DOI. Please wait...

Create DOI

Category: Project

Description: A significant challenge for real-world automated vehicles (AVs) is their interaction with human pedestrians. Starting with a Deep Q Network (DQN) trained on a simple Pygame/Python-based pedestrian crossing environment, the reward structure was adapted to allow adjustment by human feedback. Feedback was collected by eliciting behavioural judgements collected from people in a controlled environment. The reward was shaped by the inter-action vector, decomposed into feature aspects for relevant behaviours, thereby facilitating both implicit preference selection and explicit task discovery in tandem. Using computational RL and behavioural-science techniques, we harness a formal iterative feedback loop where the rewards were repeatedly adapted based on human behavioural judgments. Experiments were conducted with 124 participants that showed strong initial improvement in the judgement of AV behaviours with the adaptive reward structure.

Projects
Registrations

Results: All Projects Results: My Projects Results: All Registrations Results: My Registrations

Wiki

Add important information, links, or images here to describe your project.

Files

Loading files...

Citation

Recent Activity

Loading logs...

OSF does not support the use of Internet Explorer. For optimal performance, please switch to another browser.

This website relies on cookies to help provide a better user experience. By clicking Accept or continuing to use the site, you agree. For more information, see our Privacy Policy and information on cookie use.

Start managing your projects on the OSF today.

Free and easy to use, the Open Science Framework supports the entire research lifecycle: planning, execution, reporting, archiving, and discovery.

Create an Account Learn More Hide this message

Main content

Links to this project

Assimilating Human Feedback from Autonomous Vehicle Interaction in Reinforcement Learning Models

Link other OSF projects

Wiki

Files

Citation

Recent Activity

Start managing your projects on the OSF today.