Human-robot cooperation for ATARI

Department of Electrical Engineering

Control Robotics and Machine Learning Lab

Technion - Israel Institute of Technology

המעבדה לבקרה רובוטיקה ולמידה חישובית

Besting ATARI using human-robot cooperation

Background

Deep reinforcement learning (DRL) methods such as the Deep Q-Network (DQN) have achieved state-of-the-art results in a variety of challenging, high-dimensional domains. This success is mainly attributed to the power of deep neural networks to learn rich domain representations for approximating the value function or policy.

While current methods are able to achieve outstanding results, the current algorithms aren't sample efficient and are still unable to beat the top human scores.

We believe that by taking imitation learning a level further, we can not only learn complex problems in a fast and efficient manner but also reach high scores which are far beyond the abilities of current algorithms.

Project Goal

Develop an external control interface between the DQN and a human observer. This interface will allow you to replay trajectories, take control and even fix errors which may have been performed by the agent.

Project steps

Understand the DQN framework.
Understand the ALE framework.
Implement human-agent interface.
Implement and research different approaches and limitations of human-agent interaction.

Required knowledge

Strong programming skills.
Any knowledge in DL and RL is an advantage.
Any knowledge in optimization is an advantage.

Environment

Torch / PyTorch / TensorFlow

Comments and links

See the DQN paper by Google.
See the DQN code.
See the ALE code.