top of page
Department of Electrical Engineering
Control Robotics and Machine Learning Lab

Besting ATARI using human-robot cooperation

Background

 

Deep reinforcement learning (DRL) methods such as the Deep Q-Network (DQN) have achieved state-of-the-art results in a variety of challenging, high-dimensional domains. This success is mainly attributed to the power of deep neural networks to learn rich domain representations for approximating the value function or policy.

 

While current methods are able to achieve outstanding results, the current algorithms aren't sample efficient and are still unable to beat the top human scores.

 

We believe that by taking imitation learning a level further, we can not only learn complex problems in a fast and efficient manner but also reach high scores which are far beyond the abilities of current algorithms.

 

Project Goal

 

Develop an external control interface between the DQN and a human observer. This interface will allow you to replay trajectories, take control and even fix errors which may have been performed by the agent.

Project steps

 

  • Understand the DQN framework.

  • Understand the ALE framework.

  • Implement human-agent interface.

  • Implement and research different approaches and limitations of human-agent interaction.

 

Required knowledge

 

  • Strong programming skills.

  • Any knowledge in DL and RL is an advantage.

  • Any knowledge in optimization is an advantage.

 

Environment

  • Torch / PyTorch / TensorFlow

 

Comments and links

 

  • See the DQN paper by Google.

  • See the DQN code.

  • See the ALE code.

bottom of page