Adaptive Methods for Continuous Control

Department of Electrical Engineering

Control Robotics and Machine Learning Lab

Technion - Israel Institute of Technology

המעבדה לבקרה רובוטיקה ולמידה חישובית

Adaptive Methods for Continuous Control

Background

In robotics tasks, the actions that are performed at each timestep are complex high dimensional control commands. E.g., the amount of force (normalized to [-1,1]) to apply for each motor.

While there are algorithms capable of coping in such a setting, the complexity of the action space results in instability and longer training procedures.

Project Goal

Extend the Q estimation to better cope with continuous control tasks. Specifically, you will try to estimate the return using polynomials / adaptive discretization methods, such that finding the optimal action is possible analytically.

Project steps

Understand reinforcement learning and deep learning methodologies
Learn the basics of continuous control
Implement basic methods
Research and improve the method, compare to baselines

Required knowledge

Strong programming skills.
Any knowledge in DL and RL is an advantage.
Any knowledge in optimization is an advantage.

Environment

PyTorch (Python)

Comments and links

See the papers "Deep Deterministic Policy Gradients" and "Twin Delayed DDPG".
Potential publication if results are satisfying.