Reinforcement Learning for Position Control Problem of a Mobile Robot

Farias, Gonzalo; Garcia, Gonzalo; Montenegro, Guelis; Fabregas, Ernesto; Dormido-Canto, Sebastian; Dormido, Sebastian

Abstract

Due to the increase in complexity in autonomous vehicles, most of the existing control systems are proving to be inadequate. Reinforcement Learning is gaining traction as it is posed to overcome these difficulties in a natural way. This approach allows an agent that interacts with the environment to get rewards for appropriate actions, learning to improve its performance continuously. The article describes the design and development of an algorithm to control the position of a wheeled mobile robot using Reinforcement Learning. One main advantage of this approach concerning traditional control algorithms is that the learning process is carried out automatically with a recursive procedure forward in time. Moreover, given the fidelity of the model for the particular implementation described in this work, the whole learning process can be carried out in simulation. This fact avoids damages to the actual robot during the learning stage. It shows that the position control of the robot (or similar specific tasks) can be done without the need to know the dynamic model of the system explicitly. Its main drawback is that the learning stage can take a long time to finish and that it depends on the complexity of the task and the availability of adequate hardware resources. This work provides a comparison between the proposed approach and traditional existing control laws in simulation and real environments. The article also discusses the main effects of using different controlled variables in the performance of the developed control law.

Más información

Título según WOS: Reinforcement Learning for Position Control Problem of a Mobile Robot
Título de la Revista: IEEE ACCESS
Volumen: 8
Editorial: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Fecha de publicación: 2020
Página de inicio: 152941
Página final: 152951
DOI:

10.1109/ACCESS.2020.3018026

Notas: ISI