Multi-armed Bandit-Based Metaheuristic Operator Selection: The Pendulum Algorithm Binarization Case

Ábrego-Calderón, Pablo; Crawford, Broderick; Soto, Ricardo; MONFROY, ERIC BERNARD; Giachetti, Giovanni

Abstract

Multi-armed bandit (MAB) is a well-known reinforcement learning algorithm that has shown outstanding performance for recommendation systems and other areas. On the other hand, metaheuristic algorithms have gained much popularity due to their great performance in solving complex problems with endless search spaces. Pendulum Search Algorithm (PSA) is a recently created metaheuristic inspired by the harmonic motion of a pendulum. Its main limitation is to solve combinatorial optimization problems, characterized by using variables in the discrete domain. To overcome this limitation, we propose to use a two-step binarization technique, which offers a large number of possible options that we call scheme. For this, we use MAB as an algorithm that learns and recommends a binarization schemes during the execution of the iterations (online). With the experiments carried out, we show that it delivers better results in solving the Set Covering problem than using a fixed binarization scheme.

Más información

Título según SCOPUS: ID SCOPUS_ID:85163313412 Not found in local SCOPUS DB
Título de la Revista: Communications in Computer and Information Science
Volumen: 1824 CCIS
Editorial: Springer Nature
Fecha de publicación: 2023
Página de inicio: 248
Página final: 259
DOI:

10.1007/978-3-031-34020-8_19

Notas: SCOPUS - SCOPUS