AutoRL Hyperparameter Landscapes

verfasst von: Aditya Mohan, Carolin Benjamins, Konrad Wienecke, Alexander Dockhorn, Marius Lindauer
Abstract: Although Reinforcement Learning (RL) has shown to be capable of producing impressive results, its use is limited by the impact of its hyperparameters on performance. This often makes it difficult to achieve good results in practice. Automated RL (AutoRL) addresses this difficulty, yet little is known about the dynamics of the hyperparameter landscapes that hyperparameter optimization (HPO) methods traverse in search of optimal configurations. In view of existing AutoRL approaches dynamically adjusting hyperparameter configurations, we propose an approach to build and analyze these hyperparameter landscapes not just for one point in time but at multiple points in time throughout training. Addressing an important open question on the legitimacy of such dynamic AutoRL approaches, we provide thorough empirical evidence that the hyperparameter landscapes strongly vary over time across representative algorithms from RL literature (DQN and SAC) in different kinds of environments (Cartpole and Hopper). This supports the theory that hyperparameters should be dynamically adjusted during training and shows the potential for more insights on AutoRL problems that can be gained through landscape analyses.
Organisationseinheit(en): Institut für Künstliche Intelligenz
Institut für Informationsverarbeitung
Typ: Aufsatz in Konferenzband
Anzahl der Seiten: 27
Publikationsdatum: 12.11.2023
Publikationsstatus: Veröffentlicht
Peer-reviewed: Ja
ASJC Scopus Sachgebiete: Artificial intelligence, Software, Steuerungs- und Systemtechnik, Statistik und Wahrscheinlichkeit
Elektronische Version(en): https://proceedings.mlr.press/v224/mohan23a/mohan23a.pdf (Zugang: Offen)
https://doi.org/10.48550/arXiv.2304.02396 (Zugang: Offen)

BibTeX

@inproceedings{13e8453dfc8c46b58ef9c3b318d5952a,
title = "AutoRL Hyperparameter Landscapes",
abstract = "Although Reinforcement Learning (RL) has shown to be capable of producing impressive results, its use is limited by the impact of its hyperparameters on performance. This often makes it difficult to achieve good results in practice. Automated RL (AutoRL) addresses this difficulty, yet little is known about the dynamics of the hyperparameter landscapes that hyperparameter optimization (HPO) methods traverse in search of optimal configurations. In view of existing AutoRL approaches dynamically adjusting hyperparameter configurations, we propose an approach to build and analyze these hyperparameter landscapes not just for one point in time but at multiple points in time throughout training. Addressing an important open question on the legitimacy of such dynamic AutoRL approaches, we provide thorough empirical evidence that the hyperparameter landscapes strongly vary over time across representative algorithms from RL literature (DQN and SAC) in different kinds of environments (Cartpole and Hopper). This supports the theory that hyperparameters should be dynamically adjusted during training and shows the potential for more insights on AutoRL problems that can be gained through landscape analyses.",
keywords = "Reinforcement learning, AutoML, Hyperparameter optimization",
author = "Aditya Mohan and Carolin Benjamins and Konrad Wienecke and Alexander Dockhorn and Marius Lindauer",
note = "Publisher Copyright: {\textcopyright}2023 the authors.; 2nd International Conference on Automated Machine Learning, AutoML 2023, AutoML 2023 ; Conference date: 12-11-2023 Through 15-11-2023",
year = "2023",
month = nov,
day = "12",
doi = "10.48550/arXiv.2304.02396",
language = "English",
series = "Proceedings of Machine Learning Research",
publisher = "PMLR",
booktitle = "Conference proceeding",
}

Details zu Publikationen

AutoRL Hyperparameter Landscapes

Gefördert vom