Temporal Dependency‐Aware Trajectory‐Level Behavioural Metric for Exploration in Reinforcement Learning
Artículo
Acceso abierto
Artículo
DOAJ
ABSTRACT Intrinsic motivation serves as the predominant paradigm of exploration in reinforcement learning. In pursuit of an informative and robust state representation, the behavioural metric groups behaviourally equival...
LCC TENDOkNvbXB1dGF0aW9uYWwgbGluZ3Vpc3RpY3MuIE5hdHVyYWwgbGFuZ3VhZ2UgcHJvY2Vzc2luZw~~; LCC:Computer softwareIdioma eng
Acceso abiertoRuta libre sin proxy. Acceso recomendado cuando no hay suscripción activa.
Open Access