NODOVOX | Discovery

Temporal Dependency‐Aware Trajectory‐Level Behavioural Metric for Exploration in Reinforcement Learning

Artículo

Acceso abierto Artículo DOAJ

Temporal Dependency‐Aware Trajectory‐Level Behavioural Metric for Exploration in Reinforcement Learning

Anjie Zhu et al · Wiley · 2026 · ISSN 2468-2322

ABSTRACT Intrinsic motivation serves as the predominant paradigm of exploration in reinforcement learning. In pursuit of an informative and robust state representation, the behavioural metric groups behaviourally equival...

LCC TENDOkNvbXB1dGF0aW9uYWwgbGluZ3Vpc3RpY3MuIE5hdHVyYWwgbGFuZ3VhZ2UgcHJvY2Vzc2luZw~~; LCC:Computer softwareIdioma eng

Acceso abiertoRuta libre sin proxy. Acceso recomendado cuando no hay suscripción activa.

Open Access

Disponible en línea Ficha bibliográfica

Towards Generalisable and Explainable Traffic Signal Control via Deep Reinforcement Learning and Large Language Models

Artículo

Material complementario disponible Artículo DOAJ

Towards Generalisable and Explainable Traffic Signal Control via Deep Reinforcement Learning and Large Language Models

Hao Huang et al · Wiley · 2026 · ISSN 2468-2322

ABSTRACT As a government‐regulated public service, traffic signal control (TSC) requires reliable and transparent decision‐making. However, existing deep reinforcement learning (DRL) methods, despite improvements in ...

LCC TENDOkNvbXB1dGF0aW9uYWwgbGluZ3Vpc3RpY3MuIE5hdHVyYWwgbGFuZ3VhZ2UgcHJvY2Vzc2luZw~~; LCC:Computer softwareIdioma eng

Material complementario disponibleEl enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.

Material complementario

Abrir material Ficha bibliográfica

Hierarchical service chain orchestration for multi-cloud environments enabled by deep reinforcement learning

Artículo

Material complementario disponible Artículo DOAJ

Hierarchical service chain orchestration for multi-cloud environments enabled by deep reinforcement learning

Yuncheng Xie et al · SpringerOpen · 2026 · ISSN 2192-113X

Abstract With the rapid adoption of multi-cloud platforms, dynamic orchestration of service function chains faces coupled challenges. This study proposes a hierarchical service chain orchestration for multi-cloud environ...

LCC TENDOkNvbXB1dGVyIGVuZ2luZWVyaW5nLiBDb21wdXRlciBoYXJkd2FyZQ~~; LCC:Electronic computers. Computer scienceIdioma eng

Material complementario disponibleEl enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.

Material complementario

Abrir material Ficha bibliográfica

Adaptive emotion-aware chatbot for mental health diagnosis using recurrent reinforcement learning and transformer models

Artículo

Acceso abierto Artículo DOAJ

Adaptive emotion-aware chatbot for mental health diagnosis using recurrent reinforcement learning and transformer models

Sonia Dessai et al · Frontiers Media S.A · 2026 · ISSN 2624-8212

In the busy and stressful modern world, people tend to disregard mental health, still it is an important factor of overall health. The constant pressure to achieve success, the invasive nature of technology, and the cons...

LCC LCC:Electronic computers. Computer scienceIdioma eng

Acceso abiertoRuta libre sin proxy. Acceso recomendado cuando no hay suscripción activa.

Open Access

Disponible en línea Ficha bibliográfica

Dynamic matching strategy for college students’ innovative training projects based on reinforcement learning optimization

Artículo

Acceso abierto Artículo DOAJ

Dynamic matching strategy for college students’ innovative training projects based on reinforcement learning optimization

Xiao Ju et al · Springer · 2026 · ISSN 2731-0809

Abstract Matching college students with appropriate innovative training projects is a challenging task that often relies on static assignment techniques, which overlook individual interests, skills, and learning styles. ...

LCC TENDOkNvbXB1dGF0aW9uYWwgbGluZ3Vpc3RpY3MuIE5hdHVyYWwgbGFuZ3VhZ2UgcHJvY2Vzc2luZw~~; LCC:Electronic computers. Computer sciIdioma eng

Acceso abiertoRuta libre sin proxy. Acceso recomendado cuando no hay suscripción activa.

Open Access

Disponible en línea Ficha bibliográfica

A hierarchical motion planning framework optimizing probabilistic roadmap, pure pursuit, and deep reinforcement learning for non-holonomic automated guided vehicles

Artículo

Material complementario disponible Artículo DOAJ

A hierarchical motion planning framework optimizing probabilistic roadmap, pure pursuit, and deep reinforcement learning for non-holonomic automated guided vehicles

Muhammad Aizat et al · Elsevier · 2026 · ISSN 1110-0168

The motion planning is a critical component of autonomous navigation, requiring the vehicle to reach a target location safely. Traditional navigation approaches for four-wheel differential drive automated guided vehicles...

LCC LCC:Engineering (General). Civil engineering (General)Idioma eng

Material complementario disponibleEl enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.

Material complementario

Abrir material Ficha bibliográfica

Maximum entropy inverse reinforcement learning for campus spatial optimization from Wi-Fi probe trajectories: a case study of Southeast University Wuxi Campus

Artículo

Acceso abierto Artículo DOAJ

Maximum entropy inverse reinforcement learning for campus spatial optimization from Wi-Fi probe trajectories: a case study of Southeast University Wuxi Campus

Guangjin Wang et al · Springer · 2026 · ISSN 2731-6726

Abstract With the outward expansion of university campuses toward suburban areas, the modern campus has evolved into an increasingly independent social space, providing an organizational setting in which users’ daily a...

LCC TENDOkFyY2hpdGVjdHVyZQ~~; LCC:Technology (General)Idioma eng

Acceso abiertoRuta libre sin proxy. Acceso recomendado cuando no hay suscripción activa.

Open Access

Disponible en línea Ficha bibliográfica

Multi-objective reinforcement learning for electric vehicle charging

Artículo

Acceso abierto Artículo CONICET Digital

Multi-objective reinforcement learning for electric vehicle charging

Trimboli, Maximiliano Daniel et al · Elsevier · 2026 · ISSN 2352-4677

The transportation sector is a significant contributor to global greenhouse gas emissions, and Electric Vehicles (EVs) have emerged as a promising solution to mitigate this impact by reducing emissions and integrating re...

Idioma eng

Acceso abiertoRuta libre sin proxy. Acceso recomendado cuando no hay suscripción activa.

Open Access

Disponible en línea Ficha bibliográfica

Human-level control through deep reinforcement learning

Artículo

Página del recurso disponible Artículo OpenAlex

Human-level control through deep reinforcement learning

Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Andrei A. Rusu; Joel Veness; Marc G. Bellemare; Alex Graves; Martin Riedmiller · Nature · 2015

Materias / palabras clave: Reinforcement learning; Computer science; Artificial intelligence; Variety (cybernetics); Deep learning; Control (management); Perception; Human–computer interaction; Machine learning; Neuroscience; Biology

Idioma en

Página del recurso disponiblePágina de referencia del recurso. El texto completo no está confirmado automáticamente.

Página del recurso

Abrir recurso Ficha bibliográfica

Using combination of actions in reinforcement learning

Artículo

Acceso abierto Artículo SEDICI UNLP

Using combination of actions in reinforcement learning

Karanik, Marcelo J. et al · SEDICI UNLP · 2010

Software agents are programs that can observe their environment and act in an attempt to reach their design goals. In most cases the selection of particular agent architecture determines the behaviour in response to the ...

Idioma en

Acceso abiertoRuta libre sin proxy. Acceso recomendado cuando no hay suscripción activa.

Open Access

Disponible en línea Ficha bibliográfica

Training an agent on brainwaves: using brain signals as feedback for reinforcement learning

Texto / recurso

Acceso abierto Texto / recurso RI ITBA

Training an agent on brainwaves: using brain signals as feedback for reinforcement learning

Moreno, Juan et al · RI ITBA · 2019

"This thesis replicates and proposes an alternative method to train reinforcement learning algorithms with ErrP signals, captured through EEG, and validate the effectiveness of its use in a prototype application." Proyec...

Idioma es

Acceso abiertoRuta libre sin proxy. Acceso recomendado cuando no hay suscripción activa.

Open Access

Disponible en línea Ficha bibliográfica

Diagnosing Non-Intermittent Anomalies in Reinforcement Learning Policy Executions (Short Paper)

Texto / recurso

Página del recurso disponible Texto / recurso OpenAlex

Diagnosing Non-Intermittent Anomalies in Reinforcement Learning Policy Executions (Short Paper)

Natan, Avraham; Stern, Roni; Kalech, Meir · arXiv (Cornell University) · 2017

Due to the safety risks and training sample inefficiency, it is often preferred to develop controllers in simulation. However, minor differences between the simulation and the real world can cause a significant sim-to-re...

Idioma en

Página del recurso disponiblePágina de referencia del recurso. El texto completo no está confirmado automáticamente.

Página del recurso

Abrir recurso Ficha bibliográfica

Advances in Reinforcement Learning

Libro electrónico

Material complementario disponible Libro electrónico DOAB

Advances in Reinforcement Learning

IntechOpen · ISBN 9789533073699;9789535155034

Material complementario disponibleEl enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.

Material complementario

Abrir material Ficha bibliográfica

Chapter Deep Multiagent Reinforcement Learning Methods Addressing the Scalability Challenge

Capítulo

Acceso abierto Capítulo DOAB

Chapter Deep Multiagent Reinforcement Learning Methods Addressing the Scalability Challenge

Vouros, George · InTechOpen · ISSN 10.5772/intechopen.105627

Acceso abiertoRuta libre sin proxy. Acceso recomendado cuando no hay suscripción activa.

Open Access

Disponible en línea Ficha bibliográfica

Deep Learning and Reinforcement Learning

Libro electrónico

Material complementario disponible Libro electrónico DOAB

Deep Learning and Reinforcement Learning

IntechOpen · ISBN 9781803569512;9781803569505;9781803569529

Material complementario disponibleEl enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.

Material complementario

Abrir material Ficha bibliográfica

Deep Reinforcement Learning zur Steigerung von Energieeffizienz und Pünktlichkeit von Straßenbahnen

Libro electrónico

Material complementario disponible Libro electrónico DOAB

Deep Reinforcement Learning zur Steigerung von Energieeffizienz und Pünktlichkeit von Straßenbahnen

Tesar, Markus · KIT Scientific Publishing · ISSN 10.5445/KSP/1000155565

Material complementario disponibleEl enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.

Material complementario

Abrir material Ficha bibliográfica

Distributional Reinforcement Learning

Libro electrónico

Material complementario disponible Libro electrónico DOAB

Distributional Reinforcement Learning

Bellemare, Marc G · The MIT Press · ISBN 9780262374026;9780262048019

Material complementario disponibleEl enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.

Material complementario

Abrir material Ficha bibliográfica

Effectuation entwickeln : Ein auf Reinforcement Learning aufbauender agentenbasierter Modellierungsbeitrag zur Formalisierung unternehmerischen Verhaltens

Libro electrónico

Material complementario disponible Libro electrónico DOAB

Effectuation entwickeln : Ein auf Reinforcement Learning aufbauender agentenbasierter Modellierungsbeitrag zur Formalisierung unternehmerischen Verhaltens

Sterzel, Martin · Springer Nature · ISBN 9783658392512

Material complementario disponibleEl enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.

Material complementario

Abrir material Ficha bibliográfica

Efficient Reinforcement Learning using Gaussian Processes

Libro electrónico

Material complementario disponible Libro electrónico DOAB

Efficient Reinforcement Learning using Gaussian Processes

Deisenroth, Marc Peter · KIT Scientific Publishing · ISBN 9783866445697

Material complementario disponibleEl enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.

Material complementario

Abrir material Ficha bibliográfica

Entwicklung einer Methode zum Einsatz von Reinforcement Learning für die dynamische Fertigungsdurchlaufsteuerung

Libro electrónico

Material complementario disponible Libro electrónico DOAB

Entwicklung einer Methode zum Einsatz von Reinforcement Learning für die dynamische Fertigungsdurchlaufsteuerung

Lohse, Oliver · KIT Scientific Publishing · ISSN 10.5445/KSP/1000156002

Material complementario disponibleEl enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.

Material complementario

Abrir material Ficha bibliográfica

Learning and adaptation of a policy for dynamic order acceptance in make-to-order manufacturing

Artículo

Acceso abierto Artículo CONICET Digital

Learning and adaptation of a policy for dynamic order acceptance in make-to-order manufacturing

Arredondo, Facundo et al · Pergamon-Elsevier Science Ltd · 2010 · ISSN 0360-8352

Order acceptance under uncertainty is a critical decision-making problem at the interface between customer relationship management and production planning of order-driven manufacturing systems. In this work, a novel appr...

Idioma eng

Acceso abiertoRuta libre sin proxy. Acceso recomendado cuando no hay suscripción activa.

Open Access

Disponible en línea Ficha bibliográfica

Learning by knowledge sharing in autonomous intelligent systems

Texto / recurso

Acceso abierto Texto / recurso RI ITBA

Learning by knowledge sharing in autonomous intelligent systems

García Martínez, Ramón et al · RI ITBA · 2018 · ISSN 0302-9743

"Very few learning systems applied to problem solving have focused on learning operator definitions from the interaction with a completely unknown environment. In order to achieve better learning convergence, several age...

Idioma en

Acceso abiertoRuta libre sin proxy. Acceso recomendado cuando no hay suscripción activa.

Open Access

Disponible en línea Ficha bibliográfica

Bidirectional Q-learning for recycling path planning of used appliances under strong and weak constraints

Artículo

Acceso abierto Artículo DOAJ

Bidirectional Q-learning for recycling path planning of used appliances under strong and weak constraints

Yang Qi et al · Tsinghua University Press · 2024 · ISSN 2772-4247

With the continuous innovation in household appliance technology and the improvement of living standards, the production of discarded household appliances has rapidly increased, making their recycling increasingly signif...

LCC LCC:Transportation engineeringIdioma eng

Acceso abiertoRuta libre sin proxy. Acceso recomendado cuando no hay suscripción activa.

Open Access

Disponible en línea Ficha bibliográfica

Hidden Markov Model-Based Approach for Adaptive Learning Path Recommendation

Artículo

Material complementario disponible Artículo DOAJ

Hidden Markov Model-Based Approach for Adaptive Learning Path Recommendation

Miftah Farid Adiwisastra et al · IEEE · 2026 · ISSN 2169-3536

Adaptive learning systems require effective mechanisms to model students’ evolving learning states and provide personalized learning path recommendations. Traditional rule-based approaches are limited in capturing...

LCC LCC:Electrical engineering. Electronics. Nuclear engineeringIdioma eng

Material complementario disponibleEl enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.

Material complementario

Abrir material Ficha bibliográfica

Anterior 123 4 Siguiente

Buscar recursos académicos

Resultados