← Volver a resultados
Ficha bibliográfica · Consulta y acceso
Artículo

Bidirectional Q-learning for recycling path planning of used appliances under strong and weak constraints

Yang Qi et al · Tsinghua University Press · 2024

Acceso abierto disponible
Lectura rápida. Revisá los datos básicos del recurso y luego accedé al contenido desde el botón principal. En esta ficha solo se muestra la información necesaria para identificar la obra, citarla y abrirla.

Acceso al recurso

Entrá al contenido desde la opción principal o elegí otra fuente disponible.

Acceso principal

Acceso abierto disponible

Recurso identificado como acceso abierto, sin confirmar automáticamente si es texto completo directo.
Abrir recurso

Resumen

Descripción general del contenido del recurso.

With the continuous innovation in household appliance technology and the improvement of living standards, the production of discarded household appliances has rapidly increased, making their recycling increasingly significant. Traditional path planning algorithms encounter difficulties in balancing efficiency and constraints in addressing the multi-objective, multi-constraint challenge posed by discarded household appliance recycling routes. To tackle this issue, this study introduces a bi-directional Q-learning-based path planning algorithm. By developing a bi-directional Q-learning mechanism and enhancing the initialization method of Q-learning, the algorithm aims to achieve efficient and effective optimization of discarded household appliance recycling routes. It implements bidirectional updates of the state-action value function from both the starting point and the target point. Additionally, a hierarchical reinforcement learning strategy and guided rewards are introduced to minimize blind exploration and expedite convergence. By decomposing complex recycling tasks into multiple sub-tasks and seeking paths with superior performance at each sub-task level, the initial exploratory blindness is reduced. To validate the efficacy of the proposed algorithm, gridbased modeling of real-world environments is utilized. Comparative experiments reveal significant improvements in iteration counts and path lengths, thereby validating its practical applicability in path planning for recycling initiatives.

Cómo citar

Elegí el formato que necesitás y copiá la referencia al portapapeles.

APA 7

al, Y. Q. E. (2024). Bidirectional Q-learning for recycling path planning of used appliances under strong and weak constraints. https://doi.org/10.1016/j.commtr.2024.100153

MLA

al, Yang Qi et. "Bidirectional Q-learning for recycling path planning of used appliances under strong and weak constraints." 2024. https://doi.org/10.1016/j.commtr.2024.100153.

Chicago

al, Yang Qi et. 2024. "Bidirectional Q-learning for recycling path planning of used appliances under strong and weak constraints.". https://doi.org/10.1016/j.commtr.2024.100153.

Harvard

al, Y. Q. E. 2024, Bidirectional Q-learning for recycling path planning of used appliances under strong and weak constraints, Tsinghua University Press, available at: https://doi.org/10.1016/j.commtr.2024.100153 [Accessed 29 Jun. 2026].

Compartir e imprimir

Guardá la ficha, copiá su enlace permanente o imprimila como PDF.

Exportar referencia

Si usás un gestor bibliográfico, podés exportar el registro en los formatos más comunes.

Detalles del recurso

Información bibliográfica útil para confirmar que se trata del material correcto.

Título
Bidirectional Q-learning for recycling path planning of used appliances under strong and weak constraints
Autor / colaboradores
Yang Qi et al
Editorial
Tsinghua University Press
Año de publicación
2024
ISSN
2772-4247
ISSN
2772-4247
Idioma
eng

Materias

Explorá otros recursos relacionados a partir de estas materias.

Copiado