← Volver a resultados
Ficha bibliográfica · Consulta y acceso
Artículo

Scalable and reliable multi-agent reinforcement learning for traffic assignment

Leizhen Wang et al · Tsinghua University Press · 2025

Acceso abierto disponible
Lectura rápida. Revisá los datos básicos del recurso y luego accedé al contenido desde el botón principal. En esta ficha solo se muestra la información necesaria para identificar la obra, citarla y abrirla.

Acceso al recurso

Entrá al contenido desde la opción principal o elegí otra fuente disponible.

Acceso principal

Acceso abierto disponible

Recurso identificado como acceso abierto, sin confirmar automáticamente si es texto completo directo.
Abrir recurso

Resumen

Descripción general del contenido del recurso.

The evolution of metropolitan cities and increasing travel demand impose stringent requirements on traffic assignment methods. Multi-agent reinforcement learning (MARL) approaches outperform traditional methods in modeling adaptive routing behavior without requiring explicit system dynamics, making them attractive for real-world deployment. However, existing MARL frameworks face scalability and reliability challenges when managing large-scale networks with substantial and variable demand. This study proposes MARL-OD-DA, a novel framework that redefines agents as origin–destination (OD) pair routers and employs a continuous simplex-constrained action space. This reformulation reduces the agent population from O(N) (number of travelers) to O(|D|) (number of OD pairs), achieving at least two orders of magnitude fewer agents in practice while preserving convexity and enabling efficient adaptation to demand variation, thus significantly improving scalability. In contrast to prior MARL studies constrained to small-sized networks (up to 70 nodes, 2100 travelers) and fixed demand, MARL-OD-DA is validated on medium-sized networks (up to 416 nodes, 1406 OD pairs, and 360,600 travelers) under varying demand scenarios, demonstrating substantial improvements in scalability and applicability. To further enhance reliability, the framework integrates a Dirichlet-based policy, action pruning, and a relative gap-based reward. Theoretical analysis demonstrates that the Dirichlet-based policy reduces gradient bias, stabilizes variance, and enables sparse routing decisions, in contrast to the commonly used softmax-based policy. Experiments on three benchmark networks show that MARL-OD-DA significantly improves assignment quality and convergence speed. On the SiouxFalls network, the trained agents converge within 10 iterations during deployment, reducing the relative gap by 94.99% compared to conventional baselines.

Cómo citar

Elegí el formato que necesitás y copiá la referencia al portapapeles.

APA 7

al, L. W. E. (2025). Scalable and reliable multi-agent reinforcement learning for traffic assignment. https://doi.org/10.1016/j.commtr.2025.100225

MLA

al, Leizhen Wang et. "Scalable and reliable multi-agent reinforcement learning for traffic assignment." 2025. https://doi.org/10.1016/j.commtr.2025.100225.

Chicago

al, Leizhen Wang et. 2025. "Scalable and reliable multi-agent reinforcement learning for traffic assignment.". https://doi.org/10.1016/j.commtr.2025.100225.

Harvard

al, L. W. E. 2025, Scalable and reliable multi-agent reinforcement learning for traffic assignment, Tsinghua University Press, available at: https://doi.org/10.1016/j.commtr.2025.100225 [Accessed 28 Jun. 2026].

Compartir e imprimir

Guardá la ficha, copiá su enlace permanente o imprimila como PDF.

Exportar referencia

Si usás un gestor bibliográfico, podés exportar el registro en los formatos más comunes.

Detalles del recurso

Información bibliográfica útil para confirmar que se trata del material correcto.

Título
Scalable and reliable multi-agent reinforcement learning for traffic assignment
Autor / colaboradores
Leizhen Wang et al
Editorial
Tsinghua University Press
Año de publicación
2025
ISSN
2772-4247
ISSN
2772-4247
Idioma
eng

Materias

Explorá otros recursos relacionados a partir de estas materias.

Copiado