← Volver a resultados
Ficha bibliográfica · Consulta y acceso
Artículo de revista

Hierarchical multi-agent reinforcement learning for retrieval-augmented industrial document question answering

Yihong Qian et al · Nature Portfolio · 2026

Material complementario disponible
Lectura rápida. Revisá los datos básicos del recurso y luego accedé al contenido desde el botón principal. En esta ficha solo se muestra la información necesaria para identificar la obra, citarla y abrirla.
Publicación seriada

3D scan-based classification of Chinese young female hand morphology

Esta publicación seriada contiene 688 contenidos relacionados.

Acceso al recurso

Entrá al contenido desde la opción principal o elegí otra fuente disponible.

Acceso principal

Material complementario disponible

El enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.
Abrir material

Resumen

Descripción general del contenido del recurso.

Abstract Multimodal industrial documents–such as operation manuals, circuit diagrams, and parameter tables–contain domain knowledge distributed across text, images, and document layout. However, most existing retrieval-augmented generation (RAG) frameworks rely on static retrieval and fusion policies with fixed modality weights and uniform retrieval depth, making them less adaptable to diverse query intents and dynamic cross-modal dependencies. As a result, they often retrieve incomplete evidence and yield suboptimal reasoning in complex long-document scenarios. To address these challenges, we propose MARL-RAGDoc, a hierarchical multi-agent reinforcement learning framework for multimodal retrieval-augmented reasoning. A high-level coordinator agent dynamically allocates modality weights and retrieval depth based on query characteristics, while specialized text, image, and table agents perform fine-grained evidence selection within their respective candidate pools. A collaborative reasoning module integrates the retrieved evidence and provides hierarchical reward signals to continuously optimize retrieval policies. Experimental results on multiple multimodal document benchmarks demonstrate that MARL-RAGDoc consistently outperforms baselines in both retrieval accuracy and reasoning performance, while remaining computationally efficient. Our code and dataset are publicly available at https://github.com/Yihong-Q/MARL-RAGDoc .

Cómo citar

Elegí el formato que necesitás y copiá la referencia al portapapeles.

APA 7

al, Y. Q. E. (2026). Hierarchical multi-agent reinforcement learning for retrieval-augmented industrial document question answering. https://doi.org/10.1038/s41598-026-41684-z

MLA

al, Yihong Qian et. "Hierarchical multi-agent reinforcement learning for retrieval-augmented industrial document question answering." 2026. https://doi.org/10.1038/s41598-026-41684-z.

Chicago

al, Yihong Qian et. 2026. "Hierarchical multi-agent reinforcement learning for retrieval-augmented industrial document question answering.". https://doi.org/10.1038/s41598-026-41684-z.

Harvard

al, Y. Q. E. 2026, Hierarchical multi-agent reinforcement learning for retrieval-augmented industrial document question answering, Nature Portfolio, available at: https://doi.org/10.1038/s41598-026-41684-z [Accessed 30 Jun. 2026].

Compartir e imprimir

Guardá la ficha, copiá su enlace permanente o imprimila como PDF.

Exportar referencia

Si usás un gestor bibliográfico, podés exportar el registro en los formatos más comunes.

Detalles del recurso

Información bibliográfica útil para confirmar que se trata del material correcto.

Título
Hierarchical multi-agent reinforcement learning for retrieval-augmented industrial document question answering
Autor / colaboradores
Yihong Qian et al
Editorial
Nature Portfolio
Año de publicación
2026
ISSN
2045-2322
ISSN
2045-2322
Idioma
eng

Materias

Explorá otros recursos relacionados a partir de estas materias.

Copiado