← Volver a resultados
Ficha bibliográfica · Consulta y acceso
Artículo de revista

Multi-objective inventory optimization using reinforcement learning: a comparative study on profitability and carbon emissions

Abdulrahman Sorour et al · Nature Portfolio · 2026

Material complementario disponible
Lectura rápida. Revisá los datos básicos del recurso y luego accedé al contenido desde el botón principal. En esta ficha solo se muestra la información necesaria para identificar la obra, citarla y abrirla.
Publicación seriada

3D scan-based classification of Chinese young female hand morphology

Esta publicación seriada contiene 688 contenidos relacionados.

Acceso al recurso

Entrá al contenido desde la opción principal o elegí otra fuente disponible.

Acceso principal

Material complementario disponible

El enlace apunta a material asociado, anexos, tablas, datos o página complementaria. No se marca como libro/texto completo.
Abrir material

Resumen

Descripción general del contenido del recurso.

Abstract Inventory management is a core part of supply chains, and over the years it has been increasingly challenged by the need to balance economic performance with environmental considerations. While prior reinforcement learning (RL) studies have incorporated carbon emissions indirectly through cost penalties or regulatory constraints, this work addresses an existing gap by treating emissions as an independent optimization objective. This study examines RL as an adaptive decision‑making approach for inventory optimization with two objectives: maximizing profit and minimizing carbon emissions. The problem is formulated as a Markov Decision Process, and four RL algorithms Proximal Policy Optimization (PPO), Phasic Policy Gradient (PPG), Advantage Actor‑Critic (A2C), and Double Deep Q‑Network (DDQN) are evaluated under identical experimental conditions. Carbon emissions are explicitly modeled in the reward function rather than embedded within operating costs. The results show that PPG achieves the highest profitability with only a modest increase in emissions, while DDQN converges faster but yields lower profit overall. Sensitivity analysis indicates that reward weighting strongly influences policy behavior, with PPO providing the most stable trade‑off between profitability and emissions.

Cómo citar

Elegí el formato que necesitás y copiá la referencia al portapapeles.

APA 7

al, A. S. E. (2026). Multi-objective inventory optimization using reinforcement learning: a comparative study on profitability and carbon emissions. https://doi.org/10.1038/s41598-026-44293-y

MLA

al, Abdulrahman Sorour et. "Multi-objective inventory optimization using reinforcement learning: a comparative study on profitability and carbon emissions." 2026. https://doi.org/10.1038/s41598-026-44293-y.

Chicago

al, Abdulrahman Sorour et. 2026. "Multi-objective inventory optimization using reinforcement learning: a comparative study on profitability and carbon emissions.". https://doi.org/10.1038/s41598-026-44293-y.

Harvard

al, A. S. E. 2026, Multi-objective inventory optimization using reinforcement learning: a comparative study on profitability and carbon emissions, Nature Portfolio, available at: https://doi.org/10.1038/s41598-026-44293-y [Accessed 30 Jun. 2026].

Compartir e imprimir

Guardá la ficha, copiá su enlace permanente o imprimila como PDF.

Exportar referencia

Si usás un gestor bibliográfico, podés exportar el registro en los formatos más comunes.

Detalles del recurso

Información bibliográfica útil para confirmar que se trata del material correcto.

Título
Multi-objective inventory optimization using reinforcement learning: a comparative study on profitability and carbon emissions
Autor / colaboradores
Abdulrahman Sorour et al
Editorial
Nature Portfolio
Año de publicación
2026
ISSN
2045-2322
ISSN
2045-2322
Idioma
eng

Materias

Explorá otros recursos relacionados a partir de estas materias.

Copiado