Volltext-Downloads (blau) und Frontdoor-Views (grau)

The Role of Reinforcement Learning in Production Control: A Systematic Literature Review

  • The expansion of product portfolios, the reduction of product life cycles and the volatility of markets pose significant challenges for production systems and their control. Concurrently, these trends present a challenge in achieving an optimal balance between logistical performance and internal company costs. The discordance between the escalating demands of customers on logistics performance, exemplified by metrics such as throughput time and schedule reliability, and the cost-driven corporate objective of minimizing work-in-process and maximizing machine utilization, is becoming increasingly challenging to reconcile. The application of reinforcement learning (RL) is a significant machine learning (ML) approach for overcoming these challenges. In comparison with other ML approaches, RL facilitates direct interaction with production systems and is consequently well suited for controlling them in operational use. Despite the extensive body of research on RL approaches for production control tasks, there is a paucity of literature addressing the influence of these approaches on key logistical targets for logistics performance and costs. The article’s added value derives from the systematic literature review it conducts, which provides researchers and practitioners with an overview of how existing RL approaches influence central logistical target variables. Furthermore, it highlights blind spots in the research landscape. The results indicate the existence of a substantial number of approaches; however, their distribution across control tasks is disproportionate. Furthermore, it is evident that there are distinct discrepancies in the classification system with respect to the impact on logistical target variables.

Download full text files

Export metadata

Additional Services

Search Google Scholar

Statistics

frontdoor_oas
Metadaten
Author:Jonas SchneiderORCiD, Carl PfannschmidtORCiD, Peter NyhuisORCiD, Matthias SchmidtORCiD
URN:urn:nbn:de:bsz:960-opus4-38692
DOI:https://doi.org/10.25968/opus-3869
DOI original:https://doi.org/10.1109/ACCESS.2026.3668903
ISSN:2169-3536
Parent Title (English):IEEE Access
Publisher:IEEE
Document Type:Article
Language:English
Year of Completion:2026
Publishing Institution:Hochschule Hannover
Release Date:2026/04/30
Tag:Capacity control; machine learning; order release; production control; reinforcement learning; sequencing; systematic literature review
GND Keyword:ProduktionskontrolleGND; Operante KonditionierungGND; Maschinelles LernenGND; DurchlaufzeitGND; ArbeitsprozessGND
Volume:14
Page Number:15
First Page:34375
Last Page:34389
Institutes:Fakultät I - Elektro- und Informationstechnik
DDC classes:670 Industrielle und handwerkliche Fertigung
Licence (German):License LogoCreative Commons - CC BY - Namensnennung 4.0 International