Evaluating the Performance of Serial Production Lines Using Tree-Based Machine Learning Methods
DOI:
https://doi.org/10.31181/rme444Keywords:
Serial Production Line, Tree-based Models, Machine Learning, Multioutput Regression, Simulation, Synchronous, Buffered Systems, Operational Efficiency, Performance EvaluationAbstract
This study investigates the efficiency of serial production lines using advanced tree-based machine learning algorithms, including Random Forest (RF), XGBoost, LightGBM, and CatBoost. By analyzing various performance indicators such as throughput, machine reliability, and processing time, we construct predictive models to evaluate production line behavior. The comparative results demonstrate that LightGBM outperforms the other models in prediction accuracy and computational speed. These findings suggest that machine learning techniques, particularly ensemble tree models, can significantly enhance decision-making and operational performance in manufacturing systems.
References
Assaf, R., Colledani, M., & Matta, A. (2014). Analytical evaluation of the output variability in production systems with general Markovian structure. OR spectrum, 36(3), 799-835. https://doi.org/10.1007/s00291-013-0343-6
Bai, Y., Tu, J., Yang, M., Zhang, L., & Denno, P. (2021). A new aggregation algorithm for performance metric calculation in serial production lines with exponential machines: design, accuracy and robustness. International Journal of Production Research, 59(13), 4072-4089. https://doi.org/10.1080/00207543.2020.1757777
Biau, G., & Scornet, E. (2016). A random forest guided tour. Test, 25(2), 197-227. https://doi.org/10.1007/s11749-016-0481-7
Breiman, L. (2001). Random forests. Machine learning, 45(1), 5-32. https://doi.org/10.1023/A:1010933404324
Chen, S., Li, D., Gan, X., & Chiong, R. (2024). An Optimised Light Gradient Boosting Machine Model for Setup Time Prediction in Electronic Production Lines. In Proceedings of the Genetic and Evolutionary Computation Conference Companion (pp. 647-650). https://doi.org/10.1145/3638530.3654428
Chen, T., & Guestrin, C. (2016). Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining (pp. 785-794). https://doi.org/10.1145/2939672.2939785
Chen, X., & Ishwaran, H. (2012). Random forests for genomic data analysis. Genomics, 99(6), 323-329. https://doi.org/10.1016/j.ygeno.2012.04.003
Chen, Y., Khandelwal, M., Onifade, M., Zhou, J., Lawal, A. I., Bada, S. O., & Genc, B. (2025). Predicting the hardgrove grindability index using interpretable decision tree-based machine learning models. Fuel, 384, 133953. https://doi.org/10.1016/j.fuel.2024.133953
Chiang, S.-Y., Kuo, C.-T., & Meerkov, S. M. (2000). DT-bottlenecks in serial production lines: theory and application. IEEE Transactions on Robotics and Automation, 16(5), 567-580. https://doi.org/10.1109/70.880806
Colledani, M., & Tolio, T. (2005). A decomposition method to support the configuration/reconfiguration of production systems. CIRP annals, 54(1), 441-444. https://doi.org/10.1016/S0007-8506(07)60140-1
Dasari, S. K., Lavesson, N., Andersson, P., & Persson, M. (2015). Tree-based response surface analysis. In International workshop on machine learning, optimization and big data (pp. 118-129). Springer. https://doi.org/10.1007/978-3-319-27926-8_11
Demir, L., & Koyuncuoğlu, M. U. (2021). The impact of the optimal buffer configuration on production line efficiency: A VNS-based solution approach. Expert Systems with Applications, 172, 114631. https://doi.org/10.1016/j.eswa.2021.114631
Dey, R. (2024). Bagging v/s Boosting. In. Medium. https://medium.com/@roshmitadey/bagging-v-s-boosting-be765c970fd1
El-kenawy, E.-S. M., Abutarboush, H. F., Mohamed, A. W., & Ibrahim, A. (2021). Advance artificial intelligence technique for designing double T-shaped monopole antenna. Computers, Materials & Continua, 69(3). https://doi.org/10.32604/cmc.2021.019114
El-Kenawy, E.-S. M., Zerouali, B., Bailek, N., Bouchouich, K., Hassan, M. A., Almorox, J., Kuriqi, A., Eid, M., & Ibrahim, A. (2022). Improved weighted ensemble learning for predicting the daily reference evapotranspiration under the semi-arid climate conditions. Environmental Science and Pollution Research, 29(54), 81279-81299. https://doi.org/10.1007/s11356-022-21410-8
Elshabrawy, M. (2025). A review on waste management techniques for sustainable energy production. Metaheuristic Optim. Rev., 3(2), 47-58. https://www.americaspg.com/article/pdf/3493
Galelli, S., & Castelletti, A. (2013). Assessing the predictive capability of randomized tree-based ensembles in streamflow modelling. Hydrology and Earth System Sciences, 17(7), 2669-2684. https://doi.org/10.5194/hess-17-2669-2013, 2013
Galuzzi, B. G., Giordani, I., Candelieri, A., Perego, R., & Archetti, F. (2020). Hyperparameter optimization for recommender systems through Bayesian optimization. Computational Management Science, 17(4), 495-515. https://doi.org/10.1007/s10287-020-00376-3
Gershwin, S. B. (1987). An efficient decomposition method for the approximate evaluation of tandem queues with finite storage space and blocking. Operations research, 35(2), 291-305. https://doi.org/10.1287/opre.35.2.291
Gershwin, S. B., & Berman, O. (1981). Analysis of transfer lines consisting of two unreliable machines with random processing times and finite storage buffers. AIIE transactions, 13(1), 2-11. https://doi.org/10.1080/05695558108974530
Geurts, P., Ernst, D., & Wehenkel, L. (2006). Extremely randomized trees. Machine learning, 63(1), 3-42. https://doi.org/10.1007/s10994-006-6226-1
Hon, K. (2005). Performance and evaluation of manufacturing systems. CIRP annals, 54(2), 139-154. https://doi.org/10.1016/S0007-8506(07)60023-7
Hu, L., & Li, L. (2022). Using tree-based machine learning for health studies: literature review and case series. International journal of environmental research and public health, 19(23), 16080. https://doi.org/10.3390/ijerph192316080
Kavlakoglu, E., & Russi, E. (2024). What is XGBoost? IBM. https://www.ibm.com/think/topics/xgboost
Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., Ye, Q., & Liu, T.-Y. (2017). Lightgbm: A highly efficient gradient boosting decision tree. Advances in neural information processing systems, 30. https://proceedings.neurips.cc/paper_files/paper/2017/file/6449f44a102fde848669bdd9eb6b76fa-Paper.pdf
Khaled, K., & Singla, M. (2025). Predictive analysis of groundwater resources using random forest regression. J. Artif. Intell. Metaheuristics, 9(1), 11-19. https://www.researchgate.net/publication/388403901_Predictive_Analysis_of_Groundwater_Resources_Using_Random_Forest_Regression
Kim, S., Won, Y., Park, K. J., & Eun, Y. (2024). Throughput Approximation by Neural Network for Serial Production Lines With High Up/Downtime Variability. IEEE Transactions on Industrial Informatics, 20(3), 4227-4235. https://doi.org/10.1109/TII.2023.3321026
Kuo, C., Lim, J., & Meerkov, S. (1996). Bottlenecks in serial production lines: A system-theoretic approach. Arbor, 1050, 48109-42122. https://doi.org/10.1155/S1024123X96000348
Le Bihan, H., & Dallery, Y. (2000). A robust decomposition method for the analysis of production lines with unreliable machines and finite buffers. Annals of Operations Research, 93(1), 265-297. https://doi.org/10.1023/A:1018996428429
Li, J., E. Blumenfeld, D., Huang, N., & M. Alden, J. (2009). Throughput analysis of production systems: recent advances and future topics. International Journal of Production Research, 47(14), 3823-3851. https://doi.org/10.1080/00207540701829752
Li, X., Li, W., & Xu, Y. (2018). Human age prediction based on DNA methylation using a gradient boosting regressor. Genes, 9(9), 424. https://doi.org/10.3390/genes9090424
Liu, C.-M., Su, S.-F., & Lin, C.-L. (1996). Predictive models for performance evaluation of serial production lines. International Journal of Production Research, 34(5), 1279-1291. https://doi.org/10.1080/00207549608904965
Machello, C., Baghaei, K. A., Bazli, M., Hadigheh, A., Rajabipour, A., Arashpour, M., Rad, H. M., & Hassanli, R. (2024). Tree-based machine learning approach to modelling tensile strength retention of Fibre Reinforced Polymer composites exposed to elevated temperatures. Composites Part B: Engineering, 270, 111132. https://doi.org/10.1016/j.compositesb.2023.111132
MultiOutputRegressor, Scikit-learn. (2024). https://scikit-learn.org/stable/modules/generated/sklearn.multioutput.MultiOutputRegressor.html
NA. (2024). LightGMB Features. Microsoft Corporation. https://lightgbm.readthedocs.io/en/stable/Features.html
Natekin, A., & Knoll, A. (2013). Gradient boosting machines, a tutorial. Frontiers in neurorobotics, 7, 21. https://doi.org/10.3389/fnbot.2013.00021
Nguyen, V. (2019). Bayesian optimization for accelerating hyper-parameter tuning. In 2019 IEEE second international conference on artificial intelligence and knowledge engineering (AIKE) (pp. 302-305). IEEE. https://doi.org/10.1109/AIKE.2019.00060
P. Bruce, A. Bruce, & P. Gedeck. (2020). Bagging and boosting. In Practical Statistics for Data Scientists (SECOND ed.). O’Reilly Media, Chapter 6. https://www.oreilly.com/library/view/practical-statistics-for/9781492072935/
Samee, N. A., El-Kenawy, E.-S. M., Atteia, G., Jamjoom, M. M., Ibrahim, A., Abdelhamid, A. A., El-Attar, N. E., Gaber, T., Slowik, A., & Shams, M. Y. (2022). Metaheuristic Optimization Through Deep Learning Classification of COVID-19 in Chest X-Ray Images. Computers, Materials & Continua, 73(2). https://doi.org/10.32604/cmc.2022.031147
Tan, B., & Gershwin, S. B. (2009). Analysis of a general Markovian two-stage continuous-flow production system with a finite buffer. International Journal of Production Economics, 120(2), 327-339. https://doi.org/10.1016/j.ijpe.2008.05.022
Tan, B., & Gershwin, S. B. (2011). Modelling and analysis of Markovian continuous flow systems with a finite buffer. Annals of Operations Research, 182(1), 5-30. https://doi.org/10.1007/s10479-009-0612-6
Trawiński, B., Smętek, M., Telec, Z., & Lasota, T. (2012). Nonparametric statistical analysis for multiple comparison of machine learning regression algorithms. https://doi.org/10.2478/v10006-012-0064-z
Wang, Z., Mu, H., & You, J. (2014). Analysis of a serial production line with single part-type and multiple parallel-machine workstations. IFAC Proceedings Volumes, 47(2), 306-313. https://doi.org/10.3182/20140514-3-FR-4046.00033
Wehenkel, L., Ernst, D., & Geurts, P. (2006). Ensembles of extremely randomized trees and some generic applications. In Robust methods for power system state estimation and load forecasting. https://orbi.uliege.be/bitstream/2268/13447/1/robust-trees.pdf
Wong, Q. Y., & Chu, Y. B. (2021). A mobile production monitoring system based on internet of thing (IoT) and random forest classification. International Journal of Electrical and Electronic Engineering & Telecommunications, 10(4), 243-250. https://doi.org/10.18178/ijeetc.10.4.243-250
Wu, D., Jennings, C., Terpenny, J., Gao, R. X., & Kumara, S. (2017). A comparative study on machine learning algorithms for smart manufacturing: tool wear prediction using random forests. Journal of Manufacturing Science and Engineering, 139(7), 071018. https://doi.org/10.1115/1.4036350
Wu, J., Chen, X.-Y., Zhang, H., Xiong, L.-D., Lei, H., & Deng, S.-H. (2019). Hyperparameter optimization for machine learning models based on Bayesian optimization. Journal of Electronic Science and Technology, 17(1), 26-40. https://doi.org/10.11989/JEST.1674-862X.80904120
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Reports in Mechanical Engineering

This work is licensed under a Creative Commons Attribution 4.0 International License.