Evaluating the Performance of Serial Production Lines Using Tree-Based Machine Learning Methods

Authors

  • Feras Mabrouke Department of Industrial and Mechanical Engineering, Faculty of Engineering, An-Najah National University, Nablus, Palestine
  • Basel Al-Sahili Department of Industrial and Mechanical Engineering, Faculty of Engineering, An-Najah National University, Nablus, Palestine
  • Ramiz Assaf Department of Industrial and Mechanical Engineering, Faculty of Engineering, An-Najah National University, Nablus, Palestine
  • Abdalmuttaleb Al-Sartawi Accounting, Finance and Banking Department, Ahlia University, Bahrain
  • Salem Aljazzar Industrial Engineering Department, Jeddah College of Engineering, University of Business and Technology, Jeddah 21448, Kingdom of Saudi Arabia
  • Siraj Zahran Industrial Engineering Department, Jeddah College of Engineering, University of Business and Technology, Jeddah 21448, Kingdom of Saudi Arabia
  • Mohammad Kanan Industrial Engineering Department, Jeddah College of Engineering, University of Business and Technology, Jeddah 21448, Kingdom of Saudi Arabia

DOI:

https://doi.org/10.31181/rme444

Keywords:

Serial Production Line, Tree-based Models, Machine Learning, Multioutput Regression, Simulation, Synchronous, Buffered Systems, Operational Efficiency, Performance Evaluation

Abstract

This study investigates the efficiency of serial production lines using advanced tree-based machine learning algorithms, including Random Forest (RF), XGBoost, LightGBM, and CatBoost. By analyzing various performance indicators such as throughput, machine reliability, and processing time, we construct predictive models to evaluate production line behavior. The comparative results demonstrate that LightGBM outperforms the other models in prediction accuracy and computational speed. These findings suggest that machine learning techniques, particularly ensemble tree models, can significantly enhance decision-making and operational performance in manufacturing systems.

References

Assaf, R., Colledani, M., & Matta, A. (2014). Analytical evaluation of the output variability in production systems with general Markovian structure. OR spectrum, 36(3), 799-835. https://doi.org/10.1007/s00291-013-0343-6

Bai, Y., Tu, J., Yang, M., Zhang, L., & Denno, P. (2021). A new aggregation algorithm for performance metric calculation in serial production lines with exponential machines: design, accuracy and robustness. International Journal of Production Research, 59(13), 4072-4089. https://doi.org/10.1080/00207543.2020.1757777

Biau, G., & Scornet, E. (2016). A random forest guided tour. Test, 25(2), 197-227. https://doi.org/10.1007/s11749-016-0481-7

Breiman, L. (2001). Random forests. Machine learning, 45(1), 5-32. https://doi.org/10.1023/A:1010933404324

Chen, S., Li, D., Gan, X., & Chiong, R. (2024). An Optimised Light Gradient Boosting Machine Model for Setup Time Prediction in Electronic Production Lines. In Proceedings of the Genetic and Evolutionary Computation Conference Companion (pp. 647-650). https://doi.org/10.1145/3638530.3654428

Chen, T., & Guestrin, C. (2016). Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining (pp. 785-794). https://doi.org/10.1145/2939672.2939785

Chen, X., & Ishwaran, H. (2012). Random forests for genomic data analysis. Genomics, 99(6), 323-329. https://doi.org/10.1016/j.ygeno.2012.04.003

Chen, Y., Khandelwal, M., Onifade, M., Zhou, J., Lawal, A. I., Bada, S. O., & Genc, B. (2025). Predicting the hardgrove grindability index using interpretable decision tree-based machine learning models. Fuel, 384, 133953. https://doi.org/10.1016/j.fuel.2024.133953

Chiang, S.-Y., Kuo, C.-T., & Meerkov, S. M. (2000). DT-bottlenecks in serial production lines: theory and application. IEEE Transactions on Robotics and Automation, 16(5), 567-580. https://doi.org/10.1109/70.880806

Colledani, M., & Tolio, T. (2005). A decomposition method to support the configuration/reconfiguration of production systems. CIRP annals, 54(1), 441-444. https://doi.org/10.1016/S0007-8506(07)60140-1

Dasari, S. K., Lavesson, N., Andersson, P., & Persson, M. (2015). Tree-based response surface analysis. In International workshop on machine learning, optimization and big data (pp. 118-129). Springer. https://doi.org/10.1007/978-3-319-27926-8_11

Demir, L., & Koyuncuoğlu, M. U. (2021). The impact of the optimal buffer configuration on production line efficiency: A VNS-based solution approach. Expert Systems with Applications, 172, 114631. https://doi.org/10.1016/j.eswa.2021.114631

Dey, R. (2024). Bagging v/s Boosting. In. Medium. https://medium.com/@roshmitadey/bagging-v-s-boosting-be765c970fd1

El-kenawy, E.-S. M., Abutarboush, H. F., Mohamed, A. W., & Ibrahim, A. (2021). Advance artificial intelligence technique for designing double T-shaped monopole antenna. Computers, Materials & Continua, 69(3). https://doi.org/10.32604/cmc.2021.019114

El-Kenawy, E.-S. M., Zerouali, B., Bailek, N., Bouchouich, K., Hassan, M. A., Almorox, J., Kuriqi, A., Eid, M., & Ibrahim‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬, A. (2022). Improved weighted ensemble learning for predicting the daily reference evapotranspiration under the semi-arid climate conditions. Environmental Science and Pollution Research, 29(54), 81279-81299. https://doi.org/10.1007/s11356-022-21410-8

Elshabrawy, M. (2025). A review on waste management techniques for sustainable energy production. Metaheuristic Optim. Rev., 3(2), 47-58. https://www.americaspg.com/article/pdf/3493

Galelli, S., & Castelletti, A. (2013). Assessing the predictive capability of randomized tree-based ensembles in streamflow modelling. Hydrology and Earth System Sciences, 17(7), 2669-2684. https://doi.org/10.5194/hess-17-2669-2013, 2013

Galuzzi, B. G., Giordani, I., Candelieri, A., Perego, R., & Archetti, F. (2020). Hyperparameter optimization for recommender systems through Bayesian optimization. Computational Management Science, 17(4), 495-515. https://doi.org/10.1007/s10287-020-00376-3

Gershwin, S. B. (1987). An efficient decomposition method for the approximate evaluation of tandem queues with finite storage space and blocking. Operations research, 35(2), 291-305. https://doi.org/10.1287/opre.35.2.291

Gershwin, S. B., & Berman, O. (1981). Analysis of transfer lines consisting of two unreliable machines with random processing times and finite storage buffers. AIIE transactions, 13(1), 2-11. https://doi.org/10.1080/05695558108974530

Geurts, P., Ernst, D., & Wehenkel, L. (2006). Extremely randomized trees. Machine learning, 63(1), 3-42. https://doi.org/10.1007/s10994-006-6226-1

Hon, K. (2005). Performance and evaluation of manufacturing systems. CIRP annals, 54(2), 139-154. https://doi.org/10.1016/S0007-8506(07)60023-7

Hu, L., & Li, L. (2022). Using tree-based machine learning for health studies: literature review and case series. International journal of environmental research and public health, 19(23), 16080. https://doi.org/10.3390/ijerph192316080

Kavlakoglu, E., & Russi, E. (2024). What is XGBoost? IBM. https://www.ibm.com/think/topics/xgboost

Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., Ye, Q., & Liu, T.-Y. (2017). Lightgbm: A highly efficient gradient boosting decision tree. Advances in neural information processing systems, 30. https://proceedings.neurips.cc/paper_files/paper/2017/file/6449f44a102fde848669bdd9eb6b76fa-Paper.pdf

Khaled, K., & Singla, M. (2025). Predictive analysis of groundwater resources using random forest regression. J. Artif. Intell. Metaheuristics, 9(1), 11-19. https://www.researchgate.net/publication/388403901_Predictive_Analysis_of_Groundwater_Resources_Using_Random_Forest_Regression

Kim, S., Won, Y., Park, K. J., & Eun, Y. (2024). Throughput Approximation by Neural Network for Serial Production Lines With High Up/Downtime Variability. IEEE Transactions on Industrial Informatics, 20(3), 4227-4235. https://doi.org/10.1109/TII.2023.3321026

Kuo, C., Lim, J., & Meerkov, S. (1996). Bottlenecks in serial production lines: A system-theoretic approach. Arbor, 1050, 48109-42122. https://doi.org/10.1155/S1024123X96000348

Le Bihan, H., & Dallery, Y. (2000). A robust decomposition method for the analysis of production lines with unreliable machines and finite buffers. Annals of Operations Research, 93(1), 265-297. https://doi.org/10.1023/A:1018996428429

Li, J., E. Blumenfeld, D., Huang, N., & M. Alden, J. (2009). Throughput analysis of production systems: recent advances and future topics. International Journal of Production Research, 47(14), 3823-3851. https://doi.org/10.1080/00207540701829752

Li, X., Li, W., & Xu, Y. (2018). Human age prediction based on DNA methylation using a gradient boosting regressor. Genes, 9(9), 424. https://doi.org/10.3390/genes9090424

Liu, C.-M., Su, S.-F., & Lin, C.-L. (1996). Predictive models for performance evaluation of serial production lines. International Journal of Production Research, 34(5), 1279-1291. https://doi.org/10.1080/00207549608904965

Machello, C., Baghaei, K. A., Bazli, M., Hadigheh, A., Rajabipour, A., Arashpour, M., Rad, H. M., & Hassanli, R. (2024). Tree-based machine learning approach to modelling tensile strength retention of Fibre Reinforced Polymer composites exposed to elevated temperatures. Composites Part B: Engineering, 270, 111132. https://doi.org/10.1016/j.compositesb.2023.111132

MultiOutputRegressor, Scikit-learn. (2024). https://scikit-learn.org/stable/modules/generated/sklearn.multioutput.MultiOutputRegressor.html

NA. (2024). LightGMB Features. Microsoft Corporation. https://lightgbm.readthedocs.io/en/stable/Features.html

Natekin, A., & Knoll, A. (2013). Gradient boosting machines, a tutorial. Frontiers in neurorobotics, 7, 21. https://doi.org/10.3389/fnbot.2013.00021

Nguyen, V. (2019). Bayesian optimization for accelerating hyper-parameter tuning. In 2019 IEEE second international conference on artificial intelligence and knowledge engineering (AIKE) (pp. 302-305). IEEE. https://doi.org/10.1109/AIKE.2019.00060

P. Bruce, A. Bruce, & P. Gedeck. (2020). Bagging and boosting. In Practical Statistics for Data Scientists (SECOND ed.). O’Reilly Media, Chapter 6. https://www.oreilly.com/library/view/practical-statistics-for/9781492072935/

Samee, N. A., El-Kenawy, E.-S. M., Atteia, G., Jamjoom, M. M., Ibrahim, A., Abdelhamid, A. A., El-Attar, N. E., Gaber, T., Slowik, A., & Shams, M. Y. (2022). Metaheuristic Optimization Through Deep Learning Classification of COVID-19 in Chest X-Ray Images. Computers, Materials & Continua, 73(2). https://doi.org/10.32604/cmc.2022.031147

Tan, B., & Gershwin, S. B. (2009). Analysis of a general Markovian two-stage continuous-flow production system with a finite buffer. International Journal of Production Economics, 120(2), 327-339. https://doi.org/10.1016/j.ijpe.2008.05.022

Tan, B., & Gershwin, S. B. (2011). Modelling and analysis of Markovian continuous flow systems with a finite buffer. Annals of Operations Research, 182(1), 5-30. https://doi.org/10.1007/s10479-009-0612-6

Trawiński, B., Smętek, M., Telec, Z., & Lasota, T. (2012). Nonparametric statistical analysis for multiple comparison of machine learning regression algorithms. https://doi.org/10.2478/v10006-012-0064-z

Wang, Z., Mu, H., & You, J. (2014). Analysis of a serial production line with single part-type and multiple parallel-machine workstations. IFAC Proceedings Volumes, 47(2), 306-313. https://doi.org/10.3182/20140514-3-FR-4046.00033

Wehenkel, L., Ernst, D., & Geurts, P. (2006). Ensembles of extremely randomized trees and some generic applications. In Robust methods for power system state estimation and load forecasting. https://orbi.uliege.be/bitstream/2268/13447/1/robust-trees.pdf

Wong, Q. Y., & Chu, Y. B. (2021). A mobile production monitoring system based on internet of thing (IoT) and random forest classification. International Journal of Electrical and Electronic Engineering & Telecommunications, 10(4), 243-250. https://doi.org/10.18178/ijeetc.10.4.243-250

Wu, D., Jennings, C., Terpenny, J., Gao, R. X., & Kumara, S. (2017). A comparative study on machine learning algorithms for smart manufacturing: tool wear prediction using random forests. Journal of Manufacturing Science and Engineering, 139(7), 071018. https://doi.org/10.1115/1.4036350

Wu, J., Chen, X.-Y., Zhang, H., Xiong, L.-D., Lei, H., & Deng, S.-H. (2019). Hyperparameter optimization for machine learning models based on Bayesian optimization. Journal of Electronic Science and Technology, 17(1), 26-40. https://doi.org/10.11989/JEST.1674-862X.80904120

Downloads

Published

2025-06-24

How to Cite

Evaluating the Performance of Serial Production Lines Using Tree-Based Machine Learning Methods. (2025). Reports in Mechanical Engineering, 6(1), 27-45. https://doi.org/10.31181/rme444