Skip to Main content Skip to Navigation
Journal articles

A Comparative Study of Machine Learning Models for Predicting Length of Stay in Hospitals

Abstract : There has been a growing interest in recent years in correctly predicting the Length of Stay (LoS) in a hospital setting. Estimating the LoS on patient' admission helps hospitals in planning, controlling costs and, providing better services. In this paper, we consider predicting the LoS as a regression problem for which we implement and compare different Machine Learning (ML) algorithms. Multiple Linear Regression (MLR), Support Vector Machines (SVM), Random Forests (RF), and Gradient Boosting model (GBM) are implemented using an open-source dataset. The methodological process involves a preprocessing step combining data transformation, data standardization, and categorical data encoding. Moreover, the Synthetic Minority Over Sampling Technique for Regression (SMOTER) is applied to handle unbalanced data. Then, ML algorithms are employed, with a hyperpa-rameter tuning phase to obtain optimal coefficients. Finally, Mean Absolute Error (MAE), R-squared (R 2), and Adjusted R-squared (Adjusted R 2) metrics are selected to evaluate the model with parameters.
Document type :
Journal articles
Complete list of metadata
Contributor : Kathleen TORCK Connect in order to contact the contributor
Submitted on : Friday, October 22, 2021 - 3:07:07 PM
Last modification on : Thursday, June 23, 2022 - 6:23:54 PM




Rachda Naila Mekhaldi, Patrice Caulier, Sondès Chaabane, Abdelahad Chraibi, Sylvain Piechowiak. A Comparative Study of Machine Learning Models for Predicting Length of Stay in Hospitals. Journal of Information Science and Engineering, Academia Sinica, 2021, 37 (5), pp.1025-1038. ⟨10.6688/JISE.202109_37(5).0003⟩. ⟨hal-03395940⟩



Record views