Setiadi, De Rosal Ignatius Moses and Muslikh, Ahmad Rofiqul and Iriananda, Syahroni Wahyu and Warto, Warto and Gondohanindijo, Jutono and Ojugo, Arnold Adimabua (2024) Outlier Detection Using Gaussian Mixture Model Clustering to Optimize XGBoost for Credit Approval Prediction. Journal of Computing Theories and Applications, 2 (2). pp. 244-255. ISSN 3024-9104
11638-Article Text-40742-1-10-20241101.pdf - Published Version
Download (377kB) | Preview
Abstract
Credit approval prediction is one of the critical challenges in the financial industry, where the accuracy and efficiency of credit decision-making can significantly affect business risk. This study proposes an outlier detection method using the Gaussian Mixture Model (GMM) combined with Extreme Gradient Boosting (XGBoost) to improve prediction accuracy. GMM is used to detect outliers with a probabilistic approach, allowing for finer-grained anomaly identification compared to distance- or density-based methods. Furthermore, the data cleaned through GMM is processed using XGBoost, a decision tree-based boosting algorithm that efficiently handles complex datasets. This study compares the performance of XGBoost with various outlier detection methods, such as LOF, CBLOF, DBSCAN, IF, and K-Means, as well as various other classification algorithms based on machine learning and deep learning. Experimental results show that the combination of GMM and XGBoost provides the best performance with an accuracy of 95.493%, a recall of 91.650%, and an AUC of 95.145%, outperforming other models in the context of credit approval prediction on an imbalanced dataset. The proposed method has been proven to reduce prediction errors and improve the model's reliability in detecting eligible credit applications.
Item Type: | Article |
---|---|
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Depositing User: | Unnamed user with email cute.moses89@gmail.com |
Date Deposited: | 17 Nov 2024 16:06 |
Last Modified: | 17 Nov 2024 16:33 |
URI: | https://dl.futuretechsci.org/id/eprint/6 |