Aly, Samar and Alfonse, Marco and Salem, Abdel-Badeeh (2022) Intelligent Model for Enhancing the Bankruptcy Prediction with Imbalanced Data Using Oversampling and CatBoost. International Journal of Intelligent Computing and Information Sciences, 22 (3). pp. 92-108. ISSN 2535-1710
IJICIS_Volume 22_Issue 3_Pages 92-108.pdf - Published Version
Download (832kB)
Abstract
Bankruptcy prediction is one of the most significant financial decision-making problems, which prevents financial institutions from sever risks. Most of bankruptcy datasets suffer from imbalanced distribution between output classes, which could lead to misclassification in the prediction results. This research paper presents an efficient bankruptcy prediction model that can handle imbalanced dataset problem by applying Synthetic Minority Oversampling Technique (SMOTE) as a pre-processing step. It applies ensemble-based machine learning classifier, namely, Categorical Boosting (CatBoost) to classify between active and inactive classes. Moreover, the proposed model reduces the dimensionality of the used dataset to increase predictive performance by using three different feature selection techniques. The proposed model is evaluated across the most popular imbalanced bankrupt dataset, which is the Polish dataset. The obtained results proved the efficiency of the applied model, especially in terms of the accuracy. The accuracies ofthe proposed model in predicting bankruptcy on the Polish five years datasets are 98%, 98%, 97%, 97% and 95%, respectively.
Item Type: | Article |
---|---|
Subjects: | OA Digital Library > Computer Science |
Depositing User: | Unnamed user with email support@oadigitallib.org |
Date Deposited: | 29 Jun 2023 04:19 |
Last Modified: | 15 May 2024 10:11 |
URI: | http://library.thepustakas.com/id/eprint/1643 |