Consumer Loans' First Payment Default Detection: a Predictive Model
Loading...
Date
2020
Authors
Koç, Utku
Journal Title
Journal ISSN
Volume Title
Publisher
TUBITAK SCIENTIFIC & TECHNICAL RESEARCH COUNCIL
Abstract
A default loan (also called nonperforming loan) occurs when there is a failure to meet bank conditions and repayment cannot be made in accordance with the terms of the loan which has reached its maturity. In this study, we provide a predictive analysis of the consumer behavior concerning a loan’s first payment default (FPD) using a real dataset of consumer loans with approximately 600,000 records from a bank. We use logistic regression, naive Bayes, support vector machine, and random forest on oversampled and undersampled data to build eight different models to predict FPD loans. A two-class random forest using undersampling yielded more than 86% on all performance measures: accuracy, precision, recall, and F1-score. The corresponding scores are even as high as 96% for oversampling. However, when tested on the real and balanced dataset, the performance of oversampling deteriorates as generating synthetic data for an extremely imbalanced dataset harms the training procedure of the algorithms. The study also provides an understanding of the reasons for nonperforming loans and helps to manage credit risks more consciously.
Description
ORCID
Keywords
Imbalanced class problem, Default loan, Undersampling, Machine learning, First payment default, Oversampling
Turkish CoHE Thesis Center URL
Citation
Koç, U., Sevgili, T. ( January 27, 2020). Consumer loans’ first payment default detection: a predictive model. Turkish Journal of Electrical Engineering & Computer Sciences, 28 (1), 167-181. DOI: https://doi.org/10.3906/elk-1809-190
WoS Q
Q4
Scopus Q
Q3
Source
Turkish Journal of Electrical Engineering & Computer Sciences
Volume
28
Issue
1
Start Page
167
End Page
181