Credit Default Risk Analysis Using Machine Learning Algorithms with Hyperparameter Optimization


Abstract:

Machine learning models are an important tool that provide a scientific method to identify potential debtors early and predict which clients are more likely to default on their debts, improving the accuracy of assessment in credit risk analysis in financial companies. The purpose of this study was to analyze the performance of gradient boosting machine learning algorithms (CatBoost, LightGBM, and XGBoost) in predicting customer default risk, and the ability of the RandomUnderSampler sampling technique to address unbalanced categories of credit risk. The exploratory analysis of the data set was carried out, then the data preprocessing, finally the training with hyperparameter adjustments with the GridSearchCV method to identify the largest number of clients with credit risk. The model is evaluated based on metrics of sensitivity, specificity and precision, on a set of consumer credit data. Among the proposed algorithms, XGBoost outperformed the LightGBM and catBoost models. Experimental results confirmed that the XGBoost model performs better for credit risk prediction with historical data.

Año de publicación:

2023

Keywords:

  • Gradient boosting
  • Binary classification
  • Cbkp_redit risk
  • Machine learning
  • CREDIT RISK
  • Machine Learning

Fuente:

scopusscopus
googlegoogle
orcidorcid

Tipo de documento:

Conference Object

Estado:

Acceso restringido

Áreas de conocimiento:

  • Aprendizaje automático
  • Algoritmo
  • Finanzas

Áreas temáticas de Dewey:

  • Ciencias de la computación
Procesado con IAProcesado con IA

Objetivos de Desarrollo Sostenible:

  • ODS 8: Trabajo decente y crecimiento económico
  • ODS 17: Alianzas para lograr los objetivos
  • ODS 9: Industria, innovación e infraestructura
Procesado con IAProcesado con IA