A dimension reduction assisted credit scoring method for big data with categorical features

In the past decade, financial institutions have invested significant efforts in the development of accurate analytical credit scoring models. The evidence suggests that even small improvements in the accuracy of existing credit-scoring models may optimize profits while effectively managing risk expo...

Full description

Saved in:
Bibliographic Details
Published in:Financial innovation (Heidelberg) Vol. 11; no. 1; pp. 29 - 30
Main Authors: Miljkovic, Tatjana, Wang, Pei
Format: Journal Article
Language:English
Published: Heidelberg Springer Nature B.V 01.12.2025
SpringerOpen
Subjects:
ISSN:2199-4730, 2199-4730
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In the past decade, financial institutions have invested significant efforts in the development of accurate analytical credit scoring models. The evidence suggests that even small improvements in the accuracy of existing credit-scoring models may optimize profits while effectively managing risk exposure. Despite continuing efforts, the majority of existing credit scoring models still include some judgment-based assumptions that are sometimes supported by the significant findings of previous studies but are not validated using the institution’s internal data. We argue that current studies related to the development of credit scoring models have largely ignored recent developments in statistical methods for sufficient dimension reduction. To contribute to the field of financial innovation, this study proposes a Dimension Reduction Assisted Credit Scoring (DRA-CS) method via distance covariance-based sufficient dimension reduction (DCOV-SDR) in Majorization-Minimization (MM) algorithm. First, in the presence of a large number of variables, the DRA-CS method results in greater dimension reduction and better prediction accuracy than the other methods used for dimension reduction. Second, when the DRA-CS method is employed with logistic regression, it outperforms existing methods based on different variable selection techniques. This study argues that the DRA-CS method should be used by financial institutions as a financial innovation tool to analyze high-dimensional customer datasets and improve the accuracy of existing credit scoring methods.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:2199-4730
2199-4730
DOI:10.1186/s40854-024-00689-1