[2509.22381] Enhancing Credit Risk Prediction: A Multi-stage Ensemble Pipeline

[2509.22381] Enhancing Credit Risk Prediction: A Multi-stage Ensemble Pipeline

arXiv - Machine Learning 4 min read

About this article

Abstract page for arXiv paper 2509.22381: Enhancing Credit Risk Prediction: A Multi-stage Ensemble Pipeline

Computer Science > Machine Learning arXiv:2509.22381 (cs) [Submitted on 26 Sep 2025 (v1), last revised 27 Mar 2026 (this version, v2)] Title:Enhancing Credit Risk Prediction: A Multi-stage Ensemble Pipeline Authors:Haibo Wang, Jun Huang, Lutfu S. Sua, Figen Balo, Burak Dolar View a PDF of the paper titled Enhancing Credit Risk Prediction: A Multi-stage Ensemble Pipeline, by Haibo Wang and 4 other authors View PDF Abstract:Effective credit risk management is fundamental to financial decision-making, requiring robust models to predict default probabilities and classify financial entities. Traditional machine learning approaches face significant challenges when confronted with high-dimensional data, limited interpretability, rare-event detection, and multi-class risk imbalance. This research proposes a comprehensive multi-stage ensemble pipeline that synthesizes multiple complementary models: econometric models including Ordered logit and ordered probit, supervised learning algorithms, including XGBoost, Random Forest, Support Vector Machine, and Decision Tree; unsupervised methods such as K-Nearest Neighbors; deep learning architectures like Multilayer Perceptron; alongside LASSO regularization for feature selection and dimensionality reduction; and Error-Correcting Output Codes as an Ensemble classifier for handling imbalanced multi-class problems. We implement Permutation Feature Importance analysis for each prediction class across all constituent models to enhance model t...

Originally published on March 31, 2026. Curated by AI News.

Related Articles

Machine Learning

Is it actually possible to build a model-agnostic persistent text layer that keeps AI behavior stable?

Is it actually possible to define a persistent, model-agnostic text-based layer (loaded with the model each time) that keeps an AI system...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

Are gamers being used as free labeling labor? The rise of "Simulators" that look like AI training grounds [D]

Hey everyone, I’m an AI news curator and editor currently working on a piece about a weird trend I’ve been spotting: technical simulators...

Reddit - Machine Learning · 1 min ·
Machine Learning

Coherence Without Convergence: A New Protocol for Multi-Agent AI

Opening For the past year, most progress in multi-agent AI has followed a familiar pattern: Add more agents. Add more coordination. Watch...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

Week 6 AIPass update - answering the top questions from last post (file conflicts, remote models, scale)

Followup to last post with answers to the top questions from the comments. Appreciate everyone who jumped in. The most common one by a mi...

Reddit - Artificial Intelligence · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime