[2501.10677] Class-Imbalanced-Aware Adaptive Dataset Distillation for

[2501.10677] Class-Imbalanced-Aware Adaptive Dataset Distillation for Scalable Pretrained Model on Credit Scoring

arXiv - Machine Learning March 31, 2026 4 min read

About this article

Abstract page for arXiv paper 2501.10677: Class-Imbalanced-Aware Adaptive Dataset Distillation for Scalable Pretrained Model on Credit Scoring

Computer Science > Machine Learning arXiv:2501.10677 (cs) [Submitted on 18 Jan 2025 (v1), last revised 29 Mar 2026 (this version, v3)] Title:Class-Imbalanced-Aware Adaptive Dataset Distillation for Scalable Pretrained Model on Credit Scoring Authors:Xia Li, Hanghang Zheng, Xiwei Zhuang, Zhong Wang, Xiao Chen, Hong Liu, Jasmine Bai, Mao Mao View a PDF of the paper titled Class-Imbalanced-Aware Adaptive Dataset Distillation for Scalable Pretrained Model on Credit Scoring, by Xia Li and 7 other authors View PDF Abstract:The advent of artificial intelligence has significantly enhanced credit scoring technologies. Despite the remarkable efficacy of advanced deep learning models, mainstream adoption continues to favor tree-structured models due to their robust predictive performance on tabular data. Although pretrained models have seen considerable development, their application within the financial realm predominantly revolves around question-answering tasks and the use of such models for tabular-structured credit scoring datasets remains largely unexplored. Tabular-oriented large models, such as TabPFN, has made the application of large models in credit scoring feasible, albeit can only processing with limited sample sizes. This paper provides a novel framework to combine tabular-tailored dataset distillation technique with the pretrained model, empowers the scalability for TabPFN. Furthermore, though class imbalance distribution is the common nature in financial datasets, its...

Originally published on March 31, 2026. Curated by AI News.

Machine Learning

AI for Materials Science starter kit [D]

Hi everyone, I've been close to Deep Learning for a while now, and have a good grasp of the fundamentals. So for the computational chemis...

Reddit - Machine Learning · 1 min · 15 minutes ago

Llms

‘AI-based super attacker’ threat looms as top crypto exchanges scramble for access to powerful Claude model

Anthropic’s new AI model found vulnerabilities in code that has existed for years. The company said it had to restrict public access sin...

AI Tools & Products · 4 min · 30 minutes ago

Machine Learning

My bets on open models, mid-2026

What I expect to come next and why, focused on the open-closed gap.

AI Tools & Products · 7 min · 30 minutes ago

Machine Learning

Pennsylvania expanded generative AI to 3,000 employees, with thousands more in training

The Pennsylvania state government has continued to expand its use of generative AI among its workforce, fulfilling a 2023 governor's order.

AI Tools & Products · 4 min · 30 minutes ago

[2501.10677] Class-Imbalanced-Aware Adaptive Dataset Distillation for Scalable Pretrained Model on Credit Scoring

About this article

Related Articles

AI for Materials Science starter kit [D]

‘AI-based super attacker’ threat looms as top crypto exchanges scramble for access to powerful Claude model

My bets on open models, mid-2026

Pennsylvania expanded generative AI to 3,000 employees, with thousands more in training

No comments

Stay updated with AI News