[2602.21130] An Enhanced Projection Pursuit Tree Classifier with Visual Methods for Assessing Algorithmic Improvements
Summary
This paper introduces enhancements to the projection pursuit tree classifier, focusing on visual methods to assess algorithmic improvements in high-dimensional data classification.
Why It Matters
The advancements in projection pursuit tree classifiers address limitations in handling complex classification problems, particularly in multi-class scenarios with unequal variances. This research is significant for data scientists and machine learning practitioners seeking to improve classification accuracy and interpretability in high-dimensional datasets.
Key Takeaways
- Enhancements allow for more flexible splits and class groupings in classifiers.
- Visual diagnostic methods are developed to verify algorithm performance.
- The R package PPtreeExt implements the proposed enhancements for practical use.
Statistics > Machine Learning arXiv:2602.21130 (stat) [Submitted on 24 Feb 2026] Title:An Enhanced Projection Pursuit Tree Classifier with Visual Methods for Assessing Algorithmic Improvements Authors:Natalia da Silva, Dianne Cook, Eun-Kyung Lee View a PDF of the paper titled An Enhanced Projection Pursuit Tree Classifier with Visual Methods for Assessing Algorithmic Improvements, by Natalia da Silva and 1 other authors View PDF HTML (experimental) Abstract:This paper presents enhancements to the projection pursuit tree classifier and visual diagnostic methods for assessing their impact in high dimensions. The original algorithm uses linear combinations of variables in a tree structure where depth is constrained to be less than the number of classes -- a limitation that proves too rigid for complex classification problems. Our extensions improve performance in multi-class settings with unequal variance-covariance structures and nonlinear class separations by allowing more splits and more flexible class groupings in the projection pursuit computation. Proposing algorithmic improvements is straightforward; demonstrating their actual utility is not. We therefore develop two visual diagnostic approaches to verify that the enhancements perform as intended. Using high-dimensional visualization techniques, we examine model fits on benchmark datasets to assess whether the algorithm behaves as theorized. An interactive web application enables users to explore the behavior of both t...