[2602.16503] Interpretability-by-Design with Accurate Locally Additive Models and Conditional Feature Effects
Summary
This paper introduces Conditionally Additive Local Models (CALMs), which enhance the interpretability of Generalized Additive Models (GAMs) while maintaining predictive accuracy by allowing multiple univariate shape functions that adapt to different regions of the input space.
Why It Matters
As machine learning models become more complex, the need for interpretability grows. CALMs provide a solution that balances accuracy and interpretability, making them valuable for practitioners who require both performance and transparency in their models, especially in sensitive applications.
Key Takeaways
- CALMs improve upon GAMs by allowing multiple univariate shape functions for different input regions.
- The model maintains interpretability while achieving accuracy comparable to more complex models like GA²Ms.
- A distillation-based training pipeline is proposed to identify homogeneous regions with limited interactions.
Computer Science > Machine Learning arXiv:2602.16503 (cs) [Submitted on 18 Feb 2026] Title:Interpretability-by-Design with Accurate Locally Additive Models and Conditional Feature Effects Authors:Vasilis Gkolemis, Loukas Kavouras, Dimitrios Kyriakopoulos, Konstantinos Tsopelas, Dimitrios Rontogiannis, Giuseppe Casalicchio, Theodore Dalamagas, Christos Diou View a PDF of the paper titled Interpretability-by-Design with Accurate Locally Additive Models and Conditional Feature Effects, by Vasilis Gkolemis and 7 other authors View PDF HTML (experimental) Abstract:Generalized additive models (GAMs) offer interpretability through independent univariate feature effects but underfit when interactions are present in data. GA$^2$Ms add selected pairwise interactions which improves accuracy, but sacrifices interpretability and limits model auditing. We propose \emph{Conditionally Additive Local Models} (CALMs), a new model class, that balances the interpretability of GAMs with the accuracy of GA$^2$Ms. CALMs allow multiple univariate shape functions per feature, each active in different regions of the input space. These regions are defined independently for each feature as simple logical conditions (thresholds) on the features it interacts with. As a result, effects remain locally additive while varying across subregions to capture interactions. We further propose a principled distillation-based training pipeline that identifies homogeneous regions with limited interactions and fits ...