[2602.15298] X-MAP: eXplainable Misclassification Analysis and Profiling for Spam and Phishing Detection

[2602.15298] X-MAP: eXplainable Misclassification Analysis and Profiling for Spam and Phishing Detection

arXiv - AI 3 min read Article

Summary

The paper presents X-MAP, a framework for analyzing and profiling misclassifications in spam and phishing detection, enhancing interpretability and detection accuracy.

Why It Matters

As spam and phishing attacks continue to evolve, effective detection mechanisms are crucial for user safety. X-MAP addresses the challenge of misclassifications, which can lead to severe consequences, by providing a transparent analysis of model failures and improving detection reliability.

Key Takeaways

  • X-MAP combines SHAP-based feature attributions with non-negative matrix factorization for topic profiling.
  • The framework significantly reduces false rejection rates in spam/phishing detection.
  • X-MAP demonstrates high effectiveness with an AUROC of up to 0.98.
  • Misclassified messages show greater divergence from topic profiles, aiding in identifying errors.
  • The framework can serve as a repair layer for existing detection systems, recovering many falsely rejected messages.

Computer Science > Artificial Intelligence arXiv:2602.15298 (cs) [Submitted on 17 Feb 2026] Title:X-MAP: eXplainable Misclassification Analysis and Profiling for Spam and Phishing Detection Authors:Qi Zhang, Dian Chen, Lance M. Kaplan, Audun Jøsang, Dong Hyun Jeong, Feng Chen, Jin-Hee Cho View a PDF of the paper titled X-MAP: eXplainable Misclassification Analysis and Profiling for Spam and Phishing Detection, by Qi Zhang and 6 other authors View PDF HTML (experimental) Abstract:Misclassifications in spam and phishing detection are very harmful, as false negatives expose users to attacks while false positives degrade trust. Existing uncertainty-based detectors can flag potential errors, but possibly be deceived and offer limited interpretability. This paper presents X-MAP, an eXplainable Misclassification Analysis and Profilling framework that reveals topic-level semantic patterns behind model failures. X-MAP combines SHAP-based feature attributions with non-negative matrix factorization to build interpretable topic profiles for reliably classified spam/phishing and legitimate messages, and measures each message's deviation from these profiles using Jensen-Shannon divergence. Experiments on SMS and phishing datasets show that misclassified messages exhibit at least two times larger divergence than correctly classified ones. As a detector, X-MAP achieves up to 0.98 AUROC and lowers the false-rejection rate at 95% TRR to 0.089 on positive predictions. When used as a repair l...

Related Articles

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Improving AI models’ ability to explain their predictions
Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min ·
AI Hiring Growth: AI and ML Hiring Surges 37% in Marche
Machine Learning

AI Hiring Growth: AI and ML Hiring Surges 37% in Marche

AI News - General · 1 min ·
[2603.29171] Segmentation of Gray Matters and White Matters from Brain MRI data
Llms

[2603.29171] Segmentation of Gray Matters and White Matters from Brain MRI data

Abstract page for arXiv paper 2603.29171: Segmentation of Gray Matters and White Matters from Brain MRI data

arXiv - Machine Learning · 4 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime