[2507.14899] InsightX Agent: An LMM-based Agentic Framework with Integrated Tools for Reliable X-ray NDT Analysis
Summary
The paper presents InsightX Agent, an LMM-based framework that enhances X-ray non-destructive testing (NDT) by improving reliability, interpretability, and interactivity in defect analysis.
Why It Matters
This research addresses critical limitations in existing deep-learning approaches for X-ray NDT, such as lack of interactivity and interpretability, which are essential for operator trust and effective quality assurance in industrial applications.
Key Takeaways
- InsightX Agent utilizes a Large Multimodal Model (LMM) to enhance X-ray NDT analysis.
- The framework integrates tools for improved defect detection and analysis, achieving a high F1-score of 96.54%.
- It emphasizes active reasoning over passive data processing, enhancing diagnostic reliability.
- The approach improves interpretability and trustworthiness in defect analysis.
- Experimental evaluations demonstrate the framework's transformative potential for industrial inspection tasks.
Computer Science > Artificial Intelligence arXiv:2507.14899 (cs) [Submitted on 20 Jul 2025 (v1), last revised 25 Feb 2026 (this version, v3)] Title:InsightX Agent: An LMM-based Agentic Framework with Integrated Tools for Reliable X-ray NDT Analysis Authors:Jiale Liu, Huan Wang, Yue Zhang, Xiaoyu Luo, Jiaxiang Hu, Zhiliang Liu, Min Xie View a PDF of the paper titled InsightX Agent: An LMM-based Agentic Framework with Integrated Tools for Reliable X-ray NDT Analysis, by Jiale Liu and 6 other authors View PDF HTML (experimental) Abstract:Non-destructive testing (NDT), particularly X-ray inspection, is vital for industrial quality assurance, yet existing deep-learning-based approaches often lack interactivity, interpretability, and the capacity for critical self-assessment, limiting their reliability and operator trust. To address these shortcomings, this paper proposes InsightX Agent, a novel LMM-based agentic framework designed to deliver reliable, interpretable, and interactive X-ray NDT analysis. Unlike typical sequential pipelines, InsightX Agent positions a Large Multimodal Model (LMM) as a central orchestrator, coordinating between the Sparse Deformable Multi-Scale Detector (SDMSD) and the Evidence-Grounded Reflection (EGR) tool. The SDMSD generates dense defect region proposals from multi-scale feature maps and sparsifies them through Non-Maximum Suppression (NMS), optimizing detection of small, dense targets in X-ray images while maintaining computational efficiency. ...