[2603.29403] Security in LLM-as-a-Judge: A Comprehensive SoK
About this article
Abstract page for arXiv paper 2603.29403: Security in LLM-as-a-Judge: A Comprehensive SoK
Computer Science > Cryptography and Security arXiv:2603.29403 (cs) [Submitted on 31 Mar 2026] Title:Security in LLM-as-a-Judge: A Comprehensive SoK Authors:Aiman Almasoud, Antony Anju, Marco Arazzi, Mert Cihangiroglu, Vignesh Kumar Kembu, Serena Nicolazzo, Antonino Nocera, Vinod P., Saraga Sakthidharan View a PDF of the paper titled Security in LLM-as-a-Judge: A Comprehensive SoK, by Aiman Almasoud and 8 other authors View PDF HTML (experimental) Abstract:LLM-as-a-Judge (LaaJ) is a novel paradigm in which powerful language models are used to assess the quality, safety, or correctness of generated outputs. While this paradigm has significantly improved the scalability and efficiency of evaluation processes, it also introduces novel security risks and reliability concerns that remain largely unexplored. In particular, LLM-based judges can become both targets of adversarial manipulation and instruments through which attacks are conducted, potentially compromising the trustworthiness of evaluation pipelines. In this paper, we present the first Systematization of Knowledge (SoK) focusing on the security aspects of LLM-as-a-Judge systems. We perform a comprehensive literature review across major academic databases, analyzing 863 works and selecting 45 relevant studies published between 2020 and 2026. Based on this study, we propose a taxonomy that organizes recent research according to the role played by LLM-as-a-Judge in the security landscape, distinguishing between attacks ta...