[2601.01331] AppellateGen: A Benchmark for Appellate Legal Judgment Generation
About this article
Abstract page for arXiv paper 2601.01331: AppellateGen: A Benchmark for Appellate Legal Judgment Generation
Computer Science > Computers and Society arXiv:2601.01331 (cs) [Submitted on 4 Jan 2026 (v1), last revised 29 Mar 2026 (this version, v3)] Title:AppellateGen: A Benchmark for Appellate Legal Judgment Generation Authors:Hongkun Yang, Lionel Z. Wang, Wei Fan, Yiran Hu, Lixu Wang, Chenyu Liu, Yu Zeng, Shenghong Fu, Lei Gong, Zhengxin Zhang, Haoyang Li, Jiexin Zheng, Xin Xu View a PDF of the paper titled AppellateGen: A Benchmark for Appellate Legal Judgment Generation, by Hongkun Yang and 12 other authors View PDF HTML (experimental) Abstract:Legal judgment generation is a critical task in legal intelligence. However, existing research in legal judgment generation has predominantly focused on first-instance trials, relying on static fact-to-verdict mappings while neglecting the dialectical nature of appellate (second-instance) review. To address this, we introduce AppellateGen, a benchmark for second-instance legal judgment generation comprising 7,351 case pairs. The task requires models to draft legally binding judgments by reasoning over the initial verdict and evidentiary updates, thereby modeling the causal dependency between trial stages. We further propose a judicial Standard Operating Procedure (SOP)-based Legal Multi-Agent System (SLMAS) to simulate judicial workflows, which decomposes the generation process into discrete stages of issue identification, retrieval, and drafting. Experimental results indicate that while SLMAS improves logical consistency, the complexity...