[2509.13471] An LLM Agentic Approach for Legal-Critical Software: A

[2509.13471] An LLM Agentic Approach for Legal-Critical Software: A Case Study for Tax Prep Software

arXiv - AI March 05, 2026 4 min read

About this article

Abstract page for arXiv paper 2509.13471: An LLM Agentic Approach for Legal-Critical Software: A Case Study for Tax Prep Software

Computer Science > Software Engineering arXiv:2509.13471 (cs) [Submitted on 16 Sep 2025 (v1), last revised 4 Mar 2026 (this version, v2)] Title:An LLM Agentic Approach for Legal-Critical Software: A Case Study for Tax Prep Software Authors:Sina Gogani-Khiabani (University of Illinois Chicago), Ashutosh Trivedi (University of Colorado Boulder), Diptikalyan Saha (IBM Research), Saeid Tizpaz-Niari (University of Illinois Chicago) View a PDF of the paper titled An LLM Agentic Approach for Legal-Critical Software: A Case Study for Tax Prep Software, by Sina Gogani-Khiabani (University of Illinois Chicago) and 3 other authors View PDF HTML (experimental) Abstract:Large language models (LLMs) show promise for translating natural-language statutes into executable logic, but reliability in legally critical settings remains challenging due to ambiguity and hallucinations. We present an agentic approach for developing legal-critical software, using U.S. federal tax preparation as a case study. The key challenge is test-case generation under the oracle problem, where correct outputs require interpreting law. Building on metamorphic testing, we introduce higher-order metamorphic relations that compare system outputs across structured shifts among similar individuals. Because authoring such relations is tedious and error-prone, we use an LLM-driven, role-based framework to automate test generation and code synthesis. We implement a multi-agent system that translates tax code into execut...

Originally published on March 05, 2026. Curated by AI News.

Llms

HALO - Hierarchical Autonomous Learning Organism

The idea is called HALO - Hierarchical Autonomous Learning Organism. The core premise is simple: what if instead of just making LLMs bigg...

Reddit - Artificial Intelligence · 1 min · 6 minutes ago

Llms

Bluesky’s new app is an AI for customizing your feed | The Verge

Eventually Attie will be able to vibe code entire apps for the AT Protocol.

The Verge - AI · 3 min · about 4 hours ago

Llms

Nicolas Carlini (67.2k citations on Google Scholar) says Claude is a better security researcher than him, made $3.7 million from exploiting smart contracts, and found vulnerabilities in Linux and Ghost

Link: https://m.youtube.com/watch?v=1sd26pWhfmg The Linux exploit is especially interesting because it was introduced in 2003 and was nev...

Reddit - Artificial Intelligence · 1 min · about 7 hours ago

Llms

[P] I built an autonomous ML agent that runs experiments on tabular data indefinitely - inspired by Karpathy's AutoResearch

Inspired by Andrej Karpathy's AutoResearch, I built a system where Claude Code acts as an autonomous ML researcher on tabular binary clas...

Reddit - Machine Learning · 1 min · about 7 hours ago

[2509.13471] An LLM Agentic Approach for Legal-Critical Software: A Case Study for Tax Prep Software

About this article

Related Articles

HALO - Hierarchical Autonomous Learning Organism

Bluesky’s new app is an AI for customizing your feed | The Verge

Nicolas Carlini (67.2k citations on Google Scholar) says Claude is a better security researcher than him, made $3.7 million from exploiting smart contracts, and found vulnerabilities in Linux and Ghost

[P] I built an autonomous ML agent that runs experiments on tabular data indefinitely - inspired by Karpathy's AutoResearch

No comments

Stay updated with AI News