[2603.22714] PopResume: Causal Fairness Evaluation of LLM/VLM Resume Screeners with Population-Representative Dataset
About this article
Abstract page for arXiv paper 2603.22714: PopResume: Causal Fairness Evaluation of LLM/VLM Resume Screeners with Population-Representative Dataset
Computer Science > Computers and Society arXiv:2603.22714 (cs) [Submitted on 24 Mar 2026] Title:PopResume: Causal Fairness Evaluation of LLM/VLM Resume Screeners with Population-Representative Dataset Authors:Sumin Yu, Juhyeon Park, Taesup Moon View a PDF of the paper titled PopResume: Causal Fairness Evaluation of LLM/VLM Resume Screeners with Population-Representative Dataset, by Sumin Yu and 2 other authors View PDF Abstract:We present PopResume, a population-representative resume dataset for causal fairness auditing of LLM- and VLM-based resume screening systems. Unlike existing benchmarks that rely on manually injected demographic information and outcome-level disparities, PopResume is grounded in population statistics and preserves natural attribute relationships, enabling path-specific effect (PSE)-based fairness evaluation. We decompose the effect of a protected attribute on resume scores into two paths: the business necessity path, mediated by job-relevant qualifications, and the redlining path, mediated by demographic proxies. This distinction allows auditors to separate legally permissible from impermissible sources of disparity. Evaluating four LLMs and four VLMs on PopResume's 60.8K resumes across five occupations, we identify five representative discrimination patterns that aggregate metrics fail to capture. Our results demonstrate that PSE-based evaluation reveals fairness issues masked by outcome-level measures, underscoring the need for causally-grounded a...