[2505.06046] Healthy LLMs? Benchmarking LLM Knowledge of UK Government

[2505.06046] Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information

arXiv - Machine Learning March 05, 2026 4 min read

About this article

Abstract page for arXiv paper 2505.06046: Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information

Computer Science > Computation and Language arXiv:2505.06046 (cs) [Submitted on 9 May 2025 (v1), last revised 4 Mar 2026 (this version, v3)] Title:Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information Authors:Joshua Harris, Fan Grayson, Felix Feldman, Timothy Laurence, Toby Nonnenmacher, Oliver Higgins, Leo Loman, Selina Patel, Thomas Finnie, Samuel Collins, Michael Borowitz View a PDF of the paper titled Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information, by Joshua Harris and 9 other authors View PDF HTML (experimental) Abstract:As Large Language Models (LLMs) become widely accessible, a detailed understanding of their knowledge within specific domains becomes necessary for successful real world use. This is particularly critical in the domains of medicine and public health, where failure to retrieve relevant, accurate, and current information could significantly impact UK residents. However, while there are a number of LLM benchmarks in the medical domain, currently little is known about LLM knowledge within the field of public health. To address this issue, this paper introduces a new benchmark, PubHealthBench, with over 8000 questions for evaluating LLMs' Multiple Choice Question Answering (MCQA) and free form responses to public health queries. To create PubHealthBench we extract free text from 687 current UK government guidance documents and implement an automated pipeline for generating MCQA samples. Ass...

Originally published on March 05, 2026. Curated by AI News.

Llms

ChatGPT has a new $100 per month Pro subscription | The Verge

OpenAI has announced a new version of its ChatGPT Pro subscription that costs $100 per month. The new Pro tier offers “5x more” usage of ...

The Verge - AI · 4 min · about 1 hour ago

Llms

ChatGPT finally offers $100/month Pro plan | TechCrunch

OpenAI announced on Thursday something that power users have been asking for: a $100/month plan. Previously, subscriptions jumped from $2...

TechCrunch - AI · 4 min · about 2 hours ago

Llms

Florida AG announces investigation into OpenAI over shooting that allegedly involved ChatGPT | TechCrunch

ChatGPT had reportedly been used to plan the attack that killed two and injured five at Florida State University last April. The family o...

TechCrunch - AI · 4 min · about 3 hours ago

Llms

We’re open-sourcing a 33-benchmark diagnostic for AI alignment gaps, launches April 27

On April 27 we’re open-sourcing a free diagnostic tool called iFixAi. You run it against your AI system (agent, copilot, LLM integration,...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

[2505.06046] Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information

About this article

Related Articles

ChatGPT has a new $100 per month Pro subscription | The Verge

ChatGPT finally offers $100/month Pro plan | TechCrunch

Florida AG announces investigation into OpenAI over shooting that allegedly involved ChatGPT | TechCrunch

We’re open-sourcing a 33-benchmark diagnostic for AI alignment gaps, launches April 27

No comments

Stay updated with AI News