[2603.03352] Perfect score on IPhO 2025 theory by Gemini agent

[2603.03352] Perfect score on IPhO 2025 theory by Gemini agent

arXiv - AI 3 min read

About this article

Abstract page for arXiv paper 2603.03352: Perfect score on IPhO 2025 theory by Gemini agent

Physics > Physics Education arXiv:2603.03352 (physics) [Submitted on 26 Feb 2026] Title:Perfect score on IPhO 2025 theory by Gemini agent Authors:Yichen Huang View a PDF of the paper titled Perfect score on IPhO 2025 theory by Gemini agent, by Yichen Huang View PDF HTML (experimental) Abstract:The International Physics Olympiad (IPhO) is the world's most prestigious and renowned physics competition for pre-university students. IPhO problems require complex reasoning based on deep understanding of physical principles in a standard general physics curriculum. On IPhO 2025 theory problems, while gold medal performance by AI models was reported previously, it falls behind the best human contestant. Here we build a simple agent with Gemini 3.1 Pro Preview. We run it five times and it achieved a perfect score every time. However, data contamination could occur because Gemini 3.1 Pro Preview was released after the competition. Subjects: Physics Education (physics.ed-ph); Artificial Intelligence (cs.AI) Cite as: arXiv:2603.03352 [physics.ed-ph]   (or arXiv:2603.03352v1 [physics.ed-ph] for this version)   https://doi.org/10.48550/arXiv.2603.03352 Focus to learn more arXiv-issued DOI via DataCite (pending registration) Submission history From: Yichen Huang [view email] [v1] Thu, 26 Feb 2026 18:53:05 UTC (833 KB) Full-text links: Access Paper: View a PDF of the paper titled Perfect score on IPhO 2025 theory by Gemini agent, by Yichen HuangView PDFHTML (experimental)TeX Source view li...

Originally published on March 05, 2026. Curated by AI News.

Related Articles

Llms

What's your "When Language Model AI can do X, I'll be impressed"?

I have two at the top of my mind: When it can read musical notes. I will be mildly impressed when I can paste in a picture of musical not...

Reddit - Artificial Intelligence · 1 min ·
Google’s Gemini AI can answer your questions with 3D models and simulations
Llms

Google’s Gemini AI can answer your questions with 3D models and simulations

Google's latest upgrade for Gemini will allow the chatbot to generate interactive 3D models and simulations in response to your questions...

The Verge - AI · 4 min ·
Moody’s Integrates AI Agents With Anthropic’s Claude
Llms

Moody’s Integrates AI Agents With Anthropic’s Claude

AI Tools & Products · 4 min ·
AI on the couch: Anthropic gives Claude 20 hours of psychiatry
Llms

AI on the couch: Anthropic gives Claude 20 hours of psychiatry

AI Tools & Products · 6 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime