[2603.28130] MDPBench: A Benchmark for Multilingual Document Parsing

[2603.28130] MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios

arXiv - AI March 31, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.28130: MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios

Computer Science > Computer Vision and Pattern Recognition arXiv:2603.28130 (cs) [Submitted on 30 Mar 2026] Title:MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios Authors:Zhang Li, Zhibo Lin, Qiang Liu, Ziyang Zhang, Shuo Zhang, Zidun Guo, Jiajun Song, Jiarui Zhang, Xiang Bai, Yuliang Liu View a PDF of the paper titled MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios, by Zhang Li and 9 other authors View PDF HTML (experimental) Abstract:We introduce Multilingual Document Parsing Benchmark, the first benchmark for multilingual digital and photographed document parsing. Document parsing has made remarkable strides, yet almost exclusively on clean, digital, well-formatted pages in a handful of dominant languages. No systematic benchmark exists to evaluate how models perform on digital and photographed documents across diverse scripts and low-resource languages. MDPBench comprises 3,400 document images spanning 17 languages, diverse scripts, and varied photographic conditions, with high-quality annotations produced through a rigorous pipeline of expert model labeling, manual correction, and human verification. To ensure fair comparison and prevent data leakage, we maintain separate public and private evaluation splits. Our comprehensive evaluation of both open-source and closed-source models uncovers a striking finding: while closed-source models (notably Gemini3-Pro) prove relatively robust, open-source alternative...

Originally published on March 31, 2026. Curated by AI News.

Machine Learning

[R], 31 MILLIONS High frequency data, Light GBM worked perfectly

We just published a paper on predicting adverse selection in high-frequency crypto markets using LightGBM, and I wanted to share it here ...

Reddit - Machine Learning · 1 min · 30 minutes ago

Machine Learning

[D] Those of you with 10+ years in ML — what is the public completely wrong about?

For those of you who've been in ML/AI research or applied ML for 10+ years — what's the gap between what the public thinks AI is doing vs...

Reddit - Machine Learning · 1 min · 30 minutes ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 3 hours ago

Machine Learning

AI assistants are optimized to seem helpful. That is not the same thing as being helpful.

RLHF trains models on human feedback. Humans rate responses they like. And it turns out humans consistently rate confident, fluent, agree...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

[2603.28130] MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios

About this article

Related Articles

[R], 31 MILLIONS High frequency data, Light GBM worked perfectly

[D] Those of you with 10+ years in ML — what is the public completely wrong about?

UMKC Announces New Master of Science in Artificial Intelligence

AI assistants are optimized to seem helpful. That is not the same thing as being helpful.

No comments

Stay updated with AI News