[2602.14285] FMMD: A multimodal open peer review dataset based on F1000Research

[2602.14285] FMMD: A multimodal open peer review dataset based on F1000Research

arXiv - AI 4 min read Article

Summary

The paper introduces FMMD, a multimodal open peer review dataset from F1000Research, addressing limitations in current datasets by integrating visual, structural, and reviewer data for enhanced analysis of the peer review process.

Why It Matters

FMMD is significant as it fills a critical gap in peer review research by providing a comprehensive dataset that includes visual and structural elements alongside reviewer comments. This enables more nuanced analyses of the peer review lifecycle, which is essential for improving automated scholarly review systems and understanding the evolution of scientific manuscripts.

Key Takeaways

  • FMMD integrates visual and structural data with reviewer comments.
  • It addresses the limitations of existing text-centric peer review datasets.
  • The dataset supports tasks like multimodal issue detection and comment generation.
  • FMMD enhances the understanding of the peer review lifecycle across disciplines.
  • It provides a valuable resource for developing automated peer review systems.

Computer Science > Digital Libraries arXiv:2602.14285 (cs) [Submitted on 15 Feb 2026] Title:FMMD: A multimodal open peer review dataset based on F1000Research Authors:Zhenzhen Zhuang, Yuqing Fu, Jing Zhu, Zhangping Zhou, Jialiang Lin View a PDF of the paper titled FMMD: A multimodal open peer review dataset based on F1000Research, by Zhenzhen Zhuang and 4 other authors View PDF HTML (experimental) Abstract:Automated scholarly paper review (ASPR) has entered the coexistence phase with traditional peer review, where artificial intelligence (AI) systems are increasingly incorporated into real-world manuscript evaluation. In parallel, research on automated and AI-assisted peer review has proliferated. Despite this momentum, empirical progress remains constrained by several critical limitations in existing datasets. While reviewers routinely evaluate figures, tables, and complex layouts to assess scientific claims, most existing datasets remain overwhelmingly text-centric. This bias is reinforced by a narrow focus on data from computer science venues. Furthermore, these datasets lack precise alignment between reviewer comments and specific manuscript versions, obscuring the iterative relationship between peer review and manuscript evolution. In response, we introduce FMMD, a multimodal and multidisciplinary open peer review dataset curated from F1000Research. The dataset bridges the current gap by integrating manuscript-level visual and structural data with version-specific rev...

Related Articles

Machine Learning

Ml project user give dataset and I give best model [D] [P]

Tl,dr : suggest me a solution to create a ai ml project where user will give his dataset as input and the project should give best model ...

Reddit - Machine Learning · 1 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Machine Learning

[D] ICML 2026 Average Score

Hi all, I’m curious about the current review dynamics for ICML 2026, especially after the rebuttal phase. For those who are reviewers (or...

Reddit - Machine Learning · 1 min ·
Accelerating science with AI and simulations
Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min ·
More in Data Science: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime