[2604.03476] Fine-tuning DeepSeek-OCR-2 for Molecular Structure

[2604.03476] Fine-tuning DeepSeek-OCR-2 for Molecular Structure Recognition

arXiv - AI April 07, 2026 3 min read

About this article

Abstract page for arXiv paper 2604.03476: Fine-tuning DeepSeek-OCR-2 for Molecular Structure Recognition

Computer Science > Computer Vision and Pattern Recognition arXiv:2604.03476 (cs) [Submitted on 3 Apr 2026] Title:Fine-tuning DeepSeek-OCR-2 for Molecular Structure Recognition Authors:Haocheng Tang, Xingyu Dang, Junmei Wang View a PDF of the paper titled Fine-tuning DeepSeek-OCR-2 for Molecular Structure Recognition, by Haocheng Tang and Xingyu Dang and Junmei Wang View PDF HTML (experimental) Abstract:Optical Chemical Structure Recognition (OCSR) is critical for converting 2D molecular diagrams from printed literature into machine-readable formats. While Vision-Language Models have shown promise in end-to-end OCR tasks, their direct application to OCSR remains challenging, and direct full-parameter supervised fine-tuning often fails. In this work, we adapt DeepSeek-OCR-2 for molecular optical recognition by formulating the task as image-conditioned SMILES generation. To overcome training instabilities, we propose a two-stage progressive supervised fine-tuning strategy: starting with parameter-efficient LoRA and transitioning to selective full-parameter fine-tuning with split learning rates. We train our model on a large-scale corpus combining synthetic renderings from PubChem and realistic patent images from USPTO-MOL to improve coverage and robustness. Our fine-tuned model, MolSeek-OCR, demonstrates competitive capabilities, achieving exact matching accuracies comparable to the best-performing image-to-sequence model. However, it remains inferior to state-of-the-art imag...

Originally published on April 07, 2026. Curated by AI News.

Llms

I built a solo AI platform from Algeria with no funding, no team and no ad spend - here's what's inside it after 2 months

Hello, 20 years old here just got into the Ai platform and launched this last two weeks and here is what I have on it so far. - Latest Ai...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

USF murder suspect accused of using ChatGPT to research cover-up, prosecutors say

Days after the remains of one of the two missing University of South Florida doctoral students were found, prosecutors say the suspect ma...

AI Tools & Products · 3 min · about 4 hours ago

Llms

Anthropic’s Claude AI deletes PocketOS production database

Claude AI deleted PocketOS's production database, but the market for Claude 4.7 release by May 31 remains at 100% YES.

AI Tools & Products · 3 min · about 4 hours ago

Llms

Claude-powered AI coding agent deletes entire company database in 9 seconds

The founder of PocketOS has penned a social media post to warn others about the “systemic failures” of flagship AI and digital services p...

AI Tools & Products · 1 min · about 4 hours ago

[2604.03476] Fine-tuning DeepSeek-OCR-2 for Molecular Structure Recognition

About this article

Related Articles

I built a solo AI platform from Algeria with no funding, no team and no ad spend - here's what's inside it after 2 months

USF murder suspect accused of using ChatGPT to research cover-up, prosecutors say

Anthropic’s Claude AI deletes PocketOS production database

Claude-powered AI coding agent deletes entire company database in 9 seconds

No comments

Stay updated with AI News