[2604.04403] MolDA: Molecular Understanding and Generation via Large

[2604.04403] MolDA: Molecular Understanding and Generation via Large Language Diffusion Model

arXiv - AI April 07, 2026 3 min read

About this article

Abstract page for arXiv paper 2604.04403: MolDA: Molecular Understanding and Generation via Large Language Diffusion Model

Computer Science > Artificial Intelligence arXiv:2604.04403 (cs) [Submitted on 6 Apr 2026] Title:MolDA: Molecular Understanding and Generation via Large Language Diffusion Model Authors:Seohyeon Shin, HanJun Choi, Jun-Hyung Park, Hongkook Kim, Mansu Kim View a PDF of the paper titled MolDA: Molecular Understanding and Generation via Large Language Diffusion Model, by Seohyeon Shin and 4 other authors View PDF HTML (experimental) Abstract:Large Language Models (LLMs) have significantly advanced molecular discovery, but existing multimodal molecular architectures fundamentally rely on autoregressive (AR) backbones. This strict left-to-right inductive bias is sub-optimal for generating chemically valid molecules, as it struggles to account for non-local global constraints (e.g., ring closures) and often accumulates structural errors during sequential generation. To address these limitations, we propose MolDA (Molecular language model with masked Diffusion with mAsking), a novel multimodal framework that replaces the conventional AR backbone with a discrete Large Language Diffusion Model. MolDA extracts comprehensive structural representations using a hybrid graph encoder, which captures both local and global topologies, and aligns them into the language token space via a Q-Former. Furthermore, we mathematically reformulate Molecular Structure Preference Optimization specifically for the masked diffusion. Through bidirectional iterative denoising, MolDA ensures global structur...

Originally published on April 07, 2026. Curated by AI News.

Llms

Anthropic gave Claude $100 to go shopping, here’s what the AI ended up buying

Anthropic’s AI experiment showed Claude independently handled 186 deals worth over $4,000, but results varied by model capability, with u...

AI Tools & Products · 5 min · about 2 hours ago

Llms

CoreWeave (CRWV) Partners with Anthropic to Provide Infrastructure for Claude AI Models

CoreWeave Inc. (NASDAQ:CRWV) is one of the best technology stocks to buy for the next decade. On April 20, CoreWeave announced a multi-ye...

AI Tools & Products · 2 min · about 2 hours ago

Llms

[2604.01650] AromaGen: Interactive Generation of Rich Olfactory Experiences with Multimodal Language Models

Abstract page for arXiv paper 2604.01650: AromaGen: Interactive Generation of Rich Olfactory Experiences with Multimodal Language Models

arXiv - AI · 4 min · about 3 hours ago

Llms

[2602.11931] AdaptEvolve: Improving Efficiency of Evolutionary AI Agents through Adaptive Model Selection

Abstract page for arXiv paper 2602.11931: AdaptEvolve: Improving Efficiency of Evolutionary AI Agents through Adaptive Model Selection

arXiv - AI · 3 min · about 3 hours ago

[2604.04403] MolDA: Molecular Understanding and Generation via Large Language Diffusion Model

About this article

Related Articles

Anthropic gave Claude $100 to go shopping, here’s what the AI ended up buying

CoreWeave (CRWV) Partners with Anthropic to Provide Infrastructure for Claude AI Models

[2604.01650] AromaGen: Interactive Generation of Rich Olfactory Experiences with Multimodal Language Models

[2602.11931] AdaptEvolve: Improving Efficiency of Evolutionary AI Agents through Adaptive Model Selection

No comments

Stay updated with AI News