[2604.04488] A Patch-based Cross-view Regularized Framework for

[2604.04488] A Patch-based Cross-view Regularized Framework for Backdoor Defense in Multimodal Large Language Models

arXiv - Machine Learning April 07, 2026 4 min read

About this article

Abstract page for arXiv paper 2604.04488: A Patch-based Cross-view Regularized Framework for Backdoor Defense in Multimodal Large Language Models

Computer Science > Computer Vision and Pattern Recognition arXiv:2604.04488 (cs) [Submitted on 6 Apr 2026] Title:A Patch-based Cross-view Regularized Framework for Backdoor Defense in Multimodal Large Language Models Authors:Tianmeng Fang, Yong Wang, Zetai Kong, Zengzhen Su, Jun Wang, Chengjin Yu, Wei Wang View a PDF of the paper titled A Patch-based Cross-view Regularized Framework for Backdoor Defense in Multimodal Large Language Models, by Tianmeng Fang and 6 other authors View PDF HTML (experimental) Abstract:Multimodal large language models have become an important infrastructure for unified processing of visual and linguistic tasks. However, such models are highly susceptible to backdoor implantation during supervised fine-tuning and will steadily output the attacker's predefined harmful responses once a specific trigger pattern is activated. The core challenge of backdoor defense lies in suppressing attack success under low poisoning ratios while preserving the model's normal generation ability. These two objectives are inherently conflicting. Strong suppression often degrades benign performance, whereas weak regularization fails to mitigate backdoor behaviors. To this end, we propose a unified defense framework based on patch augmentation and cross-view regularity, which simultaneously constrains the model's anomalous behaviors in response to triggered patterns from both the feature representation and output distribution levels. Specifically, patch-level data augme...

Originally published on April 07, 2026. Curated by AI News.

Llms

Founding Engineer (Full-Stack / AI) – Build the Future of Personalized Healthcare (San Francisco, In-Person)

Hi everyone Galen AI is an early-stage, YC-backed healthtech startup building a personal AI doctor by combining clinical data, wearable d...

Reddit - ML Jobs · 1 min · 37 minutes ago

Llms

Looking for Job Opportunities — Senior MLOps / LLMOps Engineer (Remote / Visa Sponsorship)

Hi Everyone 👋 I’m a Senior MLOps / LLMOps Engineer with ~5 years of experience building and operating production-scale ML & LLM platf...

Reddit - ML Jobs · 1 min · 37 minutes ago

Llms

Early career / PhD (USA only) - $80-120/hr

Mercor is hiring Machine Learning Engineers to: Draft detailed natural-language plans and code implementations for machine learning tasks...

Reddit - ML Jobs · 1 min · 37 minutes ago

Llms

Hello MLjobs, I'm looking for research internships.

About me: I'm into Deep Learning Research particularly in multimodal AI/LLMs based in Mumbai, India. I have read papers and I re-implemen...

Reddit - ML Jobs · 1 min · 37 minutes ago

[2604.04488] A Patch-based Cross-view Regularized Framework for Backdoor Defense in Multimodal Large Language Models

About this article

Related Articles

Founding Engineer (Full-Stack / AI) – Build the Future of Personalized Healthcare (San Francisco, In-Person)

Looking for Job Opportunities — Senior MLOps / LLMOps Engineer (Remote / Visa Sponsorship)

Early career / PhD (USA only) - $80-120/hr

Hello MLjobs, I'm looking for research internships.

No comments

Stay updated with AI News