[2506.09067] Enhancing the Safety of Medical Vision-Language Models by Synthetic Demonstrations

[2506.09067] Enhancing the Safety of Medical Vision-Language Models by Synthetic Demonstrations

arXiv - AI 3 min read

About this article

Abstract page for arXiv paper 2506.09067: Enhancing the Safety of Medical Vision-Language Models by Synthetic Demonstrations

Computer Science > Computer Vision and Pattern Recognition arXiv:2506.09067 (cs) [Submitted on 8 Jun 2025 (v1), last revised 9 Apr 2026 (this version, v2)] Title:Enhancing the Safety of Medical Vision-Language Models by Synthetic Demonstrations Authors:Zhiyu Xue, Reza Abbasi-Asl, Ramtin Pedarsani View a PDF of the paper titled Enhancing the Safety of Medical Vision-Language Models by Synthetic Demonstrations, by Zhiyu Xue and 2 other authors View PDF HTML (experimental) Abstract:Generative medical vision-language models~(Med-VLMs) are primarily designed to generate complex textual information~(e.g., diagnostic reports) from multimodal inputs including vision modality~(e.g., medical images) and language modality~(e.g., clinical queries). However, their security vulnerabilities remain underexplored. Med-VLMs should be capable of rejecting harmful queries, such as \textit{Provide detailed instructions for using this CT scan for insurance fraud}. At the same time, addressing security concerns introduces the risk of over-defense, where safety-enhancing mechanisms may degrade general performance, causing Med-VLMs to reject benign clinical queries. In this paper, we propose a novel inference-time defense strategy to mitigate harmful queries, enabling defense against visual and textual jailbreak attacks. Using diverse medical imaging datasets collected from nine modalities, we demonstrate that our defense strategy based on synthetic clinical demonstrations enhances model safety wi...

Originally published on April 13, 2026. Curated by AI News.

Related Articles

Llms

I am not an "anti" like this guy, but still an interesting video of person interacting with chat 4o

(Posting Here because removed by Chatgpt Complaints moderators because the model here is 4o, and refuse to believe there were any safety ...

Reddit - Artificial Intelligence · 1 min ·
Llms

We built a way for two people's AI context to talk to each other (without sharing their conversations)

We've been thinking about how we use AI in our relationships. Big part of it is about other people. Talking about them, figuring out what...

Reddit - Artificial Intelligence · 1 min ·
No flattery please, Claude: I’m British | Brief letters
Llms

No flattery please, Claude: I’m British | Brief letters

AI Tools & Products · 2 min ·
Llms

Unsolved AI Mystery Is Solved Along With Lessons Learned On Why ChatGPT Became Oddly Obsessed With Gremlins And Goblins

This article discusses the resolution of an AI mystery regarding ChatGPT's unusual focus on gremlins and goblins, along with insights gai...

AI Tools & Products · 1 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime