[2507.10610] LaSM: Layer-wise Scaling Mechanism for Defending Pop-up

[2507.10610] LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents

arXiv - AI April 01, 2026 4 min read

About this article

Abstract page for arXiv paper 2507.10610: LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents

Computer Science > Cryptography and Security arXiv:2507.10610 (cs) [Submitted on 13 Jul 2025 (v1), last revised 31 Mar 2026 (this version, v2)] Title:LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents Authors:Zihe Yan, Zhuosheng Zhang, Jiaping Gui, Gongshen Liu View a PDF of the paper titled LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents, by Zihe Yan and 3 other authors View PDF HTML (experimental) Abstract:Graphical user interface (GUI) agents built on multimodal large language models (MLLMs) have recently demonstrated strong decision-making abilities in screen-based interaction tasks. However, they remain highly vulnerable to pop-up-based environmental injection attacks, where malicious visual elements divert model attention and lead to unsafe or incorrect actions. Existing defense methods either require costly retraining or perform poorly under inductive interference. In this work, we systematically study how such attacks alter the attention behavior of GUI agents and uncover a layer-wise attention divergence pattern between correct and incorrect outputs. Based on this insight, we propose \textbf{LaSM}, a \textit{Layer-wise Scaling Mechanism} that selectively amplifies attention and MLP modules in critical layers. LaSM improves the alignment between model saliency and task-relevant regions without additional training. Extensive experiments across multiple datasets demonstrate that our method significantly improves ...

Originally published on April 01, 2026. Curated by AI News.

Llms

AI helped me build a custom PC and 4 apps in 6 months with zero coding experience

Mid-October, early morning at work. I was hunting for a podcast to throw on while I worked and stumbled into something about what AI coul...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

I thought of something while cooking up a simple RL AI. Please Validate it. [R]

So, I was trying to build a simple AI when I thought of, 'How could I give an AI some emotions? ' This led to one thing after another, an...

Reddit - Machine Learning · 1 min · about 5 hours ago

Llms

Open-source list of GenAI-related incidents

I am sharing this open-source list of cases where the ethics of GenAI use were put in the spotlight, in the hopes of sparking discussion ...

Reddit - Artificial Intelligence · 1 min · about 6 hours ago

Llms

I built a repo for implementing and training LLM architectures from scratch in minimal PyTorch — contributions welcome! [P]

Hey everyone, I've been working on a repo where I implement large language model architectures using the simplest PyTorch code possible. ...

Reddit - Machine Learning · 1 min · about 7 hours ago

[2507.10610] LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents

About this article

Related Articles

AI helped me build a custom PC and 4 apps in 6 months with zero coding experience

I thought of something while cooking up a simple RL AI. Please Validate it. [R]

Open-source list of GenAI-related incidents

I built a repo for implementing and training LLM architectures from scratch in minimal PyTorch — contributions welcome! [P]

No comments

Stay updated with AI News