[2601.08011] TP-Blend: Textual-Prompt Attention Pairing for Precise Object-Style Blending in Diffusion Models

[2601.08011] TP-Blend: Textual-Prompt Attention Pairing for Precise Object-Style Blending in Diffusion Models

arXiv - Machine Learning 4 min read

About this article

Abstract page for arXiv paper 2601.08011: TP-Blend: Textual-Prompt Attention Pairing for Precise Object-Style Blending in Diffusion Models

Computer Science > Computer Vision and Pattern Recognition arXiv:2601.08011 (cs) [Submitted on 12 Jan 2026 (v1), last revised 1 Mar 2026 (this version, v4)] Title:TP-Blend: Textual-Prompt Attention Pairing for Precise Object-Style Blending in Diffusion Models Authors:Xin Jin, Yichuan Zhong, Yapeng Tian View a PDF of the paper titled TP-Blend: Textual-Prompt Attention Pairing for Precise Object-Style Blending in Diffusion Models, by Xin Jin and 2 other authors View PDF HTML (experimental) Abstract:Current text-conditioned diffusion editors handle single object replacement well but struggle when a new object and a new style must be introduced simultaneously. We present Twin-Prompt Attention Blend (TP-Blend), a lightweight training-free framework that receives two separate textual prompts, one specifying a blend object and the other defining a target style, and injects both into a single denoising trajectory. TP-Blend is driven by two complementary attention processors. Cross-Attention Object Fusion (CAOF) first averages head-wise attention to locate spatial tokens that respond strongly to either prompt, then solves an entropy-regularised optimal transport problem that reassigns complete multi-head feature vectors to those positions. CAOF updates feature vectors at the full combined dimensionality of all heads (e.g., 640 dimensions in SD-XL), preserving rich cross-head correlations while keeping memory low. Self-Attention Style Fusion (SASF) injects style at every self-attent...

Originally published on March 03, 2026. Curated by AI News.

Related Articles

Machine Learning

[D] Got my first offer after months of searching — below posted range, contract-to-hire, and worried it may pause my search. Do I take it?

I could really use some outside perspective. I’m a senior ML/CV engineer in Canada with about 5–6 years across research and industry. Mas...

Reddit - Machine Learning · 1 min ·
Machine Learning

[Research] AI training is bad, so I started an research

Hello, I started researching about AI training Q:Why? R: Because AI training is bad right now. Q: What do you mean its bad? R: Like when ...

Reddit - Machine Learning · 1 min ·
Machine Learning

[P] Unix philosophy for ML pipelines: modular, swappable stages with typed contracts

We built an open-source prototype that applies Unix philosophy to retrieval pipelines. Each stage (PII redaction, chunking, dedup, embedd...

Reddit - Machine Learning · 1 min ·
Machine Learning

Making an AI native sovereign computational stack

I’ve been working on a personal project that ended up becoming a kind of full computing stack: identity / trust protocol decentralized ch...

Reddit - Artificial Intelligence · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime