[2601.08011] TP-Blend: Textual-Prompt Attention Pairing for Precise

[2601.08011] TP-Blend: Textual-Prompt Attention Pairing for Precise Object-Style Blending in Diffusion Models

arXiv - Machine Learning March 03, 2026 4 min read

About this article

Abstract page for arXiv paper 2601.08011: TP-Blend: Textual-Prompt Attention Pairing for Precise Object-Style Blending in Diffusion Models

Computer Science > Computer Vision and Pattern Recognition arXiv:2601.08011 (cs) [Submitted on 12 Jan 2026 (v1), last revised 1 Mar 2026 (this version, v4)] Title:TP-Blend: Textual-Prompt Attention Pairing for Precise Object-Style Blending in Diffusion Models Authors:Xin Jin, Yichuan Zhong, Yapeng Tian View a PDF of the paper titled TP-Blend: Textual-Prompt Attention Pairing for Precise Object-Style Blending in Diffusion Models, by Xin Jin and 2 other authors View PDF HTML (experimental) Abstract:Current text-conditioned diffusion editors handle single object replacement well but struggle when a new object and a new style must be introduced simultaneously. We present Twin-Prompt Attention Blend (TP-Blend), a lightweight training-free framework that receives two separate textual prompts, one specifying a blend object and the other defining a target style, and injects both into a single denoising trajectory. TP-Blend is driven by two complementary attention processors. Cross-Attention Object Fusion (CAOF) first averages head-wise attention to locate spatial tokens that respond strongly to either prompt, then solves an entropy-regularised optimal transport problem that reassigns complete multi-head feature vectors to those positions. CAOF updates feature vectors at the full combined dimensionality of all heads (e.g., 640 dimensions in SD-XL), preserving rich cross-head correlations while keeping memory low. Self-Attention Style Fusion (SASF) injects style at every self-attent...

Originally published on March 03, 2026. Curated by AI News.

Machine Learning

[D] Got my first offer after months of searching — below posted range, contract-to-hire, and worried it may pause my search. Do I take it?

I could really use some outside perspective. I’m a senior ML/CV engineer in Canada with about 5–6 years across research and industry. Mas...

Reddit - Machine Learning · 1 min · 19 minutes ago

Machine Learning

[Research] AI training is bad, so I started an research

Hello, I started researching about AI training Q:Why? R: Because AI training is bad right now. Q: What do you mean its bad? R: Like when ...

Reddit - Machine Learning · 1 min · 19 minutes ago

Machine Learning

[P] Unix philosophy for ML pipelines: modular, swappable stages with typed contracts

We built an open-source prototype that applies Unix philosophy to retrieval pipelines. Each stage (PII redaction, chunking, dedup, embedd...

Reddit - Machine Learning · 1 min · about 3 hours ago

Machine Learning

Making an AI native sovereign computational stack

I’ve been working on a personal project that ended up becoming a kind of full computing stack: identity / trust protocol decentralized ch...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

[2601.08011] TP-Blend: Textual-Prompt Attention Pairing for Precise Object-Style Blending in Diffusion Models

About this article

Related Articles

[D] Got my first offer after months of searching — below posted range, contract-to-hire, and worried it may pause my search. Do I take it?

[Research] AI training is bad, so I started an research

[P] Unix philosophy for ML pipelines: modular, swappable stages with typed contracts

Making an AI native sovereign computational stack

No comments

Stay updated with AI News