[2602.14030] MC$^2$Mark: Distortion-Free Multi-Bit Watermarking for Long Messages

[2602.14030] MC$^2$Mark: Distortion-Free Multi-Bit Watermarking for Long Messages

arXiv - Machine Learning 3 min read Article

Summary

MC$^2$Mark introduces a novel watermarking framework that ensures reliable embedding of long messages in generated text while maintaining quality, addressing the growing need for provenance tracing in AI-generated content.

Why It Matters

As large language models produce increasingly human-like text, the ability to trace the origin and authenticity of this content becomes crucial. MC$^2$Mark offers a solution to effectively embed identifiers without compromising text quality, enhancing content integrity in various applications.

Key Takeaways

  • MC$^2$Mark utilizes Multi-Channel Colored Reweighting for effective watermarking.
  • The framework maintains text quality while embedding long messages.
  • Experiments show significant improvements in detectability and robustness over existing methods.

Computer Science > Cryptography and Security arXiv:2602.14030 (cs) [Submitted on 15 Feb 2026] Title:MC$^2$Mark: Distortion-Free Multi-Bit Watermarking for Long Messages Authors:Xuehao Cui, Ruibo Chen, Yihan Wu, Heng Huang View a PDF of the paper titled MC$^2$Mark: Distortion-Free Multi-Bit Watermarking for Long Messages, by Xuehao Cui and 3 other authors View PDF Abstract:Large language models now produce text indistinguishable from human writing, which increases the need for reliable provenance tracing. Multi-bit watermarking can embed identifiers into generated text, but existing methods struggle to keep both text quality and watermark strength while carrying long messages. We propose MC$^2$Mark, a distortion-free multi-bit watermarking framework designed for reliable embedding and decoding of long messages. Our key technical idea is Multi-Channel Colored Reweighting, which encodes bits through structured token reweighting while keeping the token distribution unbiased, together with Multi-Layer Sequential Reweighting to strengthen the watermark signal and an evidence-accumulation detector for message recovery. Experiments show that MC$^2$Mark improves detectability and robustness over prior multi-bit watermarking methods while preserving generation quality, achieving near-perfect accuracy for short messages and exceeding the second-best method by nearly 30% for long messages. Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG) Cite as: arXiv:2602.14030 ...

Related Articles

How to use the new ChatGPT app integrations, including DoorDash, Spotify, Uber, and others | TechCrunch
Llms

How to use the new ChatGPT app integrations, including DoorDash, Spotify, Uber, and others | TechCrunch

Learn how to use Spotify, Canva, Figma, Expedia, and other apps directly in ChatGPT.

TechCrunch - AI · 10 min ·
Anthropic Restricts Claude Agent Access Amid AI Automation Boom in Crypto
Llms

Anthropic Restricts Claude Agent Access Amid AI Automation Boom in Crypto

AI Tools & Products · 7 min ·
Is cutting ‘please’ when talking to ChatGPT better for the planet? An expert explains
Llms

Is cutting ‘please’ when talking to ChatGPT better for the planet? An expert explains

AI Tools & Products · 5 min ·
AI Desktop 98 lets you chat with Claude, ChatGPT, and Gemini through a Windows 98-inspired interface
Llms

AI Desktop 98 lets you chat with Claude, ChatGPT, and Gemini through a Windows 98-inspired interface

AI Tools & Products · 3 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime