[2604.02372] Backdoor Attacks on Decentralised Post-Training
About this article
Abstract page for arXiv paper 2604.02372: Backdoor Attacks on Decentralised Post-Training
Computer Science > Cryptography and Security arXiv:2604.02372 (cs) [Submitted on 31 Mar 2026] Title:Backdoor Attacks on Decentralised Post-Training Authors:Oğuzhan Ersoy, Nikolay Blagoev, Jona te Lintelo, Stefanos Koffas, Marina Krček, Stjepan Picek View a PDF of the paper titled Backdoor Attacks on Decentralised Post-Training, by O\u{g}uzhan Ersoy and 4 other authors View PDF HTML (experimental) Abstract:Decentralised post-training of large language models utilises data and pipeline parallelism techniques to split the data and the model. Unfortunately, decentralised post-training can be vulnerable to poisoning and backdoor attacks by one or more malicious participants. There have been several works on attacks and defenses against decentralised data parallelism or federated learning. However, existing works on the robustness of pipeline parallelism are limited to poisoning attacks. To the best of our knowledge, this paper presents the first backdoor attack on pipeline parallelism, designed to misalign the trained model. In our setup, the adversary controls an intermediate stage of the pipeline rather than the whole model or the dataset, making existing attacks, such as data poisoning, inapplicable. Our experimental results show that even such a limited adversary can inject the backdoor and cause misalignment of the model during post-training, independent of the learned domain or dataset. With our attack, the inclusion of the trigger word reduces the alignment percentage fr...