[2603.04636] When Agents Persuade: Propaganda Generation and

[2603.04636] When Agents Persuade: Propaganda Generation and Mitigation in LLMs

arXiv - AI March 06, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.04636: When Agents Persuade: Propaganda Generation and Mitigation in LLMs

Computer Science > Artificial Intelligence arXiv:2603.04636 (cs) [Submitted on 4 Mar 2026] Title:When Agents Persuade: Propaganda Generation and Mitigation in LLMs Authors:Julia Jose, Ritik Roongta, Rachel Greenstadt View a PDF of the paper titled When Agents Persuade: Propaganda Generation and Mitigation in LLMs, by Julia Jose and 1 other authors View PDF HTML (experimental) Abstract:Despite their wide-ranging benefits, LLM-based agents deployed in open environments can be exploited to produce manipulative material. In this study, we task LLMs with propaganda objectives and analyze their outputs using two domain-specific models: one that classifies text as propaganda or non-propaganda, and another that detects rhetorical techniques of propaganda (e.g., loaded language, appeals to fear, flag-waving, name-calling). Our findings show that, when prompted, LLMs exhibit propagandistic behaviors and use a variety of rhetorical techniques in doing so. We also explore mitigation via Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and ORPO (Odds Ratio Preference Optimization). We find that fine-tuning significantly reduces their tendency to generate such content, with ORPO proving most effective. Comments: Subjects: Artificial Intelligence (cs.AI) Cite as: arXiv:2603.04636 [cs.AI] (or arXiv:2603.04636v1 [cs.AI] for this version) https://doi.org/10.48550/arXiv.2603.04636 Focus to learn more arXiv-issued DOI via DataCite (pending registration) Submission histo...

Originally published on March 06, 2026. Curated by AI News.

Llms

Popular AI gateway startup LiteLLM ditches controversial startup Delve | TechCrunch

LiteLLM had obtained two security compliance certifications via Delve and fell victim to some horrific credential-stealing malware last w...

TechCrunch - AI · 3 min · 8 minutes ago

Llms

Von Hammerstein’s Ghost: What a Prussian General’s Officer Typology Can Teach Us About AI Misalignment

Greetings all - I've posted mostly in r/claudecode and r/aigamedev a couple of times previously. Working with CC for personal projects re...

Reddit - Artificial Intelligence · 1 min · 38 minutes ago

Llms

World models will be the next big thing, bye-bye LLMs

Was at Nvidia's GTC conference recently and honestly, it was one of the most eye-opening events I've attended in a while. There was a lot...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

we open sourced a tool that auto generates your AI agent context from your actual codebase, just hit 250 stars

hey everyone. been lurking here for a while and wanted to share something we been building. the problem: ai coding agents are only as goo...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

[2603.04636] When Agents Persuade: Propaganda Generation and Mitigation in LLMs

About this article

Related Articles

Popular AI gateway startup LiteLLM ditches controversial startup Delve | TechCrunch

Von Hammerstein’s Ghost: What a Prussian General’s Officer Typology Can Teach Us About AI Misalignment

World models will be the next big thing, bye-bye LLMs

we open sourced a tool that auto generates your AI agent context from your actual codebase, just hit 250 stars

No comments

Stay updated with AI News