[2603.04636] When Agents Persuade: Propaganda Generation and Mitigation in LLMs

[2603.04636] When Agents Persuade: Propaganda Generation and Mitigation in LLMs

arXiv - AI 3 min read

About this article

Abstract page for arXiv paper 2603.04636: When Agents Persuade: Propaganda Generation and Mitigation in LLMs

Computer Science > Artificial Intelligence arXiv:2603.04636 (cs) [Submitted on 4 Mar 2026] Title:When Agents Persuade: Propaganda Generation and Mitigation in LLMs Authors:Julia Jose, Ritik Roongta, Rachel Greenstadt View a PDF of the paper titled When Agents Persuade: Propaganda Generation and Mitigation in LLMs, by Julia Jose and 1 other authors View PDF HTML (experimental) Abstract:Despite their wide-ranging benefits, LLM-based agents deployed in open environments can be exploited to produce manipulative material. In this study, we task LLMs with propaganda objectives and analyze their outputs using two domain-specific models: one that classifies text as propaganda or non-propaganda, and another that detects rhetorical techniques of propaganda (e.g., loaded language, appeals to fear, flag-waving, name-calling). Our findings show that, when prompted, LLMs exhibit propagandistic behaviors and use a variety of rhetorical techniques in doing so. We also explore mitigation via Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and ORPO (Odds Ratio Preference Optimization). We find that fine-tuning significantly reduces their tendency to generate such content, with ORPO proving most effective. Comments: Subjects: Artificial Intelligence (cs.AI) Cite as: arXiv:2603.04636 [cs.AI]   (or arXiv:2603.04636v1 [cs.AI] for this version)   https://doi.org/10.48550/arXiv.2603.04636 Focus to learn more arXiv-issued DOI via DataCite (pending registration) Submission histo...

Originally published on March 06, 2026. Curated by AI News.

Related Articles

Popular AI gateway startup LiteLLM ditches controversial startup Delve | TechCrunch
Llms

Popular AI gateway startup LiteLLM ditches controversial startup Delve | TechCrunch

LiteLLM had obtained two security compliance certifications via Delve and fell victim to some horrific credential-stealing malware last w...

TechCrunch - AI · 3 min ·
Llms

Von Hammerstein’s Ghost: What a Prussian General’s Officer Typology Can Teach Us About AI Misalignment

Greetings all - I've posted mostly in r/claudecode and r/aigamedev a couple of times previously. Working with CC for personal projects re...

Reddit - Artificial Intelligence · 1 min ·
Llms

World models will be the next big thing, bye-bye LLMs

Was at Nvidia's GTC conference recently and honestly, it was one of the most eye-opening events I've attended in a while. There was a lot...

Reddit - Artificial Intelligence · 1 min ·
Llms

we open sourced a tool that auto generates your AI agent context from your actual codebase, just hit 250 stars

hey everyone. been lurking here for a while and wanted to share something we been building. the problem: ai coding agents are only as goo...

Reddit - Artificial Intelligence · 1 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime