Breaking AI on purpose: How researchers are helping make artificial intelligence safer

Breaking AI on purpose: How researchers are helping make artificial intelligence safer

AI News - General 4 min read

About this article

UF scientists are focused on strengthening the security measures built into AI tools to ensure they are safe for all to use.

Skip to main content News School Logo Link MENU University of Florida For Faculty Faculty Resources Senior Communicators For Media Contact Subscribe to UF News Releases UF Facts for Media UF Administration UF Directory Hurricane Hub Breaking AI on purpose: How researchers are helping make artificial intelligence safer Harrem Monkhorst February 19, 2026 Share Nullspace steering. Red teaming. Jailbreaking the matrix.   A paper written by University of Florida Computer & Information Science & Engineering, or CISE, Professor Sumit Kumar Jha, Ph.D., contains so many science fiction terms, you’d be forgiven for thinking it’s a Hollywood script.   But Jha’s work is decidedly focused on real life, most notably strengthening the security measures built into AI tools to ensure they are safe for all to use.   “We are popping the hood, pulling on the internal wires and checking what breaks. That's how you make it safer. There's no shortcut for that.” —Sumit Kumar Jha, Ph.D., a UF professor in the Department of Computer & Information Science & Engineering As AI assistants move from novelty to infrastructure, helping write code, summarizing medical notes and answering customer questions, the biggest question isn't just what these systems can do, but what happens when they are pushed to do what they shouldn't.  “By showing exactly how these defenses break, we give AI developers the information they need to build defenses that actually hold up,” Jha said. “The public release of powerful A...

Originally published on March 27, 2026. Curated by AI News.

Related Articles

Ai Safety

Bias in AI: Examples and 6 Ways to Fix it in 2026

AI bias is an anomaly in the output of ML algorithms due to prejudiced assumptions. Explore types of AI bias, examples, how to reduce bia...

AI Events · 36 min ·
Llms

[R] GPT-5.4-mini regressed 22pp on vanilla prompting vs GPT-5-mini. Nobody noticed because benchmarks don't test this. Recursive Language Models solved it.

GPT-5.4-mini produces shorter, terser outputs by default. Vanilla accuracy dropped from 69.5% to 47.2% across 12 tasks (1,800 evals). The...

Reddit - Machine Learning · 1 min ·
Top 10 AI certifications and courses for 2026
Ai Startups

Top 10 AI certifications and courses for 2026

This article reviews the top 10 AI certifications and courses for 2026, highlighting their significance in a rapidly evolving field and t...

AI Events · 15 min ·
Top 25 Applications of AI: Transforming Industries Today

Top 25 Applications of AI: Transforming Industries Today

AI News - General ·

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime