[2505.05619] LiteLMGuard: Seamless and Lightweight On-Device Prompt Filtering for Safeguarding Small Language Models against Quantization-induced Risks and Vulnerabilities

[2505.05619] LiteLMGuard: Seamless and Lightweight On-Device Prompt Filtering for Safeguarding Small Language Models against Quantization-induced Risks and Vulnerabilities

arXiv - Machine Learning 4 min read

About this article

Abstract page for arXiv paper 2505.05619: LiteLMGuard: Seamless and Lightweight On-Device Prompt Filtering for Safeguarding Small Language Models against Quantization-induced Risks and Vulnerabilities

Computer Science > Cryptography and Security arXiv:2505.05619 (cs) [Submitted on 8 May 2025 (v1), last revised 3 Mar 2026 (this version, v3)] Title:LiteLMGuard: Seamless and Lightweight On-Device Prompt Filtering for Safeguarding Small Language Models against Quantization-induced Risks and Vulnerabilities Authors:Kalyan Nakka, Jimmy Dani, Ausmit Mondal, Nitesh Saxena View a PDF of the paper titled LiteLMGuard: Seamless and Lightweight On-Device Prompt Filtering for Safeguarding Small Language Models against Quantization-induced Risks and Vulnerabilities, by Kalyan Nakka and 3 other authors View PDF HTML (experimental) Abstract:The growing adoption of Large Language Models (LLMs) has influenced the development of Small Language Models (SLMs) for on-device deployment across smartphones and edge devices, offering enhanced privacy, reduced latency, server-free functionality, and improved user experience. However, due to on-device resource constraints, SLMs undergo size optimization through compression techniques like quantization, which inadvertently introduce fairness, ethical and privacy risks. Critically, quantized SLMs may respond to harmful queries directly, without requiring adversarial manipulation, raising significant safety and trust concerns. To address this, we propose LiteLMGuard, an on-device guardrail that provides real-time, prompt-level defense for quantized SLMs. Additionally, our guardrail is designed to be model-agnostic such that it can be seamlessly integr...

Originally published on March 04, 2026. Curated by AI News.

Related Articles

[2506.21138] Multi-Sample Prompting and Actor-Critic Prompt Optimization for Diverse Synthetic Data Generation
Llms

[2506.21138] Multi-Sample Prompting and Actor-Critic Prompt Optimization for Diverse Synthetic Data Generation

Abstract page for arXiv paper 2506.21138: Multi-Sample Prompting and Actor-Critic Prompt Optimization for Diverse Synthetic Data Generation

arXiv - AI · 4 min ·
[2505.16377] VLM-SAFE: Vision-Language Model-Guided Safety-Aware Reinforcement Learning with World Models for Autonomous Driving
Llms

[2505.16377] VLM-SAFE: Vision-Language Model-Guided Safety-Aware Reinforcement Learning with World Models for Autonomous Driving

Abstract page for arXiv paper 2505.16377: VLM-SAFE: Vision-Language Model-Guided Safety-Aware Reinforcement Learning with World Models fo...

arXiv - AI · 4 min ·
[2505.07372] Self-Bootstrapping Automated Program Repair: Using LLMs to Generate and Evaluate Synthetic Training Data for Bug Repair
Llms

[2505.07372] Self-Bootstrapping Automated Program Repair: Using LLMs to Generate and Evaluate Synthetic Training Data for Bug Repair

Abstract page for arXiv paper 2505.07372: Self-Bootstrapping Automated Program Repair: Using LLMs to Generate and Evaluate Synthetic Trai...

arXiv - AI · 4 min ·
[2505.03821] Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models
Llms

[2505.03821] Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models

Abstract page for arXiv paper 2505.03821: Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models

arXiv - AI · 4 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime