Machine Learning Nlp Ai Safety

[2602.20634] Enhancing Hate Speech Detection on Social Media: A Comparative Analysis of Machine Learning Models and Text Transformation Approaches

arXiv - AI February 25, 2026 3 min read Article

Summary

This article evaluates various machine learning models for hate speech detection on social media, comparing traditional and advanced techniques, and introduces innovative text transformation methods to mitigate harmful content.

Why It Matters

With the rise of hate speech on social media, effective detection and moderation tools are crucial. This study provides insights into the performance of different machine learning models, highlighting their strengths and limitations, which can inform future developments in content moderation technologies.

Key Takeaways

Advanced models like BERT show superior accuracy in detecting hate speech.
Hybrid models combining different architectures can enhance detection capabilities.
Innovative text transformation techniques can neutralize harmful expressions.
The study highlights the strengths and limitations of current hate speech detection technologies.
Future research directions are proposed for improving detection systems.

Computer Science > Computation and Language arXiv:2602.20634 (cs) [Submitted on 24 Feb 2026] Title:Enhancing Hate Speech Detection on Social Media: A Comparative Analysis of Machine Learning Models and Text Transformation Approaches Authors:Saurabh Mishra, Shivani Thakur, Radhika Mamidi View a PDF of the paper titled Enhancing Hate Speech Detection on Social Media: A Comparative Analysis of Machine Learning Models and Text Transformation Approaches, by Saurabh Mishra and 2 other authors View PDF Abstract:The proliferation of hate speech on social media platforms has necessitated the development of effective detection and moderation tools. This study evaluates the efficacy of various machine learning models in identifying hate speech and offensive language and investigates the potential of text transformation techniques to neutralize such content. We compare traditional models like CNNs and LSTMs with advanced neural network models such as BERT and its derivatives, alongside exploring hybrid models that combine different architectural features. Our results indicate that while advanced models like BERT show superior accuracy due to their deep contextual understanding, hybrid models exhibit improved capabilities in certain scenarios. Furthermore, we introduce innovative text transformation approaches that convert negative expressions into neutral ones, thereby potentially mitigating the impact of harmful content. The implications of these findings are discussed, highlighting ...

Read Original Article

[2602.20634] Enhancing Hate Speech Detection on Social Media: A Comparative Analysis of Machine Learning Models and Text Transformation Approaches

Summary

Why It Matters

Key Takeaways

Related Articles

[2603.29957] Think Anywhere in Code Generation

[2603.17790] The Convergence Frontier: Integrating Machine Learning and High Performance Quantum Computing for Next-Generation Drug Discovery

[2603.16880] NeuroNarrator: A Generalist EEG-to-Text Foundation Model for Clinical Interpretation via Spectro-Spatial Grounding and Temporal State-Space Reasoning

[2601.14674] LaVR: Scene Latent Conditioned Generative Video Trajectory Re-Rendering using Large 4D Reconstruction Models

No comments

Stay updated with AI News