[2509.18880] Diversity Boosts AI-Generated Text Detection

[2509.18880] Diversity Boosts AI-Generated Text Detection

arXiv - Machine Learning 4 min read Article

Summary

The paper presents DivEye, a novel framework for detecting AI-generated text by analyzing unpredictability in text structure and vocabulary, outperforming existing methods significantly.

Why It Matters

As AI-generated text becomes more prevalent, effective detection methods are crucial to prevent misinformation and misuse in various sectors. DivEye's approach enhances interpretability and robustness, addressing limitations of current detection systems.

Key Takeaways

  • DivEye improves AI-generated text detection by analyzing lexical and structural unpredictability.
  • The framework outperforms existing zero-shot detectors by up to 33.2%.
  • DivEye offers insights into why texts are flagged, enhancing interpretability.
  • It is robust against paraphrasing and adversarial attacks.
  • The method improves existing detectors' performance by up to 18.7% when used as an auxiliary signal.

Computer Science > Computation and Language arXiv:2509.18880 (cs) [Submitted on 23 Sep 2025 (v1), last revised 25 Feb 2026 (this version, v3)] Title:Diversity Boosts AI-Generated Text Detection Authors:Advik Raj Basani, Pin-Yu Chen View a PDF of the paper titled Diversity Boosts AI-Generated Text Detection, by Advik Raj Basani and 1 other authors View PDF HTML (experimental) Abstract:Detecting AI-generated text is an increasing necessity to combat misuse of LLMs in education, business compliance, journalism, and social media, where synthetic fluency can mask misinformation or deception. While prior detectors often rely on token-level likelihoods or opaque black-box classifiers, these approaches struggle against high-quality generations and offer little interpretability. In this work, we propose DivEye, a novel detection framework that captures how unpredictability fluctuates across a text using surprisal-based features. Motivated by the observation that human-authored text exhibits richer variability in lexical and structural unpredictability than LLM outputs, DivEye captures this signal through a set of interpretable statistical features. Our method outperforms existing zero-shot detectors by up to 33.2% and achieves competitive performance with fine-tuned baselines across multiple benchmarks. DivEye is robust to paraphrasing and adversarial attacks, generalizes well across domains and models, and improves the performance of existing detectors by up to 18.7% when used as ...

Related Articles

What is AI, how do apps like ChatGPT work and why are there concerns?
Llms

What is AI, how do apps like ChatGPT work and why are there concerns?

AI is transforming modern life, but some critics worry about its potential misuse and environmental impact.

AI News - General · 7 min ·
[2603.29957] Think Anywhere in Code Generation
Llms

[2603.29957] Think Anywhere in Code Generation

Abstract page for arXiv paper 2603.29957: Think Anywhere in Code Generation

arXiv - Machine Learning · 3 min ·
[2603.16880] NeuroNarrator: A Generalist EEG-to-Text Foundation Model for Clinical Interpretation via Spectro-Spatial Grounding and Temporal State-Space Reasoning
Llms

[2603.16880] NeuroNarrator: A Generalist EEG-to-Text Foundation Model for Clinical Interpretation via Spectro-Spatial Grounding and Temporal State-Space Reasoning

Abstract page for arXiv paper 2603.16880: NeuroNarrator: A Generalist EEG-to-Text Foundation Model for Clinical Interpretation via Spectr...

arXiv - Machine Learning · 4 min ·
[2512.21106] Semantic Refinement with LLMs for Graph Representations
Llms

[2512.21106] Semantic Refinement with LLMs for Graph Representations

Abstract page for arXiv paper 2512.21106: Semantic Refinement with LLMs for Graph Representations

arXiv - Machine Learning · 4 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime