[2603.04707] Detection of Illicit Content on Online Marketplaces using

[2603.04707] Detection of Illicit Content on Online Marketplaces using Large Language Models

arXiv - AI March 06, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.04707: Detection of Illicit Content on Online Marketplaces using Large Language Models

Computer Science > Computation and Language arXiv:2603.04707 (cs) [Submitted on 5 Mar 2026] Title:Detection of Illicit Content on Online Marketplaces using Large Language Models Authors:Quoc Khoa Tran, Thanh Thi Nguyen, Campbell Wilson View a PDF of the paper titled Detection of Illicit Content on Online Marketplaces using Large Language Models, by Quoc Khoa Tran and 2 other authors View PDF HTML (experimental) Abstract:Online marketplaces, while revolutionizing global commerce, have inadvertently facilitated the proliferation of illicit activities, including drug trafficking, counterfeit sales, and cybercrimes. Traditional content moderation methods such as manual reviews and rule-based automated systems struggle with scalability, dynamic obfuscation techniques, and multilingual content. Conventional machine learning models, though effective in simpler contexts, often falter when confronting the semantic complexities and linguistic nuances characteristic of illicit marketplace communications. This research investigates the efficacy of Large Language Models (LLMs), specifically Meta's Llama 3.2 and Google's Gemma 3, in detecting and classifying illicit online marketplace content using the multilingual DUTA10K dataset. Employing fine-tuning techniques such as Parameter-Efficient Fine-Tuning (PEFT) and quantization, these models were systematically benchmarked against a foundational transformer-based model (BERT) and traditional machine learning baselines (Support Vector Mac...

Originally published on March 06, 2026. Curated by AI News.

Llms

One of The Worst AI's I've Ever Seen

I'm using Gemini just for they gave us a student-free-pro pack. It can't see the images I sent, most of the time it just rewrites the mes...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

Hey everyone 👋 I've set up a self-hosted API gateway using New-API to manage and distribute Claude Opus 4.6 access across multiple users....

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

The open-source AI system that beat Claude Sonnet on a $500 GPU just shipped a coding assistant

A week or two ago, an open-source project called ATLAS made the rounds for scoring 74.6% on LiveCodeBench with a frozen 9B model on a sin...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

Claude Max 20x usage hit 40% by Monday noon — how does Codex CLI compare?

I'm on Claude Max (the $100/mo plan) and noticed something that surprised me. By Monday noon I had already used 40% of the 20x monthly li...

Reddit - Artificial Intelligence · 1 min · about 6 hours ago

[2603.04707] Detection of Illicit Content on Online Marketplaces using Large Language Models

About this article

Related Articles

One of The Worst AI's I've Ever Seen

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

The open-source AI system that beat Claude Sonnet on a $500 GPU just shipped a coding assistant

Claude Max 20x usage hit 40% by Monday noon — how does Codex CLI compare?

No comments

Stay updated with AI News