[2604.09016] Identification and Anonymization of Named Entities in

[2604.09016] Identification and Anonymization of Named Entities in Unstructured Information Sources for Use in Social Engineering Detection

arXiv - AI April 13, 2026 3 min read

About this article

Abstract page for arXiv paper 2604.09016: Identification and Anonymization of Named Entities in Unstructured Information Sources for Use in Social Engineering Detection

Computer Science > Machine Learning arXiv:2604.09016 (cs) [Submitted on 10 Apr 2026] Title:Identification and Anonymization of Named Entities in Unstructured Information Sources for Use in Social Engineering Detection Authors:Carlos Jimeno Miguel, Raul Orduna, Francesco Zola View a PDF of the paper titled Identification and Anonymization of Named Entities in Unstructured Information Sources for Use in Social Engineering Detection, by Carlos Jimeno Miguel and 2 other authors View PDF HTML (experimental) Abstract:This study addresses the challenge of creating datasets for cybercrime analysis while complying with the requirements of regulations such as the General Data Protection Regulation (GDPR) and Organic Law 10/1995 of the Penal Code. To this end, a system is proposed for collecting information from the Telegram platform, including text, audio, and images; the implementation of speech-to-text transcription models incorporating signal enhancement techniques; and the evaluation of different Named Entity Recognition (NER) solutions, including Microsoft Presidio and AI models designed using a transformer-based architecture. Experimental results indicate that Parakeet achieves the best performance in audio transcription, while the proposed NER solutions achieve the highest f1-score values in detecting sensitive information. In addition, anonymization metrics are presented that allow evaluation of the preservation of structural coherence in the data, while simultaneously guara...

Originally published on April 13, 2026. Curated by AI News.

Llms

I am not an "anti" like this guy, but still an interesting video of person interacting with chat 4o

(Posting Here because removed by Chatgpt Complaints moderators because the model here is 4o, and refuse to believe there were any safety ...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

Unsolved AI Mystery Is Solved Along With Lessons Learned On Why ChatGPT Became Oddly Obsessed With Gremlins And Goblins

This article discusses the resolution of an AI mystery regarding ChatGPT's unusual focus on gremlins and goblins, along with insights gai...

AI Tools & Products · 1 min · about 4 hours ago

Llms

[2602.06869] Uncovering Cross-Objective Interference in Multi-Objective Alignment

Abstract page for arXiv paper 2602.06869: Uncovering Cross-Objective Interference in Multi-Objective Alignment

arXiv - Machine Learning · 3 min · about 4 hours ago

Machine Learning

[2604.07401] Geometric Entropy and Retrieval Phase Transitions in Continuous Thermal Dense Associative Memory

Abstract page for arXiv paper 2604.07401: Geometric Entropy and Retrieval Phase Transitions in Continuous Thermal Dense Associative Memory

arXiv - Machine Learning · 4 min · about 4 hours ago

[2604.09016] Identification and Anonymization of Named Entities in Unstructured Information Sources for Use in Social Engineering Detection

About this article

Related Articles

I am not an "anti" like this guy, but still an interesting video of person interacting with chat 4o

Unsolved AI Mystery Is Solved Along With Lessons Learned On Why ChatGPT Became Oddly Obsessed With Gremlins And Goblins

[2602.06869] Uncovering Cross-Objective Interference in Multi-Objective Alignment

[2604.07401] Geometric Entropy and Retrieval Phase Transitions in Continuous Thermal Dense Associative Memory

No comments

Stay updated with AI News