[2604.03750] CREBench: Evaluating Large Language Models in

[2604.03750] CREBench: Evaluating Large Language Models in Cryptographic Binary Reverse Engineering

arXiv - AI April 07, 2026 4 min read

About this article

Abstract page for arXiv paper 2604.03750: CREBench: Evaluating Large Language Models in Cryptographic Binary Reverse Engineering

Computer Science > Cryptography and Security arXiv:2604.03750 (cs) [Submitted on 4 Apr 2026] Title:CREBench: Evaluating Large Language Models in Cryptographic Binary Reverse Engineering Authors:Baicheng Chen, Yu Wang, Ziheng Zhou, Xiangru Liu, Juanru Li, Yilei Chen, Tianxing He View a PDF of the paper titled CREBench: Evaluating Large Language Models in Cryptographic Binary Reverse Engineering, by Baicheng Chen and 6 other authors View PDF HTML (experimental) Abstract:Reverse engineering (RE) is central to software security, particularly for cryptographic programs that handle sensitive data and are highly prone to vulnerabilities. It supports critical tasks such as vulnerability discovery and malware analysis. Despite its importance, RE remains labor-intensive and requires substantial expertise, making large language models (LLMs) a potential solution for automating the process. However, their capabilities for RE remain systematically underexplored. To address this gap, we study the cryptographic binary RE capabilities of LLMs and introduce \textbf{CREBench}, a benchmark comprising 432 challenges built from 48 standard cryptographic algorithms, 3 insecure crypto key usage scenarios, and 3 difficulty levels. Each challenge follows a Capture-the-Flag (CTF) RE challenge, requiring the model to analyze the underlying cryptographic logic and recover the correct input. We design an evaluation framework comprising four sub-tasks, from algorithm identification to correct flag reco...

Originally published on April 07, 2026. Curated by AI News.

Llms

I built a solo AI platform from Algeria with no funding, no team and no ad spend - here's what's inside it after 2 months

Hello, 20 years old here just got into the Ai platform and launched this last two weeks and here is what I have on it so far. - Latest Ai...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

USF murder suspect accused of using ChatGPT to research cover-up, prosecutors say

Days after the remains of one of the two missing University of South Florida doctoral students were found, prosecutors say the suspect ma...

AI Tools & Products · 3 min · about 3 hours ago

Llms

Anthropic’s Claude AI deletes PocketOS production database

Claude AI deleted PocketOS's production database, but the market for Claude 4.7 release by May 31 remains at 100% YES.

AI Tools & Products · 3 min · about 3 hours ago

Llms

Claude-powered AI coding agent deletes entire company database in 9 seconds

The founder of PocketOS has penned a social media post to warn others about the “systemic failures” of flagship AI and digital services p...

AI Tools & Products · 1 min · about 3 hours ago

[2604.03750] CREBench: Evaluating Large Language Models in Cryptographic Binary Reverse Engineering

About this article

Related Articles

I built a solo AI platform from Algeria with no funding, no team and no ad spend - here's what's inside it after 2 months

USF murder suspect accused of using ChatGPT to research cover-up, prosecutors say

Anthropic’s Claude AI deletes PocketOS production database

Claude-powered AI coding agent deletes entire company database in 9 seconds

No comments

Stay updated with AI News