Depth-first pruning seems to transfer from GPT-2 to Llama (unexpectedly well)
TL;DR: Removing the right transformer layers (instead of shrinking all layers) gives smaller, faster models with minimal quality loss — a...
GPT, Claude, Gemini, and other LLMs
TL;DR: Removing the right transformer layers (instead of shrinking all layers) gives smaller, faster models with minimal quality loss — a...
Abstract page for arXiv paper 2603.23966: Policy-Guided Threat Hunting: An LLM enabled Framework with Splunk SOC Triage
Abstract page for arXiv paper 2603.16790: InCoder-32B: Code Foundation Model for Industrial Scenarios
Abstract page for arXiv paper 2603.22499: OrgForge-IT: A Verifiable Synthetic Benchmark for LLM-Based Insider Threat Detection
Abstract page for arXiv paper 2603.22593: Language Models Can Explain Visual Features via Steering
Abstract page for arXiv paper 2603.22582: Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?
Abstract page for arXiv paper 2603.22577: STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving
Abstract page for arXiv paper 2603.22528: GraphRAG for Engineering Diagrams: ChatP&ID Enables LLM Interaction with P&IDs
Abstract page for arXiv paper 2603.22519: LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface
Abstract page for arXiv paper 2603.22510: Do Large Language Models Reduce Research Novelty? Evidence from Information Systems Journals
Abstract page for arXiv paper 2603.22492: Tiny Inference-Time Scaling with Latent Verifiers
Abstract page for arXiv paper 2603.22479: Cognitive Training for Language Models: Towards General Capabilities via Cross-Entropy Games
Abstract page for arXiv paper 2603.22473: Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architec...
Abstract page for arXiv paper 2603.22355: Demystifying Low-Rank Knowledge Distillation in Large Language Models: Convergence, Generalizat...
Abstract page for arXiv paper 2603.22344: Errors in AI-Assisted Retrieval of Medical Literature: A Comparative Study
Abstract page for arXiv paper 2603.22459: LLM-guided headline rewriting for clickability enhancement without clickbait
Abstract page for arXiv paper 2603.22446: Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs
Abstract page for arXiv paper 2603.22330: Fair splits flip the leaderboard: CHANRG reveals limited generalization in RNA secondary-struct...
Abstract page for arXiv paper 2603.22376: AI Co-Scientist for Ranking: Discovering Novel Search Ranking Models alongside LLM-based AI Age...
Abstract page for arXiv paper 2603.23461: End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions
Abstract page for arXiv paper 2603.23414: SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling
Abstract page for arXiv paper 2603.22368: When Visuals Aren't the Problem: Evaluating Vision-Language Models on Misleading Data Visualiza...
Abstract page for arXiv paper 2603.23355: Off-Policy Value-Based Reinforcement Learning for Large Language Models
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime