Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Depth-first pruning seems to transfer from GPT-2 to Llama (unexpectedly well)

TL;DR: Removing the right transformer layers (instead of shrinking all layers) gives smaller, faster models with minimal quality loss — a...

Reddit - Artificial Intelligence · 1 min · 2 minutes ago

Llms

[2603.23966] Policy-Guided Threat Hunting: An LLM enabled Framework with Splunk SOC Triage

Abstract page for arXiv paper 2603.23966: Policy-Guided Threat Hunting: An LLM enabled Framework with Splunk SOC Triage

arXiv - AI · 4 min · about 1 hour ago

Llms

[2603.16790] InCoder-32B: Code Foundation Model for Industrial Scenarios

Abstract page for arXiv paper 2603.16790: InCoder-32B: Code Foundation Model for Industrial Scenarios

arXiv - AI · 4 min · about 1 hour ago

All Content

Llms

[2603.22499] OrgForge-IT: A Verifiable Synthetic Benchmark for LLM-Based Insider Threat Detection

Abstract page for arXiv paper 2603.22499: OrgForge-IT: A Verifiable Synthetic Benchmark for LLM-Based Insider Threat Detection

arXiv - Machine Learning · 4 min · 6 days ago

Llms

[2603.22593] Language Models Can Explain Visual Features via Steering

Abstract page for arXiv paper 2603.22593: Language Models Can Explain Visual Features via Steering

arXiv - AI · 3 min · 6 days ago

Llms

[2603.22582] Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?

Abstract page for arXiv paper 2603.22582: Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?

arXiv - AI · 4 min · 6 days ago

Llms

[2603.22577] STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving

Abstract page for arXiv paper 2603.22577: STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving

arXiv - AI · 4 min · 6 days ago

Llms

[2603.22528] GraphRAG for Engineering Diagrams: ChatP&ID Enables LLM Interaction with P&IDs

Abstract page for arXiv paper 2603.22528: GraphRAG for Engineering Diagrams: ChatP&ID Enables LLM Interaction with P&IDs

arXiv - AI · 4 min · 6 days ago

Llms

[2603.22519] LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface

Abstract page for arXiv paper 2603.22519: LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface

arXiv - AI · 4 min · 6 days ago

Llms

[2603.22510] Do Large Language Models Reduce Research Novelty? Evidence from Information Systems Journals

Abstract page for arXiv paper 2603.22510: Do Large Language Models Reduce Research Novelty? Evidence from Information Systems Journals

arXiv - AI · 3 min · 6 days ago

Llms

[2603.22492] Tiny Inference-Time Scaling with Latent Verifiers

Abstract page for arXiv paper 2603.22492: Tiny Inference-Time Scaling with Latent Verifiers

arXiv - AI · 4 min · 6 days ago

Llms

[2603.22479] Cognitive Training for Language Models: Towards General Capabilities via Cross-Entropy Games

Abstract page for arXiv paper 2603.22479: Cognitive Training for Language Models: Towards General Capabilities via Cross-Entropy Games

arXiv - AI · 3 min · 6 days ago

Llms

[2603.22473] Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures

Abstract page for arXiv paper 2603.22473: Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architec...

arXiv - AI · 3 min · 6 days ago

Llms

[2603.22355] Demystifying Low-Rank Knowledge Distillation in Large Language Models: Convergence, Generalization, and Information-Theoretic Guarantees

Abstract page for arXiv paper 2603.22355: Demystifying Low-Rank Knowledge Distillation in Large Language Models: Convergence, Generalizat...

arXiv - Machine Learning · 4 min · 6 days ago

Llms

[2603.22344] Errors in AI-Assisted Retrieval of Medical Literature: A Comparative Study

Abstract page for arXiv paper 2603.22344: Errors in AI-Assisted Retrieval of Medical Literature: A Comparative Study

arXiv - Machine Learning · 4 min · 6 days ago

Llms

[2603.22459] LLM-guided headline rewriting for clickability enhancement without clickbait

Abstract page for arXiv paper 2603.22459: LLM-guided headline rewriting for clickability enhancement without clickbait

arXiv - AI · 4 min · 6 days ago

Llms

[2603.22446] Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Abstract page for arXiv paper 2603.22446: Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

arXiv - AI · 4 min · 6 days ago

Llms

[2603.22330] Fair splits flip the leaderboard: CHANRG reveals limited generalization in RNA secondary-structure prediction

Abstract page for arXiv paper 2603.22330: Fair splits flip the leaderboard: CHANRG reveals limited generalization in RNA secondary-struct...

arXiv - Machine Learning · 3 min · 6 days ago

Llms

[2603.22376] AI Co-Scientist for Ranking: Discovering Novel Search Ranking Models alongside LLM-based AI Agents with Cloud Computing Access

Abstract page for arXiv paper 2603.22376: AI Co-Scientist for Ranking: Discovering Novel Search Ranking Models alongside LLM-based AI Age...

arXiv - AI · 4 min · 6 days ago

Llms

[2603.23461] End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions

Abstract page for arXiv paper 2603.23461: End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions

arXiv - Machine Learning · 3 min · 6 days ago

Llms

[2603.23414] SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling

Abstract page for arXiv paper 2603.23414: SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling

arXiv - AI · 4 min · 6 days ago

Llms

[2603.22368] When Visuals Aren't the Problem: Evaluating Vision-Language Models on Misleading Data Visualizations

Abstract page for arXiv paper 2603.22368: When Visuals Aren't the Problem: Evaluating Vision-Language Models on Misleading Data Visualiza...

arXiv - AI · 4 min · 6 days ago

Llms

[2603.23355] Off-Policy Value-Based Reinforcement Learning for Large Language Models

Abstract page for arXiv paper 2603.23355: Off-Policy Value-Based Reinforcement Learning for Large Language Models

arXiv - Machine Learning · 3 min · 6 days ago

Previous Page 36 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Depth-first pruning seems to transfer from GPT-2 to Llama (unexpectedly well)

[2603.23966] Policy-Guided Threat Hunting: An LLM enabled Framework with Splunk SOC Triage

[2603.16790] InCoder-32B: Code Foundation Model for Industrial Scenarios

All Content

[2603.22499] OrgForge-IT: A Verifiable Synthetic Benchmark for LLM-Based Insider Threat Detection

[2603.22593] Language Models Can Explain Visual Features via Steering

[2603.22582] Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?

[2603.22577] STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving

[2603.22528] GraphRAG for Engineering Diagrams: ChatP&ID Enables LLM Interaction with P&IDs

[2603.22519] LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface

[2603.22510] Do Large Language Models Reduce Research Novelty? Evidence from Information Systems Journals

[2603.22492] Tiny Inference-Time Scaling with Latent Verifiers

[2603.22479] Cognitive Training for Language Models: Towards General Capabilities via Cross-Entropy Games

[2603.22473] Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures

[2603.22355] Demystifying Low-Rank Knowledge Distillation in Large Language Models: Convergence, Generalization, and Information-Theoretic Guarantees

[2603.22344] Errors in AI-Assisted Retrieval of Medical Literature: A Comparative Study

[2603.22459] LLM-guided headline rewriting for clickability enhancement without clickbait

[2603.22446] Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

[2603.22330] Fair splits flip the leaderboard: CHANRG reveals limited generalization in RNA secondary-structure prediction

[2603.22376] AI Co-Scientist for Ranking: Discovering Novel Search Ranking Models alongside LLM-based AI Agents with Cloud Computing Access

[2603.23461] End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions

[2603.23414] SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling

[2603.22368] When Visuals Aren't the Problem: Evaluating Vision-Language Models on Misleading Data Visualizations

[2603.23355] Off-Policy Value-Based Reinforcement Learning for Large Language Models

Related Topics

Stay updated with AI News