Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Reducing LLM hallucination by using a model-agnostic control layer [R]

We’ve been working on the hallucination problem from a systems perspective rather than a model perspective. Instead of trying to improve ...

Reddit - Machine Learning · 1 min ·
From LLMs to hallucinations, here’s a simple guide to common AI terms
Llms

From LLMs to hallucinations, here’s a simple guide to common AI terms

The rise of AI has brought an avalanche of new terms and slang. Here is a glossary with definitions of some of the most important words a...

TechCrunch - AI · 19 min ·
Llms

LLM Guard scored 0/8 detecting a Crescendo multi-turn attack. Arc Sentry flagged it at Turn 3.

Crescendo (Russinovich et al., USENIX Security 2025) is a multi-turn jailbreak that starts with innocent questions and gradually steers a...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.04288] Contextual Drag: How Errors in the Context Affect LLM Reasoning
Llms

[2602.04288] Contextual Drag: How Errors in the Context Affect LLM Reasoning

Abstract page for arXiv paper 2602.04288: Contextual Drag: How Errors in the Context Affect LLM Reasoning

arXiv - Machine Learning · 3 min ·
[2601.09566] Hot-Start from Pixels: Low-Resolution Visual Tokens for Chinese Language Modeling
Llms

[2601.09566] Hot-Start from Pixels: Low-Resolution Visual Tokens for Chinese Language Modeling

Abstract page for arXiv paper 2601.09566: Hot-Start from Pixels: Low-Resolution Visual Tokens for Chinese Language Modeling

arXiv - AI · 3 min ·
[2511.12832] From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation
Llms

[2511.12832] From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation

Abstract page for arXiv paper 2511.12832: From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation

arXiv - AI · 3 min ·
[2510.14686] xLLM Technical Report
Llms

[2510.14686] xLLM Technical Report

Abstract page for arXiv paper 2510.14686: xLLM Technical Report

arXiv - AI · 4 min ·
[2510.14086] Every Language Model Has a Forgery-Resistant Signature
Llms

[2510.14086] Every Language Model Has a Forgery-Resistant Signature

Abstract page for arXiv paper 2510.14086: Every Language Model Has a Forgery-Resistant Signature

arXiv - AI · 4 min ·
[2510.13900] Narrow Finetuning Leaves Clearly Readable Traces in Activation Differences
Llms

[2510.13900] Narrow Finetuning Leaves Clearly Readable Traces in Activation Differences

Abstract page for arXiv paper 2510.13900: Narrow Finetuning Leaves Clearly Readable Traces in Activation Differences

arXiv - AI · 4 min ·
[2510.13315] Self-Aug: Query and Entropy Adaptive Decoding for Large Vision-Language Models
Llms

[2510.13315] Self-Aug: Query and Entropy Adaptive Decoding for Large Vision-Language Models

Abstract page for arXiv paper 2510.13315: Self-Aug: Query and Entropy Adaptive Decoding for Large Vision-Language Models

arXiv - AI · 4 min ·
[2510.06084] Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability
Llms

[2510.06084] Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability

Abstract page for arXiv paper 2510.06084: Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability

arXiv - AI · 4 min ·
[2509.22641] Death of the Novel(ty): Beyond n-Gram Novelty as a Metric for Textual Creativity
Llms

[2509.22641] Death of the Novel(ty): Beyond n-Gram Novelty as a Metric for Textual Creativity

Abstract page for arXiv paper 2509.22641: Death of the Novel(ty): Beyond n-Gram Novelty as a Metric for Textual Creativity

arXiv - AI · 4 min ·
[2509.21091] Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute
Llms

[2509.21091] Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute

Abstract page for arXiv paper 2509.21091: Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute

arXiv - AI · 3 min ·
[2509.20986] SiNGER: A Clearer Voice Distills Vision Transformers Further
Llms

[2509.20986] SiNGER: A Clearer Voice Distills Vision Transformers Further

Abstract page for arXiv paper 2509.20986: SiNGER: A Clearer Voice Distills Vision Transformers Further

arXiv - AI · 4 min ·
[2509.12610] ScaleDoc: Scaling LLM-based Predicates over Large Document Collections
Llms

[2509.12610] ScaleDoc: Scaling LLM-based Predicates over Large Document Collections

Abstract page for arXiv paper 2509.12610: ScaleDoc: Scaling LLM-based Predicates over Large Document Collections

arXiv - Machine Learning · 4 min ·
[2509.10625] No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes
Llms

[2509.10625] No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes

Abstract page for arXiv paper 2509.10625: No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes

arXiv - AI · 4 min ·
[2509.05425] No Text Needed: Forecasting MT Quality and Inequity from Fertility and Metadata
Llms

[2509.05425] No Text Needed: Forecasting MT Quality and Inequity from Fertility and Metadata

Abstract page for arXiv paper 2509.05425: No Text Needed: Forecasting MT Quality and Inequity from Fertility and Metadata

arXiv - AI · 3 min ·
[2511.10833] SURFACEBENCH: A Geometry-Aware Benchmark for Symbolic Surface Discovery
Llms

[2511.10833] SURFACEBENCH: A Geometry-Aware Benchmark for Symbolic Surface Discovery

Abstract page for arXiv paper 2511.10833: SURFACEBENCH: A Geometry-Aware Benchmark for Symbolic Surface Discovery

arXiv - Machine Learning · 4 min ·
[2511.08939] TransactionGPT
Llms

[2511.08939] TransactionGPT

Abstract page for arXiv paper 2511.08939: TransactionGPT

arXiv - Machine Learning · 4 min ·
[2507.05890] Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators
Llms

[2507.05890] Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators

Abstract page for arXiv paper 2507.05890: Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators

arXiv - AI · 4 min ·
[2507.01335] LEDOM: Reverse Language Model
Llms

[2507.01335] LEDOM: Reverse Language Model

Abstract page for arXiv paper 2507.01335: LEDOM: Reverse Language Model

arXiv - AI · 3 min ·
[2510.15165] Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation Approach
Llms

[2510.15165] Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation Approach

Abstract page for arXiv paper 2510.15165: Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation App...

arXiv - Machine Learning · 4 min ·
[2506.17871] LLM Probability Concentration: How Alignment Shrinks the Generative Horizon
Llms

[2506.17871] LLM Probability Concentration: How Alignment Shrinks the Generative Horizon

Abstract page for arXiv paper 2506.17871: LLM Probability Concentration: How Alignment Shrinks the Generative Horizon

arXiv - Machine Learning · 4 min ·
Previous Page 175 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime