Reducing LLM hallucination by using a model-agnostic control layer [R]
We’ve been working on the hallucination problem from a systems perspective rather than a model perspective. Instead of trying to improve ...
GPT, Claude, Gemini, and other LLMs
We’ve been working on the hallucination problem from a systems perspective rather than a model perspective. Instead of trying to improve ...
The rise of AI has brought an avalanche of new terms and slang. Here is a glossary with definitions of some of the most important words a...
Crescendo (Russinovich et al., USENIX Security 2025) is a multi-turn jailbreak that starts with innocent questions and gradually steers a...
Abstract page for arXiv paper 2602.04288: Contextual Drag: How Errors in the Context Affect LLM Reasoning
Abstract page for arXiv paper 2601.09566: Hot-Start from Pixels: Low-Resolution Visual Tokens for Chinese Language Modeling
Abstract page for arXiv paper 2511.12832: From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation
Abstract page for arXiv paper 2510.14686: xLLM Technical Report
Abstract page for arXiv paper 2510.14086: Every Language Model Has a Forgery-Resistant Signature
Abstract page for arXiv paper 2510.13900: Narrow Finetuning Leaves Clearly Readable Traces in Activation Differences
Abstract page for arXiv paper 2510.13315: Self-Aug: Query and Entropy Adaptive Decoding for Large Vision-Language Models
Abstract page for arXiv paper 2510.06084: Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability
Abstract page for arXiv paper 2509.22641: Death of the Novel(ty): Beyond n-Gram Novelty as a Metric for Textual Creativity
Abstract page for arXiv paper 2509.21091: Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute
Abstract page for arXiv paper 2509.20986: SiNGER: A Clearer Voice Distills Vision Transformers Further
Abstract page for arXiv paper 2509.12610: ScaleDoc: Scaling LLM-based Predicates over Large Document Collections
Abstract page for arXiv paper 2509.10625: No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes
Abstract page for arXiv paper 2509.05425: No Text Needed: Forecasting MT Quality and Inequity from Fertility and Metadata
Abstract page for arXiv paper 2511.10833: SURFACEBENCH: A Geometry-Aware Benchmark for Symbolic Surface Discovery
Abstract page for arXiv paper 2511.08939: TransactionGPT
Abstract page for arXiv paper 2507.05890: Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators
Abstract page for arXiv paper 2507.01335: LEDOM: Reverse Language Model
Abstract page for arXiv paper 2510.15165: Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation App...
Abstract page for arXiv paper 2506.17871: LLM Probability Concentration: How Alignment Shrinks the Generative Horizon
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime