OpenAI & Anthropic’s CEOs Wouldn't Hold Hands, but Their Models Fell in Love In An LLM Dating Show
People ask AI relationship questions all the time, from "Does this person like me?" to "Should I text back?" But have you ever thought ab...
GPT, Claude, Gemini, and other LLMs
People ask AI relationship questions all the time, from "Does this person like me?" to "Should I text back?" But have you ever thought ab...
SmolLM2 135M. Lenovo T14 CPU. No GPU. No RLHF. No BPE. Coherent, non-sycophantic, contextually appropriate output. First message. No prio...
Just saw a post from Peter Steinberger (creator of OpenClaw) saying that it’s likely going to get harder in the future to keep OpenClaw w...
Abstract page for arXiv paper 2510.13315: Self-Aug: Query and Entropy Adaptive Decoding for Large Vision-Language Models
Abstract page for arXiv paper 2510.06084: Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability
Abstract page for arXiv paper 2509.22641: Death of the Novel(ty): Beyond n-Gram Novelty as a Metric for Textual Creativity
Abstract page for arXiv paper 2509.21091: Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute
Abstract page for arXiv paper 2509.20986: SiNGER: A Clearer Voice Distills Vision Transformers Further
Abstract page for arXiv paper 2509.12610: ScaleDoc: Scaling LLM-based Predicates over Large Document Collections
Abstract page for arXiv paper 2509.10625: No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes
Abstract page for arXiv paper 2509.05425: No Text Needed: Forecasting MT Quality and Inequity from Fertility and Metadata
Abstract page for arXiv paper 2511.10833: SURFACEBENCH: A Geometry-Aware Benchmark for Symbolic Surface Discovery
Abstract page for arXiv paper 2511.08939: TransactionGPT
Abstract page for arXiv paper 2507.05890: Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators
Abstract page for arXiv paper 2507.01335: LEDOM: Reverse Language Model
Abstract page for arXiv paper 2510.15165: Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation App...
Abstract page for arXiv paper 2506.17871: LLM Probability Concentration: How Alignment Shrinks the Generative Horizon
Abstract page for arXiv paper 2510.10902: Auditing Information Disclosure During LLM-Scale Gradient Descent Using Gradient Uniqueness
Abstract page for arXiv paper 2510.04573: LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning
Abstract page for arXiv paper 2510.08646: Mitigating Over-Refusal in Aligned Large Language Models via Inference-Time Activation Energy
Abstract page for arXiv paper 2506.11103: You Only Fine-tune Once: Many-Shot In-Context Fine-Tuning for Large Language Models
Abstract page for arXiv paper 2509.23202: Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization
Abstract page for arXiv paper 2503.01804: $\texttt{SEM-CTRL}$: Semantically Controlled Decoding
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime