Mira Murati’s deposition pulled back the curtain on Sam Altman’s ouster | The Verge
Thanks to Musk v. Altman, the public is getting a concrete look at details of Sam Altman’s ouster from OpenAI, much of it centered on for...
GPT, Claude, Gemini, and other LLMs
Thanks to Musk v. Altman, the public is getting a concrete look at details of Sam Altman’s ouster from OpenAI, much of it centered on for...
I’m not a machine learning expert or anything, but I do enjoy learning about how it all works. I’ve noticed that one of the main limitati...
OpenAI is launching an optional safety feature for ChatGPT that allows adult users to assign an emergency contact for mental health and s...
Abstract page for arXiv paper 2509.24385: Vid-LLM: A Compact Video-based 3D Multimodal LLM with Reconstruction-Reasoning Synergy
Abstract page for arXiv paper 2509.24282: SimuHome: A Temporal- and Environment-Aware Benchmark for Smart Home LLM Agents
Abstract page for arXiv paper 2509.24245: Prompt and Parameter Co-Optimization for Large Language Models
Abstract page for arXiv paper 2509.24203: Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRP...
Abstract page for arXiv paper 2509.21029: FORCE: Transferable Visual Jailbreaking Attacks via Feature Over-Reliance CorrEction
Abstract page for arXiv paper 2509.23383: Train Once, Answer All: Many Pretraining Experiments for the Cost of One
Abstract page for arXiv paper 2509.22611: Quantile Advantage Estimation: Stabilizing RLVR for LLM Reasoning
Abstract page for arXiv paper 2509.22299: HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space
Abstract page for arXiv paper 2509.22134: Bridging Draft Policy Misalignment: Group Tree Optimization for Speculative Decoding
Abstract page for arXiv paper 2508.07697: Semantic-Enhanced Time-Series Forecasting via Large Language Models
Abstract page for arXiv paper 2508.07638: Data Selection for LLM Alignment Using Fine-Grained Preferences
Abstract page for arXiv paper 2508.04097: Do Vision-Language Models Leak What They Learn? Adaptive Token-Weighted Model Inversion Attacks
Abstract page for arXiv paper 2508.04865: Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Lear...
Abstract page for arXiv paper 2509.15888: Distribution-Aligned Decoding for Efficient LLM Task Adaptation
Abstract page for arXiv paper 2507.18553: The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm
Abstract page for arXiv paper 2507.06567: SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference
Abstract page for arXiv paper 2509.05608: BinaryShield: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Finger...
Abstract page for arXiv paper 2509.04784: Post-training Large Language Models for Diverse High-Quality Responses
Abstract page for arXiv paper 2508.18672: Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks
Abstract page for arXiv paper 2506.20746: Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime