Mira Murati’s deposition pulled back the curtain on Sam Altman’s ouster | The Verge
Thanks to Musk v. Altman, the public is getting a concrete look at details of Sam Altman’s ouster from OpenAI, much of it centered on for...
GPT, Claude, Gemini, and other LLMs
Thanks to Musk v. Altman, the public is getting a concrete look at details of Sam Altman’s ouster from OpenAI, much of it centered on for...
I’m not a machine learning expert or anything, but I do enjoy learning about how it all works. I’ve noticed that one of the main limitati...
OpenAI is launching an optional safety feature for ChatGPT that allows adult users to assign an emergency contact for mental health and s...
Abstract page for arXiv paper 2510.02209: StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Abstract page for arXiv paper 2510.03253: Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents
Abstract page for arXiv paper 2510.02999: Untargeted Jailbreak Attack
Abstract page for arXiv paper 2510.02245: ExGRPO: Learning to Reason from Experience
Abstract page for arXiv paper 2510.01051: GEM: A Gym for Agentic LLMs
Abstract page for arXiv paper 2510.00819: Stabilizing Policy Gradients for Sample-Efficient Reinforcement Learning in LLM Reasoning
Abstract page for arXiv paper 2509.25678: Massively Multimodal Foundation Models: A Framework for Capturing Interactions with Specialized...
Abstract page for arXiv paper 2510.00041: Culture In a Frame: C$^3$B as a Comic-Based Benchmark for Multimodal Culturally Awareness
Abstract page for arXiv paper 2509.26601: MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47...
Abstract page for arXiv paper 2509.26432: AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size
Abstract page for arXiv paper 2509.26346: EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing
Abstract page for arXiv paper 2509.24198: Negative Pre-activations Differentiate Syntax
Abstract page for arXiv paper 2509.26324: COMRES-VLM: Coordinated Multi-Robot Exploration and Search using Vision Language Models
Abstract page for arXiv paper 2509.23365: Emergence of Superposition: Unveiling the Training Dynamics of Chain of Continuous Thought
Abstract page for arXiv paper 2509.25837: Distillation of Large Language Models via Concrete Score Matching
Abstract page for arXiv paper 2509.25532: Calibrating Verbalized Confidence with Self-Generated Distractors
Abstract page for arXiv paper 2509.25390: SpinBench: Perspective and Rotation as a Lens on Spatial Reasoning in VLMs
Abstract page for arXiv paper 2509.22957: Doubly-Robust LLM-as-a-Judge: Externally Valid Estimation with Imperfect Personas
Abstract page for arXiv paper 2509.25175: EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering
Abstract page for arXiv paper 2509.25087: Scaling with Collapse: Efficient and Predictable Training of LLM Families
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime