Elon Musk confirms xAI used OpenAI’s models to train Grok | The Verge
During the Musk v. Altman trial, while on the stand, Elon Musk said it was “partly” true that xAI had used model distillation of OpenAI’s...
ML algorithms, training, and inference
During the Musk v. Altman trial, while on the stand, Elon Musk said it was “partly” true that xAI had used model distillation of OpenAI’s...
Hello r/MachineLearning! I work in the US transit industry and I went all-in on learning AI & ML a few months ago. When I heard about...
My 4444 (4443 pre-rebuttal) got rejected (as expected). Just copying a reply I wrote a couple of days ago before decisions were out: Ther...
Abstract page for arXiv paper 2604.04825: Plausibility as Commonsense Reasoning: Humans Succeed, Large Language Models Do not
Abstract page for arXiv paper 2604.04815: LiveFact: A Dynamic, Time-Aware Benchmark for LLM-Driven Fake News Detection
Abstract page for arXiv paper 2604.04743: Hallucination Basins: A Dynamic Framework for Understanding and Controlling LLM Hallucinations
Abstract page for arXiv paper 2604.04741: Artificial Intelligence and Cost Reduction in Public Higher Education: A Scoping Review of Emer...
Abstract page for arXiv paper 2604.04733: Discovering Failure Modes in Vision-Language Models using RL
Abstract page for arXiv paper 2604.04732: Metaphors We Compute By: A Computational Audit of Cultural Translation vs. Thinking in LLMs
Abstract page for arXiv paper 2604.04723: Individual and Combined Effects of English as a Second Language and Typos on LLM Performance
Abstract page for arXiv paper 2604.04720: What Makes Good Multilingual Reasoning? Disentangling Reasoning Traces with Measurable Features
Abstract page for arXiv paper 2604.04664: ROSClaw: A Hierarchical Semantic-Physical Framework for Heterogeneous Multi-Agent Collaboration
Abstract page for arXiv paper 2604.04646: Training-Free Refinement of Flow Matching with Divergence-based Sampling
Abstract page for arXiv paper 2604.04634: Preserving Forgery Artifacts: AI-Generated Video Detection at Native Scale
Abstract page for arXiv paper 2604.04593: Ruling Out to Rule In: Contrastive Hypothesis Retrieval for Medical Question Answering
Abstract page for arXiv paper 2604.04565: PassiveQA: A Three-Action Framework for Epistemically Calibrated Question Answering via Supervi...
Abstract page for arXiv paper 2604.04563: Temporal Inversion for Learning Interval Change in Chest X-Rays
Abstract page for arXiv paper 2604.04562: Paper Espresso: From Paper Overload to Research Insight
Abstract page for arXiv paper 2604.04561: Mapping the Exploitation Surface: A 10,000-Trial Taxonomy of What Makes LLM Agents Exploit Vuln...
Abstract page for arXiv paper 2604.04552: StableTTA: Training-Free Test-Time Adaptation that Improves Model Accuracy on ImageNet1K to 96%
Abstract page for arXiv paper 2604.04490: RAVEN: Radar Adaptive Vision Encoders for Efficient Chirp-wise Object Detection and Segmentation
Abstract page for arXiv paper 2604.04450: Conversational Control with Ontologies for Large Language Models: A Lightweight Framework for C...
Abstract page for arXiv paper 2604.04440: Training Transformers in Cosine Coefficient Space
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime