Elon Musk testifies that xAI trained Grok on OpenAI models | TechCrunch
"Distillation" is a hot topic as frontier labs try to prevent smaller competitors from copying their models.
ML algorithms, training, and inference
"Distillation" is a hot topic as frontier labs try to prevent smaller competitors from copying their models.
Goodfire wants to make training AI models more like good old-fashioned software engineering.
Disclosure: first author. The paper was just published in TMLR, and I figured it might be of interest to some people here. It is fairly d...
Abstract page for arXiv paper 2601.05656: HAG: Hierarchical Demographic Tree-based Agent Generation for Topic-Adaptive Simulation
Abstract page for arXiv paper 2512.13168: Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows
Abstract page for arXiv paper 2511.14130: PRISM: Prompt-Refined In-Context System Modelling for Financial Retrieval
Abstract page for arXiv paper 2510.09901: Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics
Abstract page for arXiv paper 2508.02900: Seemingly Simple Planning Problems are Computationally Challenging: The Countdown Game
Abstract page for arXiv paper 2502.13388: Reflection of Episodes: Learning to Play Game from Expert and Self Experiences
Abstract page for arXiv paper 2411.06498: Barriers to Complexity-Theoretic Proofs that "AGI" Using Machine Learning is Impossible
Abstract page for arXiv paper 2604.04924: Your Pre-trained Diffusion Model Secretly Knows Restoration
Abstract page for arXiv paper 2604.04906: How AI Aggregation Affects Knowledge
Abstract page for arXiv paper 2604.04917: Vero: An Open RL Recipe for General Visual Reasoning
Abstract page for arXiv paper 2604.04895: Agentic Federated Learning: The Future of Distributed Training Orchestration
Abstract page for arXiv paper 2604.04901: FileGram: Grounding Agent Personalization in File-System Behavioral Traces
Abstract page for arXiv paper 2604.04891: Muon Dynamics as a Spectral Wasserstein Flow
Abstract page for arXiv paper 2604.04852: Strengthening Human-Centric Chain-of-Thought Reasoning Integrity in LLMs via a Structured Promp...
Abstract page for arXiv paper 2604.04825: Plausibility as Commonsense Reasoning: Humans Succeed, Large Language Models Do not
Abstract page for arXiv paper 2604.04815: LiveFact: A Dynamic, Time-Aware Benchmark for LLM-Driven Fake News Detection
Abstract page for arXiv paper 2604.04743: Hallucination Basins: A Dynamic Framework for Understanding and Controlling LLM Hallucinations
Abstract page for arXiv paper 2604.04741: Artificial Intelligence and Cost Reduction in Public Higher Education: A Scoping Review of Emer...
Abstract page for arXiv paper 2604.04733: Discovering Failure Modes in Vision-Language Models using RL
Abstract page for arXiv paper 2604.04732: Metaphors We Compute By: A Computational Audit of Cultural Translation vs. Thinking in LLMs
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime