ICML final decisions rant [D]
So, ICML accepted ~6.5K of ~24K; obviously, it doesn't mean that all the rejected papers are "bad," and these rejected papers would casca...
ML algorithms, training, and inference
So, ICML accepted ~6.5K of ~24K; obviously, it doesn't mean that all the rejected papers are "bad," and these rejected papers would casca...
We shipped iFixAi earlier this week. An open-source diagnostic for AI misalignment. 32 tests across fabrication, manipulation, deception,...
For the past several years I've been quietly assembling and processing what I believe is one of the larger privately held pretraining cor...
Abstract page for arXiv paper 2511.16383: An Agent-Based Framework for the Automatic Validation of Mathematical Optimization Models
Abstract page for arXiv paper 2601.05656: HAG: Hierarchical Demographic Tree-based Agent Generation for Topic-Adaptive Simulation
Abstract page for arXiv paper 2512.13168: Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows
Abstract page for arXiv paper 2511.14130: PRISM: Prompt-Refined In-Context System Modelling for Financial Retrieval
Abstract page for arXiv paper 2510.09901: Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics
Abstract page for arXiv paper 2508.02900: Seemingly Simple Planning Problems are Computationally Challenging: The Countdown Game
Abstract page for arXiv paper 2502.13388: Reflection of Episodes: Learning to Play Game from Expert and Self Experiences
Abstract page for arXiv paper 2411.06498: Barriers to Complexity-Theoretic Proofs that "AGI" Using Machine Learning is Impossible
Abstract page for arXiv paper 2604.04924: Your Pre-trained Diffusion Model Secretly Knows Restoration
Abstract page for arXiv paper 2604.04906: How AI Aggregation Affects Knowledge
Abstract page for arXiv paper 2604.04917: Vero: An Open RL Recipe for General Visual Reasoning
Abstract page for arXiv paper 2604.04895: Agentic Federated Learning: The Future of Distributed Training Orchestration
Abstract page for arXiv paper 2604.04901: FileGram: Grounding Agent Personalization in File-System Behavioral Traces
Abstract page for arXiv paper 2604.04891: Muon Dynamics as a Spectral Wasserstein Flow
Abstract page for arXiv paper 2604.04852: Strengthening Human-Centric Chain-of-Thought Reasoning Integrity in LLMs via a Structured Promp...
Abstract page for arXiv paper 2604.04825: Plausibility as Commonsense Reasoning: Humans Succeed, Large Language Models Do not
Abstract page for arXiv paper 2604.04815: LiveFact: A Dynamic, Time-Aware Benchmark for LLM-Driven Fake News Detection
Abstract page for arXiv paper 2604.04743: Hallucination Basins: A Dynamic Framework for Understanding and Controlling LLM Hallucinations
Abstract page for arXiv paper 2604.04741: Artificial Intelligence and Cost Reduction in Public Higher Education: A Scoping Review of Emer...
Abstract page for arXiv paper 2604.04733: Discovering Failure Modes in Vision-Language Models using RL
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime