[P] ClaudeFormer: Building a Transformer Out of Claudes — Collaboration Request
I'm looking to work with people interested in math, machine learning, or agentic coding, on creating a multi-agent framework to do fronti...
ML algorithms, training, and inference
I'm looking to work with people interested in math, machine learning, or agentic coding, on creating a multi-agent framework to do fronti...
UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...
Hello! Recently I did a project where I initially had around 30 target classes. But at inference, the model had to be able to handle a lo...
Abstract page for arXiv paper 2410.15281: LLM4AD: Large Language Models for Autonomous Driving -- Concept, Review, Benchmark, Experiments...
Abstract page for arXiv paper 2410.10700: LLMs know their vulnerabilities: Uncover Safety Gaps through Natural Distribution Shifts
Abstract page for arXiv paper 2408.13366: CodeRefine: A Pipeline for Enhancing LLM-Generated Code Implementations of Research Papers
Abstract page for arXiv paper 2404.05290: MindSet: Vision. A toolbox for testing DNNs on key psychological experiments
Abstract page for arXiv paper 2401.11605: Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers
Abstract page for arXiv paper 2402.12760: A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image Synthesis
Abstract page for arXiv paper 2603.19091: Position: Spectral GNNs Are Neither Spectral Nor Superior for Node Classification
Abstract page for arXiv paper 2603.24402: AI-Supervisor: Autonomous AI Research Supervision via a Persistent Research World Model
Abstract page for arXiv paper 2603.16951: Minimum-Action Learning: Energy-Constrained Symbolic Model Selection for Physical Law Identific...
Abstract page for arXiv paper 2603.23610: Environment Maps: Structured Environmental Representations for Long-Horizon Agents
Abstract page for arXiv paper 2601.21747: Temporal Sepsis Modeling: a Fully Interpretable Relational Way
Abstract page for arXiv paper 2603.08561: RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback
Abstract page for arXiv paper 2601.18420: Gradient Regularized Natural Gradients
Abstract page for arXiv paper 2511.07436: Analysing Environmental Efficiency in AI for X-Ray Diagnosis
Abstract page for arXiv paper 2601.02856: Electricity Price Forecasting: Bridging Linear Models, Neural Networks and Online Learning
Abstract page for arXiv paper 2601.00428: Interpretable ML Under the Microscope: Performance, Meta-Features, and the Regression-Classific...
Abstract page for arXiv paper 2510.18087: Planned Diffusion
Abstract page for arXiv paper 2509.23768: From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning
Abstract page for arXiv paper 2512.18951: Benchmarking Attribute Discrimination in Infant-Scale Vision-Language Models
Abstract page for arXiv paper 2509.03345: Do Language Models Follow Occam's Razor? An Evaluation of Parsimony in Inductive and Abductive ...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime