Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

What to expect from AlphaZero's value predictions [D]

An AlphaZero agent has learnt to predict the value of a game state by training on data generated by self-play by the model and a series o...

Reddit - Machine Learning · 1 min · about 3 hours ago

Machine Learning

Open Source Projects related to CNNs to Contribute To? [D]

Around a decade a go I was tinkering a lot with CNNs for real time event detection. I enjoyed that a lot and always wanted to get back in...

Reddit - Machine Learning · 1 min · about 3 hours ago

Machine Learning

I Work in Hollywood. Everyone Who Used to Make TV Is Now Secretly Training AI | WIRED

For screenwriters like me—and job seekers all over—AI gig work is the new waiting tables. In eight months, I’ve done 20 of these soul-cru...

Wired - AI · 27 min · about 4 hours ago

All Content

Llms

[2506.00886] Position: Agent Should Invoke External Tools ONLY When Epistemically Necessary

Abstract page for arXiv paper 2506.00886: Position: Agent Should Invoke External Tools ONLY When Epistemically Necessary

arXiv - AI · 4 min · about 9 hours ago

Llms

[2510.01569] InvThink: Premortem Reasoning for Safer Language Models

Abstract page for arXiv paper 2510.01569: InvThink: Premortem Reasoning for Safer Language Models

arXiv - AI · 3 min · about 9 hours ago

Machine Learning

[2605.08073] EmambaIR: Efficient Visual State Space Model for Event-guided Image Reconstruction

Abstract page for arXiv paper 2605.08073: EmambaIR: Efficient Visual State Space Model for Event-guided Image Reconstruction

arXiv - AI · 3 min · about 9 hours ago

Llms

[2605.08063] Flow-OPD: On-Policy Distillation for Flow Matching Models

Abstract page for arXiv paper 2605.08063: Flow-OPD: On-Policy Distillation for Flow Matching Models

arXiv - AI · 4 min · about 9 hours ago

Llms

[2605.08060] The Memory Curse: How Expanded Recall Erodes Cooperative Intent in LLM Agents

Abstract page for arXiv paper 2605.08060: The Memory Curse: How Expanded Recall Erodes Cooperative Intent in LLM Agents

arXiv - AI · 4 min · about 9 hours ago

Llms

[2605.08057] CA-SQL: Complexity-Aware Inference Time Reasoning for Text-to-SQL via Exploration and Compute Budget Allocation

Abstract page for arXiv paper 2605.08057: CA-SQL: Complexity-Aware Inference Time Reasoning for Text-to-SQL via Exploration and Compute B...

arXiv - AI · 3 min · about 9 hours ago

Machine Learning

[2605.08043] SCOPE: Structured Decomposition and Conditional Skill Orchestration for Complex Image Generation

Abstract page for arXiv paper 2605.08043: SCOPE: Structured Decomposition and Conditional Skill Orchestration for Complex Image Generation

arXiv - AI · 4 min · about 9 hours ago

Llms

[2605.07985] Dooly: Configuration-Agnostic, Redundancy-Aware Profiling for LLM Inference Simulation

Abstract page for arXiv paper 2605.07985: Dooly: Configuration-Agnostic, Redundancy-Aware Profiling for LLM Inference Simulation

arXiv - AI · 4 min · about 9 hours ago

Machine Learning

[2605.07955] TimeLesSeg: Unified Contrast-Agnostic Cross-Sectional and Longitudinal MS Lesion Segmentation via a Stochastic Generative Model

Abstract page for arXiv paper 2605.07955: TimeLesSeg: Unified Contrast-Agnostic Cross-Sectional and Longitudinal MS Lesion Segmentation v...

arXiv - AI · 4 min · about 9 hours ago

Machine Learning

[2605.07931] One Token Per Frame: Reconsidering Visual Bandwidth in World Models for VLA Policy

Abstract page for arXiv paper 2605.07931: One Token Per Frame: Reconsidering Visual Bandwidth in World Models for VLA Policy

arXiv - AI · 4 min · about 9 hours ago

Machine Learning

[2605.07903] BeeVe: Unsupervised Acoustic State Discovery in Honey Bee Buzzing

Abstract page for arXiv paper 2605.07903: BeeVe: Unsupervised Acoustic State Discovery in Honey Bee Buzzing

arXiv - AI · 4 min · about 9 hours ago

Machine Learning

[2605.07897] Semantic-Aware Adaptive Visual Memory for Streaming Video Understanding

Abstract page for arXiv paper 2605.07897: Semantic-Aware Adaptive Visual Memory for Streaming Video Understanding

arXiv - AI · 4 min · about 9 hours ago

Machine Learning

[2605.07821] Divide and Conquer: Object Co-occurrence Helps Mitigate Simplicity Bias in OOD Detection

Abstract page for arXiv paper 2605.07821: Divide and Conquer: Object Co-occurrence Helps Mitigate Simplicity Bias in OOD Detection

arXiv - AI · 4 min · about 9 hours ago

Machine Learning

[2605.07870] Spectral Dynamics in Deep Networks: Feature Learning, Outlier Escape, and Learning Rate Transfer

Abstract page for arXiv paper 2605.07870: Spectral Dynamics in Deep Networks: Feature Learning, Outlier Escape, and Learning Rate Transfer

arXiv - AI · 4 min · about 9 hours ago

Machine Learning

[2605.07872] Video Understanding Reward Modeling: A Robust Benchmark and Performant Reward Models

Abstract page for arXiv paper 2605.07872: Video Understanding Reward Modeling: A Robust Benchmark and Performant Reward Models

arXiv - AI · 3 min · about 9 hours ago

Llms

[2605.07830] CyBiasBench: Benchmarking Bias in LLM Agents for Cyber-Attack Scenarios

Abstract page for arXiv paper 2605.07830: CyBiasBench: Benchmarking Bias in LLM Agents for Cyber-Attack Scenarios

arXiv - AI · 3 min · about 9 hours ago

Llms

[2605.07817] GazeVLM: Active Vision via Internal Attention Control for Multimodal Reasoning

Abstract page for arXiv paper 2605.07817: GazeVLM: Active Vision via Internal Attention Control for Multimodal Reasoning

arXiv - AI · 4 min · about 9 hours ago

Machine Learning

[2605.07786] APEX: Assumption-free Projection-based Embedding eXamination Metric for Image Quality Assessment

Abstract page for arXiv paper 2605.07786: APEX: Assumption-free Projection-based Embedding eXamination Metric for Image Quality Assessment

arXiv - AI · 3 min · about 9 hours ago

Llms

[2605.07731] Benchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMs

Abstract page for arXiv paper 2605.07731: Benchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMs

arXiv - AI · 4 min · about 9 hours ago

Llms

[2605.07723] LLM hallucinations in the wild: Large-scale evidence from non-existent citations

Abstract page for arXiv paper 2605.07723: LLM hallucinations in the wild: Large-scale evidence from non-existent citations

arXiv - AI · 4 min · about 9 hours ago

Previous Page 3 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

What to expect from AlphaZero's value predictions [D]

Open Source Projects related to CNNs to Contribute To? [D]

I Work in Hollywood. Everyone Who Used to Make TV Is Now Secretly Training AI | WIRED

All Content

[2506.00886] Position: Agent Should Invoke External Tools ONLY When Epistemically Necessary

[2510.01569] InvThink: Premortem Reasoning for Safer Language Models

[2605.08073] EmambaIR: Efficient Visual State Space Model for Event-guided Image Reconstruction

[2605.08063] Flow-OPD: On-Policy Distillation for Flow Matching Models

[2605.08060] The Memory Curse: How Expanded Recall Erodes Cooperative Intent in LLM Agents

[2605.08057] CA-SQL: Complexity-Aware Inference Time Reasoning for Text-to-SQL via Exploration and Compute Budget Allocation

[2605.08043] SCOPE: Structured Decomposition and Conditional Skill Orchestration for Complex Image Generation

[2605.07985] Dooly: Configuration-Agnostic, Redundancy-Aware Profiling for LLM Inference Simulation

[2605.07955] TimeLesSeg: Unified Contrast-Agnostic Cross-Sectional and Longitudinal MS Lesion Segmentation via a Stochastic Generative Model

[2605.07931] One Token Per Frame: Reconsidering Visual Bandwidth in World Models for VLA Policy

[2605.07903] BeeVe: Unsupervised Acoustic State Discovery in Honey Bee Buzzing

[2605.07897] Semantic-Aware Adaptive Visual Memory for Streaming Video Understanding

[2605.07821] Divide and Conquer: Object Co-occurrence Helps Mitigate Simplicity Bias in OOD Detection

[2605.07870] Spectral Dynamics in Deep Networks: Feature Learning, Outlier Escape, and Learning Rate Transfer

[2605.07872] Video Understanding Reward Modeling: A Robust Benchmark and Performant Reward Models

[2605.07830] CyBiasBench: Benchmarking Bias in LLM Agents for Cyber-Attack Scenarios

[2605.07817] GazeVLM: Active Vision via Internal Attention Control for Multimodal Reasoning

[2605.07786] APEX: Assumption-free Projection-based Embedding eXamination Metric for Image Quality Assessment

[2605.07731] Benchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMs

[2605.07723] LLM hallucinations in the wild: Large-scale evidence from non-existent citations

Related Topics

Stay updated with AI News