Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

How do you test AI agents in production? The unpredictability is overwhelming.[D]

I’ve been in QA for almost a decade. My mental model for quality was always: given input X, assert output Y. Now I’m on a team that’s shi...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

INT8 quantization gives me better accuracy than FP16 ! [D]

Hi everyone, I’m working on a deep learning model and I noticed something strange. When I compare different precisions: FP32 (baseline) F...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

The Download: DeepSeek’s latest AI breakthrough, and the race to build world models | MIT Technology Review

China has blocked Meta’s $2 billion acquisition of AI startup Manus.

MIT Technology Review · 6 min · about 3 hours ago

All Content

Llms

[2510.13851] EvoEdit: Evolving Null-space Alignment for Robust and Efficient Knowledge Editing

Abstract page for arXiv paper 2510.13851: EvoEdit: Evolving Null-space Alignment for Robust and Efficient Knowledge Editing

arXiv - Machine Learning · 4 min · 20 days ago

Machine Learning

[2510.03843] Smart Paste: Automatically Fixing Copy/Paste for Google Developers

Abstract page for arXiv paper 2510.03843: Smart Paste: Automatically Fixing Copy/Paste for Google Developers

arXiv - Machine Learning · 3 min · 20 days ago

Machine Learning

[2510.03152] Markovian Reeb Graphs for Simulating Spatiotemporal Patterns of Life

Abstract page for arXiv paper 2510.03152: Markovian Reeb Graphs for Simulating Spatiotemporal Patterns of Life

arXiv - Machine Learning · 3 min · 20 days ago

Machine Learning

[2509.11481] RAPTOR: A Foundation Policy for Quadrotor Control

Abstract page for arXiv paper 2509.11481: RAPTOR: A Foundation Policy for Quadrotor Control

arXiv - AI · 4 min · 20 days ago

Machine Learning

[2509.00472] Partially Functional Dynamic Backdoor Diffusion-based Causal Model

Abstract page for arXiv paper 2509.00472: Partially Functional Dynamic Backdoor Diffusion-based Causal Model

arXiv - Machine Learning · 4 min · 20 days ago

Llms

[2508.16703] ShadowNPU: System and Algorithm Co-design for NPU-Centric On-Device LLM Inference

Abstract page for arXiv paper 2508.16703: ShadowNPU: System and Algorithm Co-design for NPU-Centric On-Device LLM Inference

arXiv - AI · 3 min · 20 days ago

Llms

[2508.13998] Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation

Abstract page for arXiv paper 2508.13998: Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation

arXiv - AI · 4 min · 20 days ago

Machine Learning

[2508.12301] WhisperRT -- Turning Whisper into a Causal Streaming Model

Abstract page for arXiv paper 2508.12301: WhisperRT -- Turning Whisper into a Causal Streaming Model

arXiv - Machine Learning · 4 min · 20 days ago

Machine Learning

[2508.10208] CATNet: A geometric deep learning approach for CAT bond spread prediction in the primary market

Abstract page for arXiv paper 2508.10208: CATNet: A geometric deep learning approach for CAT bond spread prediction in the primary market

arXiv - AI · 4 min · 20 days ago

Machine Learning

[2506.19591] Vision Transformer-Based Time-Series Image Reconstruction for Cloud-Filling Applications

Abstract page for arXiv paper 2506.19591: Vision Transformer-Based Time-Series Image Reconstruction for Cloud-Filling Applications

arXiv - AI · 3 min · 20 days ago

Llms

[2506.17585] Cite Pretrain: Retrieval-Free Knowledge Attribution for Large Language Models

Abstract page for arXiv paper 2506.17585: Cite Pretrain: Retrieval-Free Knowledge Attribution for Large Language Models

arXiv - AI · 4 min · 20 days ago

Llms

[2506.15461] All is Not Lost: LLM Recovery without Checkpoints

Abstract page for arXiv paper 2506.15461: All is Not Lost: LLM Recovery without Checkpoints

arXiv - Machine Learning · 4 min · 20 days ago

Machine Learning

[2506.07816] Accelerating Constrained Sampling: A Large Deviations Approach

Abstract page for arXiv paper 2506.07816: Accelerating Constrained Sampling: A Large Deviations Approach

arXiv - Machine Learning · 4 min · 20 days ago

Machine Learning

[2506.01882] Learning thermodynamic master equations for open quantum systems

Abstract page for arXiv paper 2506.01882: Learning thermodynamic master equations for open quantum systems

arXiv - Machine Learning · 3 min · 20 days ago

Llms

[2506.00077] Gaussian mixture models as a proxy for interacting language models

Abstract page for arXiv paper 2506.00077: Gaussian mixture models as a proxy for interacting language models

arXiv - Machine Learning · 4 min · 20 days ago

Machine Learning

[2505.18288] Operator Learning for Schrödinger Equation: Unitarity, Error Bounds, and Time Generalization

Abstract page for arXiv paper 2505.18288: Operator Learning for Schrödinger Equation: Unitarity, Error Bounds, and Time Generalization

arXiv - Machine Learning · 3 min · 20 days ago

Machine Learning

[2505.17087] Informatics for Food Processing

Abstract page for arXiv paper 2505.17087: Informatics for Food Processing

arXiv - AI · 3 min · 20 days ago

Llms

[2505.08548] From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation

Abstract page for arXiv paper 2505.08548: From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation

arXiv - AI · 4 min · 20 days ago

Machine Learning

[2505.05375] Threshold Modulation for Online Test-Time Adaptation of Spiking Neural Networks

Abstract page for arXiv paper 2505.05375: Threshold Modulation for Online Test-Time Adaptation of Spiking Neural Networks

arXiv - AI · 4 min · 20 days ago

Machine Learning

[2503.08751] Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning

Abstract page for arXiv paper 2503.08751: Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for ...

arXiv - Machine Learning · 4 min · 20 days ago

Previous Page 247 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

How do you test AI agents in production? The unpredictability is overwhelming.[D]

INT8 quantization gives me better accuracy than FP16 ! [D]

The Download: DeepSeek’s latest AI breakthrough, and the race to build world models | MIT Technology Review

All Content

[2510.13851] EvoEdit: Evolving Null-space Alignment for Robust and Efficient Knowledge Editing

[2510.03843] Smart Paste: Automatically Fixing Copy/Paste for Google Developers

[2510.03152] Markovian Reeb Graphs for Simulating Spatiotemporal Patterns of Life

[2509.11481] RAPTOR: A Foundation Policy for Quadrotor Control

[2509.00472] Partially Functional Dynamic Backdoor Diffusion-based Causal Model

[2508.16703] ShadowNPU: System and Algorithm Co-design for NPU-Centric On-Device LLM Inference

[2508.13998] Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation

[2508.12301] WhisperRT -- Turning Whisper into a Causal Streaming Model

[2508.10208] CATNet: A geometric deep learning approach for CAT bond spread prediction in the primary market

[2506.19591] Vision Transformer-Based Time-Series Image Reconstruction for Cloud-Filling Applications

[2506.17585] Cite Pretrain: Retrieval-Free Knowledge Attribution for Large Language Models

[2506.15461] All is Not Lost: LLM Recovery without Checkpoints

[2506.07816] Accelerating Constrained Sampling: A Large Deviations Approach

[2506.01882] Learning thermodynamic master equations for open quantum systems

[2506.00077] Gaussian mixture models as a proxy for interacting language models

[2505.18288] Operator Learning for Schrödinger Equation: Unitarity, Error Bounds, and Time Generalization

[2505.17087] Informatics for Food Processing

[2505.08548] From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation

[2505.05375] Threshold Modulation for Online Test-Time Adaptation of Spiking Neural Networks

[2503.08751] Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning

Related Topics

Stay updated with AI News