Machine Learning

ML algorithms, training, and inference

Top This Week

Llms

How do you test AI agents in production? The unpredictability is overwhelming.[D]

I’ve been in QA for almost a decade. My mental model for quality was always: given input X, assert output Y. Now I’m on a team that’s shi...

Reddit - Machine Learning · 1 min ·
Machine Learning

INT8 quantization gives me better accuracy than FP16 ! [D]

Hi everyone, I’m working on a deep learning model and I noticed something strange. When I compare different precisions: FP32 (baseline) F...

Reddit - Machine Learning · 1 min ·
The Download: DeepSeek’s latest AI breakthrough, and the race to build world models | MIT Technology Review
Machine Learning

The Download: DeepSeek’s latest AI breakthrough, and the race to build world models | MIT Technology Review

China has blocked Meta’s $2 billion acquisition of AI startup Manus.

MIT Technology Review · 6 min ·

All Content

[2510.13851] EvoEdit: Evolving Null-space Alignment for Robust and Efficient Knowledge Editing
Llms

[2510.13851] EvoEdit: Evolving Null-space Alignment for Robust and Efficient Knowledge Editing

Abstract page for arXiv paper 2510.13851: EvoEdit: Evolving Null-space Alignment for Robust and Efficient Knowledge Editing

arXiv - Machine Learning · 4 min ·
[2510.03843] Smart Paste: Automatically Fixing Copy/Paste for Google Developers
Machine Learning

[2510.03843] Smart Paste: Automatically Fixing Copy/Paste for Google Developers

Abstract page for arXiv paper 2510.03843: Smart Paste: Automatically Fixing Copy/Paste for Google Developers

arXiv - Machine Learning · 3 min ·
[2510.03152] Markovian Reeb Graphs for Simulating Spatiotemporal Patterns of Life
Machine Learning

[2510.03152] Markovian Reeb Graphs for Simulating Spatiotemporal Patterns of Life

Abstract page for arXiv paper 2510.03152: Markovian Reeb Graphs for Simulating Spatiotemporal Patterns of Life

arXiv - Machine Learning · 3 min ·
[2509.11481] RAPTOR: A Foundation Policy for Quadrotor Control
Machine Learning

[2509.11481] RAPTOR: A Foundation Policy for Quadrotor Control

Abstract page for arXiv paper 2509.11481: RAPTOR: A Foundation Policy for Quadrotor Control

arXiv - AI · 4 min ·
[2509.00472] Partially Functional Dynamic Backdoor Diffusion-based Causal Model
Machine Learning

[2509.00472] Partially Functional Dynamic Backdoor Diffusion-based Causal Model

Abstract page for arXiv paper 2509.00472: Partially Functional Dynamic Backdoor Diffusion-based Causal Model

arXiv - Machine Learning · 4 min ·
[2508.16703] ShadowNPU: System and Algorithm Co-design for NPU-Centric On-Device LLM Inference
Llms

[2508.16703] ShadowNPU: System and Algorithm Co-design for NPU-Centric On-Device LLM Inference

Abstract page for arXiv paper 2508.16703: ShadowNPU: System and Algorithm Co-design for NPU-Centric On-Device LLM Inference

arXiv - AI · 3 min ·
[2508.13998] Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation
Llms

[2508.13998] Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation

Abstract page for arXiv paper 2508.13998: Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation

arXiv - AI · 4 min ·
[2508.12301] WhisperRT -- Turning Whisper into a Causal Streaming Model
Machine Learning

[2508.12301] WhisperRT -- Turning Whisper into a Causal Streaming Model

Abstract page for arXiv paper 2508.12301: WhisperRT -- Turning Whisper into a Causal Streaming Model

arXiv - Machine Learning · 4 min ·
[2508.10208] CATNet: A geometric deep learning approach for CAT bond spread prediction in the primary market
Machine Learning

[2508.10208] CATNet: A geometric deep learning approach for CAT bond spread prediction in the primary market

Abstract page for arXiv paper 2508.10208: CATNet: A geometric deep learning approach for CAT bond spread prediction in the primary market

arXiv - AI · 4 min ·
[2506.19591] Vision Transformer-Based Time-Series Image Reconstruction for Cloud-Filling Applications
Machine Learning

[2506.19591] Vision Transformer-Based Time-Series Image Reconstruction for Cloud-Filling Applications

Abstract page for arXiv paper 2506.19591: Vision Transformer-Based Time-Series Image Reconstruction for Cloud-Filling Applications

arXiv - AI · 3 min ·
[2506.17585] Cite Pretrain: Retrieval-Free Knowledge Attribution for Large Language Models
Llms

[2506.17585] Cite Pretrain: Retrieval-Free Knowledge Attribution for Large Language Models

Abstract page for arXiv paper 2506.17585: Cite Pretrain: Retrieval-Free Knowledge Attribution for Large Language Models

arXiv - AI · 4 min ·
[2506.15461] All is Not Lost: LLM Recovery without Checkpoints
Llms

[2506.15461] All is Not Lost: LLM Recovery without Checkpoints

Abstract page for arXiv paper 2506.15461: All is Not Lost: LLM Recovery without Checkpoints

arXiv - Machine Learning · 4 min ·
[2506.07816] Accelerating Constrained Sampling: A Large Deviations Approach
Machine Learning

[2506.07816] Accelerating Constrained Sampling: A Large Deviations Approach

Abstract page for arXiv paper 2506.07816: Accelerating Constrained Sampling: A Large Deviations Approach

arXiv - Machine Learning · 4 min ·
[2506.01882] Learning thermodynamic master equations for open quantum systems
Machine Learning

[2506.01882] Learning thermodynamic master equations for open quantum systems

Abstract page for arXiv paper 2506.01882: Learning thermodynamic master equations for open quantum systems

arXiv - Machine Learning · 3 min ·
[2506.00077] Gaussian mixture models as a proxy for interacting language models
Llms

[2506.00077] Gaussian mixture models as a proxy for interacting language models

Abstract page for arXiv paper 2506.00077: Gaussian mixture models as a proxy for interacting language models

arXiv - Machine Learning · 4 min ·
[2505.18288] Operator Learning for Schrödinger Equation: Unitarity, Error Bounds, and Time Generalization
Machine Learning

[2505.18288] Operator Learning for Schrödinger Equation: Unitarity, Error Bounds, and Time Generalization

Abstract page for arXiv paper 2505.18288: Operator Learning for Schrödinger Equation: Unitarity, Error Bounds, and Time Generalization

arXiv - Machine Learning · 3 min ·
[2505.17087] Informatics for Food Processing
Machine Learning

[2505.17087] Informatics for Food Processing

Abstract page for arXiv paper 2505.17087: Informatics for Food Processing

arXiv - AI · 3 min ·
[2505.08548] From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation
Llms

[2505.08548] From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation

Abstract page for arXiv paper 2505.08548: From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation

arXiv - AI · 4 min ·
[2505.05375] Threshold Modulation for Online Test-Time Adaptation of Spiking Neural Networks
Machine Learning

[2505.05375] Threshold Modulation for Online Test-Time Adaptation of Spiking Neural Networks

Abstract page for arXiv paper 2505.05375: Threshold Modulation for Online Test-Time Adaptation of Spiking Neural Networks

arXiv - AI · 4 min ·
[2503.08751] Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning
Machine Learning

[2503.08751] Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning

Abstract page for arXiv paper 2503.08751: Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for ...

arXiv - Machine Learning · 4 min ·
Previous Page 247 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime