Content Feed

The latest content from across the network

[2604.00698] Learning to Hint for Reinforcement Learning

[2604.00698] Learning to Hint for Reinforcement Learning

Abstract page for arXiv paper 2604.00698: Learning to Hint for Reinforcement Learning

arXiv - AI · 4 min ·
[2604.00733] Spectral Compact Training: Pre-Training Large Language Models via Permanent Truncated SVD and Stiefel QR Retraction
Llms

[2604.00733] Spectral Compact Training: Pre-Training Large Language Models via Permanent Truncated SVD and Stiefel QR Retraction

Abstract page for arXiv paper 2604.00733: Spectral Compact Training: Pre-Training Large Language Models via Permanent Truncated SVD and S...

arXiv - AI · 4 min ·
[2604.00689] Performance of Neural and Polynomial Operator Surrogates
Machine Learning

[2604.00689] Performance of Neural and Polynomial Operator Surrogates

Abstract page for arXiv paper 2604.00689: Performance of Neural and Polynomial Operator Surrogates

arXiv - Machine Learning · 4 min ·
[2604.00726] Exploring Silent Data Corruption as a Reliability Challenge in LLM Training
Llms

[2604.00726] Exploring Silent Data Corruption as a Reliability Challenge in LLM Training

Abstract page for arXiv paper 2604.00726: Exploring Silent Data Corruption as a Reliability Challenge in LLM Training

arXiv - Machine Learning · 3 min ·
[2604.00686] Full-Gradient Successor Feature Representations

[2604.00686] Full-Gradient Successor Feature Representations

Abstract page for arXiv paper 2604.00686: Full-Gradient Successor Feature Representations

arXiv - Machine Learning · 3 min ·
[2604.00669] Embedded Variational Neural Stochastic Differential Equations for Learning Heterogeneous Dynamics
Machine Learning

[2604.00669] Embedded Variational Neural Stochastic Differential Equations for Learning Heterogeneous Dynamics

Abstract page for arXiv paper 2604.00669: Embedded Variational Neural Stochastic Differential Equations for Learning Heterogeneous Dynamics

arXiv - Machine Learning · 4 min ·
[2604.00653] Chameleons do not Forget: Prompt-Based Online Continual Learning for Next Activity Prediction
Machine Learning

[2604.00653] Chameleons do not Forget: Prompt-Based Online Continual Learning for Next Activity Prediction

Abstract page for arXiv paper 2604.00653: Chameleons do not Forget: Prompt-Based Online Continual Learning for Next Activity Prediction

arXiv - Machine Learning · 4 min ·
[2604.00626] A Survey of On-Policy Distillation for Large Language Models
Llms

[2604.00626] A Survey of On-Policy Distillation for Large Language Models

Abstract page for arXiv paper 2604.00626: A Survey of On-Policy Distillation for Large Language Models

arXiv - Machine Learning · 4 min ·
[2604.00599] Predicting Dynamics of Ultra-Large Complex Systems by Inferring Governing Equations
Machine Learning

[2604.00599] Predicting Dynamics of Ultra-Large Complex Systems by Inferring Governing Equations

Abstract page for arXiv paper 2604.00599: Predicting Dynamics of Ultra-Large Complex Systems by Inferring Governing Equations

arXiv - Machine Learning · 4 min ·
[2604.00580] Representation choice shapes the interpretation of protein conformational dynamics

[2604.00580] Representation choice shapes the interpretation of protein conformational dynamics

Abstract page for arXiv paper 2604.00580: Representation choice shapes the interpretation of protein conformational dynamics

arXiv - Machine Learning · 3 min ·
[2604.00556] HabitatAgent: An End-to-End Multi-Agent System for Housing Consultation
Llms

[2604.00556] HabitatAgent: An End-to-End Multi-Agent System for Housing Consultation

Abstract page for arXiv paper 2604.00556: HabitatAgent: An End-to-End Multi-Agent System for Housing Consultation

arXiv - AI · 4 min ·
[2604.00531] Learning Shared Representations for Multi-Task Linear Bandits

[2604.00531] Learning Shared Representations for Multi-Task Linear Bandits

Abstract page for arXiv paper 2604.00531: Learning Shared Representations for Multi-Task Linear Bandits

arXiv - Machine Learning · 3 min ·
[2604.00533] Learning from Many and Adapting to the Unknown in Open-set Test Streams
Llms

[2604.00533] Learning from Many and Adapting to the Unknown in Open-set Test Streams

Abstract page for arXiv paper 2604.00533: Learning from Many and Adapting to the Unknown in Open-set Test Streams

arXiv - Machine Learning · 4 min ·
[2604.00529] MF-QAT: Multi-Format Quantization-Aware Training for Elastic Inference
Machine Learning

[2604.00529] MF-QAT: Multi-Format Quantization-Aware Training for Elastic Inference

Abstract page for arXiv paper 2604.00529: MF-QAT: Multi-Format Quantization-Aware Training for Elastic Inference

arXiv - Machine Learning · 3 min ·
[2604.00523] Lipschitz Dueling Bandits over Continuous Action Spaces

[2604.00523] Lipschitz Dueling Bandits over Continuous Action Spaces

Abstract page for arXiv paper 2604.00523: Lipschitz Dueling Bandits over Continuous Action Spaces

arXiv - Machine Learning · 3 min ·
[2604.00513] MOON3.0: Reasoning-aware Multimodal Representation Learning for E-commerce Product Understanding
Llms

[2604.00513] MOON3.0: Reasoning-aware Multimodal Representation Learning for E-commerce Product Understanding

Abstract page for arXiv paper 2604.00513: MOON3.0: Reasoning-aware Multimodal Representation Learning for E-commerce Product Understanding

arXiv - Machine Learning · 4 min ·
[2604.00508] A Decoupled Basis-Vector-Driven Generative Framework for Dynamic Multi-Objective Optimization

[2604.00508] A Decoupled Basis-Vector-Driven Generative Framework for Dynamic Multi-Objective Optimization

Abstract page for arXiv paper 2604.00508: A Decoupled Basis-Vector-Driven Generative Framework for Dynamic Multi-Objective Optimization

arXiv - Machine Learning · 3 min ·
[2604.00505] Towards Initialization-dependent and Non-vacuous Generalization Bounds for Overparameterized Shallow Neural Networks
Machine Learning

[2604.00505] Towards Initialization-dependent and Non-vacuous Generalization Bounds for Overparameterized Shallow Neural Networks

Abstract page for arXiv paper 2604.00505: Towards Initialization-dependent and Non-vacuous Generalization Bounds for Overparameterized Sh...

arXiv - AI · 4 min ·
[2604.00499] Scheduling LLM Inference with Uncertainty-Aware Output Length Predictions
Llms

[2604.00499] Scheduling LLM Inference with Uncertainty-Aware Output Length Predictions

Abstract page for arXiv paper 2604.00499: Scheduling LLM Inference with Uncertainty-Aware Output Length Predictions

arXiv - Machine Learning · 4 min ·
[2604.00485] The Rashomon Effect for Visualizing High-Dimensional Data
Nlp

[2604.00485] The Rashomon Effect for Visualizing High-Dimensional Data

Abstract page for arXiv paper 2604.00485: The Rashomon Effect for Visualizing High-Dimensional Data

arXiv - Machine Learning · 3 min ·