[2604.00698] Learning to Hint for Reinforcement Learning
Abstract page for arXiv paper 2604.00698: Learning to Hint for Reinforcement Learning
Abstract page for arXiv paper 2604.00698: Learning to Hint for Reinforcement Learning
Abstract page for arXiv paper 2604.00733: Spectral Compact Training: Pre-Training Large Language Models via Permanent Truncated SVD and S...
Abstract page for arXiv paper 2604.00689: Performance of Neural and Polynomial Operator Surrogates
Abstract page for arXiv paper 2604.00726: Exploring Silent Data Corruption as a Reliability Challenge in LLM Training
Abstract page for arXiv paper 2604.00686: Full-Gradient Successor Feature Representations
Abstract page for arXiv paper 2604.00669: Embedded Variational Neural Stochastic Differential Equations for Learning Heterogeneous Dynamics
Abstract page for arXiv paper 2604.00653: Chameleons do not Forget: Prompt-Based Online Continual Learning for Next Activity Prediction
Abstract page for arXiv paper 2604.00626: A Survey of On-Policy Distillation for Large Language Models
Abstract page for arXiv paper 2604.00599: Predicting Dynamics of Ultra-Large Complex Systems by Inferring Governing Equations
Abstract page for arXiv paper 2604.00580: Representation choice shapes the interpretation of protein conformational dynamics
Abstract page for arXiv paper 2604.00556: HabitatAgent: An End-to-End Multi-Agent System for Housing Consultation
Abstract page for arXiv paper 2604.00531: Learning Shared Representations for Multi-Task Linear Bandits
Abstract page for arXiv paper 2604.00533: Learning from Many and Adapting to the Unknown in Open-set Test Streams
Abstract page for arXiv paper 2604.00529: MF-QAT: Multi-Format Quantization-Aware Training for Elastic Inference
Abstract page for arXiv paper 2604.00523: Lipschitz Dueling Bandits over Continuous Action Spaces
Abstract page for arXiv paper 2604.00513: MOON3.0: Reasoning-aware Multimodal Representation Learning for E-commerce Product Understanding
Abstract page for arXiv paper 2604.00508: A Decoupled Basis-Vector-Driven Generative Framework for Dynamic Multi-Objective Optimization
Abstract page for arXiv paper 2604.00505: Towards Initialization-dependent and Non-vacuous Generalization Bounds for Overparameterized Sh...
Abstract page for arXiv paper 2604.00499: Scheduling LLM Inference with Uncertainty-Aware Output Length Predictions
Abstract page for arXiv paper 2604.00485: The Rashomon Effect for Visualizing High-Dimensional Data