[2604.08685] RAMP: Hybrid DRL for Online Learning of Numeric Action

[2604.08685] RAMP: Hybrid DRL for Online Learning of Numeric Action Models

arXiv - AI April 13, 2026 3 min read

About this article

Abstract page for arXiv paper 2604.08685: RAMP: Hybrid DRL for Online Learning of Numeric Action Models

Computer Science > Artificial Intelligence arXiv:2604.08685 (cs) [Submitted on 9 Apr 2026] Title:RAMP: Hybrid DRL for Online Learning of Numeric Action Models Authors:Yarin Benyamin, Argaman Mordoch, Shahaf S. Shperberg, Roni Stern View a PDF of the paper titled RAMP: Hybrid DRL for Online Learning of Numeric Action Models, by Yarin Benyamin and 3 other authors View PDF HTML (experimental) Abstract:Automated planning algorithms require an action model specifying the preconditions and effects of each action, but obtaining such a model is often hard. Learning action models from observations is feasible, but existing algorithms for numeric domains are offline, requiring expert traces as input. We propose the Reinforcement learning, Action Model learning, and Planning (RAMP) strategy for learning numeric planning action models online via interactions with the environment. RAMP simultaneously trains a Deep Reinforcement Learning (DRL) policy, learns a numeric action model from past interactions, and uses that model to plan future actions when possible. These components form a positive feedback loop: the RL policy gathers data to refine the action model, while the planner generates plans to continue training the RL policy. To facilitate this integration of RL and numeric planning, we developed Numeric PDDLGym, an automated framework for converting numeric planning problems to Gym environments. Experimental results on standard IPC numeric domains show that RAMP significantly outp...

Originally published on April 13, 2026. Curated by AI News.

Llms

Transformer Math Explorer [P]

This is an interactive math reference for transformer models, presented via dataflow graphs, all the way down to elementary math. Covers ...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

how much of your time goes into environment setup vs actual model work?

For most people I've talked to, it's embarrassingly high. New machine? Set up CUDA again. New team member? Good luck for reproducing the ...

Reddit - ML Jobs · 1 min · about 2 hours ago

Machine Learning

How much can a video generated by the same diffusion model differ across GPU architectures if the initial noise latent is fixed? [D]

Hi! I am trying to sanity-check an assumption for diffusion video generation reproducibility. Suppose I run the same video diffusion mode...

Reddit - Machine Learning · 1 min · about 2 hours ago

Llms

I am not an "anti" like this guy, but still an interesting video of person interacting with chat 4o

(Posting Here because removed by Chatgpt Complaints moderators because the model here is 4o, and refuse to believe there were any safety ...

Reddit - Artificial Intelligence · 1 min · about 6 hours ago

[2604.08685] RAMP: Hybrid DRL for Online Learning of Numeric Action Models

About this article

Related Articles

Transformer Math Explorer [P]

how much of your time goes into environment setup vs actual model work?

How much can a video generated by the same diffusion model differ across GPU architectures if the initial noise latent is fixed? [D]

I am not an "anti" like this guy, but still an interesting video of person interacting with chat 4o

No comments

Stay updated with AI News