[2502.13388] Reflection of Episodes: Learning to Play Game from Expert

[2502.13388] Reflection of Episodes: Learning to Play Game from Expert and Self Experiences

arXiv - AI April 07, 2026 3 min read

About this article

Abstract page for arXiv paper 2502.13388: Reflection of Episodes: Learning to Play Game from Expert and Self Experiences

Computer Science > Artificial Intelligence arXiv:2502.13388 (cs) [Submitted on 19 Feb 2025 (v1), last revised 5 Apr 2026 (this version, v3)] Title:Reflection of Episodes: Learning to Play Game from Expert and Self Experiences Authors:Xiaojie Xu, Zongyuan Li, Chang Lu, Runnan Qi, Yanan Ni, Lumin Jiang, Xiangbei Liu, Xuebo Zhang, Yongchun Fang, Kuihua Huang, Xian Guo, Zhanghua Wu, Zhenya Li View a PDF of the paper titled Reflection of Episodes: Learning to Play Game from Expert and Self Experiences, by Xiaojie Xu and 12 other authors View PDF HTML (experimental) Abstract:StarCraft II is a complex and dynamic real-time strategy (RTS) game environment, which is very suitable for artificial intelligence and reinforcement learning research. To address the problem of Large Language Model(LLM) learning in complex environments through self-reflection, we propose a Reflection of Episodes(ROE) framework based on expert experience and self-experience. This framework first obtains key information in the game through a keyframe selection method, then makes decisions based on expert experience and self-experience. After a game is completed, it reflects on the previous experience to obtain new self-experience. Finally, in the experiment, our method beat the robot under the Very Hard difficulty in TextStarCraft II. We analyze the data of the LLM in the process of the game in detail, verified its effectiveness. Subjects: Artificial Intelligence (cs.AI) Cite as: arXiv:2502.13388 [cs.AI] (o...

Originally published on April 07, 2026. Curated by AI News.

Llms

ChatGPT downloads are slowing — and may cause problems for OpenAI’s IPO | The Verge

Data from Sensor Tower shows ChatGPT’s growth is slowing down, as Claude and other competitors’ growth is increasing, just as OpenAI is p...

The Verge - AI · 4 min · about 1 hour ago

Llms

Larry Ellison’s betting everything on OpenAI. Will it pay off or pop the bubble? | The Verge

Larry Ellison and Oracle have staked their future on a data center deal with OpenAI and a big bet that enterprise AI will pay off.

The Verge - AI · 32 min · about 1 hour ago

Llms

Google just released Deep Research Max — an autonomous research agent that writes expert-grade reports on its own

Google quietly dropped something interesting last week. They updated their Deep Research agent (available via Gemini API) and introduced ...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

When Robots Have Their ChatGPT Moment, Remember These Pincers | WIRED

From sorting chicken nuggets to screwing in light bulbs, Eka’s robots are eerily lifelike. But do they have real physical smarts?

Wired - AI · 13 min · about 4 hours ago

[2502.13388] Reflection of Episodes: Learning to Play Game from Expert and Self Experiences

About this article

Related Articles

ChatGPT downloads are slowing — and may cause problems for OpenAI’s IPO | The Verge

Larry Ellison’s betting everything on OpenAI. Will it pay off or pop the bubble? | The Verge

Google just released Deep Research Max — an autonomous research agent that writes expert-grade reports on its own

When Robots Have Their ChatGPT Moment, Remember These Pincers | WIRED

No comments

Stay updated with AI News