A Self-Evolving LLM Agent for Strategic Simulation Games 


Vol. 15,  No. 3, pp. 247-256, Mar.  2026
https://doi.org/10.3745/TKIPS.2026.15.3.247


PDF
  Abstract

Large Language Models (LLMs) have recently evolved into intelligent agents capable of interacting with external environments, with particular attention to self-evolving paradigms that improve performance without external supervision. This study proposes a self-evolving LLM agent for StarCraft II, a real-time strategy (RTS) game environment. The proposed agent employs a hierarchical architecture combining Qwen 3 1.7B as a small-scale model and GPT-3.5 Turbo as a large-scale model, where the small model learns domain-specific strategies through continual learning and the large model provides strategic judgment. The core mechanism is a replay buffer-based continual learning system that dynamically reorganizes training data centered on winning experiences. The agent accumulates successful gameplay experiences and progressively updates its training data to enhance strategic performance without external supervision. Experimental results demonstrate that the proposed method improves the win rate from 33% to 56% at difficulty level 6 over five rounds of learning, achieving 7 times higher performance than the existing baseline (8%). Notably, the agent exhibited the ability to autonomously discover and reinforce counter-strategies against the opponent AI's strategic patterns. This work presents a self-evolving framework that operates solely on win-loss signals without explicit reward functions, demonstrating the potential for autonomous learning of LLM agents in complex decision-making environments.

  Statistics


  Cite this article

[IEEE Style]

S. Kim, D. Ahn, J. Choi, "A Self-Evolving LLM Agent for Strategic Simulation Games," The Transactions of the Korea Information Processing Society, vol. 15, no. 3, pp. 247-256, 2026. DOI: https://doi.org/10.3745/TKIPS.2026.15.3.247.

[ACM Style]

San Kim, Daechul Ahn, and Jonghyun Choi. 2026. A Self-Evolving LLM Agent for Strategic Simulation Games. The Transactions of the Korea Information Processing Society, 15, 3, (2026), 247-256. DOI: https://doi.org/10.3745/TKIPS.2026.15.3.247.