MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning
概要
arXiv:2511.02805v2 Announce Type: replace-cross Abstract: LLM-based search agents often concatenate the full interaction history into the context, producing long and noisy inputs, and increasing compute cost and GPU memory overhead. To address this issue, we propose MemSearcher, an agent framework …