arXiv cs.AI by Synapse Flow 編集部

Workspace-Bench 1.0: Benchmarking AI Agents on Workspace Tasks with Large-Scale File Dependencies

概要

arXiv:2605.03596v1 Announce Type: new Abstract: Workspace learning requires AI agents to identify, reason over, exploit, and update explicit and implicit dependencies among heterogeneous files in a worker's workspace, enabling them to complete both routine and advanced tasks effectively. Despite it…

元記事を読む →

関連記事