arXiv cs.AI by Synapse Flow 編集部

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

概要

arXiv:2605.02910v2 Announce Type: new Abstract: Recent advances in large language models have led to strong performance on reasoning and environment-interaction tasks, yet their ability for creative problem-solving remains underexplored. We study this capability through the lens of creative tool us…

元記事を読む →

関連記事