BioMedArena: An Open-source Toolkit for Building and Evaluating Biomedical Deep Research Agents
概要
arXiv:2605.06177v1 Announce Type: new Abstract: Building a deep research agent today is an exercise in glue code: the same backbone evaluated on the same benchmark can report different accuracies in different papers because harness and tool registry all differ, and integrating a new foundation mode…