VibeServe: Can AI Agents Build Bespoke LLM Serving Systems?
概要
arXiv:2605.06068v1 Announce Type: new Abstract: For years, we have built LLM serving systems like any other critical infrastructure: a single general-purpose stack, hand-tuned over many engineer-years, meant to support every model and workload. In this paper, we take the opposite bet: a multi-agent…