Mixture of Masters: Sparse Chess Language Models with Player Routing
概要
arXiv:2602.04447v2 Announce Type: replace-cross Abstract: Modern chess language models are dense transformers trained on millions of games played by thousands of high-rated individuals. However, these monolithic networks tend to collapse into mode-averaged behavior, where stylistic boundaries are b…