DirMoE: Dirichlet-routed Mixture of Experts

Published in International Conference on Learning Representations (ICLR), 2026

Recommended citation: Amirhossein Vahidi, Hesam Asadollahzadeh, Navid Akhavan Attar, Marie Moullet, Kevin Ly, Xingyi Yang, Mohammad Lotfollahi

[paper] [arxiv]