davidmrau / mixture-of-experts Goto Github PK
View Code? Open in Web Editor NEWPyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538
License: GNU General Public License v3.0