gallaghercommajack / mixture-of-experts Goto Github PK
View Code? Open in Web Editor NEWThis project forked from lucidrains/mixture-of-experts
A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
License: MIT License