meganndare / transformer_generalization Goto Github PK
View Code? Open in Web Editor NEWThis project forked from robertcsordas/transformer_generalization
The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We significantly improve the systematic generalization of transformer models on a variety of datasets using simple tricks and careful considerations.
License: MIT License