cambridgeltl / ecnmt Goto Github PK
View Code? Open in Web Editor NEWEmergent Communication Pretraining for Few-Shot Machine Translation
Home Page: https://github.com/cambridgeltl/ECNMT
Emergent Communication Pretraining for Few-Shot Machine Translation
Home Page: https://github.com/cambridgeltl/ECNMT
Hi, I am re-running the ECPRETRAIN script and trying to reproduce the results.
However, I found the model tends to generate very short sentences as the training goes on:
This seems to be not aligned with the description in the paper.
Besides, it seems that we only save a model when it reaches 99% accuracy, however, none of the training epoch reaches that in my experiment.
I try to reproduce the environment, however, I found that Pytorch 1.3.1
does not seem to be available on Pytorch's official webpage anymore, so I use Pytorch 1.6
instead. I do not know whether this could be the reason. Any insight here would be helpful.
Hi,
I am trying to run the ECPRETRAINING script with default hparams. The model is able to reach 99% accuracy in 4k epochs but after that the prediction accuracy collapses. I am unable to find any bug in the code that might be causing this. I have attached the training logs for the same
Full log
log.log
It would great if you could help resolve this issue.
Thanks.
I see args.hard is default False, and when I tried with it it doesn't really work well?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.