paraphrasegen's People
Forkers
gissemari sohamparikh94 nininininini kamrankausar hanzhongyuan olaronning weifanjiang hvisvana llcruc mahmoudeid789 laraqianyang jackyuanjie1990 yxq0725paraphrasegen's Issues
Evaluation
Hi!
Would you give us more details on the way you prepare the outputs of the test to measure the metrics. I'm using MULTEVAL and I'm getting this results for Quora 50k (METEOR is slightly different and TER is different in ~13 units
n=5 BLEU (s_sel/s_opt/p) METEOR (s_sel/s_opt/p) TER (s_sel/s_opt/p)
baseline 17.6 (0.4/0.1/-) 19.6 (0.2/0.0/-) 74.4 (0.4/0.3/-)
About the perameters and data.
I am very interested in your work and want to follow your research. I read the code and have some questions. I would be very grateful if you can get your advice.
- The code needs two embeddings, one for the original sentences and the other for the generated sentences, why? If we use the same embedding, how will it affect the performance?
- In the line 156 in the rvae.py, you set loss = 79 * cross_entropy + kld_coef(i) * kld. And for kld_coef(i), the function defined two constants, 3500 and 1000. What factors are taken into account to set these parameters?
- Are the training document 'train.txt' and the test document 'test.txt' the same to the documents used in the paper (50K train data and 4K test data)? Can you give me 100K and 150K corpus?
Beam search issue
Hello,
Is there an issue with the beam_decode function in rvae.py? I think the output of decoder be converted to log_softmax instead of softmax out = F.log_softmax(self.decoder.fc(dec_out)).unsqueeze(0) in sample_decode function.
Thanks,
Jie
run error
Some question about the embedding, loss function and data.
I am very interested in your work and want to follow your research. I read the code and have some questions. I would be very grateful if you can get your advice.
- The code needs two embeddings, one for the original sentences and the other for the generated sentences, why? If we use the same embedding, how will it affect the performance?
- In the line 156 in the rvae.py, you set loss = 79 * cross_entropy + kld_coef(i) * kld. And for kld_coef(i), the function defined two constants, 3500 and 1000. What factors are taken into account to set these parameters?
- Are the training document 'train.txt' and the test document 'test.txt' the same to the documents used in the paper (50K train data and 4K test data)? Can you give me 100K and 150K corpus?
TypeError when ran train.py
Hi, when i ran train.py
it returns this error
[root @ ~]$ python2.7 train.py --num-iterations 140000
preprocessed data was found and loaded
preprocessed data was found and loaded
Traceback (most recent call last):
File "train.py", line 102, in <module>
cross_entropy, kld, coef = train_step(iteration, args.batch_size, args.use_cuda, args.dropout, start_index)
File "./paraphraseGen/model/rvae.py", line 124, in train
input = [Variable(t.from_numpy(var)) for var in input]
TypeError: can't convert np.ndarray of type numpy.object_. The only supported types are: double, float, float16, int64, int32, and uint8.
Please update to python3.5 atleast!
Pytorch does not support Python2.7 now.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.