Coder Social home page Coder Social logo

paraphrasegen's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

paraphrasegen's Issues

Evaluation

Hi!
Would you give us more details on the way you prepare the outputs of the test to measure the metrics. I'm using MULTEVAL and I'm getting this results for Quora 50k (METEOR is slightly different and TER is different in ~13 units
n=5 BLEU (s_sel/s_opt/p) METEOR (s_sel/s_opt/p) TER (s_sel/s_opt/p)
baseline 17.6 (0.4/0.1/-) 19.6 (0.2/0.0/-) 74.4 (0.4/0.3/-)

About the perameters and data.

I am very interested in your work and want to follow your research. I read the code and have some questions. I would be very grateful if you can get your advice.
  1. The code needs two embeddings, one for the original sentences and the other for the generated sentences, why? If we use the same embedding, how will it affect the performance?
  2. In the line 156 in the rvae.py, you set loss = 79 * cross_entropy + kld_coef(i) * kld. And for kld_coef(i), the function defined two constants, 3500 and 1000. What factors are taken into account to set these parameters?
  3. Are the training document 'train.txt' and the test document 'test.txt' the same to the documents used in the paper (50K train data and 4K test data)? Can you give me 100K and 150K corpus?

Beam search issue

Hello,

Is there an issue with the beam_decode function in rvae.py? I think the output of decoder be converted to log_softmax instead of softmax out = F.log_softmax(self.decoder.fc(dec_out)).unsqueeze(0) in sample_decode function.

Thanks,

Jie

run error

Hello!
When I was training the model, the program reported an error at the 1560th iteration, causing the operation to fail. I want to kown the reason. Thanks
Selection_008

Some question about the embedding, loss function and data.

I am very interested in your work and want to follow your research. I read the code and have some questions. I would be very grateful if you can get your advice.
  1. The code needs two embeddings, one for the original sentences and the other for the generated sentences, why? If we use the same embedding, how will it affect the performance?
  2. In the line 156 in the rvae.py, you set loss = 79 * cross_entropy + kld_coef(i) * kld. And for kld_coef(i), the function defined two constants, 3500 and 1000. What factors are taken into account to set these parameters?
  3. Are the training document 'train.txt' and the test document 'test.txt' the same to the documents used in the paper (50K train data and 4K test data)? Can you give me 100K and 150K corpus?

TypeError when ran train.py

Hi, when i ran train.py it returns this error

[root @ ~]$ python2.7 train.py --num-iterations 140000
preprocessed data was found and loaded
preprocessed data was found and loaded
Traceback (most recent call last):
  File "train.py", line 102, in <module>
    cross_entropy, kld, coef = train_step(iteration, args.batch_size, args.use_cuda, args.dropout, start_index)
  File "./paraphraseGen/model/rvae.py", line 124, in train
    input = [Variable(t.from_numpy(var)) for var in input]
TypeError: can't convert np.ndarray of type numpy.object_. The only supported types are: double, float, float16, int64, int32, and uint8.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.