natashamjaques / neural_chat Goto Github PK

Code to support training, evaluating and interacting neural network dialog models, and training them with reinforcement learning. Code to deploy a web server which hosts the models live online is available at: https://github.com/asmadotgh/neural_chat_web

License: MIT License

Python 62.22% Shell 0.14% sed 0.02% Jupyter Notebook 28.01% Makefile 0.01% CSS 0.43% JavaScript 4.70% HTML 4.47%

neural_chat's People

Contributors

Stargazers

Watchers

neural_chat's Issues

About the interaction data

May I know whether the raw batch of interaction data is publicly available?

Can't archive result similar to the paper when fine tune with Reinforcement Learning

I run VHRL model more times, and I can't get the result similar to the paper. Generated reponses is so short and repetitive although I set hyperparameter as in the paper. Thanks you!!

CUDNN_STATUS_EXECUTION_FAILED while training VHRED

Commend I use is python model/train.py --data=reddit_casual --model=VHRED --batch_size=2
Then it report CUDNN_STATUS_EXECUTION_FAILED

Training Start!
  0%|                                                 | 0/43573 [00:00<?, ?it/s]
Traceback (most recent call last):
  File "model/train.py", line 91, in <module>
    solver.train()
  File "/projects/da33/tao/test-project/neural_chat/model/utils/time_track.py", line 18, in timed
    result = method(*args, **kwargs)
  File "/projects/da33/tao/test-project/neural_chat/model/solver.py", line 905, in train
    mode='train', kl_mult=kl_mult)
  File "/projects/da33/tao/test-project/neural_chat/model/solver.py", line 1079, in _process_batch
    extra_context_inputs=extra_context_inputs)
  File "/projects/da33/tao/test-project/neural_chat/env/lib64/python3.6/site-packages/torch/nn/modules/module.py", line 491, in __call__
    result = self.forward(*input, **kwargs)
  File "/projects/da33/tao/test-project/neural_chat/model/models.py", line 361, in forward
    sentence_length)
  File "/projects/da33/tao/test-project/neural_chat/env/lib64/python3.6/site-packages/torch/nn/modules/module.py", line 491, in __call__
    result = self.forward(*input, **kwargs)
  File "/projects/da33/tao/test-project/neural_chat/model/layers/encoder.py", line 127, in forward
    outputs, hidden = self.rnn(rnn_input, hidden)
  File "/projects/da33/tao/test-project/neural_chat/env/lib64/python3.6/site-packages/torch/nn/modules/module.py", line 491, in __call__
    result = self.forward(*input, **kwargs)
  File "/projects/da33/tao/test-project/neural_chat/env/lib64/python3.6/site-packages/torch/nn/modules/rnn.py", line 192, in forward
    output, hidden = func(input, self.all_weights, hx, batch_sizes)
  File "/projects/da33/tao/test-project/neural_chat/env/lib64/python3.6/site-packages/torch/nn/_functions/rnn.py", line 323, in forward
    return func(input, *fargs, **fkwargs)
  File "/projects/da33/tao/test-project/neural_chat/env/lib64/python3.6/site-packages/torch/nn/_functions/rnn.py", line 275, in forward
    train, dropout_seed, dropout_state)
  File "/projects/da33/tao/test-project/neural_chat/env/lib64/python3.6/site-packages/torch/backends/cudnn/rnn.py", line 44, in init_dropout_state
    if dropout_p != 0 else None
RuntimeError: CUDNN_STATUS_EXECUTION_FAILED

My environment:
gpu: NVIDIA Tesla V100
python 3.6.8
cuda 10.1
torch 0.4.0

This problem spends me four days. Do you have any idea how to solve it?

A very encouraging work!

This "Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems" is a very encouraging work in the field of open-domain dialog system. I'm very happy to try your metric. I have a few questions here:

What's the weight of each individual metric in computing the final metric?
How did you collect the Reddit data mentioned in the paper? I'm aware of pushshift API and pushshift directory contents. However, I don't know exactly how to extract Reddit comments and process them into conversations for a specific subreddit, e.g., /r/CasualConversation.

Really appreciate your hard work :)

Thanks,
Peixiang

cornell folder should not be created before download

From the README:

Create directories ./datasets/reddit_casual/ and ./datasets/cornell/ respectively.

But when I do that and run the command:

python dataset_preprocess.py --dataset=cornell --shortcut

I receive the error:

Directory already exists. Aborting download.

When I delete the folder, there is no issue anymore.

natashamjaques / neural_chat Goto Github PK

neural_chat's People

Contributors

Stargazers

Watchers

Forkers

neural_chat's Issues

About the interaction data

Can't archive result similar to the paper when fine tune with Reinforcement Learning

CUDNN_STATUS_EXECUTION_FAILED while training VHRED

A very encouraging work!

cornell folder should not be created before download

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent