Explicit State Tracking with Semi-supervision for Neural Dialogue Generation

Code for CIKM'18 long paper: Explicit state tracking with semi-supervision for neural dialogue generation.

Paper on Arxiv

Requirements

The project was developed upon PyTorch 0.3.0 and now tested on PyTorch 0.4.0 and Python 3.6.

Experiments on task-orientend dialogues run well on a CPU.
Experiments on non-task-oriented dialogues need GPUs to run.

Dataset

Task oriented datasets:

Non-task oriented datasets:

Ubuntu Dialogue Corpus Python pickled version can be downloaded here or at julianser/Ubuntu-Multiresolution-Tools
JD.com Customer Service Corpus Please make a request as the description at chenhongshen/HVMN

Running Experiments

For running the model, run:

python [semi_sup_model.py|unsup_model.py] -mode [train|adjust|test] -model [camrest|kvret|ubuntu|jd] -c spv_proportion=XXX OTHER_CONFIGS_IN_CONFIG_PY=VALUE

For running metrics, run:

python metric.py -f RESULT_FILE -t [camrest|kvret]

For running embedding base metrics, please follow here

Use Case

Task oriented dialogue systems, such as restaurant reservation systems, where tracking user's intention and knowledge-base interactions are necessary, but you don't have enough(while still have some) labeled data for this.
Non-task oriented dialogue systems, such as technique question answering conversations, where all the domain knowledge is embedded in the corpus and does not require knowledge base interaction. User's intention is completely not annotated, but the model can still extract context-aware "state spans" which helps response generation and visualization for humans. For example:

Turn	User input	State span	Response
0	I just realised nothing can access my mysql database anymore , it says cannot connect , why would this be ? I did set it to be remote	remote would anymore mysql access	you can use the mysql client to connect to the server
1	is it running ?	running I says mysql access	yes, I am running it now

For Reference

Please consider citing:

@inproceedings{jin2018explicit,
  title={Explicit State Tracking with Semi-supervision for Neural Dialogue Generation},
  author={Jin, Xisen and Lei, Wenqiang and Ren, Zhaochun and Chen, Hongshen and Liang, Shangsong and Zhao, Yihong and Yin, Dawei},
  year={2018},
  booktitle={CIKM}
}

% additionally for experiments on task-oriented datasets

@inproceedings{lei2018sequicity,
  title={Sequicity: Simplifying task-oriented dialogue systems with single sequence-to-sequence architectures},
  author={Lei, Wenqiang and Jin, Xisen and Kan, Min-Yen and Ren, Zhaochun and He, Xiangnan and Yin, Dawei},
  booktitle={ACL},
  year={2018}
}

Easter Egg: The last sentence of Section 6 in our ACL 2018 paper Sequicity

Misc

Some portion of the code is developed upon Sequicity.

aucson / sedst Goto Github PK

sedst's Introduction

Explicit State Tracking with Semi-supervision for Neural Dialogue Generation

Requirements

Dataset

Task oriented datasets:

Non-task oriented datasets:

Running Experiments

Directory

Use Case

For Reference

Misc

sedst's People

Stargazers

Watchers

Forkers

sedst's Issues

Recommend Projects

Recommend Topics

Recommend Org