Coder Social home page Coder Social logo

salesforce / glad Goto Github PK

View Code? Open in Web Editor NEW
185.0 8.0 47.0 18 KB

Global-Locally Self-Attentive Dialogue State Tracker

License: BSD 3-Clause "New" or "Revised" License

Python 96.81% Dockerfile 3.19%
natural-language-processing machine-learning dialogue-systems pytorch

glad's Issues

RuntimeError while training: The expanded size of the tensor must match the existing size at non-singleton dimension

I am trying to run the train.py and the embedding I am using is Wikipedia embedding with 50 dimension size in python 3. I am getting this following error.

Namespace(batch_size=50, demb=400, dexp='exp', dhid=200, dout='exp/glad/default', dropout={'emb': 0.2, 'local': 0.2, 'global': 0.2}, epoch=50, gpu=0, lr=0.001, model='glad', nick='default', resume=None, seed=42, stop='joint_goal', test=False)

WARNING:root:loading split train
WARNING:root:loading split dev
WARNING:root:loading split test

INFO:root:dataset sizes: {'dev': 200, 'test': 400, 'train': 600}
INFO:root:loaded model <class 'models.glad.Model'>
INFO:root:saving config to exp/glad/default/config.json

Traceback (most recent call last):
File "train.py", line 66, in
run(args)
File "train.py", line 24, in run
model.load_emb(Eword)
File "/mount/studenten/SpokenLanguageProcessing/2019/WokeSpoke/SLU/glad_master/models/glad.py", line 151, in load_emb
self.emb_fixed.weight.data.copy_(new(Eword))

RuntimeError: The expanded size of the tensor (400) must match the existing size (150) at non-singleton dimension 1. Target sizes: [950, 400]. Tensor sizes: [950, 150]

MemoryError when preprocessing data (computing embeddings)

On my local machine, when I run python preprocess_data.py and the script computes word embeddings, it dies with a MemoryError. I've killed all other processes and have about 6GB free RAM, but that doesn't seem to be enough. Is this expected? Anything I can do against it? Perhaps download and use precomputed embeddings from somewhere?

Reproducing DSTC-2 evaluation results

I have downloaded the clean dstc-2 dataset and trained the system but unable to reproduce the results. Please share the hyper-parameters for the dstc-2 dataset.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.