Coder Social home page Coder Social logo

nn4nlp_project's People

Contributors

hainow avatar shuxinlin avatar yuhsianglin avatar

Watchers

 avatar  avatar  avatar  avatar

nn4nlp_project's Issues

Computing score in decoder

Currently we transform from hidden state to scores of all possible labels by a linear layer. Is this a good model?

If still use padding, do mask on several things

If use padding, mask the following:

  1. objective function: don't include cross entropy from pair of matching to input padding.
  2. attention: when computing alpha_{ij} (doing softmax), don't include padding. (softmax only over non-padding part.)

Attention

think about it carefully again: who attend to who?

most_common() not fully utilized?

In preprocess.py, in build_vocab(), there is a line of code:
all_words = collections.Counter(all_text).most_common()
It seems that at this moment the "most_common()" part is not really utilized. But could potentially pick the most common N words here.

Should not discard batches that have sizes different from the user-specified batch size

In preprocessor.py, in minibatch(), in the last two lines with filter:
X_batch = filter(lambda mini_batch: len(mini_batch) == batch_size, X_batch)
Y_batch = filter(lambda mini_batch: len(mini_batch) == batch_size, Y_batch)
They will discard all batches that have sizes different from the user-specified batch size (default 32). I don't think this is the correct thing to do? It would unnecessarily discard many data.

Fixed attention

"an LSTM decoder (1 layer with tanh activation function) with a fixed attention mechanism that deterministically attends to the i-th input token when decoding the i-th output, and hence does not involve learning of any attention parameters"

Not using <EOS>?

I don't think that in our tasks (NER and CCG supertagging) we would need . Can we simply remove it?

Discard long (char count) sentence?

Does it fit our use case that we discard any sentence that has character count larger than self.max_sentence_length? (In preprocessor.py, preprocess().)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.