Coder Social home page Coder Social logo

jiant-v1-legacy's People

Contributors

aalok-sathe avatar anhad13 avatar apsdehal avatar azpoliak avatar berlinchen7 avatar bordias avatar davidbenton avatar edouardgrave avatar epavlick avatar haokunliu avatar hyinghui avatar iftenney avatar iwontbecreative avatar kelina avatar najoungkim avatar narsil avatar njjiang avatar pappagari avatar phu-pmh avatar pitrack avatar pyeres avatar roma-patel avatar sheng-fu avatar shuningjin avatar sleepinyourhat avatar tommccoy1 avatar w4ngatang avatar woollysocks avatar yukatherin avatar zphang avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

jiant-v1-legacy's Issues

[CLOSED] find sensible transformer params

Issue by W4ngatang
Monday Jun 25, 2018 at 17:16 GMT
Originally opened as nyu-mll/jiant#21


There are a a few transformer parameters we've been avoiding (projection dimension, feedforward dimension) by setting it all to the same value, in addition to training hyperparameters. We should probably go through what literature there is and find sensible defaults

[CLOSED] fastText embeddings

Issue by pitrack
Wednesday Jun 20, 2018 at 17:20 GMT
Originally opened as nyu-mll/jiant#3


@W4ngatang Ideally you would check out this branch + download fastText and confirm that the instructions are clear, and that the code still runs (with/without fastText). If you want fastText to be default, then some flags may need to get changed.

I'm not getting high accuracy on the demo task.

I also changed some things in the config file so that I can use my paths. You'll need to update your paths too if you checkout this branch/if this gets merged.


pitrack included the following code: https://github.com/nyu-mll/jiant/pull/3/commits

[CLOSED] separate training parameters for train tasks and eval tasks

Issue by W4ngatang
Monday Jun 25, 2018 at 05:22 GMT
Originally opened as nyu-mll/jiant#18


We want to allow for more fine-grained control between training parameters on the main task we're training on and auxiliary tasks we're evaluating on.

If we're not training on multiple tasks, it would probably make sense to switch to a deterministic trainer that validates after passing through the entire training set (as opposed to a fixed number of batches).

[CLOSED] Crashing when trying to use only char embs in demo.sh

Issue by sleepinyourhat
Saturday Jun 23, 2018 at 13:28 GMT
Originally opened as nyu-mll/jiant#12


bash ./demo.sh -w none
[...]
Traceback (most recent call last):
File "../src/main.py", line 236, in
sys.exit(main(sys.argv[1:]))
File "../src/main.py", line 186, in main
args.shared_optimizer, args.load_model)
File "/Users/Bowman/Drive/JSALT/jiant/src/trainer.py", line 273, in train
output_dict = self._forward(batch, task=task, for_training=True)
File "/Users/Bowman/Drive/JSALT/jiant/src/trainer.py", line 504, in _forward
return self._model.forward(task, tensor_batch) # , **tensor_batch)
File "/Users/Bowman/Drive/JSALT/jiant/src/models.py", line 291, in forward
out = self._single_classification_forward(batch, task)
File "/Users/Bowman/Drive/JSALT/jiant/src/models.py", line 309, in _single_classification_forward
sent_embs, sent_mask = self.sent_encoder(batch['input1'])
File "/Users/Bowman/anaconda3/envs/jiant/lib/python3.6/site-packages/torch/nn/modules/module.py", line 491, in call
result = self.forward(*input, **kwargs)
File "/Users/Bowman/Drive/JSALT/jiant/src/modules.py", line 75, in forward
sent_embs = self._highway_layer(self._text_field_embedder(sent))
File "/Users/Bowman/anaconda3/envs/jiant/lib/python3.6/site-packages/torch/nn/modules/module.py", line 491, in call
result = self.forward(*input, **kwargs)
File "/Users/Bowman/anaconda3/envs/jiant/lib/python3.6/site-packages/allennlp/modules/text_field_embedders/basic_text_field_embedder.py", line 63, in forward
raise ConfigurationError(message)
allennlp.common.checks.ConfigurationError: "Mismatched token keys: dict_keys(['chars']) and dict_keys(['words', 'chars'])"

[CLOSED] AllenNLP vocabulary warning.

Issue by sleepinyourhat
Monday Jun 18, 2018 at 22:10 GMT
Originally opened as nyu-mll/jiant#1


See if this is worth worrying about:

06/18 06:07:17 PM: Your label namespace was 'idxs'. We recommend you use a namespace ending with 'labels' or 'tags', so we don't add UNK and PAD tokens by default to your vocabulary. See documentation for non_padded_namespaces parameter in Vocabulary.

[CLOSED] Fix paths & allow for read-only data_dir

Issue by iftenney
Sunday Jun 24, 2018 at 20:38 GMT
Originally opened as nyu-mll/jiant#13


Fix paths to play nicer with GCP config.

  • Fix bug where run_dir was prepended to log.log twice
  • Use exp_dir/prepreproc as scratch directory for data pre-preprocessing, so that glue_data can be readonly
  • make user_config.sh optional, since GCP instances will have the required environment vars already set.

iftenney included the following code: https://github.com/nyu-mll/jiant/pull/13/commits

[CLOSED] Delete checkpoints when starting training from scratch

Issue by sleepinyourhat
Friday Jun 22, 2018 at 16:37 GMT
Originally opened as nyu-mll/jiant#5


If LOAD_MODEL is 0, all checkpoints should be deleted. Otherwise, you can wind up in an odd setup (easy to hit with demo experiments):

  1. Train model for 10 epochs with settings S.
  2. Save model checkpoints for epochs 1–10.
  3. Train new model for 5 epochs with settings S', and LOAD_MODEL=0.
  4. Save model checkpoints for epoch 1–5, overwriting old checkpoints.
  5. Try to continue training the new model with LOAD_MODEL=1. Wind up using settings S' but loading the highest-numbered epoch checkpoint, which was created with settings S. Crash.

[CLOSED] Masked transformer

Issue by pitrack
Saturday Jun 23, 2018 at 04:03 GMT
Originally opened as nyu-mll/jiant#10


Mostly code taken out from allennlp and modified with a function from the subsequent mask part of The Annotated Transformer.

Ideally this would just be a flag in the allennlp library.

Confirmed that this runs. Did not confirm that it is correct (there's a chance that it is off-by-one, but I don't think so). Also seems sensitive to hyperparameters (not doing better than chance with default setting)


pitrack included the following code: https://github.com/nyu-mll/jiant/pull/10/commits

[CLOSED] STS-B classifier is broken

Issue by sleepinyourhat
Wednesday Jun 27, 2018 at 14:30 GMT
Originally opened as nyu-mll/jiant#47


06/27 10:18:45 AM: Beginning training. Stopping metric: sts-b_corr
Traceback (most recent call last):
File "main.py", line 185, in
sys.exit(main(sys.argv[1:]))
File "main.py", line 162, in main
args.shared_optimizer, load_model=False, phase="eval")
File "/Users/Bowman/Drive/JSALT/jiant/src/trainer.py", line 275, in train
output_dict = self._forward(batch, task=task, for_training=True)
File "/Users/Bowman/Drive/JSALT/jiant/src/trainer.py", line 521, in _forward
return self._model.forward(task, tensor_batch) # , **tensor_batch)
File "/Users/Bowman/Drive/JSALT/jiant/src/models.py", line 355, in forward
out = self._pair_regression_forward(batch, task)
File "/Users/Bowman/Drive/JSALT/jiant/src/models.py", line 433, in _pair_regression_forward
logits = classifier(s1, s2, s1_mask, s2_mask)
File "/Users/Bowman/anaconda3/envs/jiant/lib/python3.6/site-packages/torch/nn/modules/module.py", line 491, in call
result = self.forward(*input, **kwargs)
TypeError: forward() takes 2 positional arguments but 5 were given

@hyinghui, @W4ngatang, anyone else who's seen this code—could you take a look?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.