Coder Social home page Coder Social logo

lil-lab / amr Goto Github PK

View Code? Open in Web Editor NEW
23.0 7.0 10.0 3.53 MB

Cornell AMR Semantic Parser (Artzi et al., EMNLP 2015)

Home Page: http://yoavartzi.com/amr

License: GNU General Public License v2.0

Java 49.99% Shell 1.96% PHP 0.69% C++ 24.59% Lex 2.21% Python 5.40% Makefile 0.20% C 0.15% Scala 13.97% Perl 0.69% NASL 0.12% Pawn 0.02%
amr abstract-meaning-representation semantic-parser natural-language-processing machine-learning

amr's Introduction

Cornell AMR Semantic Parser

Requirements

Java 8.

Preparing the Repository

  • Get all required resources: ./getres.sh (form the root of the repository)
  • Compile: ant dist

Pre-trained Models

A pre-trained model is available to download here.

Parsing

Given a file sentences.txt, which contains a sentence on each line, and a model file amr.sp, both located in the root of the repository:

java -Xmx8g -jar dist/amr-1.0.jar parse rootDir=`pwd` modelFile=`pwd`/amr.sp sentences=`pwd`/sentences.txt

The output files will be in experiments/parse/logs. To see the full set of options (including increasing the logging level), run:

java -jar dist/amr-1.0.jar

Preparing the data (required only for training and testing)

To re-create our experiments, obtain the AMR Bank release 1.0 (LDC2014T12) form LDC. Extract the corpus to the directory corpus/amr_anno_1.0.

Then run the following:

  • Compile the code: ant dist
  • Prepare the environment: utils/config.sh
  • Prepare the data: utils/prepdata-ldc.sh

Attribution

@InProceedings{artzi-lee-zettlemoyer:2015:EMNLP,
  author    = {Artzi, Yoav  and  Lee, Kenton  and  Zettlemoyer, Luke},
  title     = {Broad-coverage CCG Semantic Parsing with AMR},
  booktitle = {Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing},
  month     = {September},
  year      = {2015},
  address   = {Lisbon, Portugal},
  publisher = {Association for Computational Linguistics},
  pages     = {1699--1710},
  url       = {http://aclweb.org/anthology/D15-1198}
}

amr's People

Contributors

yoavartzi avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

amr's Issues

Training Not Working

I've been having a lot of trouble training new models. During the the training process,
"Extracting features for level 2 inference
Done - Extracting features for level 2 inference"
is printed over and over until at some point my CPU usage drops to almost 0% and then it says im out of memory in my shell. Then if you wait an incredibly long amount of time, training continues. I have not been able to train any models successfully yet on any of the data in the amr 1.0 release (accept one model which I trained on just two sentences from the bolt treebank). Have you run into this issue and/or what specs do you suggest I have to train a model in this way. I'm running on 12 CPU cores with 32 Gb of memory. I tried looking at the source code to see if I could fix the problem there but the issue is happening in one of the precompiled jar files (i'm gonna look at spf now to see which jar is giving me issues and see if I can create a new jar file from there) Any suggestions? Thank you so much.

binary files in resources/propbank

After running getres.sh, there are binary files in the resources/propbank folders with names like "._bore.xml". On Linux, these will cause the PropBankReader to crash, since it attempts to parse all files in the directory.

The ._*.xml files appear to be OSX metadata. Maybe there's a difference in the way Files.walk(...) and/or Files.isRegularFile(...) treat them, e.g. ignore on OSX, while they're just normal files on Linux?

Anyway, a simple workaround is to do rm resources/propbank/._*.xml after running getres.sh

Train

How to create a new model using new dataset..?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.