Light

lil-lab / amr Goto Github PK

View Code? Open in Web Editor NEW

23.0 7.0 10.0 3.53 MB

Cornell AMR Semantic Parser (Artzi et al., EMNLP 2015)

Home Page: http://yoavartzi.com/amr

License: GNU General Public License v2.0

Java 49.99% Shell 1.96% PHP 0.69% C++ 24.59% Lex 2.21% Python 5.40% Makefile 0.20% C 0.15% Scala 13.97% Perl 0.69% NASL 0.12% Pawn 0.02%

amr abstract-meaning-representation semantic-parser natural-language-processing machine-learning

amr's Introduction

Cornell AMR Semantic Parser

Requirements

Java 8.

Preparing the Repository

Get all required resources: ./getres.sh (form the root of the repository)
Compile: ant dist

Pre-trained Models

A pre-trained model is available to download here.

Parsing

Given a file sentences.txt, which contains a sentence on each line, and a model file amr.sp, both located in the root of the repository:

java -Xmx8g -jar dist/amr-1.0.jar parse rootDir=`pwd` modelFile=`pwd`/amr.sp sentences=`pwd`/sentences.txt

The output files will be in experiments/parse/logs. To see the full set of options (including increasing the logging level), run:

java -jar dist/amr-1.0.jar

Preparing the data (required only for training and testing)

To re-create our experiments, obtain the AMR Bank release 1.0 (LDC2014T12) form LDC. Extract the corpus to the directory corpus/amr_anno_1.0.

Then run the following:

Compile the code: ant dist
Prepare the environment: utils/config.sh
Prepare the data: utils/prepdata-ldc.sh

Attribution

@InProceedings{artzi-lee-zettlemoyer:2015:EMNLP,
  author    = {Artzi, Yoav  and  Lee, Kenton  and  Zettlemoyer, Luke},
  title     = {Broad-coverage CCG Semantic Parsing with AMR},
  booktitle = {Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing},
  month     = {September},
  year      = {2015},
  address   = {Lisbon, Portugal},
  publisher = {Association for Computational Linguistics},
  pages     = {1699--1710},
  url       = {http://aclweb.org/anthology/D15-1198}
}

amr's People

Contributors

Stargazers

Watchers

Forkers

xiaowanlinju namkhanhtran roma-patel shubhampachori12110095 holtspalding alecbrick scape1989 32de18 standardgalactic dopc

amr's Issues

Training Not Working

I've been having a lot of trouble training new models. During the the training process,
"Extracting features for level 2 inference
Done - Extracting features for level 2 inference"
is printed over and over until at some point my CPU usage drops to almost 0% and then it says im out of memory in my shell. Then if you wait an incredibly long amount of time, training continues. I have not been able to train any models successfully yet on any of the data in the amr 1.0 release (accept one model which I trained on just two sentences from the bolt treebank). Have you run into this issue and/or what specs do you suggest I have to train a model in this way. I'm running on 12 CPU cores with 32 Gb of memory. I tried looking at the source code to see if I could fix the problem there but the issue is happening in one of the precompiled jar files (i'm gonna look at spf now to see which jar is giving me issues and see if I can create a new jar file from there) Any suggestions? Thank you so much.

binary files in resources/propbank

After running getres.sh, there are binary files in the resources/propbank folders with names like "._bore.xml". On Linux, these will cause the PropBankReader to crash, since it attempts to parse all files in the directory.

The ._*.xml files appear to be OSX metadata. Maybe there's a difference in the way Files.walk(...) and/or Files.isRegularFile(...) treat them, e.g. ignore on OSX, while they're just normal files on Linux?

Anyway, a simple workaround is to do rm resources/propbank/._*.xml after running getres.sh

Train

How to create a new model using new dataset..?

AMR v.s SPF

Hi,

Not really an issue, but how does this repo relate to https://github.com/cornell-lic/spf ?

Thanks!

Pretrained Model

Was the pretrained model trained on AMR bank release 1.0?

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.