Coder Social home page Coder Social logo

comet-atomic-2020's Introduction

(Comet-) ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs

Example for ATOMIC2020

Paper

(Comet-) Atomic 2020: On Symbolic and Neural Commonsense Knowledge Graphs.
Jena D. Hwang, Chandra Bhagavatula, Ronan Le Bras, Jeff Da, Keisuke Sakaguchi, Antoine Bosselut, Yejin Choi
AAAI Conference on Artificial Intelligence, 2021

If you'd like to cite this paper, please use the reference below:

@inproceedings{Hwang2021COMETATOMIC2O,
  title={COMET-ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs},
  author={Jena D. Hwang and Chandra Bhagavatula and Ronan {Le Bras} and Jeff Da and Keisuke Sakaguchi and Antoine Bosselut and Yejin Choi},
  booktitle={AAAI},
  year={2021}
}

Data: ATOMIC 2020

The data for ATOMIC 2020 is available here. If you need the ATOMIC 2019 data (Sap et al., 2019), it is downloadable here.

Model: COMET-ATOMIC 2020

Trained COMET-BART model can be downloaded here.

Trained COMET-GPT2XL model can be downloaded here.

Codebase

We include code used in expirements in COMET-ATOMIC2020 for reproducibility, ease of use. Our models are based off the HuggingFace Transformers codebase, with minor adjustments to adapt the model for our data. Details can be found in the AAAI paper.

Setup

Run pip install -r requirements.txt to install requirements for your Python instance. We recommend Conda to manage Python installs. Our codebases is on Python 3.

It's recommended that you test that your enviroment is set up correctly before running modeling code. You can do this via python models/comet_atomic2020_gpt2/comet_gpt2.py --test_install

The code for modeling is located in mosaic/infra/modeling. mosaic/datasets/KGDataset is used to convert the ATOMIC2020 CSV into an HuggingFace Datasets object.

Directory Overview

beaker_exp: Contains files needed to run expirements using Beaker (https://beaker.org/) instead of on your local machine.

human_eval: Contains HTML files for human evaluation on Amazon MTurk, as described in the AAAI paper.

models: Contains additional modeling files to reproduce the GPT2 and BART expirements. models/comet_atomic2020_bart contains a README and code to run COMET-BART2020.

scripts: Contains additional scripts (e.g. utils.py) used during expirements in the COMET-ATOMIC2020 paper.

split: Contains code used to make the test, train, and dev splits of ATOMIC2020 with Stratified Random Sampling.

system_eval: Contains code for automatic evaluation of generated entities.

Contributions

We welcome contributions to the codebase of COMET-2020. We encourage pull requests instead of issues; and suggest filing a GitHub issue with questions / suggestions.

License

COMET-ATOMIC 2020 (codebase) is licensed under the Apache License 2.0. The ATOMIC 2020 dataset is licensed under CC-BY.

Contact

Email: jenah[at]allenai[dot]org

comet-atomic-2020's People

Contributors

csbhagav avatar jenahwang avatar keisks avatar rlebras avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.