Coder Social home page Coder Social logo

hea1or / llm-planner Goto Github PK

View Code? Open in Web Editor NEW

This project forked from osu-nlp-group/llm-planner

0.0 0.0 0.0 24.51 MB

[ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models

Home Page: https://osu-nlp-group.github.io/LLM-Planner/

License: MIT License

Shell 0.09% Python 56.82% C 34.68% Makefile 0.09% Yacc 2.34% Lex 0.32% PDDL 5.66%

llm-planner's Introduction

LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models

Code for LLM-Planner.

Check project website for an overview and a demo.

News:

  • Dec 23: LLM-Planner with support for an oracle low-level planner with a new easy-setup framework with ALFOWLRD backbone. We hope this codebase can serve as a foundation for building LLM or LMM based methods with ALFRED.

Release process:

  • High level planner
    • KNN dataset
    • KNN retriever
  • Low level planner
    • Oracle low level planner
    • HLSM low-evel planner
  • Fine-grained control over visualization
  • Support for non-OpenAI foundation models

Quickstart

Clone repo:

git clone https://github.com/OSU-NLP-Group/LLM-Planner
cd LLM-Planner
export ALFWORLD_DATA="$(pwd)/alfworld/data"

Install requirements:

# Conda or Python enviornment recommended 
# Install requirements for the AI2Thor simulator and ALFRED
cd alfworld
pip install .
# Install requirements for LLM-Planner
cd ../src
pip install -r requirements.txt

Download data:

cd ../alfworld
alfworld-download

Sanity check on AI2Thor simulator

python scripts/check_thor.py
# This should return successful, if not your AI2Thor simulator is not set up correctly.

Start evaluation with GPT-4

export OPENAI_KEY=<Your OpenAI Key>
cd ../src
python run_eval.py --config gpt4_base_config.yaml

Common Questions and Answers

Coming soon.

Hardware

Tested on:

  • Mac M1
  • Ubuntu 18.04

Citation Information

If you find this code useful, please consider citing our paper:

@InProceedings{song2023llmplanner,
  author    = {Song, Chan Hee and Wu, Jiaman and Washington, Clayton and Sadler, Brian M. and Chao, Wei-Lun and Su, Yu},
  title     = {LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models},
  booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
  month     = {October},
  year      = {2023},
}

Acknowledgements

We thank the authors of ALFWORLD for releasing their code.

License

  • LLM-Planner - MIT License
  • ALFWorld - MIT License

Contact

Questions or issues? File an issue or contact Luke Song

llm-planner's People

Contributors

chanhee-luke avatar ysu1989 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.