Coder Social home page Coder Social logo

zchoi / awesome-embodied-agent-with-llms Goto Github PK

View Code? Open in Web Editor NEW
741.0 32.0 42.0 1.8 MB

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates!

embodied-agent embodied-ai scene-understanding navigation planning-algorithms manipulator-robotics awesome agent large-language-model

awesome-embodied-agent-with-llms's Introduction

🤖 Awesome-Embodied-Agent-with-LLMs Awesome

This is a curated list of "Embodied AI or agent with Large Language Models" research which is maintained by haonan.

Watch this repository for the latest updates and feel free to raise pull requests if you find some interesting papers!

News🔥

[2024/6/28] Created a new board about agent self-evolutionary research. 🤖
[2024/6/07] Add Mobile-Agent-v2, a mobile device operation assistant with effective navigation via multi-agent collaboration. 🚀
[2024/5/13] Add "Learning Interactive Real-World Simulators"——outstanding paper award in ICLR 2024 🥇.
[2024/4/24] Add "A Survey on Self-Evolution of Large Language Models", a systematic survey on self-evolution in LLMs! 💥
[2024/4/16] Add some CVPR 2024 papers.
[2024/4/15] Add MetaGPT, accepted for oral presentation (top 1.2%) at ICLR 2024, ranking #1 in the LLM-based Agent category. 🚀
[2024/3/13] Add CRADLE, an interesting paper exploring LLM-based agent in Red Dead Redemption II!🎮

Table of Contents 🍃

Trend and Imagination of LLM-based Embodied Agent

Figure 1. Trend of Embodied Agent with LLMs.[1]                        Figure 2. An envisioned Agent society.[2]

Methods

Survey

Self-Evolving Agents

Advanced Agent Applications

LLMs with RL or World Model

Planning and Manipulation or Pretraining

Multi-Agent Learning and Coordination

Vision and Language Navigation

Detection

  • DetGPT: Detect What You Need via Reasoning [arXiv 2023]
    Renjie Pi1∗ Jiahui Gao2* Shizhe Diao1∗ Rui Pan1 Hanze Dong1 Jipeng Zhang1 Lewei Yao1 Jianhua Han3 Hang Xu2 Lingpeng Kong2 Tong Zhang1
    1The Hong Kong University of Science and Technology 2The University of Hong Kong 3Shanghai Jiao Tong University

3D Grounding

Interactive Embodied Learning

Rearrangement

Benchmark

Simulator

Others

Acknowledge

[1] Trend pic from this repo.
[2] Figure from this paper: The Rise and Potential of Large Language Model Based Agents: A Survey.

awesome-embodied-agent-with-llms's People

Contributors

jameshujy avatar jeasinema avatar tinnke avatar zchoi avatar zhoues avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

awesome-embodied-agent-with-llms's Issues

Suggestion: Add MachinaScript for Robots

https://github.com/babycommando/machinascript-for-robots

MachinaScript is a dynamic set of tools and a LLM-JSON-based language designed to empower humans in the creation of their own robots.

It facilitates the animation of generative movements, the integration of personality, and the teaching of new skills with a high degree of autonomy. With MachinaScript, you can control a wide range of electronic components, including Arduinos, Raspberry Pis, servo motors, cameras, sensors, and much more.

MachinaScript's mission is to make cutting-edge intelligent robotics accessible for everyone.

Request for inclusion of our NeurIPS-23 paper

Dear authors,

We hope this comment finds you well. We wanted to take a moment to bring your attention to a relevant paper from our lab that has recently been accepted to NeurIPS 2023:

Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning — The key idea of this work is to extract an explicit domain with LLM and thereby allow the agent to use external symbolic task planner.

We would be grateful if you would consider including our papers in your survey. We believe it would greatly benefit the readers interested in this burgeoning area of LLM-driven agents.

Best regards
Lin

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.