Coder Social home page Coder Social logo

waynehamadi / agent-protocol Goto Github PK

View Code? Open in Web Editor NEW

This project forked from ai-engineer-foundation/agent-protocol

4.0 0.0 0.0 1003 KB

Common interface for interacting (and more in the future) with any agent. The protocol is tech stack agnostic - you can use it with any framework for building agents.

Home Page: https://e2b.dev

License: MIT License

Shell 1.94% JavaScript 0.48% Python 92.68% TypeScript 4.74% Makefile 0.17%

agent-protocol's Introduction

agent protocol

πŸ“š Docs

You can find more info in the docs.

🧾 Summary

The AI agent space is young. Most developers are building agents in their own way. This creates a challenge: It's hard to communicate with different agents since the interface is often different every time. Because we struggle with communicating with different agents, it's also hard to compare them easily. Additionally, if we had a single communication interface with agents, it'd also make it easier developing devtools that works with agents out of the box.

We present the Agent Protocol - a single common interface for communicating with agents. Any agent developer can implement this protocol. The Agent Protocol is an API specification - list of endpoints, which the agent should expose with predefined response models. The protocol is tech stack agnostic. Any agent can adopt this protocol no matter what framework they're using (or not using).

We believe, this will help the ecosystem grow faster and simplify the integrations.

We're starting with a minimal core. We want to build upon that iteratively by learning from agent developers about what they actually need.

πŸš€ The incentives to adopt the protocol

  • Ease with which you can use the benchmarks.
  • Other people can more easily use and integrate your agent
  • Enable building general devtools (for development, deployment and monitoring) that can be built on top of this protocol
  • You don’t need to write boilerplate API and you can focus on developing your agent

🎯 Immediate goals of the protocol

Set a general simple standard that would allow for easy to use benchmarking of agents. One of the primary goals of the protocol is great developer experience, and simple implementation on the end of agent developers. You just start your agent and that’s all you have to do.

πŸ—£οΈ Request for Comments

If you'd like to propose a change or an improvement to the protocol. Please follow the RFC template.

βš™οΈ Components

The most important part. It specifies which endpoints should the agent expose. The protocol is defined in OpenAPI specification.

How does the protocol work?

Right now the protocol is defined as a REST API (via the OpenAPI spec) with two essential routes for interaction with your agent:

  • POST /agent/tasks for creating a new task for the agent (for example giving the agent an objective that you want to accomplish)
  • POST /agent/tasks/{task_id}/steps for executing one step of the defined task

It has also a few additional routes for listing the tasks, steps and downloading / uploading artifacts.

This is our implementation of the protocol. It’s a library that you can use to build your agent. You can use it, or you can implement it on your own. It’s up to you.

Using the SDK should simplify the implementation of the protocol to the bare minimum, but at the same time it shouldn't tie your hands. The goal should be to allow agent builders to build their agents and the SDK should solve the rest.

Basically it wraps your agent in a web server that allows for communication with your agent (and in between agents in the future).

This library should be used by the users of the agents. Your agent is deployed somewhere and the users of your agent can use this library to interact with your agent.

Thanks to the standard the users can try multiple agents without the need for any additional adjustments (or very minimal) in their code.

πŸ“¦ How to use the protocol

If you're an agent developer, you can use the SDK to implement the protocol. You can find more info in the docs or in the SDK folder.

πŸ€— Adoption

Engaged projects in development of agent protocol

Open-source agents and projects that have adopted Agent Protocol

πŸ“ƒ High-level future roadmap

  • Agent-to-agent communication
  • Connection to the outside world:
    • 3rd party services (= β€œAgent I/O”)
    • Authentication on behalf of users
  • Is there anything missing? Please submit an RFC with a proposed feature!

πŸ’¬ Public discourse & development

  • PRs and issues are welcome!
  • Join Auto-GPT Discord and their dedicated agent-protocol channel
  • Join e2b Discord and their dedicated agent-protocol channel

agent-protocol's People

Contributors

jakubno avatar valentatomas avatar mlejva avatar jzanecook avatar waynehamadi avatar

Stargazers

 avatar  avatar Swifty avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.