kGPT formerly known as kafkaGPT

What if you could ask a question and get an answer from the Confluent documentation?

This repo is an implementation of a locally hosted chatbot specifically focused on question answering over the Confluent documentation. Built with OpenAI ChatGPT API, LangChain and FastAPI.

The app leverages LangChain's streaming support and async API to update the page in real time for multiple users.

✅ Running locally (MacOS/Linux)

Note: this app requires OpenAI's ChatGPT API. It costs money to use the API. More information here: OpenAI ChatGPT

Clone the repo:
1. git clone [email protected]:mvfolino68/kGPT.git
Install dependencies: tested on Python 3.10.9
1. from within the repo, setup a virtual environment: python3 -m venv _venv
2. activate the virtual environment: source _venv/bin/activate
3. install dependencies: pip install -r requirements.txt
Set up an OpenAI API Key:
1. you can use the OpenAI API documentation to set up an API key.
2. set the environment variables for your API key. See Environment Variables for more information.
Setup a vectorstore:
1. option 1: Use the existing vectorstore:
  1. download the vectorstore from here and place it in the root directory of the repo. download here
2. option 2: Create a new vectorstore:
  1. Note: this method will take a while to run due to the size of the Confluent docs.
  2. run python ingest.py to ingest Confluent docs data into the vectorstore (only needs to be done once).
  3. you can use other Document Loaders to load your own data into the vectorstore.
Run the app: make start
Open localhost:9000 in your browser.
Ask a question! 🎉

Environment Variables

You can set these environment variables in a .env file in the root directory of the project. See the .env.example file for an example.

If Using OpenAI API Directly Set these Variables:

Variable	Description	Link
`OPENAI_API_KEY`	Your OpenAI API key.	OpenAI Authentication
`OPENAI_API_TYPE`	`"open_ai"` if using OpenAI API.	-
`OPENAI_API_BASE`	Leave this blank if using OpenAI API directly.	-
`AZURE_OPENAI_DEPLOYMENT_NAME`	Leave this blank if using OpenAI API directly.	-
`AZURE_OPENAI_MODEL`	Leave this blank if using OpenAI API directly.	-
`OPENAI_API_VERSION`	Leave this blank if using OpenAI API directly.	-

If Using Azure OpenAI Set these Variables:

Variable	Description	Link
`OPENAI_API_KEY`	Your Azure OpenAI API key.	Azure OpenAI Quickstart
`OPENAI_API_TYPE`	Set this to `"azure"` if using Azure OpenAI.	-
`OPENAI_API_BASE`	The base URL for your Azure OpenAI resource.	Azure OpenAI Quickstart
`AZURE_OPENAI_DEPLOYMENT_NAME`	The name of your Azure OpenAI deployment.	-
`AZURE_OPENAI_MODEL`	The name of the model you are using.	Azure OpenAI Quickstart
`OPENAI_API_VERSION`	The OpenAI API version and should align with the model you are using.	Azure OpenAI Quickstart

If Using Pinecone As Vectorstore Set these Variables:

Variable	Description	Link
`PINECONE_API_KEY`	Your Pinecone API key.	Pinecone Quickstart
`PINECONE_ENVIRONMENT`	The name of your Pinecone environment.	Pinecone Quickstart
`PINECONE_INDEX`	The name of the Pinecone index.	Pinecone Quickstart

🐳 Running locally (Docker)

This method requires that you have Docker installed on your machine. To install Docker, follow the instructions here. This setup assumes that you set up your environment variables in a .env file in the root directory of the project.

docker build -t kgpt .
docker run --env-file .env -p 9000:9000 kgpt

📸 Screenshots

🤔 How it works

📝 Ingestion (run once to create the vectorstore)

Pull html from the Confluent documentation using sitemap.xml and BeautifulSoup to clean the html.
Load the data into DocumentStore using LangChain's UstructuredHTML Loader.
Chunk the documents into smaller chunks using LangChain's TextSplitter.
Create embeddings for each chunk using OpenAI's OpenAI Embeddings.
Load the embeddings into a vectorstore using LangChain's vectorstore wrapper.
1. FAISS is used as the vectorstore in this example. More information on FAISS can be found here.

📝 Question-Answering

User accesses text input box and chat history via the web app.
The web app sends the chat history and user input to the backend. The backend uses LangChain's ConversationalRetrievalChain to:
1. Determine what a standalone question would be (using ChatGPT).
2. Look up relevant documents from the vectorstore.
3. Pass the standalone question and relevant documents to ChatGPT to generate a final answer.
Return the final answer to the web app and add the answer to the chat history.

Diagram:

Troubleshooting

If you receive an error like this:

WARNING:/Users/<user>/Documents/kGPT/.venv/lib/python3.10/site-packages/langchain/chat_models/openai.py:Retrying langchain.chat_models.openai.acompletion_with_retry.<locals>._completion_with_retry in 4.0 seconds as it raised APIConnectionError: Error communicating with OpenAI.

Check here to see the fix

If you receive this error when using the download vectorstore:
```
ERROR:root:IndexFlat.search() missing 3 required positional arguments: 'k', 'distances', and 'labels'
```
Rerun the ingest.py script to create a new vectorstore.

🚀 Helpful Links

Blog Posts:

Contributing to kGPT

I'm happy to accept contributions to this project. Please open an issue or a pull request.

Future Work

Make the app available online (currently only available locally). This will require a hosting service that can support the vectorstore and the web app.
Allow users to input their own OpenAI API key in the frontend.
Include more documentation sites.
Tune the ChatVectorDBChain to improve the quality of the answers.
Produce the chat history as a topic.

Follow me on Twitter & LinkedIn

Twitter: @mvfolino68 LinkedIn: mfolino

pbajjuri / kgpt Goto Github PK

kgpt's Introduction