Please provide us with the following information: <h

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Address bot mistakes/incorrect information about azure-search-openai-demo HOT 5 OPEN

bp3000bp commented on August 17, 2024

Address bot mistakes/incorrect information

from azure-search-openai-demo.

Comments (5)

pamelafox commented on August 17, 2024

I see that you read the doc on improving answer quality already. Have you identified if the incorrect answers are due to the retrieval step or the LLM step?

If they are due to the retrieval step, then you may want to try out different chunking strategies or try a content chunking strategy like concatenating the document title to each chunk. You can also try increasing the retrieval count and/or setting a minimum reranker score. Have you tried running evaluations with https://github.com/Azure-Samples/ai-rag-chat-evaluator to see if any parameters increase answer quality for you?

You can also try changing to a more powerful model like gpt-4. That won't help with the retrieval step, but that could help if the problem is that the model isn't distinguishing well between helpful results and irrelevant results in the sources.

from azure-search-openai-demo.

pamelafox commented on August 17, 2024

As for whether you can teach the bot: I would first start off with taking those examples that it does poorly on, and add them to an evaluation data set. Then you can run evaluations with different parameters and see whether you can improve results.
There's only so much context that you can fit into the prompt, so I'm not sure it'd work to include stuff in the system prompt like "vacation policy is in doc X". Perhaps it would if your number of documents is small enough.
The way to actually teach these models is fine-tuning, which you can certainly try, but the expense may not be worth it. There's a technique called RAFT from Berkeley which is designed for improving results for RAG applications. The retrieval still needs to be good, however.

from azure-search-openai-demo.

bp3000bp commented on August 17, 2024

@pamelafox It's with retrieval. I believe it's because there are some documents with overlapping content. For example, another one that the bot often gets wrong is confusing the "Company credit card" doc with the "Business card" doc.

How can I concatenate the document title to each chunk? Is there documentation on this? Or how else could I address the bot getting confused in retrieval with overlapping content? Already added metadata to these PDFs but that isn't seeming to help.

I don't know how complicated this would be (I'm just a novice IT intern college student), but ideally, I'd be able to tell the bot if the user says certain keywords like "company card" don't even bother going to the semantic search ranker thing, just skip that step, I'll tell you the exact file to reference and then you can take that info, send to LLM and complete process. This might be something I have to figure out on my own, but please let me know if you think there's a way that could work.

Thank you for the responses and helpful information, it is much appreciated.

from azure-search-openai-demo.

bp3000bp commented on August 17, 2024

@pamelafox Waiting to hear back on this, let me know thank you!

from azure-search-openai-demo.

pamelafox commented on August 17, 2024

To concatenate the title to each chunk, you would modify the code here:

azure-search-openai-demo/app/backend/prepdocslib/searchmanager.py

Line 211 in e8fd819

documents = [

Specifically, instead of "content": section.split_page.text, you would set that content field to something like "Document title: {title}\n Document chunk: {section.split_page.text}"

If you wanted it included in the vector embedding, you'd pass that string to the embeddings as well:

azure-search-openai-demo/app/backend/prepdocslib/searchmanager.py

Line 237 in e8fd819

texts=[section.split_page.text for section in batch]

As for skipping a search directly and going directly to that file, check out this blog post where I describe one way to do that:
https://blog.pamelafox.org/2024/03/rag-techniques-using-function-calling.html

from azure-search-openai-demo.

Address bot mistakes/incorrect information about azure-search-openai-demo HOT 5 OPEN

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent