Comments (2)
@ozgurdemir I have been debugging sth. else and found your issue here. A quick question. Are you sure the filtering is applied twice?
The first line in "VectorStore.similarity_search_with_relevance_scores" removes "score_threshold".
score_threshold = kwargs.pop("score_threshold", None)
This should prevent "FAISS.similarity_search_with_score_by_vector" (no preceding underscore here) from applying filtering...
Just wanted to help.
Regards
from langchain.
Hey @habanoz ,
thx for your reply. I'm not working on this particular project anymore. I remember that we stumbled on this as we had a weird behaviour when using the relevance_score_fn
.
However, looking at the code again it seems like you're right.
VectorStoreRetreiver
calls the VectorStore.similarity_search_with_relevance_scores
which removes the score_threshold
parameter and then calls _similarity_search_with_relevance_scores
without this parameter. So the filtering step should be circumvented.
In other words the relevance_score_fn
is applied to the returned scores but the filtering happens on the raw scores.
All in all it seems like a very confusing implementation and probably not what is intended. The filtering by score depends on the similarity_metric
used. So it should be either smaller_than or greater_than depending on the similarity_metric but this code path is not executed due to the above mentioned logic.
So I still think there's a bug here :(
from langchain.
Related Issues (20)
- Server-Side Request Forgery (SSRF) HOT 2
- ChatBedrock not creating graph documents with LLMGraphTransformer
- Use gpt-4o-mini ChatOpenAI, but gpt-3.5-turbo-0125 used HOT 4
- Chat with pandas df string length BadRequestError
- DOC: <Issue related to /v0.2/docs/how_to/custom_tools/> HOT 5
- RunnableWithMessageHistory doesn't work after packaging with MlFlow
- Strange output when summarizing long text using local Llama-3 model with LlamaCpp HOT 1
- Langchain SQL agent withAzure SQL & Azure OpenAI fails on invoke method returning Internal server error 500
- Tool calling agent invokes undefined tool: 'describe'
- ChatPrompTemplate with MessagesPlaceholder ser/des broken
- "Human: " added to the prompt. HOT 2
- openai.BadRequestError: Error code: 400 - {'error': {'message': "Invalid value for 'content': expected a string, got null HOT 3
- ImportError: cannot import name 'LangSmithParams' from 'langchain_core.language_models.chat_models'(import langchain_google_genai) in collab environment HOT 3
- DOC: Need improvement in the langchain js docs v0.2
- ImportError: cannot import name 'AutoModelForCausalLM' from partially initialized module 'transformers' (most likely due to a circular import)
- Chroma - wrong relevance scores. HOT 1
- Milvus Vector Store: Collection Not Created During Initialization
- ChatOllama & Ollama from langchain_ollama partner package does not provide support to pass base_url HOT 5
- TypeError: 'VectorParams' object is not subscriptable HOT 6
- langchain-huggingface: Using ChatHuggingFace requires hf token for local TGI using localhost HuggingFaceEndpoint HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from langchain.