Find answers from the community

s
F
Y
a
P
Home
Members
lucastonon
l
lucastonon
Offline, last seen last month
Joined September 25, 2024
l
lucastonon
ยท

Retrievers

One quick feedback:
I believe that most of the documentation is very centered in query engines (e.g. https://gpt-index.readthedocs.io/en/latest/examples/vector_stores/WeaviateIndexDemo-Hybrid.html)
In most of my applications I am not that interested in the query engines, but I am very interested in the retriever - specially because the query engine has its own prompt and although it is customizable I have other engines I want to couple with my prompts
I deeply believe that having this double standard (query engine as a 1st class citizen & retriever as 2nd class) can lead to some bias also in the code development
๐Ÿ™‚ idk if you guys agree with it
3 comments
l
L
Hello! I've been testing with hybrid search for both weaviate and pinecone and got very weird results.
I am doing a search for restaurants based on their descriptions - here I am printing the name of the restaurant and the score.
It seems that bm25 is simply not working when I setup for alpha>0.
When using Weaviate (exact same setup) I got score = 1 for every alpha > 0 - only regular got the right score.
obs: this happens for no matter what query I use
obs2: I am using both the free version of pinecone and weaviate

I am happy to share more code if necessary
Plain Text
ret = idx.as_retriever(similarity_top_k=5)
bm25 = idx.as_retriever(similarity_top_k=5, vector_store_query_mode="hybrid", alpha=0.0)
hret = idx.as_retriever(similarity_top_k=5, vector_store_query_mode="hybrid", alpha=0.75)
hsnw = idx.as_retriever(similarity_top_k=5, vector_store_query_mode="hybrid", alpha=1.0)
5 comments
L
l