Score

At a glance

The community member who posted the original question is asking how to get score-based results directly during the retrieval process, instead of using the similarity_top_k parameter. The comments suggest that while it's possible to get the scores of the retrieved nodes, there may not be a straightforward way to retrieve nodes based on a specific relevance score threshold, rather than limiting the number of results.

One community member suggests that setting similarity_top_k to a high value and then post-processing the results could be a workaround, but there doesn't seem to be a clear answer provided in the comments.

ttharak#3

Hi,

How can I get the score based results directly during retrieval process instead of using similarity_top_k

4 comments

LLogan M

I'm not sure what you mean exactly? You can get the scores the nodes directly

Plain Text

retriever = index.as_retriever(similarity_top_k=2)
nodes = retriever.retrieve("query")
print(nodes[0].score)

ttharak#3

So here we are mentioning the similarity_top_k which takes top n documents where n is number that was predefined.

I have a use case where users might select a value on scale of 0 to 1 where 0 means less relevant data and 1 means more relevant data.
So here we are retrieving the nodes based on relevance not based on number of documents.

Performing search results based on relevance is more useful than limiting nodes with top_k param

ttharak#3

I want to retrieve nodes with score relevance of 0.5 and greater, so the number of nodes that I get from search results might be either 20, 200 or 200k it doesn't matter.
I will be just processing all those nodes and generate response using custom LLM

LLogan M

I don't think there's an option for this, other than setting the top k to some crazy value, and then doing some postprocessing on the results

Add a reply

Find answers from the community

Score