Checking similarity

At a glance

The community members are discussing ways to optimize a query process without calling a large language model (LLM). They suggest using index.query(..., response_mode="no_text") to only fetch the source nodes without calling the LLM, and index.query(..., similarity_cutoff=0.5) to filter the results based on similarity. There is some uncertainty around the behavior when all nodes are filtered out. The community members also discuss increasing the number of results returned with index.query(..., similarity_top_k=4), and confirm that using similarity_cutoff without response_mode="no_text" will not call the LLM, only the embeddings model.

ddiscor_dian

can I do that before I start the whole LLM QA bit?

13 comments

LLogan M

Yes! If you do something like index.query(..., response_mode="no_text") it will only fetch the source nodes and not call the LLM

LLogan M

You might also be interested in the similarity filtering option

index.query(..., similarity_cutoff=0.5)

But I'm not sure what the behavior is if all nodes get filtered out 🤔

ddiscor_dian

I'll test if, an set it to 0.0 🤣

ddiscor_dian

yes, I think cutoff might be what I need

ddiscor_dian

maybe it should be in the demonstration or higher up in the docs?

ddiscor_dian

seems like a pretty valid use case: Customers can ask about companies products, not about holidays in the Bahamas

ddiscor_dian

I'll poke about in the code to see exactly what they both do

LLogan M

Sounds good! 👍