Query engine

At a glance

Hi, I'm having trouble tracing the cause of this issue in the source code (v0.9.14.post3).

I built a hybrid retriever:

Plain Text

vector_retriever = index.as_retriever(similarity_top_k = 3)
bm25_retriever = BM25Retriever.from_defaults(docstore = index.docstore, similarity_top_k = 3)

retriever = QueryFusionRetriever(
    llm              = llama,
    mode             = "reciprocal_rerank",
    num_queries      = 1,  # set this to 1 to disable query generation
    retrievers       = [bm25_retriever, vector_retriever],
    similarity_top_k = 3,
    use_async        = True,
    verbose          = True
)

query_engine = RetrieverQueryEngine.from_args(retriever)

Note that the llm argument is initialized to be llama which is defined elsewhere as one of the Llama-2 chat models. Yet, when I call _queryengine.query(), it still sends API calls to OpenAI for completion. I thought it shouldn't be doing that because I've passed in a non-default model into the llm argument in the QueryFusionRetriever class.

What gives?

10 comments

LLogan M

A query engine is two things -- a retriever and a response synthesizer

LLogan M

Here, you've only configured the retriever

LLogan M

You can pass in the service context into from_args()

LLogan M

Or manually get the response synthesizer and pass that in (which is what is using the llm)

bbin4ry_d3struct0r

Ooooooh

bbin4ry_d3struct0r

Thanks very much for the quick reply!

LLogan M

:dotsCATJAM:

bbin4ry_d3struct0r

Yup, that did it 👍

bbin4ry_d3struct0r

So many components to keep track of 😆

LLogan M

A bit, but it's all in the name of customizing 😅

V0.10.x makes global default settings a bit easier, if that sounds attractive 🙂 (but is some work to upgrade to)

Add a reply

Find answers from the community

Query engine