Can someone please explain the

aali_l0355

Can someone please explain the distinction between chat mode and query mode to me? Initially, I believed the only distinction was that in chat mode, it retains the previous messages, while the underlying process remains the same—context is provided, retrieval is performed using embeddings, and the top k most relevant results are sent to the LLM. However, comparing the outcomes of these two modes reveals differences. Notably, it seems to incorporate a significant amount of out-of-context information, likely sourced from OpenAI's knowledge base, leading to longer responses.

3 comments

LLogan M

There are several different ways to do chat, hence there are several chat modes

The default is just an agent with index as a tool. Given a user message and the chat history, decide to either query the index or respond without it

LLogan M

Other chat modes offer different approaches

LLogan M

https://docs.llamaindex.ai/en/stable/module_guides/deploying/chat_engines/usage_pattern.html#available-chat-modes

Add a reply

Find answers from the community

Can someone please explain the