Find answers from the community

Updated 2 months ago

Can someone please explain the

Can someone please explain the distinction between chat mode and query mode to me? Initially, I believed the only distinction was that in chat mode, it retains the previous messages, while the underlying process remains the same—context is provided, retrieval is performed using embeddings, and the top k most relevant results are sent to the LLM. However, comparing the outcomes of these two modes reveals differences. Notably, it seems to incorporate a significant amount of out-of-context information, likely sourced from OpenAI's knowledge base, leading to longer responses.
L
3 comments
There are several different ways to do chat, hence there are several chat modes

The default is just an agent with index as a tool. Given a user message and the chat history, decide to either query the index or respond without it
Other chat modes offer different approaches
Add a reply
Sign up and join the conversation on Discord