Find answers from the community

Updated 5 months ago

i'm working on a RAG app using

At a glance
i'm working on a RAG app using LlamaIndex / llamacpp server / vectorpg / flask / react+ typescript chat UI

i've got a prototype working that:
  1. uses a custom VectorDBRetriever to retrieve first batch of nodes with scores
  2. response_syntheziser with custom prompt text_qa_template
  3. RetrieverQueryEngine that uses SentenceTransformerRerank
  4. and finally generating response via CondenseQuestionChatEngine.
problem: i'm unable to figure out how to roll in chat history so that the app functions like a true chat bot.

are there any documents / tutorials / code bases that would be a good reference to help me figure out this last piece?
L
J
4 comments
The condense question chat engine is already using chat history (abeit maybe not the best way)

We have several chat engines/chat modes though
will revisit the chat modules. i remember having issues trying to use the simple chat one with my reranker.

and i've been meaning to look into agents, will dig in!
and maybe condense question isn't the best but it was the only one i could get working. had trouble with context chat as well but i don't recall what it was
Add a reply
Sign up and join the conversation on Discord