Hi can anyone point me to anexample for a local llm chat bot with following steps-
- Retrieve documents from qdrant store (done)
- Rerank retrieved results with cross encoder ( I saw an example of hugging face but not sure how to apply this to retrieved results)
- Create an llm with chat history and context. It will have a custom prompt to use chat history and context for answering
4 Put all of above in a continuous chat bot experience with ollama