You can put a retriever into a condense plus context chat engine, or wrap it into a query engine and give it as a tool to an agent
Just have to Instansiate without as_chat_engine shortcuts
from llama_index.core.chat_engine import CondensePlusContextChatEngine
engine = CondensePlusContextChatEngine.from_defaults(retriever, llm=llm, ...)