Has anyone figured out how to trigger RAG retrieval inside a reasoning models <think> steps? Similar to the Search-o1 paper here:
https://search-o1.github.ioIn my opinion, this would be the best way to use deepseek to optimize RAG since new retrieve knowledge can push the reasoning chain in a different direction.
I tried to make the system prompt of deepseek llama 8b amd qwen distil 32b output <search> tokens but it does not like to follow instructions !