what do you think would be the best approach to do RAG over let's say 70k-100k documents? One approach as you mentioned is a Metadata filter so that we can do a pre-filter (in that case does llama-index support the in operator). Or do you think something like a Knowledge graph is a good approach?