Based on your seemingly endless wealth of knowledge, can you let me know your thoughts on this:
- Relatively large store of PDF/TXT documents (for this example 200)
- Need to retain detail as much as possible in the answer.
- 50/50 split on if answers will be based on an individual document or need to be synthesized across multiple documents.
With the above considered, what is the best approach (in your opinion) available today to achieve this? I am currently using a graph to collate multiple simple vector indexes and then querying that. Still very early into my exploration of this framework so keen to get your thoughts.