curious if anyone here has used gpt-index on top of a truly large corpus of data without training the model on your corpus, and whether the results were production quality? i'm in the process of trying to implement this on my corpus of data (115k docs, ~15k words per doc) using OpenAI, and I'm concerned I will need to change course and use a model I can actually train on my data because relevancy won't be good enough