I'm trying to index a taxonomy using GPTVectorStoreIndex. I have build a bunch of documents containing only one line each, like:
Doc1: The name of group 00.01 is appliances. Doc2: The name of group 00.03 is housing. Doc1: The name of group 02.02 is animals.
From this I would like to be able to ask: What is the name of group 00.01.
My issue is that for some numbers it works and with others I gpt just answers it can find anything. It seems kind of random. Is this not supposed work?
I have also tried to have everything in one document with one line per group. But again results are a bit random hit or miss.
Well thanks for your input. It does not reflect in my results since I'm able to a certain extent to make numbers/ids work with a few hundred elements. Only when I scale up to thousands it breaks down.
I'm using GPTVectorStoreIndex in conjunction with SentenceSplutter and simplenodeparser