π Looking for guidance on the best method for structuring data for the index. I have an ecommerce website, that I'm looking to build a bot to answer customer questions about our products. I'd like to index the product information, use instructions, care instructions, customer reviews, and FAQs. What's the best way to ensure all the relevant text data for a given product is indexed together. Additionally, we have size and color variants that I want the output to consider when fed a query that references size or color. thanks!
hi casey! one thing you could consider is having a list index for every product, and a vector index over products. the vector index will help "route" the query to the right product, and a list index ensures that any query will go through all information in the list index
How do you define a vector index over multiple list indexes? I thought we had to do it the other way around (define a list index over multiple tree indexes)
@Kaveen there aren't really any rules for which indexes you can combine! (At least as far as I know... maybe a tree index can't be top level, but who knows lol)
In this scenario, the top level vector index would need a summary for each list index.
Then, queries will (hopefully) be matched to the appropriate product
Thank you! Would it make sense to do a tree index over attributes of the product, ex. reviews, care instructions, etc. then a list index with the product info over the tree?
Hmm it could work! I might start with the vector index over the list indexes first (since it's easier/more simple), to get a good baseline, and then see how it performs from there π
Hi @jerryjliu0 if the index is set up this way i.e. every list index represents a product, will it be able to answer questions like, say "What products do you have in red color?" My guess is it won't since data is only indexed by the products and not other characteristics of the products.