Building the index would be the same speed. I think
a) but loading will be faster with chroma (or any vector db really). Although chroma doesn't really advertise "speed" as a feature, as much as other vector dbs
b) it will scale to larger vector dbs better without using so much RAM
I think testing with
query()
is not quite the right approach. The majority of time spent will be in LLM calls, but quite a large margin. I would be testing retrieve instead
retriever = index.as_retriever(similarity_top_k=2)
nodes = retriever.retrieve("query")