Find answers from the community

Updated last year

Hey Has anyone experience with the

At a glance
Hey! Has anyone experience with the retrieval speed of the AutoMergingRetriever? It is taking for me about 1 min/query, with pinecone vector db. I am wondering if it's only for me so slow.
L
D
14 comments
Is it the retrieval speed or the overall query/synthesis speed?
It is the retrieval speed. Retriever engine takes about 1 min, to retrieve nodes, query engine is almost exactly the same
Does the speed improve if you just use the base vector store rather than pinecone?
(Just narrowing down the cause)
Give me a second to test it
10 min and the embedding will finish. What is your experience? Is it below 10s?
embedding can take a while if you have a lot of documents, but you can speed it up

see here
https://discord.com/channels/1059199217496772688/1147202918987071549/1147203675010375680
I changed batch size to 2000 and run it on cuda
Btw. without pinecone db the retrieval time is 6 sec
What do you think the reason could be? I am using the llama_index pinecone wrapper. With simple retrieval it was working super fast before.
Hmm, tbh I'm not 100% sure how this retriever works, I haven't looked at the code yet lol

Lets see if the source code reveals anything
Hmm nothing really that special, my only guess is there is some latency with the pinecone index πŸ€” Pinecone is only used once during the initial retrieve

initial_nodes = self._vector_retriever.retrieve(query_bundle)
Okay, I will try to work out something. Thanks!
Add a reply
Sign up and join the conversation on Discord