Log in
Log into community
Find answers from the community
View all posts
Related posts
Was this helpful?
π
π
π
Powered by
Hall
Inactive
Updated 2 years ago
0
Follow
What is the fastest way to build vector
What is the fastest way to build vector
Inactive
0
Follow
At a glance
D
Dove
2 years ago
Β·
What is the fastest way to build vector index from nodes with precomputed embeddings?
Is it just
index = VectorStoreIndex(nodes)
?
It still seems to take a long time (20mins for 400k nodes), considering that the embeddings for nodes are already computed.
L
D
12 comments
Share
Open in Discord
L
Logan M
2 years ago
So every node has
node.embedding
already set?
D
Dove
2 years ago
Yes
L
Logan M
2 years ago
hmm ok good. Although I just read your message again and you have 400K nodes lol
L
Logan M
2 years ago
at that point, loading that all into memory and moving stuff around is going to be the biggest bottleneck
L
Logan M
2 years ago
not much else to do to improve speeds
L
Logan M
2 years ago
with that many nodes I highly recommend using a vector DB integration
D
Dove
2 years ago
Which DB do you recommend?
L
Logan M
2 years ago
weaviate or pinecone is nice for hosted stuff
Self-hosted I would use qdrant or chroma
Tbh I think qdrant will be the fastest
D
Dove
2 years ago
Got it, thanks!
L
Logan M
2 years ago
Like, it might still be slow-ish to insert 400K nodes, but that should be a one time cost π
D
Dove
2 years ago
I tested the
:memory:
mode for qdrant, and it seemed to be sligtly faster than the vanilla in-memory storage
L
Logan M
2 years ago
I'd you can run a hosted version, like with their docker image, it may be faster too (pretty sure the backend is all written in rust then)
Add a reply
Sign up and join the conversation on Discord
Join on Discord