Log in
Log into community
Find answers from the community
View all posts
Related posts
Did this answer your question?
π
π
π
Powered by
Hall
Inactive
Updated 10 months ago
0
Follow
Questions
Questions
Inactive
0
Follow
v
valu
10 months ago
Β·
Hey if I have a an array of a thousand questions and I want to search for similarity to a specific question - what's the best way to approach this? Add the bank to a chromadb with metadata and then do a search?
L
v
13 comments
Share
Open in Discord
L
Logan M
10 months ago
Yea pretty much π
v
valu
10 months ago
so i make a document out of each question - put the question number/id as the metadata
v
valu
10 months ago
use a local embedding model to make an index
v
valu
10 months ago
then query
v
valu
10 months ago
will this do both vector and bip search?
L
Logan M
10 months ago
What is bip search? But otherwise yes
v
valu
10 months ago
like keyword search
v
valu
10 months ago
it seems the default vectorstoreindex doesnt have it
v
valu
10 months ago
i should use chromadb?
L
Logan M
10 months ago
Yea combing vector and keyword I usually referred to as hybrid search (at least lately in this space)
Only a few dbs have it built in tbh. Other times you have to setup yourself (coming bm25 with a vector index for example)
L
Logan M
10 months ago
https://docs.llamaindex.ai/en/stable/module_guides/storing/vector_stores.html#vector-store-options-feature-support
L
Logan M
10 months ago
From scratch with sparse vectors + qdrant
https://docs.llamaindex.ai/en/stable/examples/vector_stores/qdrant_hybrid.html#qdrant-hybrid-search
Using builtin with weaviate
https://docs.llamaindex.ai/en/stable/examples/vector_stores/WeaviateIndexDemo-Hybrid.html
Building your own with bm25
https://docs.llamaindex.ai/en/stable/examples/retrievers/reciprocal_rerank_fusion.html#reciprocal-rerank-fusion-retriever
v
valu
10 months ago
used the tutorial to build thx
Add a reply
Sign up and join the conversation on Discord
Join on Discord