We are migrating from pgvector to qdrant

At a glance

The community members are migrating from pgvector to qdrant for a database of 200k chunks/vectors. The migration process is taking much longer (over 10 days) compared to the previous 4 hours for pgvector. The community members have tried various approaches, including disabling the hybrid mode, but are still facing slowness issues.

A community member, Logan M, provided the following suggestions:

Disabling the enable_hybrid=True setting, which runs a model locally to generate sparse embeddings, can significantly improve the speed (1000x faster).
If sparse embeddings are still desired, the community members can customize how they are generated to improve the speed.
The community members were manually creating the collections, which might have been the issue. Logan M suggested removing that code and letting the vector store handle the collection creation for new collections.

After implementing Logan M's suggestions, the community member reported that the issue was resolved, and the migration process now works as expected.

However, there is still a discussion around enabling hybrid search later after uploading all the documents, and the community members are exploring ways to reduce the latency in that process.

Useful resources

OOmar2108

We are migrating from pgvector to qdrant.
we have a db of 200k chunks/vectors.
when we uploaded this database to pgvector, it was taking 4 hours for complete embeddings + db upload.
but now we want to migrate to qdrant (retrieve speed is better). but the embedding + upload process is now taking more than 10 days !! we have really searched everywhere and have no idea how to resolve this slowness issue.

21 comments

LLogan M

You have enable_hybrid=True, which by default runs a model locally to generate sparse embeddings -- this will be super slow if running on CPU

LLogan M

turning that off, you will see it run 1000x faster

LLogan M

optionally, if you want sparse embeddings, you can customize how they are generated (maybe you have some external API to call, or some other method that will be faster)

OOmar2108

Thank you @Logan M for your answer.
do you know why I'm getting this error now:

OOmar2108

UnexpectedResponse: Unexpected Response: 400 (Bad Request)
Raw response content:
b'{"status":{"error":"Wrong input: Not existing vector name error: "},"time":0.018649379}'

LLogan M

Hmm, probably related to switching enable_hybrid=False on the same collection? (Assuming that's what you did)

OOmar2108

yes put the enable hybrid on false, i also created a new collection but still get the same error

LLogan M

Hmmm, do you have an outdated version of the vector store? pip install -U llama-index-vector-stores-qdrant

OOmar2108

I just re-installed it and still have the same error

LLogan M

Hmm. I can try to replicate, but I'm like 99% sure it will work fine for me 😅

Just to confirm, what is the exact code you are running? I.e. how do you create the vector store, how are you inserting

LLogan M

I think i saw in your notebook you were creating collections manually, which might be the issue

OOmar2108

I'm exactly running the code in this jupyter notebook

OOmar2108

When you say that I am creating the collection manually, what can I do instead, please ?

LLogan M

Plain Text

if not client.collection_exists(collection_name=COLLECTION_NAME):
    client.create_collection(
        collection_name=COLLECTION_NAME,
        optimizers_config=models.OptimizersConfigDiff(indexing_threshold=0,),
        hnsw_config=models.HnswConfigDiff(on_disk=True),
        vectors_config={
            "text-dense": models.VectorParams(
                size=3072, 
                distance=models.Distance.COSINE,
            )
        },
        sparse_vectors_config={
            "text-sparse": models.SparseVectorParams(
                index=models.SparseIndexParams()
            )
        },
    )

Remove this code. The vector store handles it for you for new collections

OOmar2108

yeaaaaaah it works !! thanks a lot Logan ! I appreciate your help

LLogan M

nice!

IISMAIL BEN ALLA

i had the same issue, do you recommand a snippet of code to enable hybrid search later after uploading all the documents

LLogan M

you cant really enable it after, if you want hybrid, you need to generate the sparse embeddings

IISMAIL BEN ALLA

but it makes upload process really slow !

IISMAIL BEN ALLA

is there a way to reduce the latency ?

LLogan M

indeed it does. If you want hybrid search, you should have the hardware to generate the sparse embeddings 👀

You can completely customize how the sparse embeddings are generated
https://docs.llamaindex.ai/en/stable/examples/vector_stores/qdrant_hybrid/?h=qdrant+hyb#advanced-customizing-hybrid-search-with-qdrant

Add a reply

Find answers from the community

We are migrating from pgvector to qdrant