ishaand2

I am trying to check why my GPU memory utilisation is so high when ingesting documents into the qdrant vector store.
Despite calling .persist() , it appears that the full set of embeddings are being auto-loaded in memory.

Is there a way to unload/offload the vector store (indices) from & to GPU memory?

Find answers from the community

GPU