I am trying to check why my GPU memory utilisation is so high when ingesting documents into the qdrant vector store. Despite calling .persist() , it appears that the full set of embeddings are being auto-loaded in memory.
Is there a way to unload/offload the vector store (indices) from & to GPU memory?