Settings.chunk_size
to 128, it slows down my TEI embeddings waaay too much, from 2s to 40s.embed_batch_size
to 128, but that didn't help. Also, it seems like TEI is only using CPU and not my GPUs, even though I launched its docker image with "docker run --gpus all".VectorStoreIndex
that's getting bogged down and not TEI itself.insert
_batch_size set to 4096.index = VectorStoreIndex.from_documents(documents, show_progress=True, insert_batch_size=4096)
docker run --gpus "device=0"
, it does use the first GPU. Setting it to "all" per their directions doesn't seem to work, though.