Find answers from the community

Updated 4 months ago

hey, I have a big list of documents and

At a glance
hey, I have a big list of documents and Im trying to do VectorStoreIndex.from_documents on it but the embeddings generation takes very long, how can I fix this, thanks
Attachment
b95eb106-dc19-4456-9cb6-5b78572aba43.png
T
T
L
6 comments
Which embedding model are you using?
Im using openai's ada
With default settings I tried and it took me 20min for 75 000 pages of PDFs
Is yours taking a similar time?
Im using paged csv loader which splits each row of my csv into a document. For 100k+ documents the process is taking more than an hour
Add a reply
Sign up and join the conversation on Discord