Yeah I have been using external apis for running the language model.
I am using Weaviate for storing indexes. On a cpu instance, it seems to take a lot of time. As per their docs, gpu helps. My personal laptop has one, which is why I never ran into this problem before