It could be one of these reasons:
- Are you indexing large amount of data and keeping it in memory?
- Using a local LLM?
As having a large indexing in local memory can also cause slowness and same goes for LLM.
If it is the first case try using vector sctores like Chroma/Pinecone/Weaviate etc
For second, I guess you'll have to buy some more Gigs😅