I'm creating a FastAPI to use VectorStoreIndex with HuggingFace LLM and embed model. When I am running the FastAPI app on my local Mac M1 and sending a request to it, I am getting this error:
RuntimeError: MPS backend out of memory
. Has anyone dealt with this before?