llama.cpp: loading model from /Users/my_user/Library/Caches/llama_index/models/llama-2-13b-chat.ggmlv3.q4_0.bin error loading model: unknown (magic, version) combination: 67676a74, 00000003; is this really a GGML file? llama_init_from_file: failed to load model
index = VectorStoreIndex.from_documents(documents, service_context=service_context)
from_documents()
is creating a default service context because you didn't pass one in. And since you don't have an openai key set, it's defaulting to llama.cpp, and it seems like something there is not quite working π