I'm using llama2 70b as llm and "local:BAAI/bge-large-en-v1.5" as embedding model, however all of my content is in german. I'm not sure if the embedding model is really the best choice then, as it mentions "en", but I cant find anything that supports german or anything in that direction. Also, llama2 only responds in english, even tho it can respond in german when asked to, but somehow doesnt when its run over llamaindex.
Any advice on improving the whole pipeline when using exclusivly german data to index and query?
Yeah if you're using open source models it should help finding a multilingual or language specific model. One of the main reasons I use OpenAI is because they can actually handle my language well with their LLM and embeddings