Find answers from the community

Updated 5 months ago

Debugging faithfulness evaluation in rag bot

At a glance

Trying to replicate Faithfullness eval(https://docs.llamaindex.ai/en/stable/examples/evaluation/faithfulness_eval/) on my RAG bot. But it sometime takes 15 mins, 30 mins, 1hr etc. Anyway to debug it? thanks.

Attachment

10 comments

LLogan M

Are you using openai? Some other LLM/embedding model?

JJatin.K

yes Llama 3.1

Attachment

JJatin.K

BGE onnx for embedding

Attachment

LLogan M

Oh boy llamacpp

LLogan M

that will take ages to run yes

LLogan M

Have you considered trying ollama?

LLogan M

Might be a bit faster

LLogan M

but really depends on your hardware

JJatin.K

I do have NVIDIA RTX 3060. The prime reason of using LlamaCPP was using a language model stored locally. Since we are building a RAG chatbot that will not be connected to internet at client site, it need to be bundled with all it need to run before shipping out. If i can do that with Ollama, i will never use LlamaCPP again.

JJatin.K

do i also need to change embedding?

Add a reply