has anyone had success using local hf models with rag evaluators (faithfulness, correctness, etc)? if so, which model(s) worked?
i know i can customize the prompt templates but im hoping to stay "out-of-box" so i can more easily stay up to date with upstream
Add a reply
Sign up and join the conversation on Discord