Find answers from the community

Updated last year

Improving RAGs

Hi, can I just ask if anyone has found ways to improve the accuracy of the basic RAG system.

I know the three methods on the documentation are summarising, window and metadata search. However, summarising and creating metadata seems to be way too expensive w chatGPT calls for me. I've tried adding a window but it seems pretty weak (I'm doing top_k=4 and window of +-3 sentences). Has anyone used any methods that aren't expensive?

2 comments

WWhiteFang_Jr

For expense part:

You could try using BAAI/bge-base-en-v1.5 or similar open-source embeddding model which can reduce the embedding cost.

For improving the accuracy of RAG system: llamaindexc recently posted new approach in a medium article: https://blog.llamaindex.ai/evaluating-the-ideal-chunk-size-for-a-rag-system-using-llamaindex-6207e5d3fec5

WWhiteFang_Jr

Improving RAGs

Add a reply