Find answers from the community

Updated last year

Improving RAGs

Hi, can I just ask if anyone has found ways to improve the accuracy of the basic RAG system.

I know the three methods on the documentation are summarising, window and metadata search. However, summarising and creating metadata seems to be way too expensive w chatGPT calls for me. I've tried adding a window but it seems pretty weak (I'm doing top_k=4 and window of +-3 sentences). Has anyone used any methods that aren't expensive?
W
2 comments
For expense part:
  • You could try using BAAI/bge-base-en-v1.5 or similar open-source embeddding model which can reduce the embedding cost.
For improving the accuracy of RAG system: llamaindexc recently posted new approach in a medium article: https://blog.llamaindex.ai/evaluating-the-ideal-chunk-size-for-a-rag-system-using-llamaindex-6207e5d3fec5
Add a reply
Sign up and join the conversation on Discord