Find answers from the community

Updated last year

Lots of more advanced RAG techniques e g

At a glance
Lots of more advanced RAG techniques (e.g. those outlined in https://gpt-index.readthedocs.io/en/latest/end_to_end_tutorials/dev_practices/production_rag.html) rely on generation of summaries from data that is being embedded. What do you use to generate these summaries? I find that GPT4 is not usable with larger datasets because of poor performance and too low rate limits and GPT3.5 sometimes does not generate good enough summaries. Are there any alternatives?
W
1 comment
In comparison to all the opensource LLMs out there, OpenAI GPT-4, GPT-3.5 seems like the best option to go with. There are other Paid LLMs as well like Claude/PaLM, You can try them in your use case as well for RAG products.

LlamaIndex provides way to connect to different LLM provider platforms, Have a look: https://gpt-index.readthedocs.io/en/latest/core_modules/model_modules/llms/root.html#modules
Add a reply
Sign up and join the conversation on Discord