Find answers from the community

Updated 2 months ago

Reducing Latency Issues with Open AI and RAG

Hi Everyone! My current solution suffers from latency issues that negatively affect the user experience. We are using the Open AI with RAG, and as I'm new to this space and the project is directly handed over to me, I would appreciate the suggestions or advice on which area to look for to reduce the latency.
W
A
5 comments
Hi, it would be helpful if you could describe more about your project/problem statement
Thanks for the response @WhiteFang_Jr. As I have just taken over the project, I'm not familiar with the whole internal working yet. Also not allowed to disclose the implementation details. We are using Open AI APIs and Llama for RAG and Feeding Docs for retrieval.

I would appreciate advice on areas to look for improvement or strategies for the same.
how much time it is taking before answering
On the top:
Point one will help you understanding where the actual problem lies and then maybe I can help you more!
Thanks @WhiteFang_Jr , I will check time consuption by each part of the process.
Add a reply
Sign up and join the conversation on Discord