Find answers from the community

Updated 3 months ago

any recommendation for a good solution

any recommendation for a good solution to monitor latency during RAG? currently it takes me 10~20 seconds to generate a response, and i want to figure out which stage is causing the latency
b
T
5 comments
That's some good stuff, here is our observability docs. https://gpt-index.readthedocs.io/en/stable/module_guides/observability/observability.html

I'm not familiar with all of the Partner's llama index supports but I bet some of them will handle latency
might be helpful!!
Thanks! thats helpful
Add a reply
Sign up and join the conversation on Discord