Log in
Log into community
Find answers from the community
View all posts
Related posts
Did this answer your question?
π
π
π
Powered by
Hall
Inactive
Updated 3 months ago
0
Follow
any recommendation for a good solution
any recommendation for a good solution
Inactive
0
Follow
T
Tony L
last year
Β·
any recommendation for a good solution to monitor latency during RAG? currently it takes me 10~20 seconds to generate a response, and i want to figure out which stage is causing the latency
b
T
5 comments
Share
Open in Discord
b
bmax
last year
That's some good stuff, here is our observability docs.
https://gpt-index.readthedocs.io/en/stable/module_guides/observability/observability.html
I'm not familiar with all of the Partner's llama index supports but I bet some of them will handle latency
b
bmax
last year
as well as you can tie into any event
https://gpt-index.readthedocs.io/en/stable/module_guides/observability/callbacks/root.html
b
bmax
last year
https://gpt-index.readthedocs.io/en/stable/examples/callbacks/LlamaDebugHandler.html
b
bmax
last year
might be helpful!!
T
Tony L
last year
Thanks! thats helpful
Add a reply
Sign up and join the conversation on Discord
Join on Discord