Find answers from the community

Updated 6 months ago

Latency

At a glance

The community members are discussing tools to track indexing latency and get maximum stats while indexing. One community member suggests using Arize and time.time(), while another agrees but wants to compare the latency of different embedding methods, such as TGI-based and OpenAI embeddings, especially when working with terabytes of data. The community members also note that embedding calls are typically batched, and Arize tracks each embedding call.

BBhavya Giri

is there a tool which i can use to track indexing latency and get maximum stats while indexing

6 comments

LLogan M

Arize? Using time.time()

LLogan M

Lol

BBhavya Giri

ageed but for something like Actually latency per embedding call comparing TGI Based Embedding to OpenAI one

BBhavya Giri

working with TB of Data

LLogan M

Arize tracks each embedding call

LLogan M

Keep in mind embedding calls are typically batched though

Add a reply