dependent on your chunk size + your similarity top k if you're using vector index, your total index size if using list index, other params if you're using other indices
@jerryjliu0 in your opinion, why sometimes, for the same prompt and query, same nodes and number of nodes and same token used, do I see differences in timings of responses?