Find answers from the community

Updated 6 months ago

Hi all.

At a glance

Hi all.
I have a question. why using service_context in CondensePlusContextChatEngine to serve for token is not working. When I run it always get 0 value


response = chat_engine.chat(input_text)
print(str(token_counter.total_llm_token_count))

Attachments

3 comments

WWhiteFang_Jr

Service_Context is completely removed from v0.11.x. It could be because of that ?

Try doing it this way once: https://docs.llamaindex.ai/en/stable/examples/observability/TokenCountingHandler/#setup

BBrent

If we do it like that when there are many requests at the same time, will it be affected? When using service_context with RetrieverQueryEngine.from_args(retriever, text_qa_template=prompt_tmpl, service_context=service_context), I get reasonable results, but with CondensePlusContextChatEngine, I don't.

BBrent

@WhiteFang_Jr I've tried the method you mentioned. It works well if I run it from the query function. However, if I call the query function multiple times at once, it throws an error.

Add a reply