Token usage

At a glance

how i can extract the llama_index.token_counter.token_counter form a index.query ?? i need to know how mutch i consume of them in every query

16 comments

LLogan M

Try
index.service_context.llm_predictor._last_token_usage

And also
index.service_context.embed_model._last_token_usage

CCrisTian

ojjjo ... let me try it jejejeje

CCrisTian

it works perfects ....thanks ...

CCrisTian

i was reading the documentation and it said that i can adjust the output of the query ... with a cain of template ... but i can not find a example ... my idea is that as the result of the query i recover the response of the query but also a document related to the response ... do you know if that its possible ?

LLogan M

You can get the nodes used to create the response like this

response = index.query(...)
print(response.source_nodes)

However, if you want to get the name of the document it came from, there's some extra setup when creating the document objects

Something like this

Plain Text

for doc in documents:
    doc.extra_info = {"name": "name"}

Then that will show up in the source nodes

CCrisTian

mmmmm ok ... let me test that ...

CCrisTian

mmm not work as spected ....

CCrisTian

mmmm wait

CCrisTian

works...but...i use a GPTListIndex...so when i query them...the result contains all source documents...perhaps because i use mode="recursive"...what do you think?

LLogan M

Ah, a list index will always check each node.

If you only want to check the closest matching node(s), look into using GPTSimpleVectorIndex maybe?

CCrisTian

yes ... but the think is that i requere a couple of documents to index and query them ... all of them have information that it is relevant to the knowledge and my idea is that when respond .. it show to the user the documento where the information is it ...

other way is make a index of documents in a new document that way part of the answer contain a link or a datasource of the documento that query before

CCrisTian

documento = document 😛 (sorry)

LLogan M

Haha no worries, that makes sense!

You could set the top_k in vector index to reduce the search space compared to a list index.

response = index.query(..., similarity_top_k=5)

CCrisTian

testing

CCrisTian

mmm not work .. always shows the same sources_nodes ... let me read about it ... and let you know if i find something ... thanks by the way !!!

LLogan M

Sounds good, good luck! 💪

Add a reply

Find answers from the community

Token usage