Find answers from the community

Updated 4 months ago

Tool cals

At a glance

Hey, so how would i manage to estimate costs for my RAG pipeline if i use the OpenAIAgentRunner?
Are tool calls billed differently then completion calls? And also how do i have to calculate the context?
What i mean here is that for example if i have a chunk size of 512 and retrieve 30 nodes, do i have to calculate 512*30+len(tokenize(message)) as average token count per message?

3 comments

LLogan M

Tools are counted as input tokens. OpenAI doesn't make this easy, but you can roughly estimate using the token counts of tool.metadata.to_openai_tool() output

LLogan M

So in your example, the input token estimation would be `512×30 + len(chat_history) + len(tools)

LLogan M

There is actually a util in llama-index you can use for this https://github.com/run-llama/llama_index/blob/d4a31cf6ddb5dd7e2898ad3b33ff880aaa86de11/llama-index-core/llama_index/core/utilities/token_counting.py#L10

Assembled from some openai forum discussions lol

Add a reply