Is there any plans for standardizing the

At a glance

The post raises the issue of standardizing metadata in responses, as using raw response to count tokens may not scale across different providers. Community members suggest capturing metadata like input_tokens_count, output_tokens_count, stop_reason, and stop_sequence. They also discuss the possibility of exposing a TokenCounter in the handler, allowing each integration to have its own implementation, with defaults for missing metadata. The community members agree that this would be a useful long-term solution, especially for directed ReAct agents, and propose using glue code in the token counter to use the metadata if available, or fall back to the original logic.

ssansmoraxz

Is there any plans for standardizing the metadata in responses? Picking raw response for example to count the tokens will not scale across the many different providers.

18 comments

ssansmoraxz

I would suggest capturing input_tokens_count, output_tokens_count

ssansmoraxz

stop_reason, stop_sequence for guided capturing stop-words

ssansmoraxz

On the same thought, stop words parameter could be part of BaseLLM as there are some deviations in parameter names when passed to some providers.

LLogan M

Not really any plans beyond what's already there (capturing the text/message)

It would be a large effort. And not every api provides those items in the response

ssansmoraxz

OK. I was going through the bedrock API docs.

Thought to add token count capturing for whichever were supported. Saw the current implementation and thought an issue may arise.

ssansmoraxz

Is it OK if I still add those and do a slight refactor?

ssansmoraxz

Also just so we are clear the usage dict capture is for anthropic right?

LLogan M

which usage dict capture? In the token counter?

ssansmoraxz

yes

LLogan M

Mainly meant for openai, plus any other API that returns usage in a similar format

ssansmoraxz