Orion Pax

Best practices for testing workflows

Are there general platforms or best practices for testing workflows?

3 comments

The Multi Concierge Flow Issue

I've been digging into an issue with the multi concierge flow. I have 2 agents. Each can add or multiply 2 numbers. I ask the system to 'add 10 to 10 and then multiply the result by 20'.

My expectation is that the first agent says "10 + 10 is 20", then transfers. I see this from the tools and chat history:
tools input:
['transfer_to_agent']

chat history:

Plain Text

[ChatMessage(role=<MessageRole.SYSTEM: 'system'>, additional_kwargs={}, blocks=[TextBlock(block_type='text', text="You are on orchestration agent.\nYour job is to decide which agent to run based on the current state of the user and what they've asked to do.\nYou do not need to figure out dependencies between agents; the agents will handle that themselves.\nHere are the agents you can choose from:\nAddition Agent: Used to add two numbers together.\nMultiply Agent: Used to multiply two numbers together.\n\n\nHere is the current user state:\n\n\nPlease assist the user and transfer them as needed.\nDo not make up information or provide information that is not in the user state or agent context.\n")]), ChatMessage(role=<MessageRole.USER: 'user'>, additional_kwargs={}, blocks=[TextBlock(block_type='text', text='Add 10 to 10 and then multiply the result by 22')]), ChatMessage(role=<MessageRole.ASSISTANT: 'assistant'>, additional_kwargs={'tool_calls': [ChatCompletionMessageToolCall(id='call_SOOg9k2TNckhXV3MAITRo4Vs', function=Function(arguments='{"a":10,"b":10}', name='add_two_numbers'), type='function')]}, blocks=[]), ChatMessage(role=<MessageRole.TOOL: 'tool'>, additional_kwargs={'tool_call_id': 'call_SOOg9k2TNckhXV3MAITRo4Vs', 'name': 'add_two_numbers'}, blocks=[TextBlock(block_type='text', text='20')])]

Results in the following tool selection:

Plain Text

[ToolSelection(tool_id='call_34DqgTGSX5VhWVMmCiTgsFhQ', tool_name='multiply_two_numbers', tool_kwargs={'a': 20, 'b': 22})]

Despite not giving it a "multiply_two_numbers" tool, it says that's the next one it should run, so it fails on 'agent_name' missing.

18 comments

Find answers from the community

Best practices for testing workflows

The Multi Concierge Flow Issue

The Perils of Infinite Loops in a Multi-Agent System

Recurring issue with progress event and 429 errors in multi-agent workflow

@Logan M I'm not sure if this is related

Automatically converting name to email address

Injecting existing history into chat engine request fails due to missing index and id

Am I missing a concept. The

Let's say you have a pdf with a variety

Stable version

Has anyone tried to support multiple

When I build an index on 1 pdf file I

Has anyone seen an issue where they

I have 10 transcripts in an index. Each

This guide doesn't seem to work anymore

The guide on this page for structured

If I ask "Why is the sky blue" to a

I have a pretty simple use case where I'

The LLM_Predictor class doesn't seem to

If I insert the same document into a

Is there a documented method to use

Thanks again for this Logan M I started

In the query response you can get the

Eval

Eval notebooks