React

At a glance

have you come across any ReAct-specific datasets? I've not found any of the open models to be good at it. I'm about to try Zephyr

40 comments

LLogan M

I have not come across any yet 🤔

Yea every open source model I've tried sucks at being an agent lol even zephyr

LLogan M

It could be a matter of prompt engineering, but that's also annoying

LLogan M

@thoraxe just saw this actually

https://huggingface.co/papers/2310.12823

https://huggingface.co/datasets/THUDM/AgentInstruct

tthoraxe

cool, will check it out.

The challenge I'm finding with ReAact is more in Langchain's specific implementation of the ReAct agent, and, more specifically, the need for the first "Thought" prompt, even though it's empty.

tthoraxe

I opened this bug in relation to it:
https://github.com/langchain-ai/langchain/issues/12087

tthoraxe

tl;dr it has to do with python templating and the intermediary thoughts/agent_scratchpad implementation

tthoraxe

I'm able to get many of the models to generate the first thought when the incomplete thought prompt is absent

tthoraxe

but introducing the incomplete thought prompt throws them all off

LLogan M

Was our react agent doing the same thing?

LLogan M

(I know barely anything about langchains version lol)

tthoraxe

i haven't tried your react agent

tthoraxe

let me give it a whirl

tthoraxe

does your agent expect to use the RAG?

tthoraxe

oh, funny, someone at RH sent me THUDM/agentlm which I just tried - it also fails at react (in the 13b variant)

tthoraxe

they all fail to follow the prompt

tthoraxe

it's unclear from your react example that it works without an index, or even how to specify what tools are available:
https://docs.llamaindex.ai/en/stable/examples/chat_engine/chat_engine_react.html

LLogan M

I think at least with that dataset, you can finetune an LLM with the dataset for react agents right?

LLogan M

You can use our agents with any tools -- here's an example with a random function. It's using the function name and docstring as the name/description, but you can also manually set it it with kwargs in the FunctionTool def

https://docs.llamaindex.ai/en/stable/core_modules/agent_modules/agents/usage_pattern.html#get-started

With a query engine, you just create query engine tools

tthoraxe

it looks like it would involve literally re-writing the datasets in ReAct format, I think

LLogan M

They are already in a react format, albeit something a little different compared to llamaindex or langchain

tthoraxe

let me double check. it didn't look like agentinstruct was in react format

tthoraxe

on a call

tthoraxe

ok yeah it does look like react

tthoraxe

i don't think this is a fine-tuning problem though

tthoraxe

Answer the following questions as best you can. You have access to the following tools:

Search: A search engine. Useful for when you need to answer questions about current events. Input should be a search query.
Calculator: Useful for when you need to answer questions about math.

Use the following format:

Thought: you should always think about what to do
Action: the action to take, should be one of [Search, Calculator]
Action Input: the input to the action
Observation: the result of the action
... (this Thought/Action/Action Input/Observation can repeat N times)
Thought: I now know the final answer
Final Answer: the final answer to the original input question

Question: Who was president when John F. Kennedy was in middle school?
Thought:

tthoraxe

this is vanilla react from langchain

tthoraxe

langchain expects that the model will re-generate Thought

tthoraxe

buried in langchain is a PromptTemplate that tries to inject the agent's temporary thoughts into that thought, so if you don't include the value for the temp memory, langchain blows up

tthoraxe

need to try llamaindex here

tthoraxe

your react agent expects json input, FWIW

tthoraxe

and you will also fail:
https://github.com/run-llama/llama_index/blob/main/llama_index/agent/react/output_parser.py#L64

tthoraxe

i'm trying to debug as usual and struggling to get llamaindex to output stuff in a way I can understand

tthoraxe

which part of the following is actually sent to the model?

Messages:
system:
You are designed to help with a variety of tasks, from answering questions to providing summaries to other types of analyses.

Tools
You have access to a wide variety of tools. You are responsible for using
the tools in any sequence you deem appropriate to complete the task at hand.
This may require breaking the task into subtasks and using different tools
to complete each subtask.

You have access to the following tools:
> Tool Name: multiply
Tool Description: multiply(a: int, b: int) -> int
Multiple two integers and returns the result integer
Tool Args: {'title': 'multiply', 'type': 'object', 'properties': {'a': {'title': 'A', 'type': 'integer'}, 'b': {'title': 'B', 'type': 'integer'}}, 'required': ['a', 'b']}

Output Format
To answer the question, please use the following format.
Plain Text
Thought: I need to use a tool to help me answer the question.
Action: tool name (one of multiply)
Action Input: the input to the tool, in a JSON format representing the kwargs (e.g. {"text": "hello world", "num_beams": 5})
Please use a valid JSON format for the action input. Do NOT do this {'text': 'hello world', 'num_beams': 5}.

If this format is used, the user will respond in the following format:
Plain Text
Observation: tool response
You should keep repeating the above format until you have enough information
to answer the question without using any more tools. At that point, you MUST respond
in the following format:
Plain Text
Thought: I can answer without using any more tools.
Answer: [your answer here]
Current Conversation
Below is the current conversation consisting of interleaving human and assistant messages.

user: What is 2123 * 215123
**
Response:
assistant: ?
**

tthoraxe

seems like llama2-70b-chat and falcon-180b can deal with your react agent OK

tthoraxe

i still think asking models like this to generate structured JSON is fragile. but it seems to work

LLogan M

That entire blob is the prompt. React is hard

It's either json parsing or regex parsing, pick your poison 🤷‍♂️ lol

LLogan M

This aligns with my experience. React is tough for small models these days

tthoraxe

I don't know that you have to do much in the way of regex parsing

tthoraxe

the list of tools has names. you ask the model to name the tool in Action: and then the input to the tool in Action Input: which is how Langchain does it

tthoraxe

they're just too rigid in the expectation of the model generating a Thought: as the first part of the response

Add a reply

Find answers from the community

React

Tools

Output Format

Current Conversation