LlamaIndex

Log inLog into community

Find answers from the community

Updated 12 months ago

It claims to use GetDate but it's lying

It claims to use GetDate but it's lying

At a glance

·

It claims to use GetDate but it's lying because there is a debug print in the GetDate function which is not printed, i assume also not called. The observation is therefore wrong. Also the Action Input into the obsidian engine looks strange? And ofc, the output is completely wrong, there is no /path/to/node2.md

L

m

35 comments

lol yea it just hallucinated using the date tool

Looks like yes.

I would modify the description to the obsidian tool to mention that it can be accessed by date

or something like that

Ok I can try that.

The correct place to do that is in the description of the tool?

yup!

No success.

Is there a "best practices" documentation?

not really -- a lot of this is just prompting 😅

It also looks like you are using a react agent? Not sure which LLM you are using, but open-source LLMs are generally bad at being agents these days

If you are using openai, I highly recommend the OpenAIAgent

Yes, ReActAgent

LLMs, currently llama-2-13b-chat

oof llama2

might be the worst agent lol

Tried mistral 7b before.

Maybe try zephyr?

What would you recommend? For agent and local llm? Checking zephyr right now...

zephyr was probably the best I've tried. Still slightly unreliable though, agentic behaviour seems tough for these LLMs

For example, some results from a recent paper benchmarking on a ton of datasets

Attachment

https://github.com/THUDM/AgentBench

Ok, so you are saying it's basically an issue with the llm not understanding what's going on and not a code issue.

pretty much. Some prompt engineering may help of course, but feels like a general issue with open-source LLMs at the moment.

They can power little demos, but beyond that things get tricky with them

That's sad. I don't want to put my obsidian vault into openai or something.

Things will improve someday 🙂 They work fine for general Q/A and RAG though.

For example, in your case, I would just include todays date as part of the prompt

Ok hm lol. Now with llama-2-13b it "somehow" works, including some hallucination.

It shows the correct note of today, but thinks today is a saturday.

That "tool description" is used as part of the prompt would be my guess?

If I ask for yesterday's note the reasoning itself works well, it uses GetDate and then reasons that yesterday was the 18th. But then it crashes in the react/step.py in some reasoning step "could not parse output"

yea, the LLM is shown the chat history, list of tool names + descriptions, and some instructions for the ReAct format

Classic 😅

Hm ok, anything you would recommend to read or yt to get a better understanding how things work?

If you wanted to dive in, we have some stuff

Low level agent guides
https://docs.llamaindex.ai/en/stable/module_guides/deploying/agents/modules.html#id1

Some recent videos on custom Agent stuff
https://www.youtube.com/watch?v=T0bgevj0vto

Thanks a lot. For the help and the links.

Add a reply

Sign up and join the conversation on Discord