Hi,

At a glance

Hi,

I've got a OpenAIAgent with 2 query engine tools.

Is it possibile to call the tools in parallel to reduce the latency?

24 comments

RRoland Tannous

check here

RRoland Tannous

https://docs.llamaindex.ai/en/stable/examples/agent/openai_agent_parallel_function_calling/

oootkin

Thanks!

oootkin

@Roland Tannous from the docs, seems like that function calls are not parallel, but we can call multiple tools within a single turn of User and Agent dialogue.

My need is to call different tools in parallel to reduce the latency

RRoland Tannous

you're right

RRoland Tannous

see if this still works

RRoland Tannous

https://llamahub.ai/l/llama-packs/llama-index-packs-agents-llm-compiler?from=

RRoland Tannous

llm-compiler allows among other things parallel function calling and that's the integration connector with llamaindex

RRoland Tannous

if you wanna know more about llmcompiler, check it out here:
https://github.com/SqueezeAILab/LLMCompiler?tab=readme-ov-file

oootkin

@Roland Tannous yes, I saw it and it seems very interesting... I'll give it a shot! Thanks

oootkin

@Roland Tannous I tried using LLMCompiler and i've got 2 problems.

The first one, is that i need to generate a larger response (right know it respond with a couple of words. Is there a way to change the final prompt?
The second one is that I think the stream_chat methos has not yet been implemented: got a NotImplementedError

RRoland Tannous

no idea. you gotta test , experiment and find out 🙂

RRoland Tannous

and of course let us know 🙂

oootkin

Im trying to modify the joiner_prompt but with no success... I think that is not best suited for a good chatbot and detailed answers...

RRoland Tannous