Thanks for the heads up! I tried to get MultimodalReActAgentWorker to work with GeminiMultiModal, but it seems the format ist not what Gemini expects and only works with openAI models right now. MMGemini model returns that it expects a blog or image but go a list
class MultimodalReActAgentWorker(BaseAgentWorker):
...
self._add_user_step_to_reasoning = partial(
add_user_step_to_reasoning,
generate_chat_message_fn=generate_openai_multi_modal_chat_message, # type: ignore
)
and there is this todo in the MultimodalReActAgentWorker
# TODO: support gemini as well. Currently just supports OpenAI