Find answers from the community

Updated 4 days ago

Summarize flow in screenshot with figure title

Hi anyone experience this woith azuremultimodal chat?
msg = ChatMessage(
role=MessageRole.USER,
blocks=[
TextBlock(text="Summarize this flow in screenshot of application present in image and support the description along with surronding text with a figure title."),
ImageBlock(image=image_document.image_resource.data),
],
)

response = llm.chat(messages=[msg])
Attachment
image.png
L
D
W
13 comments
oh dang, this might be a bug, lemme look
Hmm, I think something is just out of date on your end -- pip install -U llama-index-llms-openai llama-index-multi-modal-llms-openai llama-index-multi-modal-llms-azure-openai ?
(I should bump the min-deps, thats a lot lol)
This is working with simple AzureOpenAI class but not with AzureMultiModal
Also index.as_query_engine()

When I pass llm = azuremultimodallmm its working
but when I pass multi_modal_llm = azuremultimodallmm its need settings.llm to be set but with the donwgraded version I'm not able to set AzureMultiModal in my settings
Can you run pip freeze | grep llama and show me what you have installed?
assert isinstance(llm, LLM) is false for this AzureMultiModal model class
Did you run the pip install -U llama-index-llms-openai llama-index-multi-modal-llms-openai llama-index-multi-modal-llms-azure-openai command? The multi-modal llm classes are not up to date
ok let me yesterday I was facing the issue so you gove me some downgraded version.
oh πŸ˜… lol
Oh! now I remember -- I'm about to fix your previous issue
The muli-modal assert one πŸ˜…
Add a reply
Sign up and join the conversation on Discord