Find answers from the community

Updated 4 months ago

I'm looking to use an LLM with

At a glance

I'm looking to use an LLM with unstructured data in an image that has hand written notes. Is it possible to extract the notes from the image and feed it into the LLM as text to then generate structured data? It feels similar to the example of working with unstructured data in a pdf to extract structured data.

2 comments

TTeemu

Yeah you could just follow a tutorial like this and modify the prompts to generate data in your desired format: https://docs.llamaindex.ai/en/stable/examples/multi_modal/openai_multi_modal.html

AAndrew

Ok so using GPT4V is the key. Thank you so much

Add a reply