Find answers from the community

Updated 2 months ago

I'm looking to use an LLM with

I'm looking to use an LLM with unstructured data in an image that has hand written notes. Is it possible to extract the notes from the image and feed it into the LLM as text to then generate structured data? It feels similar to the example of working with unstructured data in a pdf to extract structured data.
T
A
2 comments
Yeah you could just follow a tutorial like this and modify the prompts to generate data in your desired format: https://docs.llamaindex.ai/en/stable/examples/multi_modal/openai_multi_modal.html
Ok so using GPT4V is the key. Thank you so much
Add a reply
Sign up and join the conversation on Discord