Find answers from the community

Updated 6 months ago

Can I implement training of images from

At a glance

Can I implement training of images from PDF in llama-index?

3 comments

It's not supported right now, but you could always write your own loader for it, and create your own ImageDocument instances.

Right now there are two options for images in llama-index

Generating a caption for the image and retrieving based on the caption
Applying ocr on the image and retrieving based on the OCR

PyPDF can extract images it looks like, so this could be added to the loader in the future
https://pypdf.readthedocs.io/en/stable/user/extract-images.html

oopenmind

okay, I look forward you guys to imlement image training from pdf soon.

oopenmind

Nice work!

Add a reply