Find answers from the community

Updated 9 months ago

Hi everyone!

Hi everyone!
Does anyone have experience in extracting images from PDFs? I’m working with math PDF courses and some illustrations are interesting for other parts of my processes. I’ve tried generating them on the fly, low quality, and I’ve tried generating the code that generates shapes or curves (matplotlib) and it’s kinda hit or miss.
T
F
L
4 comments
You can use a library like PyMuPDF to extract all the images
that implies that you need to convert every page to an image, then run multimodal queries
Add a reply
Sign up and join the conversation on Discord