Log in
Log into community
Find answers from the community
View all posts
Related posts
Did this answer your question?
๐
๐
๐
Powered by
Hall
Inactive
Updated 4 months ago
0
Follow
Pa4de
Pa4de
Inactive
0
Follow
B
Bhavya Giri
4 months ago
ยท
is there a wat to parse a complex pdf extracting tables, images and text, maybe even llm (gpt4o or maybe a local multimodel one)?
maybe while reading files
@Logan M
L
B
10 comments
Share
Open in Discord
L
Logan M
4 months ago
I mean, this is what llama parse does.
If you want, you can send a pdf page by page to a multi modal llm and prompt it to extract too
B
Bhavya Giri
4 months ago
can i use my deployed locally deployed model?
L
Logan M
4 months ago
Yea why not. Just need to send it image of each page and prompt it to extract
B
Bhavya Giri
4 months ago
still i need to pay for llamaparse, see i am working 1 TB of data
the cost is gonna sky rocket
B
Bhavya Giri
4 months ago
pdfs are like these, i have 4 A100 to run some opensource llm
B
Bhavya Giri
4 months ago
any suggestion on approch i should follow
B
Bhavya Giri
4 months ago
@Logan M
L
Logan M
4 months ago
I meant if you have a local multimodal llm running, you could use that instead. Just might take a while
L
Logan M
4 months ago
Anything is going to be expensive with 1TB of data my guy lol
B
Bhavya Giri
4 months ago
do you have any suggested blog post for mulitmodel or something similar to this?
Add a reply
Sign up and join the conversation on Discord
Join on Discord