Find answers from the community

Updated 2 months ago

LlamaParse

Does anyone have advice on how I can implement data extraction from PDF? Not necessarily querying the PDF's text, but grabbing particular fields in the PDF. The PDF will have tables and fields and such, and we want to automate the extraction piece.

If you'v ever used ChatGPT on mobile, it'd be similar to how you can take a photo of something, annotate (draw a circle around it) then explain/grab that part of the image (which in our case would be some text).

I'm wondering if there's an out of the box API I can use, or if I need to go from pdf -> image -> text -> open ai api
W
S
n
3 comments
Have you tried LlamaParse? Its super cool and fits good for your case.
In addition to this, Llamaparse now provides support for extraction via GPT4-o
Definitely take a look at LlamaParse especially examples of using the custom parsing instructions!
I'll take a look, thank you everyone! I f'ing love this community
Add a reply
Sign up and join the conversation on Discord