Log in
Log into community
Find answers from the community
View all posts
Related posts
Did this answer your question?
๐
๐
๐
Powered by
Hall
Inactive
Updated 3 months ago
0
Follow
Does the PDF reader use OCR
Does the PDF reader use OCR
Inactive
0
Follow
v
vkdi5cord
2 years ago
ยท
Does the PDF reader use OCR?
j
S
r
7 comments
Share
Open in Discord
j
jerryjliu0
2 years ago
not yet, it uses a basic pdf2text parser
S
SJ
2 years ago
@conceptofmind in LangChain discord likes the following:
https://github.com/clovaai/donut
S
SJ
2 years ago
which looks like it should do wonders...
https://towardsdatascience.com/ocr-free-document-understanding-with-donut-1acfbdf099be
j
jerryjliu0
2 years ago
we use the donut model to parse images!
j
jerryjliu0
2 years ago
so i amend my previous statement. the image parser uses the donut model, the pdf parser does not
r
rtk
2 years ago
Sorry for the late response, but is there a reason you don't use the donut model to parse PDFs? And is there a simpler alternative for PDFs?
j
jerryjliu0
2 years ago
we have a donut model to process images, but if you want to add a donut pdf parser that would be appreciated too!
Add a reply
Sign up and join the conversation on Discord
Join on Discord