Find answers from the community

Home
Members
Tonka7su
T
Tonka7su
Offline, last seen 4 months ago
Joined September 25, 2024
flipping tables in PDFs! 😠 I tried https://github.com/VikParuchuri/marker to parse PDF's to markdown to improve parsing and chunking but tables i.e. budget documents with 'funky' formatting such as merged cells cause the markdown tables to be parsed incorrectly.... azure document intelligence works better.... but would like a local and/or open-source package instead....
14 comments
R
T
D
W