Log in
Log into community
Find answers from the community
View all posts
Related posts
Did this answer your question?
๐
๐
๐
Powered by
Hall
Inactive
Updated 2 months ago
0
Follow
I m trying to query some Data using this
I m trying to query some Data using this
Inactive
0
Follow
V
Vaylonn
last year
ยท
I'm trying to query some Data using this
https://gpt-index.readthedocs.io/en/latest/examples/output_parsing/LangchainOutputParserDemo.html
.
Unofrtunatly when I ingest some PDFs i get wrong results when I'm trying to query over data that is in a table like this: (see picture)
I get some resulsts that seems right but that are not.
Attachment
V
L
10 comments
Share
Open in Discord
V
Vaylonn
last year
In fact I get these results
Plain Text
Copy
{"Codes ISIN": "FR0010298596, FR0010298604, FR0010298612",
V
Vaylonn
last year
When i ask him to find the 4 numbers that are highlited.
V
Vaylonn
last year
My program is running the basic Vectore Store with top_k retrieval. For my pdf
V
Vaylonn
last year
Is it the right thing to do since i query over some text in the pdf too and that is working
V
Vaylonn
last year
Or do I need to implement another way to do it
V
Vaylonn
last year
Such as this one
https://gpt-index.readthedocs.io/en/latest/examples/query_engine/pdf_tables/recursive_retriever.html
(is it possible to be used with the langchain output parser ?). Or do I you know any way to do it better using the langchain output parser ?
L
Logan M
last year
Tough problem, tables in PDFs suuuuck to parse ๐ซ using Camelot to structure the table data might help, assuming Camelot works lol
V
Vaylonn
last year
assuming it haha
V
Vaylonn
last year
yeah it's starting to get a too much tricky for me from now on
V
Vaylonn
last year
but i'll manage to suceed
Add a reply
Sign up and join the conversation on Discord
Join on Discord