Find answers from the community

a
alx
Offline, last seen 4 months ago
Joined September 25, 2024
How can I debug / influence the extraction of metadata from PDFs? My Problem: When I break one of my PDFs into Nodes (Using the SimpleDirectoryReader and the SentenceSplitter, the Node metadata 'page_label' (document page number) is empty. This happens just with one pdf, but works fine for others. Any advice? Thanks!
2 comments
W
a