The community member is building a query engine with LlamaIndex, based on a large database of PDFs, and would like to be able to retrieve the page number with the sources. The comments suggest that the community member can use metadata to achieve this. One community member advises defining the metadata when building the nodes, and then printing the metadata for the source nodes after getting the response. Another community member confirms that the PDF reader should automatically split by page and put the page number in the metadata, but if not, the community member may need to write a custom loader to set the metadata as desired.
I am building a query engine with LlamaIndex, based on a large database of PDFs, and I would like to be able to retrieve the page number with the sources @Logan M