doc_id
to the names of the files or utilizing metadata and extracting the information from there?(Optional[Callable[str, Dict]])
as an arg in SimpleDirectoryReader, or you can manually set doc_id on the documents after retrieving them with SimpleDirectoryReader. Another option is there's an extra_info
field on the Document you can set (it allows you to specify a general dictionary of metadata per Document)