how can i version my documents (notion / pdf etc. ) for RAG pipeline. lets say if there is any update in the documentation then i will have to vectorize complete data again
I think you can add filename_as_id=True while reading the docs.
This will add the unique doc ID as your filename. When any existing file gets updated you can remove all the nodes for that file ID and insert the updated file.