Give an ingestion pipeline with Vector+DocStore+IngestionCache with DocstoreStrategy=UPSERTS over Documents in a recursive directory.
If I run this same ingestion pipeline with Documents = 1 single file what would occur?
Will the other docs be deleted since (I know UPSERTS usually is just UPDATE+INSERT) but just checking.
If the single document file existed in the full processing, will it recognized and only perform the update.
---------------------
Similar question, if i wanted to run a completely different source of documents like youtube transcripts into the same Vector Collection would both ingestion pipelines be able to work without stepping on each others embeddings.
@Logan M - This probably warrants a seperate question but setup above gives context.
How would you manage the removal of a document in DocStore/Vector/Cache.
I can do this manually from the VectorStore by using search but then that leaves the DocStore with reference to a file that no longer exist.
Does llamaindex assist with this or does one need to manually call into the docstore/vectorstore and do the manual delection based on a file being deleted