@Logan M I found the bm25 retriever actually retrieves the document im expecting but its pulling the entire document from the doc store which is quite long. Is there a built in solution to provide it with nodes stored in the vector store? Im using postgres for context
I see. I was thinking about chunking before storing in the docstore but then the issue is that each node wouldnt have a unique id i can assign since im using the ingestion pipeline.
I was thinking Doc1(id=A123) could become TextNode(id=A123-0), TextNode(id=A123-1)...TextNode(id=A123-N) but then i could be left with some nodes not getting deduped if the document gets shorter for example