Is anyone aware of a convenient way to manually edit the Documents object? I’m still getting some errors in pdf extraction, and it would be nice to do a few edits prior to running it through the node parser.
I was thinking more like if someone has developed a text editor offline from Python. I’m not quite sure if it’s possible to save this Document object as a text file, save to disk and edit. But thanks for tip. I will look at copying document.text, editing in a text editor and then inserting it back in, document by document.