Find answers from the community

Updated 10 months ago

I guess I found a way, I check if doc.

I guess I found a way, I check if doc.text is != '' and then add it to a seperate docs array when I then path to the embedding process. so far it did get around the error
W
o
4 comments
Yeah that will work, But curiousAre you making document object by yourself?

If so then you can put a check before making Document object that if the text is empty dont make a document object and pass!
I do like: documents = SimpleDirectoryReader(folder).load_data()
It could be that some of your pages are empty thats why it is making document object for that.

In that case your way will work.
ok cool, its like unknown files with unknown content so I need to get around manually check for 0k files πŸ˜„
Add a reply
Sign up and join the conversation on Discord