Find answers from the community

Updated 6 months ago

Hi there, I learned that PDF parsing

At a glance

The community member is inquiring about the complexity of parsing different types of content, such as PDF, word documents, and other formats beyond pure text. They are wondering if word parsing is as complex as PDF parsing, or if it is less complex. The comments suggest that parsing tables and images can be challenging, and recommend trying tools like LlamaParse or Unstructured. However, there is no explicitly marked answer to the original question.

oosiworx

Hi there, I learned that PDF parsing seems to be a very complex task, how is it about word parsing? Is that the same story in different cloth or is that less complex? what would be the easiest to parse to get the most best results beside pure text?

2 comments

WWhiteFang_Jr

Table, images parsing using any parsing is hard.
I would recommend trying LlamaParse but if not then give try to Unstructured too.

oosiworx

OK thank you

Add a reply